OnBenchMark Logo

Data Engineer

No of Positions  No of Positions:   4

location Location: Remote

date Tentative Start Date:   October 24, 2021

Work From Work From : Any Location

rate Rate : $ 12  -  14 (Hourly)

experience Experience : 2 to 5 Year

Job Applicants : 4
Job Views : 344
You have successfully applied. Company will contact you soon.
Name : {{jobapplydata.name}}
Company Name : {{jobapplydata.cname}}
Email  {{jobapplydata.email}} |   Send Email   {{emaildata.total}}
Phone {{jobapplydata.phone}} | Call
You have successfully applied. Need to upgrade your plan to view contact details of client. Upgrade Plan
Job Category : Information Technology & Services
Duration : 3-6  Month
Key Skills Required Skills
pyspark Python spark hadoop hive HDFS

Skills Required: PySpark, Python, SQL, Spark SQL, HIVE, HDFS, Spark

Roles and Responsibilities:

  • Responsible for developing and maintaining applications with PySpark
  • Contribute to the overall design and architecture of the application developed and deployed.
  • Performance Tuning wrt to executor sizing and other environmental parameters, code optimization, partitions tuning, etc
  • Interact with business users to understand requirements and troubleshoot issues.
  • Implement Projects based on functional specifications.

Must-Have Skills:

  • Good experience in Pyspark - Including Dataframe core functions and Spark SQL
  • Good experience in SQL DBs - Be able to write queries including fair complexity.
  • Should have excellent experience in Big Data programming for data transformation and aggregations
  • Good at ETL architecture. Business rules processing and data extraction from Data Lake into data streams for business consumption.
  • Having exposure to SCALA and Apache Airflow is desirable
  • Good customer communication.
  • Good Analytical skills

Copyright© Cosette Network Private Limited All Rights Reserved
Submit Query
WhatsApp Icon