Jr Data Engineer
Location : Ahmedabad, India
Experience : 1 Year
Rate: $7 / Hourly
Availability : 1 Week
Work From : Offsite
Category : Information Technology & Services
Project Experience Store-Sales-Project Duration: 2 Weeks It is the ETL POC project in Which we have stored data files which first go for cleaning and after that, the Transformed file send to the respective user's Mail Responsibilities:
• Design and implement flow.
• Convert Unstructured Data to structured
• Discuss and resolve bugs and implement new features. Environment: Visual Studio Code, Git, Docker
Technology: Python, MySQL and Postgres Database, Docker, Apache Airflow Streaming Data To S3
Duration: 1 Month It is the Real Time pipeline (POC Project) in which there are multiple sources (Mongo, Postgres, MySQL, files) from which continuous data are shared, which will be handled by CDC Debezium. After that, the Kafka broker stores that data in respective topics. From topics that data will be structured through Ksql or spark and finally send that data to the S3 Bucket