A n k i t
D A T A E N G I N E E R
Highly qualified Data Engineer and trainer with 5 years of professional experience in building Big Data ETL Pipeline , Data Science , Cloud Automation hands-on expertise in streaming and distributed systems for Big Data using spark Coming with solid Software Engineering and programming skills, and experience with ML workflows.
S K I L L S
Programming Languages :- PYTHON | JAVA SCRIPT |SHELL SCRIPT
- Bigdata Technologies :- KAFKA | PYSPARK | Airflow | ETL | Datawarehouse
Cloud:- AWS ( Lambda , Step Function , Glue) | GOOGLE CLOUD | TERRAFORM
- Database: SQL (Postgres, MySQL, POSTGRACE) | NoSQL ( MongoDB , Dynamo Db ) | S3 Bucket
- Web Technologies: HTML | CSS | REST API | DJANGO | FLASK
CI /CD : Git , TeamCity , Visual Studio Code , Jira
- Others :- DATA ANALYTICS | AUTOMATION | ML
W O R K E X P E R I E N C E
Deltacubes Technology
Sr. DATA ENGINEER
- Design and implement data engineering, ingestion and curation functions on AWS cloud using AWS native and custom programming
- Establish, communicate, and advocate best practices and design patterns related to Kafka consumption
- Utilized PySpark to Collected and interpreted large data
- Built and run ETL scripts using python, scrapy , kafka , AWS ,Airflflow
Feb , 2021- Present
- Analyze, re-architect and re-platform on-premise data warehouses to data platforms on AWS cloud using AWS or 3rd party services.
Intelegencia
DATA ENGINEER ( Temporary migration project of sight machine )
Aug ,2020- Feb ,2021
- Develop end to end scalable data cloud-native micro-services, messaging-systems RabbitMQ/Kafka
- Build Data Pipeline for Asian Paint factories using Apache Flink and SM API
- Supported the database design, development, implementation, information storage and retrieval, data flow and analysis activities
- Translated a set of requirements and data into a usable database schema by creating or recreating ad hoc queries, scripts and macros, updates existing queries
- Used Kafka with python for real time data streaming from IOT Device
Apollo TeleHealth Services
DATA ENGINEER
- Managing 7+ Projects across different states in India.
Apr,2019- Aug,2020
- Automated 50+ analytical report using python, pandas, Kafka, MySQL, PandaSql and Selenium.
- Published 10+ clinical and project impact report
- Manipulating, cleansing & processing data using Python, Excel and SQL. Responsible for loading, extracting and validation of client data.
- Analyzing raw data, drawing conclusions & developing recommendations.
- Writing MySQL scripts to manipulate data for data loads and extracts.
- Database Management and Data Modeling
- Using Web scrapping Techniques to get data from multiple Websites.
- Data entry, data auditing, creating data reports & monitoring all data for accuracy.
- Creating interacting dashboards using python flask/django and HTML/CSS.
- Supplying qualitative and quantitative data to colleagues & clients.
- Designing complex dashboards that take advantage of all Tableau functions, including data blending, actions, parameters, and more.
Mildtrix Business Solutions
Software Engineer Trainee
Jan,2018- Mar ,2019
- Worked alongside product managers to re-architect a multi-page web app into a single page web-app built in React
- Built RESTful APIs that served data to our Javascript front-end .
- Built a full-stack web app digi khata accounting and inventory management system
- Used: Javascript, Python, SQL, HTML/CSS, Django to build multiple school
- Build Fees club an online portal to manage school fee management system
E D U C A T I O N
MCA ( Distance )
Suresh Gyan Vihar University
BCA
Ranchi University
XII (Senior Secondary), Science
D.A.V. Bariatu Public school
X (Secondary)
D.A.V. Bariatu Public school
C E R T I F I C A T I O N S
Software Engineering for Data Scientists in Python
2019 - 2021
72%
2015 - 2018
65%
2013 - 2015
71%
2012 - 2013
86%
Financial Forecasting in Python Data Scientist with Python
Advanced Deep Learning with Keras in Python
;• STANSYS
BASE SAS certifification Excel and Adv excel SQL and UNIX
Certifified in AWS Cloud Practitioner By AWS Google Cloud : Create and Manage Cloud Resources Docker essentials by IBM Developer Skills Network Tableau analyst by tableau.