2013-2017
B.tech(Compute Science and Engineering)
Rajasthan Technical Univesity
B.tech(Compute Science and Engineering)
Rajasthan Technical Univesity
(Machine Learning and Artificial Intelligence)
IIIt - Bangalore
Experience Dream11 | Mumbai, Maharashtra SDE-2, Data Engineer | 07/2022 Developed Streaming Data Platform Created a self-serve Data Streaming Platform to automate company-wide Kafka streaming pipelines using Flink. Designed end to end RestAPIs. Dash-boarding and Monitoring of events, user adoption and churn via Looker ML Re-architecting and optimizing Spark ETL and redshift queries Features: Implemented multiple Kafka, elastic search, Redshift, Athena operators for such as sink, source and transform, alerting and monitoring, edit and update pipelines, data preview. Impact : Reduced approx. 100hrs of manual efforts. Technologies: Java Spring Boot, Kafka, Flink, RestAPIs, Redshift, Apache Spark, AWS, AWS S3, Python, Postman, Github Microsoft | Hyderabad, Telangana Data Engineer | 07/2017 - 07/2022 Usage, Compute and Capacity Planning of Azure Data Centers Redesigned and revamped the legacy product for Azure data centers’ usage, compute and planning with updated SLA’s, monitoring, optimizing and scoping of business logic. Impact: Response rate improved by 20%, compute and capacity made available to all users 20% in advance before hitting the bottleneck. Technologies: SQL Server, Cosmos, Python, Azure Data Factory, Power BI Predicting Requirement Estimation and Sales Pricing Lead an offshore team of 5 into design and architecture discussion with customer and cross time zone teams. Along with translating business requirements into features and tasks while mentoring and supporting them on their various tasks. Established coding, testing, ADO, CI/CD and documentation standards for enhanced quality delivery. Developing various reusable scripts for data parsing/processing, data modeling and designing parameterized/ scalable scheduled ADF Pipelines. Impact: The overall process helped in enhanced customer satisfaction and got another deal signed with the same customer with immediate revenue generation of approx. $400k. Technologies: Azure Databricks(ADB), Azure Delta Lake, Azure Data Factory(ADF), Azure DevOps (ADO), Spark (Pyspark, SparkSql), Python, SQL Server Timeseries Forecasting and Prediction Designed and implemented solution for data preparation and data wrangling for the training of models, predicting weather condition and corresponding resource’s demands (like, gas, engineers) for next two years Created pipelines for automated incremental data load and processing. Technologies: Azure Databricks, Python (PySpark, Pandas), Azure Data Factory, Kubernetes Predictive Rule Engine for Insurance Premium Curating rules and algorithm for Descriptive model for rule engine to predict insurance premiums based on various factors Developed automation testing as per sonar-lint standards of code refactoring. Technologies: Python (Pandas, Behave, Flask), Kubernetes Garima Jain 9521105272 | jain95garima@gmail.com | Mumbai, Maharashtra GDPR Compliance Analysis Designed an Intelligent System for detecting and analyzing the GDPR compliance issues related to critical personal information (bank account number, govt ID, etc.) Curated audio and text extraction from video, designed and trained the model for the same. Technologies: Python, Audio and Text Recognition, SciKit Learn, Power BI