Work History 2021-09
– Current Sr. Technical Lead HCL
Technologies - SaaS Data Engineering (Sep 2021 – Till Date) Description: Extract, enrich and validate data from multiple sources and generate metrics in hive for other teams to develop reporting dashboards for BI and to provide insights about software usage information within organization. Sources include but are not limited to Zoom, Slack, ServiceNow etc. Technologies: AWS, Hive, PySpark, Python, Airflow, Spark SQL
● Understand business requirements from the client and participate in capturing it in Jira.
● Develop PySpark code to read data from S3 and load it into hive tables.
● Create Hive views on top of extracted data for business needs.
● Understand data and implement efficient solutions.
● Process automation and various file level processing using Unix Shell scripts.
● Follow agile methodologies for smooth execution of all phases of development.