- Algorithm developed, and solution written for Recommender system, for one of the product developed for media domain.
- Developed Reporting system for Data analysis platform. Keeping in focus future advanced analytics requirement having use of machine learning and mathematical formulations
- Developed a data analysis Platform for my current project using microservice architecture. In which I developed Data Integration solution developed single-handedly used Apache Airflow used layer architecture. In DI I am downloading data from Amazon redshift to AWS s3 bucket. From there I migrated data load files to GCS (Google cloud storage) from there I loaded files into LNRR dataset tables using the load truncate approach then data extracted from LN table filed during the process. Inserted into OD dataset into various tables as required with rules taking care of avoiding duplicate. Then various tables in downstream datasets were refreshed using SQL operations with new data available in OD [IN, TG, VW] Developed most metrics for the front end. Got a spot award for the same. Here in this project, we have given various metrics radio domain. Currently managing the project
Big Data Assignment is done in which:
setup apache Hadoop environment in my system including Apache Hadoop 2.7.3 Hdfs, hive server 2, Apache NIFI
Elk stack(Elastic Search,Kibana, Logstash)
2. I build a data flow in Apache NIFI that ingest data from Facebook into Hadoop
3. Same data should be inserted in Elastic Search