DevOps Engineer [SRE]
Location : Bengaluru, India
Experience : 5.2 Year
Rate: $25 / Hourly
Availability : 2 Week
Work From : Any
Category : Information Technology & Services
PROJECT DETAILS: -
Project name: NSA-AIP [ Artificial Intelligence Platform] Account: visa-credit cards-banking Designation: Senior DevOps Engineer [SRE] Tools-used JENKINS/Cloud BeesCI, BITBUCKET/ GIT/GITHUB, DOCKER, KUBERNTES, SONAR QUBE, NEXUS IQ, CHECKMARX, SHELL, BASH, JFROG, SPLUNK, PROMETHEUS, GRAFANA, Bigdata Tools- KAFKA, FLINK, Hadoop, SPARK, HDFS, REDIS, Nomad-HashiCorp, S3, PySpark, TensorFlow pytorch AppleMac, RedHat, Ubuntu, Centos, Python, Go, Java, C++, Scala, Groovy, Neural networks AI ML Environment: Linux, Windows, Apple Mac, aws-cloud Company Name: VISA , Bangalore , payroll
xxxxxx Limited 22/4/2022 to present Job
Responsibilities: - ∙ SRE - ACTIVITIES Site reliability engineers (SREs) work between development and operations software developer with additional operations ∙ Application deployments with continuous integration and continuous delivery (CI/CD) pipelines, operating system patching, and maintenance ∙ Continuous integration and continuous delivery Monitoring Incident response Alerts, Automation and Tooling, Capacity Planning, Collaboration ∙ CICD pipelines for continuous build & deployment and integrated Junit and SonarQube plugins in Jenkins for automated testing and for Code quality check, using SonarQube, checkmarx, nexus IQ ∙ Emerged as a crucial discipline that bridges the gap between software development and IT operations, with a focus on achieving high availability, scalability, and resilience of services ∙ Part in on-call rotations, meaning they are available 24/7 to respond promptly to incidents and emergencies. This requires a deep understanding of the system’s architecture and a cool-headed approach to problem-solving under pressure ∙ An incident postmortem brings teams together to take a deeper look at an incident and figure out what happened, why it happened, how the team responded, and what can be done to prevent repeat incidents and improve future responses ∙ Site reliability engineers spend no more than half their time performing manual IT operations and system administration tasks—analyzing logs, performance tuning, applying patches, testing production environments, responding to incidents, conducting postmortems—and spend the rest of their time developing code that automates those tasks. ∙ Infrastructure optimization, including right-size cloud resources and adding or removing resources like central processing unit (CPU) and random-access memory (RAM) as needed
∙ SRE - ACTIVITIES Site reliability engineers (SREs) work between development and operations software developer with additional operations
∙ Application deployments with continuous integration and continuous delivery (CI/CD) pipelines, operating system patching, and maintenance
∙ Continuous integration and continuous delivery Monitoring Incident response Alerts, Automation and Tooling, Capacity Planning, Collaboration
∙ CICD pipelines for continuous build & deployment and integrated Junit and SonarQube plugins in Jenkins for automated testing and for Code quality check, using SonarQube, checkmarx, nexus IQ
∙ Emerged as a crucial discipline that bridges the gap between software development and IT operations, with a focus on achieving high availability, scalability, and resilience of services
∙ Part in on-call rotations, meaning they are available 24/7 to respond promptly to incidents and emergencies. This requires a deep understanding of the system’s architecture and a cool-headed approach to problem-solving under pressure
∙ An incident postmortem brings teams together to take a deeper look at an incident and figure out what happened, why it happened, how the team responded, and what can be done to prevent repeat incidents and improve future responses
∙ Site reliability engineers spend no more than half their time performing manual IT operations and system administration tasks—analyzing logs, performance tuning, applying patches, testing production environments, responding to incidents, conducting postmortems—and spend the rest of their time developing code that automates those tasks.
∙ Infrastructure optimization, including right-size cloud resources and adding or removing resources like central processing unit (CPU) and random-access memory (RAM) as needed
visa-credit cards-banking
∙ Experience in CICD Jenkins pipelines to drive all micro services builds out to the Docker registry and then deployed to Kubernetes, Created Pods and managed using Kubernetes
∙ Created Jenkins CICD pipelines for continuous build & deployment and integrated Junit and SonarQube plugins in Jenkins for automated testing and for Code quality check.
∙ Use of Docker and Kubernetes to manage micro services for development of continuous integration and continuous delivery.
∙ Deployed Prometheus with Grafana to monitor the Kubernetes cluster and configured alerts firing when different conditions met
∙ Good experience with Helm charts deployments, writing manifest files to deploy K8s objects/microservices ∙ Experienced in infrastructure and application monitoring using Prometheus Grafana Splunk Nagios CloudWatch, integrated various exporters for Prometheus to scrap metrics for the visualization in Grafana
∙ Experience in installing, configuring and managing Jenkins CI tool on Linux machines and using Jenkins Pipelines to push all Microservices builds to Docker Registry (ECR), then deployed to Kubernetes, built and managed Pods using Kubernetes.
∙ Experience in Kubernetes to deploy, scale, load balance and managing Kubernetes Pods with multiple namespaces versions using Helm charts
∙ Built end to end CI/CD Pipelines in Jenkins integrating SCM, compiling source code, performing tests and pushing build artifacts to Nexus
∙ Managed GIT Configuration, Branching and Merging, resolved conflicts, Push changes, etc.
∙ Managed Kubernetes deployments, objects for high availability and scalability using horizontal pod scaler and resources management
∙ Setup Nginx Ingress controller to manage the ingress/egress routing rules for Kubernetes
∙ Strong experience in managing the Kubernetes environment for scalability, availability and zero downtime ∙ Experienced in build and deployment of Java, Python applications on to different environments such as QA, UAT and Production.
∙ Experience in working on AWS and its services like VPC, EC2, IAM, ECS, EBS, RDS, S3, Lambda, ELB, Autoscaling, Route53, APIGateway CloudWatch, Cloud Trail, SQS, and SNS
∙ Experience in writing scripts using Python, YAML, Groovy, Shell, BASH scripts to automate tasks ∙ Experience to create Terraform templates use to create Aws Services VPC, NAT, S3, RDS, EKS, ECR, EC2, EBS, ELB, ASG, SQS, SNS, IAM, etc using modules
∙ Experience in using Build Automation scripting like Maven as build tools for building of deployable artifacts (JAR, WAR) from source code.
∙ Strong CICD experience using Jenkins DevOps, for end - to-end build automation and deployments
finance department
∙ Designed, deployed & managed: scalable, highly available, and systems on AWS
∙ Proficiency in Capacity planning and Architecture Design for Continuous integration and Deployment (CI/CD) infrastructure on AWS, and VMware cloud ∙ Depth knowledge of AWS resources like EC2, VPC, EBS, AMI, S3 buckets, creation of subnets and all other operational task ∙ Strong experience on infrastructure-as-a-code & CI-CD implementations
∙ Done the Capacity planning and Architecture Design of AWS and VMWare Infrastructure
∙ Worked on CI/CD tools like: GitHub, Ansible Jenkins, Docker and Kubernetes
∙ Understanding needs and requirements from the clients, finalizing the design and solution with technical team
∙ Leading project planning sessions, Coordinating staff and internal resources
∙ Managing project progress and adapt work as required, ensuring projects meet deadlines
∙ Deploying the code to test environments using automated deployment tools Jenkins
∙ Implementing automation tools and frameworks for automated code deployment (CI/CD)
∙ Responsible for creating Artifacts using Maven and deploying those artifacts into Tomcat
∙ Responsible for Managing AWS IAM Users, Policies, Groups etc.
UBS (BANK-DOMAIN)