No of Positions: 1
Location: New Delhi
Tentative Start Date: July 15, 2022
Work From : Offsite
Rate : $ 12 - 16 (Hourly)
Experience : 3 to 5 Year
Role Description:
Cloud operations engineers may create processes designed to measure system effectiveness and identify areas for improvement. They may also create processes intended to provide environment security, as well as automated processes to provide information on current specifications. Cloud operations engineers should also stay abreast of new technologies in the field and provide recommendations to organizational management on new solutions. These engineers could oversee the selection of orchestration tooling, as well as compliance audits and reporting. They may be responsible for identifying, correcting, and enhancing important software tools. These professionals could also seek ways to enhance systems operations, with a focus on automation and minimizing cost.
Responsibilities:
Responsible for supporting Cloud service measurement, monitoring, and reporting.
Sound knowledge in engineering, and administration in supporting an exceptionally large distributed clustered Splunk environment consisting of search heads, indexers, deployers, deployment servers, heavy/universal forwarders, and Splunk Enterprise Security premium app.
Part of a 24/7 team providing the first line of Operational Support including event response and recovery efforts.
Deploy and maintain critical applications on cloud-native microservices and serverless.
Improve overall operational quality through common practices and working with engineering, QA, IaaS, and DevOps teams.
Responsible for supporting efforts that improve operational performance and availability of HB Production environments.
Provide technical support for day-to-day operations of critical Cloud Services as part of an operational support rotation. This will require participation in our On-Call rotation.
Able to work in shifts on a rotational basis.
Input into the monitoring of systems applications and supporting data.
Assist with service deployments to staging & production environments.
Assist with creating and updating runbooks & sops.
Desired Skills & Competencies:
• Effective communication skills.
• Excellent written and verbal communication skills.
• Experience working with and supporting production-level services within public cloud environments.
• At least 3 to 5 years of hands-on working experience in building & supporting large-scale environments.
• AWS and Azure Cloud knowledge (Level = Expert)
• Linux (Level = Expert)
• Scripting Language ( Shell / Bash / YAML ) (Level = Expert)
• Terraform (Level = Intermediate)
• AWS - Hands-on experience on RDS, Elasticsearch, EKS, ECR, RDS, Load balancer, CloudFormation, EMR, CF.
• (Preferably ALB), IAM, Lambda, Pinpoint, Amplify, Code Build, S3, ACM.
• Experience using modern Monitoring and Alerting tools (Prometheus, Grafana, AWS CloudWatch, Ops genie, etc.)
• Preferably ( Splunk Cloud Platform )
• Expert Networking knowledge (Switches, VLANs, Firewalls, MPLS, and Security) to assist the Network team in troubleshooting Familiarity with the tools (Jenkins, TeamCity, etc.) and processes used to support a Continuous Integration and Continuous Deployment environment.
• Expert in setting the Cloud Disaster Recovery Strategies.
• Expert in documenting the SOPs, DOs, and Don'ts of cloud infrastructure.
• Sound knowledge of compliance regulations like HIPPA, PCI DSS, GDPR, ISO/IEC 27001, NIST, NERC