FIND_THE_RIGHTJOB.
JOB_REQUIREMENTS
Hires in
Not specified
Employment Type
Not specified
Company Location
Not specified
Salary
Not specified
Role Proficiency:
Act under guidance of Lead II/Architect understands customer requirements and translate them into design of new DevOps (CI/CD) components. Capable of managing at least 1 Agile Team
Outcomes:
Measures of Outcomes:
Outputs Expected:
Automated components :
Configured components:
Scripts:
Onboard users:
Mentoring:
Stakeholder Management:
Training/SOPs :
Measure Process Efficiency/Effectiveness:
Stakeholder Management:
Skill Examples:
Knowledge Examples:
Site Reliability Engineer (SRE) - Cloud-Native Services (Final) About the Role We are seeking a highly motivated and experienced Site Reliability Engineer (SRE) to join our core engineering team. This role is critical to maintaining the reliability, performance, and scalability of our modern, cloud-native application ecosystem. The ideal candidate will possess a strong blend of software engineering skills and deep operational knowledge, dedicated to reducing toil and driving system improvements through automation, with a specific focus on security compliance and SLA-driven operational excellence. ________________________________________ Key Responsibilities Deployment Automation (CI/CD): Develop, maintain, and automate robust deployment pipelines using tools like Jenkins, SonarQube, and Maven/ANT, ensuring fast, reliable, and safe production rollouts. Compliance & Security Remediation: Own the process of analyzing, prioritizing, and remediating vulnerabilities/findings identified in security scanning tools (Qualys, Sentinel One, Wiz). Ensure continuous compliance across multiple regions and distinct environments (NPE/PROD), specifically maintaining or exceeding the target compliance rate of 95%. Infrastructure Management: Manage, scale, and secure cloud infrastructure using CloudFormation and other IaC best practices. This includes implementing and automating AMI Rehydration processes. Incident Response & Management: Act as a primary responder during critical events, participating in on-call rotations using PagerDuty. Triage and resolve incidents originating from multiple geographies and business units quickly and effectively to ensure resolution within defined Service Level Agreements (SLAs). Conduct thorough post-mortems. Operational Excellence & Toil Reduction: Maintain and improve service availability, latency, and efficiency. Design, develop, and implement solutions to automate repetitive manual tasks. Observability: Implement and manage comprehensive monitoring, logging, and ing solutions (SLOs/SLIs) using tools like ELK, Splunk, and DataDog. ________________________________________ Required Qualifications Experience & Methodology 5+ years of experience working as an SRE with full project lifecycle experience. Experience in configuring, building, and supporting applications and operations in a public cloud environment (AWS, GCP, Azure). Strong exposure to Agile and Scaled Agile based development models. Demonstrated ability to work effectively in a fast-paced, high-volume, deadline-driven environment. Technical Skills Cloud Infrastructure: Good knowledge of cloud infrastructure (cloud services, security, IAM, VPC), and provisioning tools like CloudFormation, Terraform, or Ansible. Containerization & Orchestration: Expertise with Kubernetes (EKS), and container scheduler services such as ECS or GKE/Docker. Compliance & Platform: Demonstrated knowledge of the compliance process and remediation experience with Qualys, Sentinel One, and Wiz Reports, and practical experience with AMI Rehydration. Programming & Scripting: Excellent coding skills in at least one high-level language (e.g., Python, Go, Java) and scripting languages such as Unix Shells, Perl, Shell, bash, ksh. Experience in one or more of the following: .NET based app development; Java based app development. CI/CD & SCM Ecosystem: Extensive experience with continuous integration tools and Source Code Management (SCM): CI Tools: Jenkins, SonarQube, JIRA, Nexus, Confluence, Maven/ANT, Gradle. SCM: Experience performing source code control management using Bitbucket/GIT (branching, merging, tagging, etc.). Configuration Management: Experience in automation using Chef, Puppet or another SCM tool. Monitoring & Logging: Experience with tools like Elastic Search, ELK, Data Dog, PagerDuty, AppDynamics, Splunk, etc.
Aws,CI,CD,Scripting
Similar jobs
Tata Consultancy Services (TCS)
Hyderabad, Pakistan
6 days ago
Qualitest
Hyderabad, Pakistan
6 days ago
Tata Consultancy Services (TCS)
Hyderabad, Pakistan
6 days ago
Tata Consultancy Services (TCS)
Hyderabad, Pakistan
6 days ago
Capgemini Engineering
Hyderabad, Pakistan
6 days ago
Lloyds Technology Centre
Hyderabad, Pakistan
11 days ago
© 2025 Qureos. All rights reserved.