Qureos

FIND_THE_RIGHTJOB.

Site Reliability Officer

JOB_REQUIREMENTS

Hires in

Not specified

Employment Type

Not specified

Company Location

Not specified

Salary

Not specified

This role is ideal for candidates eager to grow in a collaborative environment focused on system resilience and continuous improvement. Accountabilities Assist in monitoring system performance, availability, and reliability across services. Support the development and maintenance of automation scripts to reduce manual operational tasks (toil). Learn and contribute to the implementation of safe deployment strategies and basic change management practices. Participate in incident response efforts by helping investigate and document issues, sometimes in an on-call capacity, depending on the incident. Collaborate with team members to improve observability through dashboards, logs, and alerting tools. Contribute to writing and maintaining technical documentation and playbooks. Support capacity monitoring and learn how to identify potential scaling or cost issues. Work within an agile team environment, actively participating in team meetings and reviews. Stay up to date with trends in SRE, DevOps, and cloud computing through training and mentorship. Education & Experience 4+ years relevant experience in a related field Bachelor s degree in Computer Science, Information Technology, Engineering, or a related field (or recent graduate/final-year student). Basic knowledge of scripting or programming in languages such as Python, Go, or Bash. Familiarity with Linux/Unix operating systems and system administration concepts. Understanding of networking, cloud services (e.g., AWS, Azure, or GCP), and version control (e.g., Git) is a plus. Interest in learning about CI/CD pipelines, monitoring tools, and infrastructure as code (IaC). Strong analytical and problem-solving mindset with attention to detail. Good communication and teamwork skills, with a willingness to learn from peers and mentors.

© 2025 Qureos. All rights reserved.