Qureos

FIND_THE_RIGHTJOB.

Cloud Engineer

JOB_REQUIREMENTS

Hires in

Not specified

Employment Type

Not specified

Company Location

Not specified

Salary

Not specified

Summary of the Job

An Cloud Engineer needs to have excellent AWS/Cloud services support experience.Development and integration services enable retailers, hospitality, and service organisations to create engaging and seamless customer experiences. Using the combination of cloud platform, endpoint integration application and development capabilities. You will work closely with our existing development on Infrastructure teams to fully establish and drive forward Cloud Ops SaaS and application support and best practices in the company.

Experience in a similar role with an emphasis on a high-pressure 24x7x365 ops monitoring and infrastructure support environment. Establishing and reviewing configuration management, automating our infrastructure, implementing continuous integration, and complimenting the team in DevOps best practices to help achieve a continuously deployable system. You will be part of a 24x7 shift-based team monitoring and resolving incidents both on our SaaS platform and deployed applications when required.

You will be part of a team set up for success, responding to customer generated issues raised to the Cloud Ops support team via the Service Desk, and will be working collaboratively to find resolutions for the customer.

You will assist in identifying production issues and implementing integrations that meet customer needs and help put in place the processes to build scalable, efficient cloud infrastructure.

You’ll be involved in implementing and improving monitoring for automated system health checks using tools such as Datadog and Splunk, as well as be helping to implement observability into the company to ensure real-time resolution and pre-emptive identification of issues.

Key Responsibilities

Reporting to the Cloud Architect, you will be expected to:

  • Implement and maintain monitoring and alerting
  • Perform root cause analysis for production errors
  • Perform Anomalies detections
  • Good understanding of Change detection
  • Investigate and resolve technical issues
  • Design procedures/searches for system troubleshooting and maintenance
  • Build and maintain highly available production systems
  • Support the design of cloud infrastructure that is secure, scalable, and highly available on AWS
  • Work collaboratively with software engineering to define infrastructure and deployment requirements
  • Provision, configure, and maintain cloud infrastructure defined as code
  • Ensure configuration and compliance with configuration management tools
  • Troubleshoot problems across a wide array of services and functional areas
  • Build and maintain operational tools for deployment, monitoring, and analysis of AWS infrastructure and systems
  • Monitor deployments for updates and fixes
  • Build the automation scripts to be used for updates and fixes
  • Perform infrastructure cost analysis and optimisation
  • Build environments adhering to strict security requirements
  • Migrate legacy environments to the Cloud

Skills & Experience | Essential

  • Cloud Platform Expertise
    • Understanding of major cloud platforms: In-depth knowledge of at least one major cloud platform such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP).
    • Cloud-native tools: Familiarity with tools specific to the platform, like AWS CloudFormation, Azure Resource Manager (ARM), or Google Cloud Deployment Manager.
  • Infrastructure as Code (IaC)
    • Automation of infrastructure: Expertise in IaC tools like Terraform, Ansible, or Chef to automate the provisioning and management of cloud resources.
  • Containerisation and Orchestration
    • Docker and Kubernetes: Proficiency in Docker for containerization and Kubernetes for container orchestration.
    • CI/CD pipelines: Knowledge of integrating containers into Continuous Integration/Continuous Deployment pipelines.
  • Monitoring and Logging
    • Monitoring tools: Experience with monitoring solutions like DataDog, Nagios, or AWS CloudWatch to ensure cloud services are operating optimally.
    • Log management: Skills in using logging tools like Splunk, to collect, analyse, and act on log data.
  • Networking and Security
  • Scripting and Automation
  • Disaster Recovery and Backup Management
  • Cost Management and Optimization
  • Incident and Problem Management
  • Collaboration and Communication
  • Experience in working in a 24x7x365 high up-time, high pressure environment
  • Holding one or more of the following AWS qualifications:
    • AWS Cloud Practitioner
    • AWS SysOps Administrator
    • AWS DevOps Engineer

Similar jobs

No similar jobs found

© 2025 Qureos. All rights reserved.