ibex. is seeking a
Multi-Cloud DevOps Engineer
to join our growing team. This pivotal role offers the chance to leverage cutting-edge technologies, champion automation, and significantly enhance infrastructure resilience across diverse cloud environments. If you're a proactive problem-solver with a passion for optimizing complex systems and deploying robust solutions across multiple cloud providers, we'd love to connect with you!
Responsibilities
-
Continuous Integration & Deployment (CI/CD): Design, implement, and continuously improve CI/CD pipelines tailored for multi-cloud and Kubernetes-centric platforms. This includes building and maintaining Docker container images and managing all related dependencies.
-
Multi-Cloud Environment Management: Configure, automate, and maintain critical environments across various cloud infrastructures, including AWS, Azure, Google Cloud Platform (GCP), on-premises Kubernetes clusters, VMware, and containerized setups. A strong focus on process automation is key to boosting efficiency and consistency.
-
Distributed System Management: Deploy and manage large-scale Kubernetes clusters (e.g., Azure AKS, AWS EKS, Google GKE), MySQL clusters, storage solutions (e.g., OpenEBS), ELK stacks, and Redis, ensuring high availability and resilience across cloud boundaries.
-
Automation & Configuration: Utilize powerful tools like Terraform, Ansible, and Puppet to automate environment provisioning and streamline configuration management across our diverse multi-cloud infrastructure.
-
Application Modernization: Collaborate closely with development teams to facilitate the smooth transition of legacy monolithic applications to microservices architectures, leveraging both on-premises and public cloud platforms (Azure, AWS, GCP).
-
Monitoring & Observability: Develop comprehensive dashboards and integrate various components to enhance visibility into multi-cloud infrastructure and application performance.
-
Operational Efficiency: Implement automated solutions for repetitive operational tasks using APIs, CLI tools, and custom scripting, thereby improving efficiency and minimizing manual effort across all platforms.
Qualifications
-
This role requires 5 to 7 years of relevant experience in DevOps, Site Reliability Engineering, or a similar capacity with a significant focus on multi-cloud solutions.
-
A bachelor's degree in a computer-related field or similar experience is preferred.
-
Cloud Platforms: Extensive practical experience with production workloads on at least two of the leading public cloud providers: Microsoft Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP).
-
Container Orchestration: Expertise with Kubernetes (on-premises or managed services like Azure AKS, AWS EKS, Google GKE) and Docker.
-
Configuration Management & IaC: Skilled in using tools such as Ansible and particularly Terraform for provisioning and managing infrastructure as code across multi-cloud environments.
-
Monitoring & Logging: Experience with industry-standard monitoring and logging tools including Grafana, Prometheus, and the ELK stack.
-
CI/CD Tools: Hands-on experience with robust build systems (e.g., Dockerfiles) and leading CI/CD tools such as GitLab CI/CD or Jenkins.
-
Source Control: Proficient in Git for version control.
-
Scripting: Strong proficiency in scripting languages including Bash and Python.
-
Operating Systems: Demonstrated proficiency in Unix-based environments (e.g., RedHat, Ubuntu).
-
Virtualization & Containerization: Practical experience with VMware and Docker containers.
-
API Interaction: Proven ability to interact with REST APIs for automation and integration.
Nice-to-Have Skills
-
Service Mesh: Deployed and operated service meshes (e.g. Istio, App Mesh, Anthos) in multi-cloud Kubernetes environments.
-
Cloud Cost Management: Understanding of cloud spending optimization across multiple providers (e.g., billing models, reserved instances, basic FinOps concepts).
-
Advanced Observability: Experience with distributed tracing tools (e.g., Jaeger, OpenTelemetry) or more advanced logging aggregation strategies.
-
Cloud Networking Fundamentals: Basic understanding of multi-cloud networking principles (e.g., VPNs, peering, network security groups).
-
Cloud Security Fundamentals: Familiarity with basic security best practices across cloud platforms (e.g., IAM, security groups).
-
Distributed Systems Components: Experience with MySQL clusters, Redis, or similar distributed storage solutions.
-
PowerShell Scripting: Additional proficiency in PowerShell.
-
VMware CLI/API: Experience with VMware CLI or API for automation.