Qureos

Find The RightJob.

Sr. Staff Platform Engineer

At Sumerge, our Platform Engineering team is at the core of our operational excellence, setting up robust infrastructure and integrating cutting-edge DevOps tools across diverse projects and client environments. The team specializes in deploying and managing container orchestration platforms such as Kubernetes and Red Hat OpenShift, and configuring databases such as MongoDB and event-streaming platforms like Kafka to ensure seamless, scalable, and secure operations.

By automating and optimizing our development pipelines, our platform engineers design, implement, maintain, and continuously improve the organization's platforms and infrastructure, ensuring high availability, performance, security, scalability, and operational efficiency.

Join us to be at the forefront of technological innovation and play a pivotal role in driving the success of our digital transformation initiatives.

Responsibilities
    • Design, implement, and maintain CI/CD pipelines for automated build, test, and deployment.
    • Support frequent, reliable, and secure application releases.
    • Provision and manage cloud, on-premises, or hybrid infrastructure.
    • Implement Infrastructure as Code (IaC) using tools such as Terraform, ARM, or CloudFormation.
    • Ensure scalability, availability, and cost optimization of environments.
    • Automate system provisioning, configuration, and operational tasks.
    • Maintain configuration management tools (Ansible, Chef, Puppet, etc.).
    • Design, deploy, upgrade, patch, and maintain container orchestration platforms (Kubernetes, OpenShift or other orchestration platform operations).
    • Manage cluster lifecycle, node scaling, and infrastructure optimization, configure and support OpenShift data foundation (ODF)
    • Configure monitoring and logging and provide support and response for any incident.
    • Deploy, configure, and manage IBM Cloud Pak for Business Automation (CP4BA) components (containers & traditional) including Business Automation Workflow, Business Automation Studio, FileNet Content Platform Engine, Operational Decision Manager, and Automation Document Processing.
    • Deploy, configure, and manage IBM Cloud Pak for Integration (CP4I) components (CP4I) components (containers & traditional).
    • Deploy, configure, and manage Confluent.
    • Deploy, configure, and manage Elasticsearch.
    • Perform day-to-day administration, monitoring, tuning, and patching of all Cloud Pak deployments.
    • Implement monitoring, logging, and alerting for all Cloud Pak services using Prometheus, Grafana, ELK stack, and IBM Cloud Pak System Health.
    • Reduce manual intervention and operational overhead.
    • Build, manage, and optimize containerized applications.
    • Ensure secure and efficient container runtime environments.
    • Troubleshoot issues using logs, metrics, and traces.
    • Manage secrets, access control, and secure configurations.
    • Support vulnerability scanning, patching, and compliance requirements.
    • Work closely with development, QA, and operations teams.
    • Promote DevOps culture, best practices, and continuous improvement.
    • Maintain technical documentation, runbooks, and deployment guides.
    • Support audits and operational reviews.

Requirements

    • B.S. or higher in Computer Science, Engineering, or a related technical field is preferred.
    • 8+ years of experience in platform engineering, system administration, or a related field.
    • Strong Experience with Linux/Windows server administration.
    • Strong Experience with CI/CD pipelines, automation tools (Jenkins, GitLab CI, GitHub Actions), and scripting (Bash, PowerShell, Python).
    • Strong Experience with containers and orchestration (Docker, Kubernetes and OpenShift).
    • Strong Experience with monitoring & observability tools.
    • Strong Experience with IBM Cloud Pak for Integration (CP4I) components is a plus.
    • Strong Experience with IBM Cloud Pak for Business Automation (CP4BA) components is a plus.
    • Strong Experience with Confluent is a plus.
    • Strong Experience with Elasticsearch.
    • Strong Experience with networking fundamentals (TCP/IP, DNS, VPN, load balancers).
    • Strong Experience with security best practices, IAM, secrets management, and compliance.
    • Managing production environments with high availability, scalability, and performance tuning.
    • Incident management, troubleshooting, and root-cause analysis.
    • Managing backups, DR plans, and systems resilience.
    • Strong Experience with SLA/SLO/SI and service reliability practices.
    • Incident management, troubleshooting, and root-cause analysis.

Similar jobs

No similar jobs found

© 2026 Qureos. All rights reserved.