Job Responsibilities
Manage and maintain Azure Kubernetes Services. Establish, deploy, and maintain CI/CD pipelines to automate the build, test, and deployment processes. Investigate and resolve issues related to the application infrastructure, continuous integration, and deployment pipelines. Desired Skills and Experience Roles and Responsibilities
- Design and develop application health dashboards, alerting and notification delivery systems to help with observability of application stack in Azure cloud.
- Respond to incidents, perform root cause analysis, troubleshoot issues, and implement solutions to prevent recurrence.
- Act as gatekeeper for production deployments, participate in the application release cycles and perform production releases.
- Manage, and maintain environments hosting Credit, Swaps & FX FO IT microservices and data lake platform.
- Manage and maintain the lifecycle of core application suite that provide common capabilities such as continuous deployment, observability, and kafka streaming.
- Establish, deploy, and maintain CI/CD pipelines to automate the build, test, and deployment processes adhering to firms audit and compliance policies.
- Migration of on-prem build and deployment projects to adopt existing GitOps, cloud deployment pipeline pattern and branching policies.
- Assist the development teams in containerising, building, and migration of on-prem applications to Azure cloud.
- Setup, manage and maintain central observability solution for on-prem and cloud.
- Identify areas that benefit from automation and build automated processes wherever possible.
- Collaborate with infra teams to provision and manage infra resources required by FO IT development teams in Azure cloud.
- Implement backup and disaster recovery strategies and participate in annual DR tests and assist with executing the DR test plan.
- Create and maintain documentation related to common issues, fixes, deployment/release processes, transfer knowledge among DevOps and support team members to remove any key man dependencies.
Essential Criteria
- 2 to 7 years of experience in a SRE/DevOps role preferably in Investment Banking with solid understanding of both.
- Strong knowledge of DevOps practices, tools, and technologies.
- Experience in working with, managing, and maintaining enterprise scale production application microservice environments, observability tools.
- Strong knowledge of containerization and orchestration of microservices.
- Experience with Docker/Podman, Helm, ArgoCD GitOps tool, Terraform.
- Experience with Azure Kubernetes Service, Azure Storage, and other Azure cloud related technologies.
- Experience with Prometheus, Grafana, Loki, Tempo, Grafana Agent, Azure Monitor logging and observability tools.
- Bamboo CI/CD tools, Bitbucket, GIT.
- Automation scripting (Bash, Powershell, Python).
- Be able to demonstrate a high level of professionalism, organisation, self-motivation, and a desire for self-improvement.
- Ability to plan, schedule and manage a demanding workload. Desirable Criteria
- Good knowledge of Azure Landing zone, Azure networking concepts such as private links.
- Good knowledge or experience in deploying and maintaining Azure Databricks infra.
- Good Java, NodeJs skills.
- Good understanding of Kafka streaming and MongoDB.
- Knowledge of DevSecOps practices.