Job Purpose:
The Digital Operations Engineer manages the Bank’s Oracle Cloud Infrastructure and Kubernetes environments to ensure stability, security, and scalability of digital banking services. He/She maintains 99.9%+ service uptime, optimizes infrastructure performance and cost, and enforces CI/CD best practices across digital platforms. The Digital Operations Engineer implements monitoring and observability tools, builds infrastructure as code, and responds to incidents with precision to minimize disruption. He/She enables development teams with robust cloud-native solutions that support the Bank’s innovation and digital transformation goals.
Key Accountabilities:
-
Manages and optimizes the Oracle Cloud Infrastructure (OCI) environment to ensure stability, scalability, and cost-efficiency.
-
Deploys and maintains Oracle Kubernetes Engine (OKE) clusters for containerized workloads.
-
Implements infrastructure as code using Terraform to enable consistent and repeatable infrastructure provisioning.
-
Builds and maintains CI/CD pipelines for React Native, Flutter, React, and Node.js applications, supporting automated deployment workflows.
-
Monitors production systems using Prometheus, Grafana, and OCI Monitoring to ensure proactive performance tracking and incident detection.
-
Responds to production incidents, performs root cause analysis, and supports on-call incident handling rotation.
-
Implements and tracks Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets to uphold service reliability standards.
-
Manages Oracle Autonomous Database instances, including performance tuning and query optimization.
-
Configures OCI networking components such as Virtual Cloud Networks (VCNs), load balancers, and security groups to ensure secure and efficient connectivity.
-
Implements backup, restore, and disaster recovery procedures to support business continuity and data protection.
-
Optimizes cloud resource utilization and cost efficiency through proactive rightsizing and monitoring.
-
Configures auto-scaling policies to ensure high availability of applications and infrastructure.
-
Manages secrets and encryption keys using OCI Vault to enforce secure credential management.
-
Implements centralized logging using the ELK stack for efficient log aggregation and analysis.
-
Supports mobile application packaging and deployment to the Apple App Store and Google Play Store.
-
Collaborates with development and testing teams to execute blue-green and canary deployment strategies.
-
Troubleshoots infrastructure and application performance issues to minimize downtime and improve user experience.
-
Creates and maintains detailed operational documentation and runbooks for repeatable processes and incident resolution.
-
Utilizes AI-powered tools for infrastructure design validation, performance insights, and troubleshooting.
-
Ensures cross-platform CI/CD pipelines are maintained across all major digital touchpoints, including web, mobile, and backend services.
Qualifications and Experience:
-
Bachelor's degree in Computer Science, IT, or a related field.
-
Professional certification in Oracle Cloud Infrastructure and Kubernetes Administration.
-
Minimum of 5 years of experience in DevOps, SRE, or cloud infrastructure experience
-
Hands on experience in Infrastructure as Code (Terraform), CI/CD Pipeline (Web and/ or mobile applications), Oracle Cloud Infrastructure and Kubernetes Administration.
-
A proven track record of executing similar technical mandates in Banking, fintech, or a regulated industry.
-
Ability to work under pressure during production incidents with on-call and rotation-based availability.
-
Familiarity with mobile app deployment processes (iOS/Android) is desired.
Key Skills & Competencies:
-
Strong Linux/Unix system administration skills
-
Excellent troubleshooting and problem-solving skills
-
Strong communication and documentation abilities
-
Experienced across multiple language ecosystems (Node.js npm, Dart pub)
-
Advanced Oracle Cloud Infrastructure (OCI Architect, Operations) skills
-
Excellent Kubernetes Administrator (CKA, CKAD) skills.
-
Skilled in Oracle Autonomous Database administration components.
-
Experienced level of creating and managing Helm charts for Kubernetes
-
Good abilities in using service mesh technologies (Istio, Linkerd)
-
Hands-on experience with GitOps tools (ArgoCD, Flux)
-
Strong capacity planning and forecasting skills
-
Good expertise in cloud cost optimization strategies
-
Experienced in disaster recovery planning
-
Advanced skills in banking or financial services infrastructure
-
Proficient with APM tools (New Relic, Datadog, Dynatrace)
-
Good Operational knowledge of message queues (Kafka, RabbitMQ)
-
Skilled in load testing tools (k6, Gatling)
-
Experienced skills in deploying mobile apps via App Store Connect and Google Play Console.
-
Good skills in build systems (npm for React Native/Node.js, pub for Flutter)
Applicants who are meeting the job requirements will be contacted.
** Applications will be accepted until 2-Nov-2025 at 2:00 P.M
Submissions received after this date and time will not be considered **