Senior DevOps Cloud Engineer (AWS Production Launch – Contract)
Availability: Must align with US business hours
Mandatory: Full availability during launch phase
About the Role
We are seeking a Senior DevOps Engineer to oversee our first production deployment on AWS and ensure infrastructure reliability through beta launch.
This is not a build-from-scratch role.
Our internal team has already developed:
- Terraform infrastructure code
- CI/CD pipelines
- Monitoring configuration
We need a production-experienced professional who has successfully taken systems live and understands what to monitor, validate, and safeguard during irreversible production operations.
Your role is to review, validate, supervise, stress-test, and support launch readiness.
Key ResponsibilitiesProduction Deployment Oversight
- Review Terraform production plans and verify staging vs production workspace isolation
- Oversee the first terraform apply in the production AWS account
- Execute and document RDS Multi-AZ failover drill (target: <60-second recovery)
- Verify all 13 ECS Fargate services start healthy with Service Connect mesh connectivity
- Configure PagerDuty integration for production CloudWatch alarms
Architecture Decision – WebSocket at Scale
- Load test 500 concurrent WebSocket connections through ALB
- Evaluate ALB (sticky sessions) vs NLB (TCP passthrough) for real-time GPS updates
- Deliver a written recommendation supported by performance data
Kinesis GPS Pipeline Optimization
- Tune 8-shard Kinesis Data Stream consumer for sustained throughput of 1000+ GPS records/second
- Validate burst scenario handling (500 devices reconnecting with ~300M buffered records)
- Evaluate Kinesis Data Analytics (Apache Flink) for GPS anomaly detection
Performance Baseline & Optimization
- Analyze RDS Performance Insights data from load tests
- Identify and recommend fixes for top 5 slow queries
- Right-size ECS task CPU/memory allocations based on utilization
Production Readiness & Launch
- Execute 5 incident simulation drills:
- RDS failover
- ECS task failure (kill 50%)
- Redis failover
- WAF rate limiting scenario
- Full rolling restart
- Participate in launch-day war room for pilot customer onboarding (150–300 vehicles)
- Deliver post-launch performance report
What This Role Does NOT Include
- Writing Terraform modules
- Building CI/CD pipelines
- Configuring CloudWatch dashboards, WAF rules, or GuardDuty
- Writing application code
This is a production oversight and launch-readiness leadership role.
Required Qualifications
- 5+ years managing production AWS infrastructure
- Hands-on ECS Fargate experience with 10+ microservices
- Executed RDS Multi-AZ failover drills in production environments
- Kinesis Data Streams experience at scale (1000+ records/second)
- Strong Terraform proficiency (workspace management, state operations, plan review)
- WebSocket infrastructure experience (Socket.io, ALB/NLB, sticky sessions)
- Incident response and production readiness validation experience
Strongly Preferred
- AWS Solutions Architect Professional or DevOps Engineer Professional certification
- Experience with IoT data pipelines or vehicle telematics
- Kinesis Data Analytics (Apache Flink) experience
- Blue/Green deployment using CodeDeploy
- PagerDuty setup and on-call configuration
- Multi-tenant SaaS infrastructure experience
If you want, I can now make a high-authority executive version that attracts only senior 8–12 year experienced DevOps candidates.
Job Type: Full-time
Pay: ₹150,000.00 - ₹200,000.00 per month
Work Location: In person