Must Have Technical/Functional Skills
Golang - Working experience (E2 and above)
Delivering quality code
Unit tests
Kubernetes (E2)
Deployments
Statefulsets
Load Balancers
AWS Cloud (E2)
CI/CD
Communication & Team Management
Grafana
Prometheus
Alertmanager
Loki for logs
Ability to push independently
Roles & Responsibilities
Lead Site Reliability Engineer
Key Activities
1. Automation Tools – technology involves – GO , Terraform Cookbook, Chef.
2. Containerization and Microservices – Strong knowledge in Kubernetes & Cloud API
3. Incident Management – Managing and troubleshoot Incidents , JIRA , PagerDuty, ServiceNow
4. Set up Monitoring & Alerts (Observability) – Knowledge in Prometheus, Grafana , ELK
5. Create IAC code using Tools like Gitlab, Ansible , Chef , Nagios, Argo CD
6. Security & Compliance – Managing Security & Compliance policies of Production Environment access
7. Managing AWS/GCP cloud Infra
8. Troubleshooting & build of GO lang code for automation
Generic Managerial Skills:
Communication & Team Management
Ability to push independently
Salary Range: $90,000-$110,000 a year
#LI-DM1