About us:
100ms is building a Platform-as-a-Service for developers, integrating video-conferencing experiences into their apps. Our SDKs enable developers to add gold-standard audio-video quality conferencing with much faster shipping times. 100ms allows shipping live conferencing application time in days instead of months, allowing companies to focus on their core business.
We are a team uniquely placed to work on this problem. We have built a world-record scale live video infrastructure powering billions of live video minutes in a day.
We are a hybrid team with engineers who've built video infrastructure at Facebook and Hotstar.
We’re looking for an experienced DevOps Engineer to join our high-performing infrastructure team. You’ll help manage and scale our mission-critical systems deployed on Google Cloud Platform, ensuring high availability, performance, and security for thousands of concurrent users.
-
Own and operate production infrastructure, multiple GKE clusters with HA, autoscaling, and observability.
-
Manage GitOps workflows using Argo CD for automated, version-controlled deployments.
-
Maintain and optimise monitoring & alerting stacks using Prometheus, Grafana, and Loki.
-
Implement infrastructure as code using Terraform for GCP resources and Kubernetes manifests.
-
Lead or support incident response, cluster upgrades, and disaster recovery procedures.
-
Computer Science/Engineering or equivalent practical experience
-
Minimum 3 years of hands-on experience with Kubernetes in a production environment.
-
Strong knowledge of CI/CD pipelines and GitOps workflows using Argo CD or similar.
-
Proficient in infrastructure automation using Terraform and Helm.
-
Experience managing monitoring/logging stacks (Prometheus, Loki, Grafana, Alertmanager).
-
Comfortable with Linux systems, shell scripting, and basic networking.
-
Prior experience with handling large infrastructure
-
Knowledge of secrets management tools (e.g., HashiCorp Vault, Sealed Secrets).
-
Prior experience with handling GCP and GKE
-
Experience with open source contribution
-
Ability to speak and write in English fluently and idiomatically
-
Strong inclination to keep up-to-date with latest trends, learn new concepts, or contribute to open-source projects and would be eager to talk about ideas in internal or external forums
-
You will be part of a small team at a fast-growing engineering-first startup
-
You will work with engineers across the globe with experience in video at places like Facebook and Hotstar
-
You can grow as an individual contributor or as a team leader - freedom to set your own goals
-
You will work on problems at the cutting-edge of real-time video communication technology at a massive scale
-
At 100ms, we place a strong emphasis on in-office presence to promote collaboration and strengthen company culture.
-
Under the current policy, employees are expected to work from the office at least three days a week—Tuesday, Wednesday, and Friday—as an essential part of their role.