DevOps Manager
Department Overview: The Solution Engineering department plays a critical role in supporting the company's software development, infrastructure, security, and DevOps functions. The team ensures that technical solutions align with SILAC's long-term enterprise architecture strategy while maintaining high standards for reliability, security, and scalability. Through close collaboration with engineering teams and business stakeholders, the department enables incremental development and operational excellence to support SILAC's rapid growth and innovation objectives.
Job Overview: The DevOps Manager is responsible for leading and scaling SILAC's DevOps function, ensuring operational reliability (Run), advancing automation and CI/CD maturity (Grow), and developing a high-performing technical team (Team). This working manager role provides hands-on leadership across cloud and on-prem infrastructure, release governance, monitoring, and incident response. The position drives system stability, scalability, automation, and AI-enabled efficiencies to support SILAC's rapid business growth and enterprise architecture goals.
Job Details
What you'll do:
Operational Excellence & Reliability
-
Ensure system stability, uptime, and performance across Azure cloud and on-prem environments.
-
Establish and maintain robust monitoring, alerting, and observability frameworks.
-
Enforce incident response procedures, including on-call rotation policies.
-
Ensure immediate reporting, escalation, and resolution of production incidents.
Release Governance & Stability
-
Oversee production release execution ensuring minimal disruption and high reliability.
-
Enforce structured release management processes and proper code release gating.
-
Ensure thorough Root Cause Analysis (RCA) documentation for incidents.
-
Implement preventative measures to reduce recurring production issues.
CI/CD & Infrastructure Evolution
-
Evolve TeamCity and the full CI/CD ecosystem toward zero-touch deployments.
-
Expand Docker, Docker Swarm, and Kubernetes containerization capabilities.
-
Strengthen build governance, automated testing integration (Cypress), and release quality controls.
-
Enhance infrastructure scalability to support rapid system growth.
AI Enablement & Automation Expansion
-
Identify and implement AI-driven efficiencies in monitoring, governance, and operational workflows.
-
Leverage automation to improve deployment speed, system reliability, and engineering productivity.
-
Establish governance frameworks for secure and responsible AI adoption.
Leadership & Team Development
-
Act as a working manager leading a small, high-performing DevOps team.
-
Mentor engineers and build scalable processes, documentation standards, and governance practices.
-
Collaborate closely with Software Engineering, QA, Security, and Infrastructure teams.
-
Support strategic scalability initiatives aligned with enterprise growth objectives.
Job Requirements
Key Competencies:
-
Problem-solving and analytical thinking
-
Strong communication skills
-
Attention to detail with a focus on data driven decision making
-
Ability to adapt to changing technical requirements
Required
-
Bachelor's degree in Computer Science, Information Technology, Engineering, or related field (or equivalent experience).
-
7+ years of experience in DevOps, Infrastructure Engineering, or Cloud Engineering.
-
3+ years of leadership or team lead experience.
-
Strong hands-on expertise with Azure Cloud and on-prem VMware.
-
Experience with Docker, Docker Swarm, Kubernetes.
-
Experience designing and governing full CI/CD pipelines (TeamCity preferred).
-
Experience with C#, Python, SQL Server environments.
-
Strong background in monitoring, observability, and incident response governance.
-
Proven experience building documentation standards and release governance processes.
-
Excellent communication and cross-functional collaboration skills.
Desired
-
Experience in the Annuity, Insurance, or Healthcare industry.
-
Experience scaling infrastructure in high-growth environments.
-
Experience integrating AI capabilities into operational workflows.
-
Certifications in Azure, Kubernetes, or DevOps disciplines.