Find The RightJob.
SRE Senior Specialist Job Description
Location Richmond VA
Summary
The SRE Senior Specilaist will set the direction and operating model for a Site Reliability Engineering function supporting a hybrid estate Azure and onprem with a primary focus on Azurehosted applications over the next 12 months The role partners closely with application owners and platform teams to define reliability targets SLAsNFRs translate them into measurable SLIsSLOs and drive proactive improvements that reduce toil and improve customer experience
This position owns SRE strategy governance and outcomes error budget policy reliability reviews incident learning observability standards Datadog and infrastructure automation Terraform with GitHub Actions Azure DevOps for delivery workflows The lead will coach engineers coordinate crossteam initiatives and report reliability and cost posture to stakeholders
Roles Responsibilities
Define and own the SRE roadmap for Azurehosted and hybrid services aligning reliability goals with business priorities
Engage application owners to elicit SLAs NFRs and risk assumptions facilitate SLISLO definition and operational readiness reviews
Establish error budget policies and reliability governance SLO compliance burn rate s release criteria
Drive observability standards using Datadog golden signals distributed tracing logmetric conventions and actionable ing
Design and publish executive dashboards for availability latency error rate saturation error budgets and cost tracking
Lead incident learning ensure consistent RCA practice postincident reviews correctivepreventive action tracking and knowledge sharing
Partner with IncidentProblem and Change teams to improve MTTR reduce repeat incidents and lower change failure rate
Champion automation to reduce toil selfhealing runbooks and BAU automation PowerShell where appropriate
Oversee IaC standards and reviews Terraform modules GitHub Actions pipelines ensure securebydesign and policy compliance
Define SRE engagement model intake prioritization oncall expectations and mentor SRE engineers
Perform capacity and performance planning for key Azure services guide resilience patterns multizoneregion where needed
Contribute to DRBCP strategy and testing cadence for critical services
Measure and communicate reliability and cost outcomes provide continuous improvement recommendations
MustHave Technical Skills
12 years in SREDevOpsOperations engineering with leadershipownership of reliability outcomes
Deep handson Azure experience IaaSPaaS including networking identity compute storage and monitoring concepts
Good understanding of Azure services like App Insights Log Analytics Azure Monitor Azure Apps Services Azure Functions Azure Front Door Cosmos DB Azure SQL Key vault API Management Azure DNS VnetSubNet NSG AKS
Strong SRE fundamentals SLAsNFRs SLIsSLOs error budgets toil reduction incident command and postmortems
Datadog expertise dashboards monitors APM logs traces with ing best practices
Infrastructure as Code using Terraform module design state management environment promotion
CICD automation using GitHub Actions and familiarity integrating with Azure services
Working knowledge of Azure DevOps boardspipelinesreleases and enterprise delivery governance
Proficiency in PowerShell for automation and operational tooling scripting best practices
Experience supporting NET applications IISKestrel hosting Windows services basic performance troubleshooting
Strong troubleshooting skills across app OS network and cloud dependencies
Ability to communicate with technical and nontechnical stakeholders drive crossteam alignment
GoodtoHave Technical Skills
Experience designing DRHA architectures multiregion patterns failover testing
Knowledge of securitycompliance controls in cloud RBAC Key Vault policy as code
Experience implementing or improving ITIL processes in partnership with ServiceNow teams
Exposure to chaos engineering load testing and performance engineering methodologies
Experience creating reusable Terraform module registries and policy
Experience with FinOps practices Azure cost management chargebackshowback cost optimization patterns
Mandatory Skills : System Reliability Strategy Design
Benefits/perks listed below may vary depending on the nature of your employment with LTIMindtree (“LTIM”):
Benefits and Perks:
The range displayed on each job posting reflects the minimum and maximum salary target for the position across all US locations. Within the range, individual pay is determined by work location and job level and additional factors including job-related skills, experience, and relevant education or training. Depending on the position offered, other forms of compensation may be provided as part of overall compensation like an annual performance-based bonus, sales incentive pay and other forms of bonus or variable compensation.
Disclaimer: The compensation and benefits information provided herein is accurate as of the date of this posting. LTIMindtree is an equal opportunity employer that is committed to diversity in the workplace. Our employment decisions are made without regard to race, color, creed, religion, sex (including pregnancy, childbirth or related medical conditions), gender identity or expression, national origin, ancestry, age, family-care status, veteran status, marital status, civil union status, domestic partnership status, military service, handicap or disability or history of handicap or disability, genetic information, atypical hereditary cellular or blood trait, union affiliation, affectional or sexual orientation or preference, or any other characteristic protected by applicable federal, state, or local law, except where such considerations are bona fide occupational qualifications permitted by law.Similar jobs
Plaid
New York, United States
5 days ago
Ethan Allen
Garden City, United States
5 days ago
CHLOETA
Oklahoma City, United States
5 days ago
WM
Waikoloa Village, United States
5 days ago
WM
Hillsboro, United States
5 days ago
Amazon.com
Jessup, United States
5 days ago
Steampunk
McLean, United States
5 days ago
© 2026 Qureos. All rights reserved.