Qureos

Find The RightJob.

Senior Specialist - Architecture

Role description


SRE Senior Specialist Job Description

Location Richmond VA

Summary

The SRE Senior Specilaist will set the direction and operating model for a Site Reliability Engineering function supporting a hybrid estate Azure and onprem with a primary focus on Azurehosted applications over the next 12 months The role partners closely with application owners and platform teams to define reliability targets SLAsNFRs translate them into measurable SLIsSLOs and drive proactive improvements that reduce toil and improve customer experience

This position owns SRE strategy governance and outcomes error budget policy reliability reviews incident learning observability standards Datadog and infrastructure automation Terraform with GitHub Actions Azure DevOps for delivery workflows The lead will coach engineers coordinate crossteam initiatives and report reliability and cost posture to stakeholders

Roles Responsibilities

Define and own the SRE roadmap for Azurehosted and hybrid services aligning reliability goals with business priorities

Engage application owners to elicit SLAs NFRs and risk assumptions facilitate SLISLO definition and operational readiness reviews

Establish error budget policies and reliability governance SLO compliance burn rate s release criteria

Drive observability standards using Datadog golden signals distributed tracing logmetric conventions and actionable ing

Design and publish executive dashboards for availability latency error rate saturation error budgets and cost tracking

Lead incident learning ensure consistent RCA practice postincident reviews correctivepreventive action tracking and knowledge sharing

Partner with IncidentProblem and Change teams to improve MTTR reduce repeat incidents and lower change failure rate

Champion automation to reduce toil selfhealing runbooks and BAU automation PowerShell where appropriate

Oversee IaC standards and reviews Terraform modules GitHub Actions pipelines ensure securebydesign and policy compliance

Define SRE engagement model intake prioritization oncall expectations and mentor SRE engineers

Perform capacity and performance planning for key Azure services guide resilience patterns multizoneregion where needed

Contribute to DRBCP strategy and testing cadence for critical services

Measure and communicate reliability and cost outcomes provide continuous improvement recommendations

MustHave Technical Skills

12 years in SREDevOpsOperations engineering with leadershipownership of reliability outcomes

Deep handson Azure experience IaaSPaaS including networking identity compute storage and monitoring concepts

Good understanding of Azure services like App Insights Log Analytics Azure Monitor Azure Apps Services Azure Functions Azure Front Door Cosmos DB Azure SQL Key vault API Management Azure DNS VnetSubNet NSG AKS

Strong SRE fundamentals SLAsNFRs SLIsSLOs error budgets toil reduction incident command and postmortems

Datadog expertise dashboards monitors APM logs traces with ing best practices

Infrastructure as Code using Terraform module design state management environment promotion

CICD automation using GitHub Actions and familiarity integrating with Azure services

Working knowledge of Azure DevOps boardspipelinesreleases and enterprise delivery governance

Proficiency in PowerShell for automation and operational tooling scripting best practices

Experience supporting NET applications IISKestrel hosting Windows services basic performance troubleshooting

Strong troubleshooting skills across app OS network and cloud dependencies

Ability to communicate with technical and nontechnical stakeholders drive crossteam alignment

GoodtoHave Technical Skills

Experience designing DRHA architectures multiregion patterns failover testing

Knowledge of securitycompliance controls in cloud RBAC Key Vault policy as code

Experience implementing or improving ITIL processes in partnership with ServiceNow teams

Exposure to chaos engineering load testing and performance engineering methodologies

Experience creating reusable Terraform module registries and policy

Experience with FinOps practices Azure cost management chargebackshowback cost optimization patterns


Skills


Mandatory Skills :
System Reliability Strategy Design


Other details


Benefits/perks listed below may vary depending on the nature of your employment with LTIMindtree (“LTIM”):

Benefits and Perks:

  • Comprehensive Medical Plan Covering Medical, Dental, Vision
  • Short Term and Long-Term Disability Coverage
  • 401(k) Plan with Company match
  • Life Insurance
  • Vacation Time, Sick Leave, Paid Holidays
  • Paid Paternity and Maternity Leave

The range displayed on each job posting reflects the minimum and maximum salary target for the position across all US locations. Within the range, individual pay is determined by work location and job level and additional factors including job-related skills, experience, and relevant education or training. Depending on the position offered, other forms of compensation may be provided as part of overall compensation like an annual performance-based bonus, sales incentive pay and other forms of bonus or variable compensation.

Disclaimer: The compensation and benefits information provided herein is accurate as of the date of this posting.

LTIMindtree is an equal opportunity employer that is committed to diversity in the workplace. Our employment decisions are made without regard to race, color, creed, religion, sex (including pregnancy, childbirth or related medical conditions), gender identity or expression, national origin, ancestry, age, family-care status, veteran status, marital status, civil union status, domestic partnership status, military service, handicap or disability or history of handicap or disability, genetic information, atypical hereditary cellular or blood trait, union affiliation, affectional or sexual orientation or preference, or any other characteristic protected by applicable federal, state, or local law, except where such considerations are bona fide occupational qualifications permitted by law.


Benefits

Compensation range: $57,409.00 to $114,602.00 per year
About LTM
LTM is an AI-centric global technology services company and the Business Creativity partner to the world’s largest and most disruptive enterprises. We bring human insights and intelligent systems together to help clients create greater value at the intersection of technology and domain expertise. Our capabilities span integrated operations, transformation, and business AI — enabling new ways of working, new productivity paradigms, and new roads to value. Together with over 87,000 employees across 40 countries and our global network of partners, LTM — a Larsen & Toubro company — owns business outcomes for our clients, helping them not just outperform the market, but to Outcreate it. Please also note that neither LTM nor any of its authorized recruitment agencies/partners charge any candidate registration fee or any other fees from talent (candidates) towards appearing for an interview or securing employment/internship. Candidates shall be solely responsible for verifying the credentials of any agency/consultant that claims to be working with LTM for recruitment. Please note that anyone who relies on the representations made by fraudulent employment agencies does so at their own risk, and LTM disclaims any liability in case of loss or damage suffered as a consequence of the same. Recruitment Fraud Alert - https://www.ltimindtree.com/recruitment-fraud-alert/

© 2026 Qureos. All rights reserved.