Role Summary
As a Level 2 Technical Support Engineer, you will be the technical backbone of our customer success operations. You will provide advanced troubleshooting and end-to-end incident ownership across a diverse portfolio of customer accounts. This role sits at the intersection of Support, DevOps, and Systems Administration, requiring a candidate who can navigate complex environments—from legacy Windows Servers to modern Kubernetes clusters—while maintaining strict adherence to SLAs and high customer satisfaction.
Key Responsibilities
-
Advanced Incident Management:
Own and resolve complex L2 technical incidents within agreed SLAs. Perform deep-dive root cause analysis across application, OS, network, and infrastructure layers.
-
System Reliability & Monitoring:
Proactively monitor system health, capacity, and performance across VM and Kubernetes environments. Maintain and improve Grafana/Prometheus dashboards and ELK/EFK logging stacks.
-
Database Operations:
Execute sophisticated SQL queries for data investigations and reporting. Safely run database migration scripts and perform data corrections in production environments.
-
Infrastructure & Maintenance:
Manage lifecycle operations including OS/Application patching, upgrades, and change control validation. Support CI/CD deployments and post-deployment health checks.
-
Service Request Fulfillment:
Handle critical administrative tasks such as user provisioning, complex account unlocks, and data-level fixes that require elevated permissions.
-
Knowledge Leadership:
Maintain high-quality technical documentation, runbooks, and internal knowledge base articles to shift-left common issues to L1 teams.
-
Stakeholder Communication:
Act as a technical liaison between customers, Account Managers, and Engineering teams, providing clear diagnostics and status updates for escalated issues.
Required Technical Skills
-
Experience:
4+ years in a high-pressure Technical Support, SRE, or Systems Administration role (L2+).
-
Operating Systems:
Expert-level Windows Server administration (AD, DNS, IIS, Group Policy) and strong proficiency in Linux environments.
-
Cloud & Orchestration:
Hands-on experience with
Google Cloud Platform (GCP)
and
Kubernetes
(managing pods, services, ingress, and troubleshooting via kubectl).
-
Networking:
Deep understanding of TCP/IP, HTTP/S, TLS/SSL certificates, and VPNs. Proficiency with tcpdump, Wireshark, and network diagnostic tools.
-
Databases:
Strong SQL skills with the ability to write complex joins and scripts (MS SQL Server preferred).
-
Middleware:
Experience managing and troubleshooting Message Queue systems, specifically
RabbitMQ
.
-
Automation:
Proficiency in
PowerShell
or
Bash
for automating repetitive tasks (Python is a significant advantage).
-
Web Technologies:
Solid understanding of REST APIs, HTTP status codes, and browser developer tools.
Professional Attributes
-
Problem Solver:
Ability to remain calm under pressure and methodically deconstruct complex technical failures.
-
Communicator:
Ability to translate "deep tech" issues into business-friendly updates for clients.
-
Adaptable:
Quick to learn proprietary domain logic and custom application architectures.
-
Availability:
Willingness to participate in an on-call rotation for after-hours maintenance or critical emergencies.
Preferred Qualifications (Nice to Have)
-
Developer Background:
Previous experience in software development is a major plus, as it aids in code-level debugging.
-
DevOps Tooling:
Familiarity with CI/CD pipelines (GitLab, Jenkins) and Infrastructure as Code.
-
Security:
Exposure to WAF, IDS/IPS, and security patching best practices.
-
Education:
Bachelor’s degree in Computer Science, Information Systems, or equivalent practical experience.