Role Summary
We are seeking a proactive and technically skilled
Python Application Support Engineer
to join our Technical Operations team. This role is crucial for ensuring the stability and reliability of our mission-critical, Python-based applications. You will be responsible for timely incident resolution, deep-dive troubleshooting, implementing permanent fixes, and driving operational efficiency through automation.
🔑 Key Responsibilities
Technical Troubleshooting & Incident Management
-
Incident Resolution: Serve as the primary point of contact for complex Level 2 and Level 3 production incidents, diagnosing root causes and resolving issues across our Python application stack.
-
Deep-Dive Analysis: Utilize log analysis tools (e.g., Splunk, ELK Stack) and monitoring platforms (e.g., Prometheus, Grafana) to quickly identify and address anomalies in application behavior.
-
Code Debugging: Analyze, debug, and fix application issues directly within the Python codebase, including Flask/Django services, worker queues, and custom scripts.
-
Database Health: Troubleshoot performance issues and conduct basic SQL/NoSQL query tuning and health checks (e.g., for PostgreSQL, MongoDB, or Redis).
Operational Excellence & Automation
-
Monitoring & Alerting: Continuously refine and optimize application monitoring, alerting, and logging configurations to improve mean time to detect (MTTD) and mean time to resolve (MTTR).
-
Python Automation: Develop, maintain, and enhance automated scripts (primarily in Python) to streamline routine operational tasks, reporting, health checks, and system recovery processes.
-
Documentation: Create and maintain comprehensive documentation, runbooks, and knowledge base articles for application support procedures and recurring issues.
Collaboration & Prevention
-
Cross-Functional Fixes: Collaborate closely with the Development and DevOps teams to provide clear technical feedback on recurring issues and implement permanent, scalable solutions.
-
Proactive Maintenance: Identify potential system bottlenecks, performance degradation points, and areas prone to failure, recommending and implementing preventative measures.
⚙️ Required Qualifications
-
Experience: 3 to 5 years of professional experience in Application Support, Production Support, Site Reliability Engineering (SRE), or a similar technical role.
-
Python Expertise (Mandatory): Strong hands-on experience with Python scripting and programming, including the ability to read, debug, and modify application code.
-
Operating Systems: Proficient working knowledge of Linux/Unix environments and shell scripting.
-
Databases: Solid experience with relational (e.g., PostgreSQL, MySQL) and/or NoSQL (e.g., MongoDB, Redis) databases, focusing on query analysis and performance.
Skills:- Python, Customer Success and Client Servicing