Monitoring & Tools L3 Administrator / Engineer (Dynatrace, Splunk, SolarWinds, Nagios)
Role Overview
The Tools L3 Administrator is responsible for advanced management, troubleshooting, and optimization of enterprise monitoring and observability platforms. This includes application performance monitoring (APM), infrastructure monitoring, log analytics, and alerting systems. The role requires deep expertise in tools such as Dynatrace, Splunk, SolarWinds, Nagios, and the ability to resolve complex issues independently while ensuring proactive monitoring and service reliability.
Key Responsibilities
- Provide L3 support for escalated incidents related to monitoring and observability tools.
- Manage and maintain Dynatrace, Splunk, SolarWinds, Nagios platforms for enterprise environments.
- Configure and optimize dashboards, alerts, and reports to ensure proactive monitoring.
- Perform root cause analysis using log analytics and APM tools.
- Implement performance tuning, capacity planning, and monitoring automation.
- Integrate monitoring tools with ITSM platforms (ServiceNow, Remedy) for incident workflows.
- Automate repetitive tasks using PowerShell, Python, or Ansible.
- Collaborate with infrastructure, application, and security teams to ensure end‑to‑end visibility.
- Lead critical incident investigations and document resolutions.
- Maintain knowledge base and documentation for configurations, processes, and troubleshooting guides.
Required Skills & Experience
- 7–12 years of experience in enterprise monitoring and observability with strong L3 expertise.
- Hands‑on experience with Dynatrace (APM), Splunk (log analytics), SolarWinds (network monitoring), Nagios (infrastructure monitoring).
- Strong knowledge of monitoring architecture, integrations, and scaling strategies.
- Expertise in alerting, dashboards, and reporting for proactive incident detection.
- Experience with cloud monitoring (Azure Monitor, AWS CloudWatch, GCP Operations Suite).
- Familiarity with DevOps pipelines and CI/CD monitoring integrations.
- Proficiency in scripting and automation (Python, PowerShell, Ansible).
- Solid understanding of networking, servers, and application performance metrics.
- Ability to lead critical incident resolution and mentor junior administrators.
Preferred Qualifications
- Certifications: Splunk Certified Admin/Architect, Dynatrace Professional, SolarWinds Certified Professional, Nagios Certified Expert.
- Experience with observability stacks (ELK/EFK, Prometheus, Grafana).
- Exposure to AIOps platforms for predictive monitoring.
- Knowledge of ITIL processes for incident, problem, and change management.
Your future duties and responsibilities
Required qualifications to be successful in this role
Together, as owners, let’s turn meaningful insights into action.
Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because…
You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction.
Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise.
You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons.
Come join our team—one of the largest IT and business consulting services firms in the world.