Qureos

FIND_THE_RIGHTJOB.

Senior DDI Infrastructure Engineer

JOB_REQUIREMENTS

Hires in

Not specified

Employment Type

Not specified

Company Location

Not specified

Salary

Not specified

Senior DDI Infrastructure Engineer

Senior DDI Infrastructure Engineer


This role within the Core Services Infrastructure team focuses on managing and evolving a global DNS, DHCP, and IP Address Management
(DDI) platform. The team operates across multiple regions, supporting a global infrastructure that demands high availability, scalability, and reliability. Our
mission is to deliver best-in-class infrastructure services while minimizing operational overhead through extensive automation and self-service capabilities.
We are seeking an experienced engineer to lead the team, drive automation efforts, and enhance the scalability, efficiency, and resiliency of our services.


Key Responsibilities:

  • Design, implement, maintain, and troubleshoot global DNS, DHCP, and IP Address Management (DDI) platforms in a distributed, global

  • environment.

  • Lead automation initiatives to eliminate manual operational tasks, focusing on self-service tooling, CI/CD pipelines, and configuration-as-code

  • solutions.

  • Mentor junior team members and collaborate with global stakeholders on engineering, automation, and remediation projects.

  • Develop Python 3-based automation tools, REST APIs, and monitoring/health-check frameworks to optimize service delivery.

  • Create and maintain technical documentation, including processes, procedures, project requirements, and architectural designs.

  • Ensure scalability and high availability of DDI services to meet the demands of a global infrastructure.

  • Participate in the team’s on-call support rotation to ensure 24/7 service reliability.

Technical Expertise:

  • Deep knowledge of DNS (BIND), DHCP (ISC), and IP Address Management, including protocol internals, RFC standards, and DNSSEC.

  • 10+ years of experience with Linux systems administration, web hosting technologies, and domain registration.

  • Advanced scripting and automation skills (Python 3, Terraform, Ansible, Groovy, REST APIs, FastAPI, Django, JSON, YAML).

  • Proven experience in automating infrastructure operations, including self-service tooling, CI/CD pipelines, and configuration-as-code.

  • Experience with Python module development, unit testing, and managing virtual environments (e.g., venv, pipenv, poetry).

  • Proficiency in managing cloud-based services (e.g., GCP DNS/Logging, AWS Route53).

  • Expertise in containerization and orchestration technologies (Kubernetes, Docker, Podman).

  • Proficiency in source control and CI/CD pipelines (Git, GitHub/GitLab, Jenkins, GitFlow).

  • Familiarity with observability and monitoring frameworks, including OpenTelemetry, Fluentd, and Fluent Bit.

  • Experience with ServiceNow self-service forms and automation workflows.

  • Strong understanding of scaling infrastructure for high availability and performance in global environments.

  • Proficiency in leveraging AI/LLMs (e.g., ChatGPT, Claude, Copilot) to enhance productivity.

Soft Skills:

  • Strong interpersonal and communication skills with a collaborative mindset to work effectively in a global team.

  • Self-motivated and accountable, with a proactive approach to problem-solving.

  • Commitment to continuous learning and mentoring team members.

  • Preferred Skills (Nice-to-Have)

  • Experience with BlueCat Networks DDI platforms in global environments.

  • Knowledge of Anycast-based DNS services.

  • Networking expertise (OSI model, VLANs, routing, packet analysis).

  • Advanced Linux skills (Shell scripting, troubleshooting, Podman containers).

  • Familiarity with network automation frameworks (e.g., netmiko, nornir, nautobot).

  • Experience with monitoring tools (SNMP, Prometheus, Grafana).

  • Familiarity with MCP servers (FastMCP)

  • Linux configuration management tools (e.g., Chef, SaltStack).

What We Offer:

  • The opportunity to work with a highly skilled, globally distributed team on cutting-edge infrastructure technologies.

  • A culture that prioritizes automation, innovation, and continuous improvement.

  • A collaborative environment where your contributions directly impact the scalability and reliability of critical global services.

Key Highlights:

  • Global Team: Collaborate with team members and stakeholders across multiple regions to support a truly global infrastructure.

  • Automation-First Approach: Drive initiatives to automate manual processes, reduce operational overhead, and deliver self-service capabilities.

  • Scalability and Resiliency: Design and maintain systems that scale to meet the demands of a global enterprise while ensuring high availability.

© 2025 Qureos. All rights reserved.