Overview:
As an Infrastructure Engineer, you will empower teams by implementing and maintaining a robust, scalable infrastructure. You will play a pivotal role in ensuring the reliability and efficiency of our production and development environments, enabling smooth deployments, efficient workflows, and global connectivity.
Key Responsibilities
1. Infrastructure Management
-
Implement and maintain a global infrastructure.
-
Monitor performance and optimize resource utilization.
-
Troubleshoot issues and implement solutions to ensure high availability and stability.
2. Automation
-
Design, build, and optimize automation tools for infrastructure deployment.
-
Collaborate with development teams to address connectivity needs and pain points.
3. Developer Support
-
Provide technical guidance on infrastructure and connectivity components.
-
Develop self-service tools and documentation for developers to manage deployments.
-
Foster a culture of collaboration and continuous improvement within development and operations teams.
4. Production Support
-
Participate in incident response and resolution to minimize downtime.
-
Continuously analyze and improve performance, reliability, and security of production environments.
Qualifications
-
Infrastructure Expertise: Strong knowledge of networking, architecture, Linux, automation, and security; hands-on experience with large-scale infrastructure.
-
Infrastructure as Code (IaC): Experience with tools like Terraform or Ansible.
-
Cloud Knowledge: Familiarity with AWS, Azure, GCP, and cloud-native technologies.
-
Scripting & Automation: Proficiency in Bash, Python, or similar languages.
-
Troubleshooting: Ability to diagnose complex problems and implement effective solutions.
-
Collaboration & Communication: Strong interpersonal skills for cross-functional teamwork.
Bonus Skills
-
Firewall infrastructure knowledge (e.g., FortiGate).
-
Monitoring & observability tools (Prometheus, Cribl, Splunk).
-
Security best practices for containerized environments.
-
CI/CD pipeline design and implementation (GitLab or similar).
-
Proactive mindset and continuous improvement approach.
Job Description - Grade Specific
Responsible for the operations and maintenance of On Premise, Capgemini or client dedicated, computing platforms and servers, including:- system engineering and physical datacentre: Provides maintenance and support for all system in scope, monitors and executes corrective actions, installs, configures, and tests operating systems, troubleshoots and conducts incident resolution, liaise with other IT teams and 3d party vendors Develops and executes plans for patching, maintains security, backup, and redundancy strategies, Develops capabilities on emerging technologies, defines processes, conducts compliance and quality checks, and identifies opportunities for improvements and efficiencies- storage and backup: run storage and backup environment as per blueprint and specific accounts setup. drive continual service improvement actions