Job Brief:
Would be responsible for designing, implementing, maintaining, and optimizing the organization’s core IT infrastructure systems. The role ensures high availability, security, scalability, and performance of infrastructure services in support of business operations. This role involves managing teams, developing infrastructure strategies, and collaborating with various stakeholders to deliver efficient and effective IT services.
Job Responsibilities:
-
Designing, implementing, and maintaining infrastructure solutions, including hardware, networking, virtualization, cloud environments, and security systems.
-
Leading and mentoring a team of infrastructure engineers, providing technical guidance, and fostering a collaborative environment.
-
Monitor system performance, identify issues, and implement solutions to improve reliability and performance. Set up alerts and notifications for proactive response.
-
Monitoring and optimizing infrastructure performance, identifying bottlenecks, and implementing solutions to enhance efficiency and reliability.
-
Participates in incident response and post-mortem analysis to ensure continuous system improvement and resilience.
-
Execute system upgrades, patching, and lifecycle management in alignment with change management practices.
-
Conduct root cause analysis of incidents and implement preventive measures.
-
Leading incident response efforts, troubleshooting complex issues, and ensuring timely resolution of infrastructure-related problems.
-
Supports disaster recovery and business continuity plans, minimizing downtime and maintaining operations.
-
Supports Automation Tools & Practices: Applies tools like Ansible, Terraform, to automate infrastructure provisioning, configuration, and management.
-
Cloud & System Implementation: Supports cloud-based and on-premises infrastructure, ensuring optimized performance, scalability, and security.
-
Support Configuration & Patching Management: Uses tools like Ansible, Terraform, to automate infrastructure provisioning, configuration management, and patching.
-
Collaborating with other IT teams, such as development, operations, and security, to ensure seamless integration and support of infrastructure services.
-
Collaborates with development and operations teams to integrate best practices in reliability into the infrastructure operations.
-
Participate in on-call rotations to provide 24/7 support for critical systems
-
Maintaining comprehensive documentation of infrastructure configurations, procedures, and best practices, and sharing knowledge with the team.
Qualification and Experience:
-
Bachelor’s degree in computer science, Information Technology, or a related field.
-
Minimum of 6-8 years of Management experience.
-
Excellent verbal and written communication skills.
-
Proven effective management skills.
-
Proficient with Microsoft Office Suite or related software.
-
Strong presentation skills.