As a pragmatic problem solver on a wide range of Data Center environment and systems you understand which issues to escalate to the appropriate resolver groups. You will proactively monitor the customer environment by checking system error logs, monitoring ticket queues and consulting with other groups involved in maintaining the environments. You will make recommendations to resolve system errors and performance issues and take corrective actions. You create and maintain documentation on technologies you support. You understand all aspects of the equipment you support. You know how to innovate and make decisions on his/her own, but also know how to take direction when it is given, paying attention to all details involved. Expected to improve current processes, and introduce automation with aim towards simplification.
You are able to execute small projects on your own and work with your manager in planning and executing larger local projects. You understand all aspects of the equipment you support. Ideally we are seeking someone from a reputable Cloud provider and are also open to receiving applications from Tier 1 Data Center colocation providers. This would suit an individual who is able add significant value based on existing experience whilst evolving their career upstream into Cloud Services
**An Active TS/SCI Clearance is required for this position**
Ideal candidates would be required to:
- Possess and active TS/SCI security clearance
- Understand the design and functionality of the Data Centers within your assigned Region
- Provide audits for power and mechanical capacity or upgrades.
- Work with internal teams to trouble shoot problems and conduct Root Cause Analysis (RCA) and Corrective Action (CA) for design related problems.
- Work with local colocation companies to understand and coordinate site utility requirements
- Provide after-hours support as needed
- Work with project teams/colocation partners to properly test and validate installation, operation, and performance of electrical/mechanical systems.
- Support of Operations including failure mode and root cause analysis, maintenance and troubleshooting support, best practices, maintenance initiatives and operating procedure review
- Maintain all technical documentation regarding corporate data centers, this includes procedures for the operations.Work with Regional leaders and other business leaders to manage projects, optimize performance and improve the reliability and efficiency of the collocation, leased and owned data centre. infrastructure electrical and mechanical systems.
- Participate in operational reviews to collect and analyze technical data to identify and resolve existing reliability and availability concerns.
- Provide Subject matter Expert resource to identify and resolve resiliency, reliability and availability risks globally.
- Oversee the Issue intake, Evaluation, and Resolution Process for the review of collocation, leased and owned data centre builds issues with focus on providing quality improvement recommendations.
- Interface with internal data centre design teams, server hardware teams, environmental health and safety teams to promote standards that maintain consistency and reliability in services delivered
- Be recognized as the technical expert within the group as well as within other teams.
- Be positive and always offer creative, out of the box solutions.