Lead the installation, maintenance, upgrading, and configuration of IT infrastructure and other IT-related projects as determined by business need across a multi-site environment
Identify and solve complex systemic issues spanning multiple systems and teams. This may include designing systems or services from ground up
Partner with internal service owners and teams to evaluate technical data, create recommendations, obtain consensus, plan and execute service upgrades and changes
Maintain appropriate documentation, including drawings, configurations, settings, and recovery plans
Work directly with the leadership to define a long-term infrastructure support strategy focused on cost optimization and end-user needs
Provide Windows, Linux operating system administration, including logging solutions, OS building/configuration and script writing.
Monitoring and maintaining network servers such as file servers, VPN gateways and intrusion detection systems
Work with internal and external stakeholders to troubleshoot and resolve application issues across complex enterprise and local environments
Influence and enforce IT related security policies and controls by following defined procedures and standards
Research and evaluate current and emerging technologies and stay informed of new technologies and solutions that increase productivity, innovation, and business capabilities
Provide administration of backup/recovery processes and procedures
Participating in a cross-platform Site Reliability team to build and maintain tools, solutions and microservices associated with deployment and our operations platform, ensuring that all meet our customer service standards and reduce errors
Test our system integrity, implemented designs, application developments and other processes related to software defined infrastructure, making improvements as needed
Deploy product updates and patches as required while implementing integrations when they arise
Experience with scripting languages (Javascript, Python, Powershell)
Experience with version control (Git, SVN)
Knowledge of Infrastructure as Code tools (Terraform, Ansible)
Ability to talk to both customers and other IT professionals and adapt to their technical knowledge.
Availability to work occasional nights, evenings, and weekends as assigned.
Availability and ability to provide 24/7 on call support, as scheduled.
Demonstrated ability to mentor junior engineers and work with remote teams
Demonstrated ability to design and lead development of best practices and creation of standard operating procedures.
Deep understanding of Virtualization technologies on VMWare hypervisor
Knowledge and experience supporting NAS/SAN Storage arrays
Well versed in Business Continuity & Disaster Recovery methodologies and implementations