Qureos

FIND_THE_RIGHTJOB.

Director, Cloud Site Operations

San Francisco, United States

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role:

Crusoe is building a clean cloud for AI and high-performance computing. As we expand our global footprint of GPU-optimized data centers, we are seeking a Director of Cloud Site Operations to lead day-to-day operations across our domestic and international sites.

This leader will ensure that Crusoe’s cloud platform—powered by 100% clean energy—operates at the highest standards of availability, efficiency, and sustainability. The Director will oversee distributed teams, enforce operational discipline, and drive innovation in how we run and scale next-generation AI data centers.

What You’ll Be Working On:

Operational Leadership

  • Lead 24/7 operations across Crusoe’s global fleet of GPU-focused cloud data centers.

  • Ensure world-class uptime, performance, and resiliency while maintaining sustainability goals.
    Standardize operational playbooks and enforce best practices for safety, security, and compliance.

  • Drive continuous improvement in efficiency (MTTR, PUE, MW utilization, NRC/OpEx per MW).

Site Management & Readiness

  • Manage hardware uptime and operational readiness for large-scale GPU clusters (H200, B200, GB200, MI300X, MI355X, GB300, etc.).

  • Ensure observability into performance and readiness across diverse geographies (U.S., Europe, Asia).

Team Leadership & Development

  • Lead and develop a distributed global team of site operations managers, engineers, and technicians.

  • Build a safety-first culture focused on reliability, execution, and accountability.

  • Implement scalable staffing and shift models to support rapid growth and international operations.

Vendor & Partner Management

  • Manage strategic relationships with colocation partners, OEMs, and service providers.

  • Ensure SLAs are exceeded while balancing cost, quality, and sustainability.

  • Partner closely with engineering, capacity planning, and product teams to align operational readiness with business growth.

Risk, Compliance & Security

  • Ensure global adherence to compliance frameworks (ISO, SOC, Uptime Institute, ASHRAE, etc.).

  • Oversee physical and operational security, incident response, and root cause analysis.

  • Maintain operational excellence in high-density, liquid-cooled GPU environments.


Executive Reporting & Strategy

  • Provide leadership updates on global site performance, capacity growth, and incident management.

  • Contribute to long-term site strategy, expansion roadmaps, and scaling models to support 300k+ GPU growth.

  • Serve as a thought leader for sustainable AI infrastructure, ensuring Crusoe remains at the forefront of clean compute.

What You’ll Bring to the Team:

  • 10+ years of experience in data center or cloud infrastructure operations, including 5+ years in senior leadership.

  • Proven success managing global, multi-site operations for cloud or hyperscale environments.

  • Deep knowledge of critical power and cooling systems, including liquid cooling for high-density GPU clusters.

  • Experience building and scaling global teams in high-growth, mission-critical environments.

  • Strong executive communication and cross-functional leadership skills.

  • Willingness to travel internationally (25–40%).

Preferred Experience:

  • Background in cloud service providers, hyperscalers, or large-scale colocation environments.
    Experience with GPU/AI workloads and HPC-optimized facilities.

  • Familiarity with clean energy integration (geothermal, hydro, solar + storage) in data center operations.

  • Expertise in incident management, root cause analysis, and building resilient systems at scale.

Benefits:

  • Industry competitive pay

  • Restricted Stock Units in a fast growing, well-funded technology company

  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

  • Employer contributions to HSA accounts

  • Paid Parental Leave

  • Paid life insurance, short-term and long-term disability

  • Teladoc

  • 401(k) with a 100% match up to 4% of salary

  • Generous paid time off and holiday schedule

  • Cell phone reimbursement

  • Tuition reimbursement

  • Subscription to the Calm app

  • MetLife Legal

  • Company paid commuter benefit; $300 per month

Compensation Range

Compensation will be paid in the range of $206,000 – $258,000. Restricted Stock Units are included in all offers. Compensation is determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

Similar jobs

No similar jobs found

© 2025 Qureos. All rights reserved.