HPC/AI Data Center Technician
- Department: Data Center Operations
- Reports to: Data Center Operations Manager / Site Lead
- Location: On-site (site-specific)
- Job Type: Full-time
- Schedule: 12-hour shifts, including nights, weekends, holidays, and on-call rotation
- Pay: $40.00 – $45.00 per hour, depending on experience and skills
- Overtime: Paid at 1.5× regular rate for hours worked beyond 40/week, in accordance with FLSA
- Travel: Up to 25–50% for multi-site support, as required
- About the Role
- We are hiring an experienced, hands-on Data Center Technician to support the day-to-day deployment and operation of HPC / AI / GPU infrastructure inside our facilities.
- This is a floor-level operations role, not a design or engineering position. You will spend most of your shift on the data center floor — racking equipment, running and terminating cables, troubleshooting hardware, swapping components, and supporting liquid-cooling and power infrastructure. We need someone with real, verifiable, multi-year hands-on experience who can show up on day one and work independently with minimal supervision.
- You will report to the site lead and serve as the on-site Tier 2 escalation point for network, cabling, and hardware issues. You will work closely with remote NetOps and customer engineering teams via ticketing systems.
- Key Responsibilities
- Hardware Deployment & Rack and Stack
- Unbox, rack, stack, cable, and power on GPU servers, storage arrays, network switches, routers, firewalls, CDUs, and PDUs per rack elevations, cable schedules, and BOMs
- Validate physical placement, serial numbers, and asset tags
- Configure basic OS and out-of-band management (BMC / iLO / iDRAC)
- Perform firmware updates and component replacements (CPU, memory, GPUs, NICs, HBAs, drives, power supplies)
- Structured Cabling & Fiber Infrastructure
- Install, terminate, label, and dress copper (Cat6 / 6A) and fiber (SM / MM) cabling
- Work with 100G / 400G / 800G transceivers, AOC / DAC, and MPO / MTP assemblies
- Use VFL, optical power meter, OTDR, and fiber scope for fault location, cleaning, and repair
- Verify LLDP neighbors, link status, and optical power levels
- Maintain TIA-942 / BICSI standards and cable management discipline
- Network Hardware Support
- Deploy and maintain Arista, Juniper, Cisco, and SONiC switches and routers
- Provide Tier 2 physical-layer troubleshooting and post-repair validation
- Assist Network Operations and automation teams with on-site diagnostic tasks
- Support Tier 1 technicians on network triage
- Power & Liquid Cooling
- Install, commission, and maintain Direct Liquid Cooling (DLC) systems, CDUs, heat exchangers, manifolds, piping, pumps, and valves
- Perform pressure testing, leak detection, water-quality monitoring, flow validation, and sensor calibration
- Support chilled-water / HVAC systems and PDU / UPS / RPP power infrastructure
- Read and interpret P&ID drawings and mechanical schematics
- Strictly follow OSHA, LOTO, and hot-work safety procedures
- GPU / AI Cluster Operations
- Support deployment, burn-in, and ongoing operations of high-density GPU clusters (H100 / H200 / B200 / GB200 and similar platforms)
- Troubleshoot GPU, NIC (InfiniBand / RoCE / NVLink), and interconnect hardware issues
- Execute node isolation and full-node replacement per customer SLA
- Break/Fix & Preventive Maintenance
- Respond to hardware alerts, diagnose root cause, replace components, execute RMA, and perform full-system swaps
- Execute scheduled preventive maintenance: cleaning, filter changes, health checks
- Participate in 7×24 on-call rotation, including nights, weekends, and emergency response
- Smart Hands & Remote Collaboration
- Act as on-site eyes and hands for remote engineering and customer teams
- Execute precise instructions via ServiceNow, Jira, or Zendesk
- Provide clear technical updates and escalation summaries
- Logistics, Inventory & Asset Management
- Manage receiving, inbound inspection, put-away, outbound fulfillment, cycle counts, and RMA returns
- Maintain accurate asset records, 5S warehouse standards, and ESD protection
- Documentation, Compliance & Safety
- Document all work in DCIM systems, runbooks, SOPs, and internal wikis
- Ensure full traceability through change-management processes
- Follow OSHA, ESD, PPE, and site security policies at all times
- Required Qualifications
- High School Diploma or GED
- 2+ years of hands-on data center operations experience
- 2+ years of server, storage, and network hardware installation experience
- 2+ years of structured and data-center cabling experience (copper + fiber)
- Working knowledge of Layer 2 / 3 networking concepts (OSI, TCP/IP, VLAN)
- Proficiency with Linux command-line basics
- Proficiency with ticketing systems (ServiceNow / Jira / Zendesk)
- Ability to read and interpret rack elevations, cable schedules, and mechanical drawings
- Clear English written and verbal communication
- Valid driver's license
- Legal authorization to work in the United States
- Preferred Qualifications
- Certifications
- CompTIA Network+ / Server+
- CCNA (or equivalent JNCIA / Arista ACE)
- BICSI Installer 1 / 2 or RCDD
- OSHA 10 / 30
- Vendor certifications (Dell, Lenovo, HPE, NVIDIA, Supermicro)
- Technical Experience
- HPC / GPU cluster deployment in hyperscale or AI environments
- 100G / 400G / 800G optics and OTDR-based fiber troubleshooting
- Direct Liquid Cooling (DLC), CDU, and chilled-water system installation
- DCIM platforms (Nlyte, Sunbird, Device42)
- Arista / Juniper / Cisco / SONiC CLI
- InfiniBand / RoCE / NVLink diagnostics
- Other
- Mandarin Chinese language skills are a plus for sites supporting Chinese-speaking customers or teams
- Physical & Work Environment Requirements
- Ability to lift 50–70 lbs (23–32 kg) independently and repeatedly
- Comfortable working on ladders, in confined spaces, and for extended periods standing, bending, and kneeling
- Comfortable working in loud, cold, high-airflow data center environments
- Willing to work 12-hour shifts, nights, weekends, holidays, and on-call rotations
- Compensation & Benefits
- Item Details
- Hourly Rate $40.00 – $45.00 / hour
- Overtime 1.5× regular rate beyond 40 hrs/week (FLSA)
- Shift Differential Available for nights, weekends, and holidays
- Health Medical, dental, vision, FSA / HSA
- Retirement 401(k) with company match
- Time Off Paid vacation, holidays, and sick leave
- Other Long-term disability, EAP, training and certification reimbursement
- How to Apply
- Submit your resume directly through Indeed. Please make sure your resume clearly lists:
- Years of hands-on data center experience
- Specific equipment, vendors, and cable plant types you have worked with
- Any GPU cluster, liquid cooling, or high-speed optics experience
- Certifications held
Pay: $40.00 - $45.00 per hour
Benefits:
- 401(k)
- Dental insurance
- Flexible schedule
- Health insurance
- Life insurance
- Paid time off
- Relocation assistance
- Retirement plan
- Vision insurance
Work Location: In person