Job Overview
We are seeking an experienced Senior Linux HPC Engineer to support high-performance computing (HPC) environments running advanced simulation applications within a secure, mission-critical environment. The role requires a highly skilled professional capable of managing, optimizing, and maintaining Linux-based HPC infrastructure while ensuring system reliability, performance, and security.
Key Responsibilities
- Install, configure, and maintain Linux-based HPC clusters
- Support and optimize simulation and compute-intensive applications
- Monitor system performance and troubleshoot hardware/software issues
- Manage job schedulers and workload management systems (e.g., Slurm, PBS, or similar)
- Perform system tuning, performance optimization, and capacity planning
- Ensure system availability, stability, and security compliance
- Work closely with engineering and technical teams to support operational requirements
- Maintain documentation for system configurations, procedures, and troubleshooting
- Provide technical support and root-cause analysis for system incidents
Required Qualifications
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience)
- Strong hands-on experience with Linux system administration (RedHat/CentOS/Ubuntu)
- Proven experience working with High Performance Computing (HPC) environments
- Experience supporting simulation or compute-intensive applications
- Knowledge of networking, storage systems, and distributed computing
- Experience with shell scripting (Bash, Python, or similar)
- Familiarity with cluster management and job scheduling tools (e.g., Slurm, PBS, LSF)
- Strong troubleshooting and performance tuning skills
Preferred Qualifications
- Experience working in secure or regulated environments (e.g., defense, aerospace, research, or government projects)
- Knowledge of parallel computing (MPI, OpenMP)
- Experience with GPU computing environments is a plus
- Linux certifications (RHCSA, RHCE, or equivalent) are advantageous
Job Types: Full-time, Contract
Application Question(s):
- What is your expected Salary?
- What is your nationality?
- Do you have hands-on experience administering Linux servers in a production environment?
- How many years of experience do you have working with High Performance Computing (HPC) clusters?
- Which Linux distributions have you worked with? (Select all that apply)
Red Hat Enterprise Linux
CentOS / Rocky Linux / AlmaLinux
Ubuntu
SUSE
Other
- Do you currently hold any Linux certifications (RHCSA, RHCE, or equivalent)?
- What is your notice period in days?
- Do you have experience with system performance tuning and troubleshooting in Linux environments?
Location: