FIND_THE_RIGHTJOB.
JOB_REQUIREMENTS
Hires in
Not specified
Employment Type
Not specified
Company Location
Not specified
Salary
Not specified
PS Global Competency Center
Hewlett Packard Enterprise
Job Title Lead Solutions Architect AI Infrastructure Private Cloud
Job Description
We are seeking an experienced Lead Solutions Architect with deep expertise in AIML infrastructure High Performance Computing HPC and container platforms to join our dynamic team focused on delivering HPE Private Cloud AI and Enterprise AI Factory Solutions This role is instrumental in architecting deploying and optimizing private cloud environments that leverage HPEs codeveloped solutions with NVIDIA as well as validated HPE reference architectures to support enterprisegrade AI workloads at scale
The ideal candidate will bring strong technical expertise in AI infrastructure container orchestration platforms and hybrid cloud environments and will play a key role in delivering scalable secure and highperformance AI platform solutions powered by HPE GreenLake and NVIDIA AI Enterprise technologies
Key Responsibilities
1Leadership and Strategy
Provide delivery assurance and serve as the lead design authority to ensure seamless execution of Enterprise grade container platform including Red Hat OpenShift and SUSE Rancher HPE Private Cloud AI and HPCAI solutions fully aligned with customer AIML strategies and business objectives
Align solution architecture with NVIDIA Enterprise AI Factory design principles including modular scalability GPU optimization and hybrid cloud orchestration
Oversee planning risk management and stakeholder alignment throughout the project lifecycle to ensure successful outcomes
2Solution Planning and Design
Architect and optimize endtoend solutions across container orchestration and HPC workload management domains leveraging platforms such as Red Hat OpenShift SUSE Rancher andor workload schedulers like Slurm and Altair PBS Pro
Ensure seamless integration of container and AI platforms with the broader software ecosystem including NVIDIA AI Enterprise as well as opensource DevOps AIML tools and frameworks
3Opportunity assessment
Lead technical responses to RFPs RFIs and customer inquiries ensuring alignment with business and technical requirements
Conduct proofofconcept PoC engagements to validate solution feasibility performance and integration within customer environments
Assess customer infrastructure and workloads to recommend optimal configurations using validated reference architectures from HPE and strategic partners such as Red Hat NVIDIA SUSE along with components from the opensource ecosystem
4Innovation and Research
Stay current with emerging technologies industry trends and best practices across HPC Kubernetes container platforms hybrid cloud and security to inform solution design and innovation
5Customercentric mindset
Act as a trusted advisor to enterprise customers ensuring alignment of AI solutions with business goals
Translate complex technical concepts into value propositions for stakeholders
6Team Collaboration
Collaborate with crossfunctional teams including subject matter experts in infrastructure componentssuch as HPE servers storage networkingand data science teams to ensure cohesive and integrated solution delivery
Mentor technical consultants and contribute to internal knowledge sharing through tech talks and innovation forums
Required Skills
1 HPC AI Infrastructure
Extensive knowledge of HPC technologies and workload scheduler such as Slurm andor Altair PBS Pro
Proficient in HPC cluster management tools including HPE Cluster Management HPCM andor NVIDIA Base Command Manager
Experience with HPC cluster managers like HPE Cluster Management HPCM andor NVIDIA Base Command Manager
Good understanding with highspeed networking stacks InfiniBand Mellanox and performance tuning of HPC components
Solid grasp of highspeed networking technologies such as InfiniBand and Ethernet
2 Containerization Orchestration
Extensive handson experience with containerization technologies such as Docker Podman and Singularity
Proficiency with at least two container orchestration platforms CNCF Kubernetes Red Hat OpenShift SUSE Rancher RKEK3S Canonical Charmed Kubernetes
Strong understanding of GPU technologies including the NVIDIA GPU Operator for Kubernetesbased environments and DCGM Data Center GPU Manager for GPU health and performance monitoring
3Operating Systems Virtualization
Extensive experience in Linux system administration including package management boot process troubleshooting performance tuning and network configuration
Proficient with multiple Linux distributions with handson expertise in at least two of the following RHEL SLES and Ubuntu
Experience with virtualization technologies including KVM and OpenShift Virtualization for deploying and managing virtualized workloads in hybrid cloud environments
4 Cloud DevOps MLOps
Mandatory Skills : Architecture Patterns and Styles,Angular,Ansible,Java,JavaScript,Jenkins,Kubernetes,Application Architecture,Application Rearchitecting,Microservices,Node.js,Architectural diagrams,Asp.net,PostgreSQL,PowerShell,.Net Core,SpringBoot,Azure DevOps,Azure Functions,Terraform,Azure Logic Apps,Azure Monitor,Azure Service Bus,Azure SQL,Gitlab,C#,.Net Framework,Azure Cloud Architecture,Azure Frontdoor,Entity Framework (EF/EF Core),Azure App Service,Architectural Patterns
Good to Have Skills : Architectural diagrams
Similar jobs
No similar jobs found
© 2026 Qureos. All rights reserved.