Find The RightJob.
On behalf of our Client from USA, Mobilunity is looking for a Senior Infrastructure/Platform Engineer
Our client builds a self-hostable inference engine for pre-trained models, a wide catalog of embedding, reranking, extraction, and OCR models packed onto shared GPUs at high utilization, letting customers run them in their own cloud at a fraction of the cost of managed APIs. Search and document processing is the initial focus area, though the engine is general-purpose. The project is open source under a permissive license, holds a recognized security compliance certification, and has an actively growing developer community. The company is based in the US and has raised a sizable round from well-known venture investors.
You’ll help build the engine and own how it gets deployed and operated. That spans the deployment tooling shipped to customers (infrastructure-as-code modules, Kubernetes packaging, deployment guides, air-gapped install paths) and the internal platform behind it (build/release pipelines, multi-architecture GPU images, model caching, evaluation infrastructure). Deployments currently target two major public clouds, with additional providers, including specialized GPU cloud providers.
As the company moves toward offering managed clusters, you’ll help define the operational bar that makes that credible: service-level objectives, on-call practices, scale-from-zero behavior, observability, and GPU cost efficiency.
The team works in short, multi-week milestones with high autonomy and a workflow built heavily around AI coding agents (substantial token usage weekly). Senior engineers own requirements end-to-end across multiple projects.
Requirements:
In return we offer:
Come on board, and let’s grow together!
Similar jobs
No similar jobs found
© 2026 Qureos. All rights reserved.