Qureos

Find The RightJob.

Artificial Intelligence Researcher

Are you passionate about advancing the frontier of AI? Do you thrive at the intersection of cutting-edge research and real-world impact? A fast-growing AI startup is looking for exceptional AI researchers and engineers with expertise in multimodal models to join their world-class team in Seattle, WA .

About Us

We are a well-funded startup backed by top-tier investors, with a mission to bring real-time AI avatars to life. Our team includes experts with advanced degrees from leading institutions and extensive experience at top tech companies. With significant funding and a clear vision, we are scaling rapidly and looking for top talent to help operationalize cutting-edge research into transformative products.

What You’ll Do

As an AI Research Engineer , you’ll play a pivotal role in bridging the gap between research and production. Your responsibilities will include:

  • Operationalizing Research : Collaborate with researchers to transition models from experimental checkpoints to production-ready systems. Establish scalable patterns for large-scale training, rapid experimentation, and deployment of new architectures.
  • Optimizing Model Performance : Profile and enhance model inference for latency and throughput using techniques like quantization, pruning, distillation, and architectural refinements to ensure cost-effective scalability.
  • Model Acceleration : Apply optimization techniques (e.g., TensorRT, ONNX, vLLM) to accelerate multimodal models, including video diffusion, large language models (LLMs), and speech models.
  • Designing Data Pipelines : Build efficient pipelines for video data ingestion, preprocessing, and training at petabyte scale using tools like Dagster and Ray.
  • Evaluating and Iterating : Develop evaluation frameworks to measure model quality, establish benchmarks, and guide continuous improvement of model capabilities.

What We’re Looking For

We’re seeking candidates with a strong foundation in AI research and production engineering. The ideal candidate will have:

  • Production ML Experience : Proven track record of deploying ML models to production, with a deep understanding of common failure modes (e.g., resource contention, OOMs, batch optimization) and how to address them.
  • Deep Learning Expertise : Strong knowledge of PyTorch and modern ML architectures, with experience training and optimizing large models (e.g., transformers, diffusion models).
  • Systems Proficiency : Comfort working with GPUs, debugging CUDA issues, and profiling model workloads to identify compute or memory bottlenecks.
  • Data Engineering Skills : Experience building scalable data pipelines for high-bandwidth media processing and training workflows.

Preferred Qualifications

  • Experience with video or audio models in research or production settings.
  • Familiarity with low-level optimization (e.g., CUDA kernels, Triton, custom operators).
  • Knowledge of real-time ML systems and latency-critical inference.
  • Prior work with model compression techniques (quantization, distillation, pruning).

Why Join Us?

  • Impactful Work : Your contributions will directly shape the future of real-time AI avatars, advancing cutting-edge research into products that touch millions of lives.
  • World-Class Team : Collaborate with some of the brightest minds in AI, including researchers and engineers from top institutions and companies.
  • Growth Opportunities : Be part of a fast-growing startup with significant funding and a clear vision for the future.
  • In-Person Collaboration : Work alongside your teammates in our vibrant Seattle HQ , fostering creativity and innovation through daily collaboration.

How to Apply

If you’re excited about the opportunity to work on multimodal AI models and help bring groundbreaking research to life, we’d love to hear from you! Apply now to join a team shaping the future of AI.

© 2026 Qureos. All rights reserved.