Are you passionate about advancing the frontier of AI? Do you thrive at the intersection of cutting-edge research and real-world impact? A fast-growing AI startup is looking for exceptional AI researchers and engineers with expertise in
multimodal models
to join their world-class team in
Seattle, WA
.
About Us
We are a well-funded startup backed by top-tier investors, with a mission to bring real-time AI avatars to life. Our team includes experts with advanced degrees from leading institutions and extensive experience at top tech companies. With significant funding and a clear vision, we are scaling rapidly and looking for top talent to help operationalize cutting-edge research into transformative products.
What You’ll Do
As an
AI Research Engineer
, you’ll play a pivotal role in bridging the gap between research and production. Your responsibilities will include:
-
Operationalizing Research
: Collaborate with researchers to transition models from experimental checkpoints to production-ready systems. Establish scalable patterns for large-scale training, rapid experimentation, and deployment of new architectures.
-
Optimizing Model Performance
: Profile and enhance model inference for latency and throughput using techniques like quantization, pruning, distillation, and architectural refinements to ensure cost-effective scalability.
-
Model Acceleration
: Apply optimization techniques (e.g., TensorRT, ONNX, vLLM) to accelerate multimodal models, including video diffusion, large language models (LLMs), and speech models.
-
Designing Data Pipelines
: Build efficient pipelines for video data ingestion, preprocessing, and training at
petabyte scale
using tools like Dagster and Ray.
-
Evaluating and Iterating
: Develop evaluation frameworks to measure model quality, establish benchmarks, and guide continuous improvement of model capabilities.
What We’re Looking For
We’re seeking candidates with a strong foundation in AI research and production engineering. The ideal candidate will have:
-
Production ML Experience
: Proven track record of deploying ML models to production, with a deep understanding of common failure modes (e.g., resource contention, OOMs, batch optimization) and how to address them.
-
Deep Learning Expertise
: Strong knowledge of PyTorch and modern ML architectures, with experience training and optimizing large models (e.g., transformers, diffusion models).
-
Systems Proficiency
: Comfort working with GPUs, debugging CUDA issues, and profiling model workloads to identify compute or memory bottlenecks.
-
Data Engineering Skills
: Experience building scalable data pipelines for high-bandwidth media processing and training workflows.
Preferred Qualifications
-
Experience with
video or audio models
in research or production settings.
-
Familiarity with
low-level optimization
(e.g., CUDA kernels, Triton, custom operators).
-
Knowledge of
real-time ML systems
and latency-critical inference.
-
Prior work with
model compression techniques
(quantization, distillation, pruning).
Why Join Us?
-
Impactful Work
: Your contributions will directly shape the future of real-time AI avatars, advancing cutting-edge research into products that touch millions of lives.
-
World-Class Team
: Collaborate with some of the brightest minds in AI, including researchers and engineers from top institutions and companies.
-
Growth Opportunities
: Be part of a fast-growing startup with significant funding and a clear vision for the future.
-
In-Person Collaboration
: Work alongside your teammates in our vibrant
Seattle HQ
, fostering creativity and innovation through daily collaboration.
How to Apply
If you’re excited about the opportunity to work on
multimodal AI models
and help bring groundbreaking research to life, we’d love to hear from you! Apply now to join a team shaping the future of AI.