Data Scientist (Speech) | BLR

JOB_REQUIREMENTS

Hires in

Not specified

Employment Type

Not specified

Company Location

Not specified

Salary

Not specified

What You’ll Do

Dive into model architectures (ASR / TTS / SLMs) and optimize them for specific GPUs and hardware profiles
Build, debug, and tune kernels using CUDA / Tinygrad / AMD toolchains
Convert, optimize, and benchmark models using TensorRT, ONNX, and other inference engines
Work hands-on with PyTorch to train, fine-tune, and evaluate real-time speech models
Run large-scale experiments, manage datasets, and analyze model performance at scale
Productionize models for ultra-low latency speech workloads
Collaborate with research, infra, and product teams to push models into production

Requirements

Strong experience with CUDA, Tinygrad, AMD GPU toolkit, or similar low-level GPU programming stacks
Hands-on proficiency with PyTorch and Python
Deep understanding of neural networks, training dynamics, and optimization
Experience handling and processing large datasets
Familiarity with production inference pipelines
Strong problem-solving skills with ability to go deep into performance bottlenecks

Great to Have

Similar jobs

No similar jobs found