Qureos

FIND_THE_RIGHTJOB.

Senior Data Scientist - Hybrid

India

Drive frontier research in speech-to-speech and multimodal AI systems to build natural, self-learning, voice-capable AI Workers.

Company details
GoCommotion is a fast-growing startup revolutionizing Customer Experience (CX) with AI Workers — persistent, multimodal agents capable of real-time, contextual conversations across channels. We blend LLMs, voice modeling, and agentic reinforcement to build truly autonomous AI systems.
Website: https://www.gocommotion.com/

Requirements
  • 3+ years of applied or academic experience in speech, multimodal, or LLM research
  • Bachelor’s or Master’s in Computer Science, AI, or Electrical Engineering
  • Strong in Python and scientific computing, including JupyterHub environments
  • Deep understanding of LLMs, transformer architectures, and multimodal embeddings
  • Experience in speech modeling pipelines: ASR, TTS, speech-to-speech, or audio-language models
  • Knowledge of turn-taking systems, VAD, prosody modeling, and real-time voice synthesis
  • Familiarity with self-supervised learning, contrastive learning, and agentic reinforcement (ART)
  • Skilled in dataset curation, experimental design, and model evaluation
  • Comfortable with tools like Agno, Pipecat, HuggingFace, and PyTorch
  • Exposure to LangChain, vector databases, and memory systems for agentic research
  • Strong written communication and clarity in presenting research insights
  • High research curiosity, independent ownership, and mission-driven mindset
  • Currently employed at a product-based organisation
Responsibilities
  • Research and develop direct speech-to-speech modeling using LLMs and audio encoders/decoders
  • Model and evaluate conversational turn-taking, latency, and VAD for real-time AI
  • Explore Agentic Reinforcement Training (ART) and self-learning mechanisms
  • Design memory-augmented multimodal architectures for context-aware interactions
  • Create expressive speech generation systems with emotion conditioning and speaker preservation
  • Contribute to SOTA research in multimodal learning, audio-language alignment, and agentic reasoning
  • Define long-term AI research roadmap with the Research Director
  • Collaborate with MLEs on model training and evaluation, while leading dataset and experimentation design

Job Details
Location: Hybrid — Mumbai, Bengaluru, Chennai, India

Interview process
  • Screening / HR round
  • Technical round(s) — coding, system design, ML case studies
  • ML / research deep dive
  • Final / leadership round

Important Note
ClanX is a recruitment partner, helping GoCommotion hire the Senior Machine Learning Engineer.

© 2025 Qureos. All rights reserved.