Qureos

FIND_THE_RIGHTJOB.

JOB_REQUIREMENTS

Hires in

Not specified

Employment Type

Not specified

Company Location

Not specified

Salary

Not specified

We’re Hiring: AI Tech Lead (Real-Time AI & GenAI Systems)

We are looking to hire a strong AI Tech Lead who can architect and guide the implementation of real-time AI systems across voice automation, LLM pipelines, and knowledge-enhanced applications.

This role demands deep architectural judgment, hands-on expertise with LLMs and GenAI, and the ability to define best practices, performance benchmarks, and scalable system designs for low-latency, production-grade AI products.

Role Overview

As an AI Tech Lead, you will lead the design and development of distributed, provider-agnostic AI architectures involving multi-provider LLMs, speech pipelines (STT/TTS), and AI microservices. You will be responsible for ensuring low latency, high reliability, scalability, and cost efficiency across real-time AI systems.

You will also set technical standards, guide engineers, and make high-impact architectural decisions for enterprise and customer-facing AI products.

Key Responsibilities

AI Architecture & Systems Design

  • Architect and lead development of real-time AI systems including:
  • Multi-provider LLM pipelines
  • Speech-to-text (STT) and text-to-speech (TTS) chains
  • Distributed AI microservices
  • Ensure low speech-to-speech latency, resilient failover, load balancing, and horizontal scaling
  • Design provider-agnostic architectures with intelligent routing and fallback strategies
  • Optimize inference paths for performance and cost efficiency

GenAI Best Practices & Governance

  • Define architectural “dos and don’ts” for GenAI systems:
  • Prompt design and safety patterns
  • Context shaping and window optimization
  • Caching and fallback logic
  • Real-time agent orchestration
  • Lead model governance and evaluation frameworks:
  • Benchmarking multiple LLM providers
  • Routing logic based on latency, quality, and cost
  • Token efficiency and context management
  • Establish KPIs for:
  • LLM quality and grounding accuracy
  • Latency and reliability
  • Cost control and system stability

Knowledge & RAG Architecture

  • Own end-to-end RAG and knowledge integration architecture
  • Design and optimize:
  • Vector databases and retrieval strategies
  • Chunking and embedding policies
  • Context enhancement and hybrid grounding approaches
  • Function calling, tool use, and emerging knowledge-graph integrations
  • Ensure retrieval quality supports accurate, safe, and domain-grounded responses

Security & Data Handling

  • Ensure secure handling of voice and transcript data
  • Apply basic PII-safe practices and data isolation
  • Support retention, deletion, and multi-tenant data controls

Technical Leadership

  • Provide technical guidance and mentorship to AI engineers
  • Define performance benchmarks, design standards, and review practices
  • Stay ahead of new LLM capabilities, voice AI trends, and GenAI tooling

Required Skills & Experience

  • Strong experience designing and building production-grade AI systems
  • Deep hands-on expertise with LLMs, RAG, and GenAI architectures
  • Experience with real-time or low-latency systems (voice AI preferred)
  • Strong understanding of distributed systems, microservices, and scaling patterns
  • Experience working with multiple LLM providers and routing strategies
  • Ability to balance quality, latency, reliability, and cost
  • Strong architectural thinking and decision-making skills

Nice to Have

  • Experience with voice automation platforms and conversational AI
  • Exposure to function calling, tool-using agents, and knowledge graphs
  • Prior experience defining AI governance or platform standards
  • Experience in multi-tenant SaaS AI platforms

What You’ll Get

  • Ownership of core AI architecture decisions
  • Opportunity to build and scale real-time AI systems used in production
  • Work on cutting-edge voice AI, LLM, and knowledge-grounded applications
  • Influence long-term AI platform strategy and technical direction
  • Collaborate with senior engineers, product leaders, and founders

Role: AI Tech Lead
Location: Pune / Hybrid (as applicable)
Experience Level: Senior / Lead

Job Type: Contractual / Temporary
Contract length: 6 months

Pay: ₹213,343.82 - ₹303,019.68 per month

Work Location: In person

© 2025 Qureos. All rights reserved.