We are seeking an Associate AI Engineer to join our development team. You will work directly on building and optimizing sophisticated AI applications using Generative AI and Computer Vision. Your primary focus will be on NLP tasks, fine-tuning Large Language Models (LLMs), and building robust RAG pipelines.
Key Responsibilities:
- LLM Application Development: Build agents and workflows using Agentic AI Frameworks to automate complex business logic.
- RAG Systems: Design and maintain Retrieval-Augmented Generation pipelines, managing vector databases (Pinecone/Milvus) and optimizing retrieval accuracy.
- Model Fine-Tuning: Assist in fine-tuning open-source models (Llama-3, Mistral) on custom datasets using efficient techniques.
- Computer Vision: Integrate vision capabilities (such as YOLO or multimodal analysis) into agent workflows.
- Deployment: Dockerize AI applications and deploy models on GPU-accelerated environments for inference.
- Data Preparation: Clean and curate datasets for training and vector ingestion.
Required Technical Skills:
- Python Proficiency: Strong coding skills in Python, specifically with AsyncIO and API frameworks like FastAPI.
- Generative AI Stack: Hands-on experience with Voice Models & Speech AI, OpenAI/Anthropic APIs, and HuggingFace transformers.
- NLP Fundamentals: Understanding of embeddings, tokenization, and context window management.
- Vector Databases: Experience implementing RAG using tools like Pinecone, ChromaDB, or Qdrant.
- Deep Learning Basics: Familiarity with PyTorch and the concept of training loops.
- Computer Vision: Basic familiarity with object detection (YOLO) or image processing (OpenCV).
Nice to Have:
- Experience with LangGraph for multi-agent orchestration.
- Knowledge of model evaluation frameworks (RAGAS, etc.).
- Experience serving local models using vLLM or Ollama
Job Type: Full-time
Work Location: In person