Job Title: Data Scientist (4+ Years Experience)
Location: Pune
Type: Full-time / On-site
About the Role
We are looking for an experienced Data Scientist with strong hands-on experience in building AI/ML solutions, especially using LLMs, retrieval augmented generation (RAG), and modern vector database architectures. The ideal candidate should have strong problem-solving skills, advanced Python expertise, and experience deploying ML models in production environments.
Responsibilities
- Design, develop, and deploy end-to-end ML and NLP solutions.
- Build RAG-based pipelines using vector stores and LLMs for real-world use cases.
- Develop embedding pipelines using OpenAI, HuggingFace, SentenceTransformers, etc.
- Perform data cleaning, transformation, feature engineering, and EDA.
- Fine-tune foundational models and apply few shot / in-context prompting techniques.
- Build reusable components leveraging LangChain, LlamaIndex and other LLM toolkits.
- Work closely with Engineering and Product teams to convert business problems into AI solutions.
- Contribute to MLOps workflows including model deployment, testing and CI/CD.
- Prepare analytics dashboards, visualizations, and insights to be consumed by stakeholders.
Required Skills & Experience
- Bachelor’s or Master’s degree in Computer Science, Data Science, AI, or related field.
- 4+ years hands-on experience in Machine Learning / NLP / GenAI.
- Practical experience with RAG architectures & vector databases (FAISS / Pinecone / Weaviate / ChromaDB).
- Proficiency in Python, LangChain, LlamaIndex and LLM frameworks (OpenAI, HuggingFace Transformers).
- Strong experience in Pandas, NumPy, Matplotlib for data manipulation & visualization.
- Experience in LLM fine-tuning, prompt engineering, and in-context learning strategies.
- Cloud AI platform experience (AWS Sagemaker / Azure AI / Google Vertex AI).
- Experience integrating models into APIs and production systems (MLOps, CI/CD).
- Strong analytical mindset with ability to communicate insights to both tech and business stakeholders.
Nice to Have
- Experience with Kafka, Spark, or real-time data processing frameworks.
- Exposure to vector search optimization and prompt evaluation metrics.
- Knowledge of multi-agent AI frameworks / orchestration engines.
Job Types: Full-time, Permanent
Pay: ₹90,000.00 - ₹100,000.00 per month
Work Location: In person