Job Summary:
We are seeking an experienced AI Engineer with deep expertise in fine-tuning and optimizing Large Language Models (LLMs) such as GPT, DeepSeek, LLAMA, BERT, MistralAI, and Hugging Face models. The ideal candidate will have a strong background in Retrieval-Augmented Generation (RAG), agentive AI frameworks (e.g., LangGraph), and building scalable, secure AI solutions tailored for autonomous agents and enterprise applications.
Key Responsibilities:
- Fine-tune and optimize LLMs (e.g., GPT, LLAMA, MistralAI, BERT) for specific use cases and enterprise deployment.
- Design and implement advanced RAG pipelines for knowledge-intensive tasks.
- Build and deploy autonomous AI agents using agentive frameworks like LangGraph.
- Develop and integrate RESTful APIs for scalable AI services.
- Collaborate with cross-functional teams to deliver high-performance, secure AI applications.
- Continuously evaluate and implement cutting-edge AI technologies and best practices.
- Ensure model efficiency, scalability, and compliance with security standards.
Required Skills:
- Proven experience with LLMs (GPT, BERT, LLAMA, etc.) and model fine-tuning.
- Hands-on expertise in RAG techniques and LangGraph or similar agentive AI tools.
- Proficiency in Python, deep learning frameworks (e.g., PyTorch, TensorFlow), and Hugging Face ecosystem.
- Experience with AI model deployment, inference optimization, and API development.
- Strong understanding of data handling, prompt engineering, and performance tuning.
- Excellent communication and collaboration skills.