Job Summary:
We are seeking a skilled and innovative AI Engineer to design, develop, and implement AI-driven solutions to support our products and operations. The ideal candidate will have strong experience in machine learning, deep learning, data processing, and software engineering.
Responsibilities:
LLM Development & Fine-Tuning
- Fine-tune and adapt state-of-the-art LLMs (e.g., LLaMA, GPT, Mistral, Gemma) for domain-specific tasks.
- Implement advanced training strategies including LoRA, QLoRA, PEFT, and parameter-efficient adaptation methods.
- Manage dataset preprocessing, curation, augmentation, and experiment tracking.
- Optimize models for quality, latency, and memory efficiency across training and inference.
GPU-Based Training & Optimization
- Run and optimize training pipelines across multi-GPU environments (NVIDIA A100, H100, RTX, etc.).
- Implement mixed-precision training, quantization, and distributed training (e.g., DeepSpeed, FSDP, DDP).
- Monitor GPU performance, memory usage, and throughput to ensure optimal resource utilization.
Model Serving & Infrastructure
- Deploy LLMs using modern serving frameworks (vLLM, TensorRT-LLM, HuggingFace TGI, Triton).
- Develop scalable inference architectures using Docker, Kubernetes, and cloud GPU infrastructure.
- Build end-to-end pipelines for continuous training, evaluation, and automated deployment.
Research, Experimentation & Innovation
- Stay updated with cutting-edge research in transformers, embeddings, RAG, PEFT, and efficient fine-tuning.
- Run experiments, benchmark models, and propose architectural or training improvements.
- Prototype new capabilities and develop internal best practices for LLM workflows.
Collaboration & Leadership
- Work closely with product, engineering, and data teams to translate requirements into model behaviors.
- Mentor junior engineers and contribute to technical design reviews and architecture decisions.
- Communicate model performance, limitations, and trade-offs clearly to stakeholders
Requirements:
- Bachelor’s or Master’s degree in Computer Science, AI, Data Science, Engineering, or a related field.
- Strong coding skills in Python (TensorFlow, PyTorch, Scikit-learn, OpenCV, etc.).
- Solid understanding of machine learning algorithms, deep learning, NLP, or computer vision.
- Experience with cloud platforms (AWS, Azure, GCP) is a plus.
- Familiarity with MLOps tools (Docker, Kubernetes, MLflow, etc.) preferred.
- Strong problem-solving skills and ability to work on complex, data-driven problems.
- Ability to work collaboratively in a team-oriented environment.
Preferred Qualifications:
- Experience deploying AI models in production environments.
- Knowledge of LLMs (Large Language Models) and prompt engineering.
- Background in data engineering or backend development.
- Research experience or publications in AI/ML fields.
You can share your cv on whtsapp as well. 03355707047
Job Type: Full-time
Pay: Rs200,000.00 - Rs400,000.00 per month
Work Location: In person