About Devfum
At Devfum, we leverage cutting-edge technologies such as web development, artificial intelligence (AI), and extended reality (XR) to automate business processes, enhance customer experiences, and drive sustainable growth worldwide.We foster a collaborative environment where you’ll work on cutting-edge technology, contribute to impactful projects, and grow through continuous learning and career development.
Role Overview:Devfum is hiring a Mid-Level AI Engineer to design, build, deploy, and operate production-grade AI systems.This is not a research only role and not a pure backend role. It is a backend heavy AI engineering role focused on building real AI products end to end.
You will work on:
- AI system architecture
- LLM-powered automation and agent systems
- Custom model training and fine-tuning
- GPU-based deployment and inference optimization
- Production AI infrastructure and monitoring
1. Key Responsibilities
- Design AI solutions from problem definition to production
- Architect LLM-based systems (agents, tool-calling, workflows, RAG, hybrid approaches)
- Integrate third-party LLM APIs and custom models into backend services
- Train or fine-tune generative models when APIs are insufficient; replicate research into deployable pipelines
- Build and maintain AI inference + workflow APIs using FastAPI/Flask (async inference, background jobs, retries, rate limiting, error handling)
- Deploy and operate AI systems on GPU infrastructure (CUDA/NVIDIA), across single- and multi-GPU environments
- Optimize inference for cost, latency, throughput, memory usage, and reliability (batching, caching, CPU vs GPU tradeoffs)
- Apply model optimization techniques (FP16/INT8 quantization, ONNX, TensorRT, pruning, distillation) when needed
- Debug production issues (model failures, drift, quality degradation, latency regressions) and drive long-term stability
- Implement monitoring/observability for performance, drift, and output quality; trigger tuning/retraining when required
- Build CI/CD for AI services (model builds, deployments, versioning, rollbacks) and automate ops with Linux + Bash
2. Requirements
- Bachelor’s degree in Computer Science, Software Engineering, Data Science, AI/ML (or equivalent practical experience)
- 2–3 years of real AI/ML engineering experience
- Proven experience deploying AI/ML or LLM systems to production
- Hands-on experience with GPUs (CUDA/NVIDIA), and operating inference workloads on GPU servers
- Strong knowledge and hands-on experience building AI automation systems using LangChain and LangGraph.
- Strong Python backend engineering skills; experience building production APIs (FastAPI/Flask)
- Experience with Docker and CI/CD pipelines for deploying services/models
- Experience debugging live AI systems in production (failures, drift, latency, reliability issues)
- Familiarity with LLM systems: agents, tool calling, workflows, RAG, and evaluation approaches
- Familiarity with inference optimization concepts (batching, quantization FP16/INT8, ONNX/TensorRT)
- Ownership mindset: takes responsibility end-to-end and drives tasks to completion
- Calm under failure; handles incidents and ambiguity without panic
- Clear communicator (written + verbal), can explain tradeoffs and decisions
- Prioritizes reliability and measurable outcomes over hype and experimentation for its own sake
- Comfortable with responsibility and operating production systems on Linux environments
Why Join Devfum?
We offer an environment that fosters growth, learning, and work-life balance, ensuring that our employees feel valued and motivated.
- Bi-Annual Salary Increments: Recognizing your contributions regularly.
- Clear Career Growth Roadmap: Software Engineer → Senior Software Engineer → Engineering Manager.
- Continuous Learning: Access to paid courses to enhance your skills.
- Daily Meals: Enjoy complimentary lunch and tea/coffee.
- Recreational Activities: Monthly sports activities and annual company tours.
Work Schedule
- Monday – Friday, 12 PM – 9 PM
Why Join Devfum?
We foster an environment that encourages growth, learning, and work-life balance to ensure our employees feel valued and motivated.
- Clear Career Growth Roadmap – A structured path for your professional development
- Continuous Learning Opportunities – Access to paid courses to sharpen your skills
- Daily Lunch & Tea – Complimentary meals and refreshments
- Monthly Sports Activities – Fun activities to stay active
Submit your application today!
Job Type: Full-time
Pay: Rs150,000.00 - Rs250,000.00 per month
Application Question(s):
- Kindly share your GitHub profile link
- How many years of real AI/ML engineering experience
Education:
Work Location: In person