Qureos

Find The RightJob.

Mid-Level AI Engineer ( Backend Heavy )

About Devfum

At Devfum, we leverage cutting-edge technologies such as web development, artificial intelligence (AI), and extended reality (XR) to automate business processes, enhance customer experiences, and drive sustainable growth worldwide.We foster a collaborative environment where you’ll work on cutting-edge technology, contribute to impactful projects, and grow through continuous learning and career development.

Role Overview:Devfum is hiring a Mid-Level AI Engineer to design, build, deploy, and operate production-grade AI systems.This is not a research only role and not a pure backend role. It is a backend heavy AI engineering role focused on building real AI products end to end.

You will work on:

  • AI system architecture
  • LLM-powered automation and agent systems
  • Custom model training and fine-tuning
  • GPU-based deployment and inference optimization
  • Production AI infrastructure and monitoring

1. Key Responsibilities

  • Design AI solutions from problem definition to production
  • Architect LLM-based systems (agents, tool-calling, workflows, RAG, hybrid approaches)
  • Integrate third-party LLM APIs and custom models into backend services
  • Train or fine-tune generative models when APIs are insufficient; replicate research into deployable pipelines
  • Build and maintain AI inference + workflow APIs using FastAPI/Flask (async inference, background jobs, retries, rate limiting, error handling)
  • Deploy and operate AI systems on GPU infrastructure (CUDA/NVIDIA), across single- and multi-GPU environments
  • Optimize inference for cost, latency, throughput, memory usage, and reliability (batching, caching, CPU vs GPU tradeoffs)
  • Apply model optimization techniques (FP16/INT8 quantization, ONNX, TensorRT, pruning, distillation) when needed
  • Debug production issues (model failures, drift, quality degradation, latency regressions) and drive long-term stability
  • Implement monitoring/observability for performance, drift, and output quality; trigger tuning/retraining when required
  • Build CI/CD for AI services (model builds, deployments, versioning, rollbacks) and automate ops with Linux + Bash

2. Requirements

  • Bachelor’s degree in Computer Science, Software Engineering, Data Science, AI/ML (or equivalent practical experience)
  • 2–3 years of real AI/ML engineering experience
  • Proven experience deploying AI/ML or LLM systems to production
  • Hands-on experience with GPUs (CUDA/NVIDIA), and operating inference workloads on GPU servers
  • Strong knowledge and hands-on experience building AI automation systems using LangChain and LangGraph.
  • Strong Python backend engineering skills; experience building production APIs (FastAPI/Flask)
  • Experience with Docker and CI/CD pipelines for deploying services/models
  • Experience debugging live AI systems in production (failures, drift, latency, reliability issues)
  • Familiarity with LLM systems: agents, tool calling, workflows, RAG, and evaluation approaches
  • Familiarity with inference optimization concepts (batching, quantization FP16/INT8, ONNX/TensorRT)
  • Ownership mindset: takes responsibility end-to-end and drives tasks to completion
  • Calm under failure; handles incidents and ambiguity without panic
  • Clear communicator (written + verbal), can explain tradeoffs and decisions
  • Prioritizes reliability and measurable outcomes over hype and experimentation for its own sake
  • Comfortable with responsibility and operating production systems on Linux environments

Why Join Devfum?

We offer an environment that fosters growth, learning, and work-life balance, ensuring that our employees feel valued and motivated.

  • Bi-Annual Salary Increments: Recognizing your contributions regularly.
  • Clear Career Growth Roadmap: Software Engineer → Senior Software Engineer → Engineering Manager.
  • Continuous Learning: Access to paid courses to enhance your skills.
  • Daily Meals: Enjoy complimentary lunch and tea/coffee.
  • Recreational Activities: Monthly sports activities and annual company tours.

Work Schedule

  • Monday – Friday, 12 PM – 9 PM

Why Join Devfum?

We foster an environment that encourages growth, learning, and work-life balance to ensure our employees feel valued and motivated.

  • Clear Career Growth Roadmap – A structured path for your professional development
  • Continuous Learning Opportunities – Access to paid courses to sharpen your skills
  • Daily Lunch & Tea – Complimentary meals and refreshments
  • Monthly Sports Activities – Fun activities to stay active

Submit your application today!

Job Type: Full-time

Pay: Rs150,000.00 - Rs250,000.00 per month

Application Question(s):

  • Kindly share your GitHub profile link
  • How many years of real AI/ML engineering experience

Education:

  • Bachelor's (Required)

Work Location: In person

© 2026 Qureos. All rights reserved.