Location: Lahore (On-site)
Office Timing: 12:30 PM – 9:00 PM
Salary: Competitive, based on experience
Contract Type: Full-Time, Permanent
Experience: 3+ Years of Experience
About Us:
We are a leading provider of innovative software solutions in UK, focused on delivering cutting-edge technologies that drive efficiency, automation, and smart decision-making. Our platform is already in use across various industries, and we’re looking for an AI Expert to enhance and evolve our software with advanced AI capabilities. Join us and help shape the future of smart software solutions!
Responsibilities:
- Design and deploy LLM-powered tools, chat assistants, and automation modules
- Implement AI/ML pipelines for inference deployment at scale
- Collaborate with Backend developers to integrate AI APIs and background processing
- Optimize model performance and latency, especially on low-resource GPU environments
- Explore and fine-tune open-source models (e.g., LLaMA, Mistral, SQL coders, etc.)
- Monitor performance, usage, and resource allocation across multiple user sessions
- Contribute to AI roadmap, architecture decisions, and deployment best practices
Required Skills:
- Hands-on experience with LLMs, NLP, and custom model training/fine-tuning
- Proficiency in Python and frameworks like PyTorch or TensorFlow
- Familiarity with Hugging Face, LangChain, LLM orchestration, and prompt engineering
- Experience deploying AI on AWS or Linux-based environments with Docker
- Experience integrating high-performance LLM serving frameworks (e.g., vLLM, TGI) into backend APIs with support for parallel and batched inference
- Comfortable working with MySQL/MariaDB, Redis, and job queues
- Ability to build and manage GPU-efficient inference workloads
Nice to Have:
- Knowledge of multi-tenant inference services or load balancing for AI APIs
- Experience with healthcare AI or analytics dashboards
Job Type: Full-time
Pay: Rs150,000.00 - Rs350,000.00 per month
Work Location: In person