Company: Botmer International
Type: Full-time | Remote
Working Hours: 9:00 AM – 5:00 PM PKT
Days: Monday to Friday
Role Overview
We are hiring a Senior AI / ML Engineer to work on a production-grade AI infrastructure and evaluation platform. The role focuses on model evaluation, benchmarking, reproducibility, and continuous testing, rather than pure model training.
This position is suited for engineers who understand how AI systems behave in real environments and can build reliable, repeatable evaluation pipelines.
Key Responsibilities
- Design and implement AI/ML evaluation and benchmarking pipelines
- Build systems for reproducible experiments and deterministic model runs
- Integrate and validate models across different frameworks and formats
- Implement performance, safety, efficiency, and reward-based metrics
- Develop automated workflows for model ingestion, validation, and execution
- Maintain continuous testing infrastructure for AI models
- Collaborate in short, fast-paced development sprints (1–2 weeks)
- Use AI-assisted development tools to accelerate delivery
Required Skills & Experience
- 5+ years of hands-on AI / Machine Learning experience
- Strong Python skills with production-quality code
- Solid experience with PyTorch
- Experience working on model evaluation, validation, or benchmarking systems
- Understanding of experiment tracking, reproducibility, and ML metrics
- Experience deploying ML workloads in cloud or containerized environments
- Ability to work autonomously and take technical ownership
Preferred (Not Mandatory)
- Experience with robotics, embodied AI, or reinforcement learning concepts
- Familiarity with ONNX, TensorFlow, or HuggingFace models
- Exposure to MLOps, CI/CD pipelines for ML, or continuous evaluation systems
What We Offer
- Fully remote role with consistent night-shift hours
- Work on technically deep, non-trivial AI systems
- High autonomy and long-term engagement
- Competitive compensation based on experience
Job Type: Full-time
Application Question(s):
- Have you worked on ML model evaluation, benchmarking, or reproducible experimentation (not only training)?
- Total years of hands-on AI / ML experience:
- Do you have production experience with PyTorch?
- Have you worked with continuous testing, CI/CD, or automated evaluation for ML systems?
- Current monthly salary (PKR):
- Expected monthly salary (PKR):
- Can you join immediately? If not, how long is your notice period?
Work Location: Remote