Arbisoft is looking for an experienced ML Engineer to design and deploy cutting-edge AI solutions, including LLMs, RAG pipelines, and agentic workflows. The ideal candidate brings deep expertise in Python, transformers, and scalable cloud-based ML systems.
Key Responsibilities:
-
Design, implement, and evaluate ML/DL models using PyTorch, TensorFlow, or similar frameworks
-
Build and optimize LLM-based systems, including prompt-tuning, fine-tuning, and adapter-based training (e.g., LoRA, QLoRA)
-
Develop robust and scalable RAG pipelines. Integrate vector databases like FAISS, Pinecone, Weaviate, etc.
-
Construct and maintain Agentic AI workflows involving multi-step reasoning, tool calling, memory components, and planning logic.
-
Work with Proprietary APIs, as well as open-source libraries and models.
-
Develop modular and clean Python code, adhering to software engineering best practices (OOP, reusable components, testing).
-
Implement scalable solutions in cloud environments (like AWS), leveraging GPU/TPU resources effectively.
-
Design inference pipelines that are robust and optimized for latency and throughput.
-
Collaborate with research and product teams to translate ideas into production-grade ML features.
Required Skills:
-
3+ years of experience in machine learning and deep learning, including building models from scratch.
-
Has a track record of shipping ML solutions that scale in production.
-
Strong proficiency in Python and deep understanding of software design principles.
-
Proven experience with transformer-based architectures, LLMs, and embedding models.
-
Hands-on experience with RAG systems, deep understanding of agent-based systems.
-
Familiarity with LangChain, LlamaIndex, or similar frameworks.
-
Experience with cloud platforms (AWS/GCP/Azure) and understanding of scalability, resource optimization, and model deployment.
-
Familiarity with performance profiling, efficient model serving, and hardware-aware design (e.g., GPU utilization, quantization).
-
Ability to read, debug, and contribute to complex ML/DL codebases.
Good to have:
-
Experience with MLOps, orchestration tools (e.g., Airflow, AWS Step Functions), containerization (Docker, Kubernetes.
-
Exposure to optimization toolkits (ONNX, TensorRT) and serving frameworks (Triton, TorchServe).
-
Experience with experiment tracking (e.g., Weights & Biases, Comet)
-
Understanding of alignment techniques like RLHF or curriculum learning