Job Title : GenAI / ML Engineer Location: Quincy, MA (Hybrid 3 days a week onsite) Duration : Long term contract Contract: W2 / C2C Job Description : Role Overview: Responsible for building, optimizing, and operating high-scale AI/ML systems with strong performance, accuracy, and reliability guarantees. Key Responsibilities: Develop and deploy ML and GenAI models for enterprise use cases Build RAG pipelines capable of processing millions of documents Optimize inference latency and model throughput Implement monitoring for accuracy, drift, and performance Support MLOps pipelines and CI/CD for models Performance, Load & Accuracy Focus: Experience tuning models for latency and cost efficiency Define and monitor precision, recall, and accuracy metrics Support load testing and stress testing of AI services Experience with distributed inference architectures Technology Stack: Python PyTorch, TensorFlow, HuggingFace AWS AI services Vector databases and embeddings MLOps tools, CI/CD pipelines Experience & Qualifications: 4 8+ years of AI/ML engineering experience with exposure to enterprise-scale systems. For applications and inquiries, contact:
hirings@openkyber.com