Position Overview
Mid-Level AI Engineer with strong hands-on experience in end-to-end AI system development, specializing in on-premises and private cloud AI deployments. This role focuses on building production-grade AI solutions using primarily local models, optimizing GPU-based training and fine-tuning pipelines, and developing scalable RAG, agentic, multimodal, and speech-enabled systems.
Key Responsibilities
-
Analyze requirements and contribute to technical AI solution design.
-
Strong programming skills in Python (mandatory).
-
Experience in backend development (APIs, microservices, model serving).
-
Hands-on experience with local LLM deployment and optimization.
-
Experience in RAG system design and vector databases.
-
Practical experience in model fine-tuning (LoRA / PEFT).
-
Solid understanding of: Transformer architecture, quantization techniques, and embeddings with retrieval.
-
Experience working in Linux environments.
-
Participate in peer code reviews and team planning activities.
-
Provide task estimations and deliver within approved timelines.
-
Apply secure coding practices and participate in bug resolution with quality metrics in mind.
Qualifications
-
Bachelor’s degree in computer science, Artificial Intelligence, Data Science, or related discipline.
-
2-5 years of experience in AI Engineering.
-
Proficiency in C#, .NET Core and Python.
-
Experience with Structured and Unstructured databases.
-
Experience with Angular or other frontend technologies is a strong plus.
-
Working knowledge of relational databases (i.e., SQL Server) and non-relational databases (i.e., Qdrant)
-
Familiarity with Git, RESTful services, and Agile development methodologies