Cisco Systems
, a leading US technology company, is seeking an
AI Engineer – Golang & Microservices
to design, build, and deploy production-grade AI systems leveraging RAG, agent orchestration, and model optimization techniques, integrated into high-performance Golang microservices. This fully remote role from Egypt involves close collaboration with US-based teams while owning the scalability, reliability, governance, and observability of AI platforms in production.
In this role, you will
architect and implement
distributed AI systems that merge intelligent model pipelines with resilient
Golang microservices
, ensuring performance, governance, and operational excellence across cloud environments.
Job Description
-
Architect, build, and operate production-grade AI platforms integrating LLM systems with scalable Golang microservices.
-
Design and implement Retrieval-Augmented Generation (RAG) and hybrid search pipelines with vector databases and knowledge graphs for grounded AI responses.
-
Develop and orchestrate agent-based AI workflows with structured tool calling, multi-step reasoning, and state management.
-
Build high-performance REST and gRPC services in Golang to serve AI inference, embeddings, and evaluation pipelines at scale.
-
Implement model optimization strategies including LoRA-based fine-tuning, distillation, quantization, and pruning for efficient deployment.
-
Establish automated evaluation, guardrails, and observability frameworks to monitor hallucination rates, drift detection, and model performance.
-
Integrate AI services with cloud-native infrastructure (AWS/GCP/Azure) using Docker and Kubernetes for scalable and reliable deployment.
-
Design event-driven AI pipelines leveraging Kafka or similar messaging systems for asynchronous processing and workflow orchestration.
-
Enforce AI governance, privacy, and security standards including adversarial testing, red-teaming practices, and data protection controls.
-
Collaborate with cross-functional teams to improve reliability, scalability, and performance while continuously refining AI system architecture.
Qualifications
-
2+ years of experience in AI Systems Engineering or Platform roles, with strong production exposure.
-
Strong proficiency in Golang, building scalable REST and gRPC microservices in distributed environments.
-
Hands-on experience designing and deploying LLM-powered systems, including RAG pipelines, agent workflows, or tool orchestration.
-
Practical experience with model optimization techniques such as LoRA, distillation, quantization, or other PEFT methods.
-
Experience integrating AI services with vector databases, hybrid search, or knowledge graph systems for grounded generation.
-
Solid understanding of cloud-native architecture (AWS/GCP/Azure), containerization (Docker), and Kubernetes-based deployments.
-
Experience implementing observability, evaluation frameworks, and reliability practices for AI systems (monitoring, guardrails, drift detection), and working in an agile environment.
-
Familiarity with asynchronous architectures using Kafka or similar messaging systems for event-driven AI workflows.
-
Strong understanding of system performance, scalability, security, and AI governance principles in production environments.