Find The RightJob.

GPU Computer Vision Engineer

We're seeking passionate, driven, and self-motivated individuals to join our team. If you're eager to grow in a dynamic environment and be part of a company that’s at the forefront of AI innovation, this is your chance to hit the ground running.

VIDIZMO is a USA-based technology company headquartered in Tysons, Virginia, and a Microsoft Solutions Partner in Data & AI, Infrastructure, and Digital & App Innovation. Offering a AI-Powered Intelligence Hub, we empower Fortune 500 companies, large enterprises, governments, and the public sector to securely manage, analyze, and govern their data with complete control and compliance.

Our Multimodal AI Data Intelligence Platform leverages Large Language Models (LLMs) and RAG (Retrieval-Augmented Generation) to deliver powerful capabilities such as auto-tagging, redaction, content summarization, OCR, translation, subtitle creation, object detection and tracking, content search, sentiment and emotion analysis, topic extraction, document classification, and facial attribute detection.

AI Output Verification & Accountability (Required Competency)

The candidate must be capable of effectively using AI tools (e.g., Claude, ChatGPT, Copilot, or similar) to support work outputs such as content, analysis, documentation, or code. However, the candidate must also demonstrate strong judgment and ownership of the final deliverables. AI-generated output must never be treated as final without review. The candidate is expected to validate accuracy, completeness, logic, compliance, and alignment with business requirements, and refine AI-assisted work to meet the expected quality standards. The individual will be held accountable for the final output, regardless of whether AI tools were used during execution.

About the Role

We are seeking an experienced Computer Vision & Video Analytics Engineer with deep expertise in TensorRT optimization and GPU-accelerated inference. In this role, you will build and optimize real-time intelligent video analytics (IVA) systems by bridging cutting-edge ML models with high-performance production pipelines.

You will architect low-latency, GPU-accelerated video solutions using TensorRT, CUDA, NVIDIA DeepStream, and GStreamer, ensuring scalable, production-grade deployment across edge and cloud environments.

Responsibilities

Architect and optimize real-time video analytics pipelines using NVIDIA DeepStream and GStreamer.
Responsibility for reviewing and validating AI-generated outputs
Ownership of final deliverables, regardless of AI assistance
Ability to critically evaluate AI results and ensure they meet defined requirements
Accountability for accuracy, quality, and compliance
Lead TensorRT-based model optimization, including precision tuning (FP16/INT8), engine building, and performance benchmarking.
Maximize GPU throughput and minimize latency using CUDA kernels, memory optimization, and parallelization techniques.
Integrate computer vision models (detection, segmentation, tracking, classification) into high-performance production systems.
Develop clean and efficient C/C++ and Python code in Linux-based environments.
Troubleshoot, debug, and optimize GPU-accelerated inference pipelines.
Collaborate with ML, hardware, and backend engineering teams to refine and deploy end-to-end AI systems.
Participate in system design, code reviews, performance tuning, and optimization cycles.

Requirements

Professional experience in computer vision, GPU acceleration, or high-performance computing.
Strong proficiency in C/C++ and Python.
Hands-on experience with:
- TensorRT (model conversion, calibration, engine building, profiling)
- CUDA programming
- NVIDIA DeepStream SDK
- GStreamer pipelines
Strong background in CV frameworks: PyTorch, TensorFlow, ONNX, OpenCV.
Understanding of core CV tasks: detection, segmentation, classification, tracking, and model optimization.
Familiarity with Kafka, MQTT, or microservice-based architectures.
Excellent debugging and performance engineering skills in GPU-accelerated environments.
Bachelor's or Master’s in Computer Science, Electrical Engineering, or related fields.
The ideal candidate demonstrates strong business acumen, translates objectives into impactful solutions, and effectively leverages AI tools for efficiency. Proficiency in Claude (mandatory) and familiarity with tools such as ChatGPT are required. An ownership mindset and the ability to deliver results independently are essential. Successful completion of a Claude-based assessment is required as part of the hiring process.

Preferred Qualifications

Experience with Docker, containerization, and Kubernetes.
Familiarity with NVIDIA Jetson, Orin, or edge AI devices.
Experience building custom GStreamer plugins or low-level CUDA kernels.
Knowledge of distributed inference or multi-GPU optimization.

Essential Skills:

The ideal candidate demonstrates strong business acumen, can translate objectives into impactful solutions, and is proficient in leveraging AI tools—including mandatory use of platforms like ChatGPT—to enhance efficiency. An ownership mindset and the ability to drive outcomes with minimal task-level direction are essential.

Benefits: Health Insurance (OPD/IPD), Separate Maternity Cover, Leave encashment, Car Support Program, Referral Bonus, EOBI, Bi-Annual Increment. Provident Fund, Career Growth, Bonus (benefits vary based on location)

Multiple Locations: Pakistan, India, UAE, Australia, Canada & USA

Similar jobs

AI Engineer

Qureos

Karachi, Pakistan

about 1 month ago

Easy Apply

Sr. Artificial Intelligence (AI) Engineer

Pivotize Consulting

Karachi, Pakistan

about 23 hours ago

Senior AI Engineer — Agentic AI & Intelligent Systems

Paksa IT soloutions (Pvt.) Limited

Karachi, Pakistan

1 day ago

Senior / Expert AI Developer - Night Shift

ARRAK CONSULTING SMC PRIVATE LIMITED

Karachi, Pakistan

1 day ago

Data Scientist

Jeeny

Karachi, Pakistan

1 day ago

Data Scientist

Jeeny

Karachi, Pakistan

1 day ago

Term of use Privacy policy