Job Title: AI Algorithm Engineer
Location: Hybrid/Remote Mostly
Job Type: Full-time
Salary: Negotiable
Responsibilities
- Core AI Module Development
Deeply participate in the architecture design, feature development, unit testing, and continuous maintenance of C++ core components within AI training and inference systems.
Ensure high performance and reliability in large-scale engineering environments.
- Performance Analysis & System Optimization
Collaborate with the team to identify performance bottlenecks in AI training/inference workflows.
Optimize key paths such as operators, memory, and concurrency scheduling, solving performance challenges in real business scenarios.
- Hardware Acceleration & Low-Level Optimization
Research and apply GPU acceleration technologies (CUDA / ROCm) or CPU SIMD/vector instruction sets.
Integrate advanced computing optimization strategies into large-scale, computation-intensive tasks to improve training and inference performance.
- System Integration & Engineering Delivery
Integrate optimized low-level modules into existing AI frameworks (e.g., search/recommendation systems, semantic model training frameworks).
Participate in strict functional validation, stress testing, and performance evaluation.
- Cutting-Edge Technology Exploration
Continuously track developments in system software, parallel computing, high-performance computing (HPC), and AI infrastructure.
Work with the team to explore their feasibility and practical application in business scenarios.
Education Background
Bachelor’s degree or above in Computer Science, Software Engineering, Artificial Intelligence, Electrical Engineering, or related disciplines.
Programming & Engineering Skills
Solid C/C++ fundamentals; familiar with modern C++ features (C++11/14/17/20) and good engineering practices / design patterns.
Proficient in Linux/Unix development environments; capable of writing Shell scripts.
Practical Python development skills for model experiments, tooling, data processing, and automated testing.
Computer Science Fundamentals
Strong knowledge of data structures, algorithms, compilation, and performance tuning.
Understanding of computer architecture, CPU/GPU parallel architectures, and OS scheduling principles.
Learning & Problem Solving
Strong passion and self-motivation in low-level technologies, AI infrastructure, and performance optimization.
Excellent analytical, abstraction, and complex problem-solving abilities.
Preferred Qualifications (Plus)
Practical experience in C++ performance optimization: memory management, multithreading, lock optimization, cache-friendly programming, etc.
Familiarity with GPU architecture; experience with CUDA / ROCm programming.
Knowledge or experience in parallel computing (OpenMP, MPI, SIMD, distributed training, etc.).
Experience with at least one deep learning framework (PyTorch / TensorFlow), including operator development, model optimization, or inference engines.
- Experience in search, recommendation systems, embeddings, vector retrieval, or semantic models.
Bilingual ability in Chinese and English is required;Why Apply:- Competitive salary and performance-based bonuses.
- Paid holidays and PTO
- Snacks, drinks, and perks
- Health, dental, and vision insurance.
- Annual exclusive perks
- Opportunities for professional development and career advancement.
- Engage with cutting-edge gaming projects and contribute to the success of globally recognized titles.
Flexible work from home options available.