About Abaka AI
Abaka AI is built on one mission: to be the world's most trusted data partner for AI companies. More than 1,000 industry leaders across Generative AI, Embodied AI, and Automotive AI rely on us to power their data pipelines. With our headquarters in Silicon Valley—and teams in Paris, Singapore, and Tokyo—we support global partners with fast, reliable, and scalable data solutions.
Our offerings include a diverse catalog of off-the-shelf datasets (image, video, multimodal, reasoning, 3D, and beyond) as well as comprehensive data collection and annotation services. Whether teams need raw data, curated datasets, or full-cycle data engineering, Abaka AI provides the foundation for building high-performance AI systems.
About the Role
As a Data Acquisition Engineer at Abaka AI, you will own and scale our raw data supply ecosystem by combining technical systems building with hands-on supplier sourcing and management. This is a 01 builder role focused on creating scalable, AI-native infrastructure for discovering, evaluating, onboarding, and managing data suppliers globally.
You will design and implement automation, internal tools, and AI-driven workflows to increase sourcing leverage—while also directly identifying, engaging, and managing external data partners. You will work closely with leadership to develop commercial instincts and supplier negotiation skills as you take full ownership of the data supply pipeline.
This is a high-impact role at the intersection of engineering, growth, and operations.
Responsibilities
Build automated pipelines and AI-driven workflows to discover and evaluate new raw data sources
Design and implement internal tooling for supplier tracking, scoring, and performance management
Experiment with scraping, APIs, enrichment tools, and automation platforms to increase sourcing efficiency
Aggressively identify and outreach to new data suppliers across global markets
Evaluate supplier quality, reliability, and scalability in partnership with internal teams
Manage ongoing vendor relationships, ensuring quality, cost, and delivery standards are met
Track supplier performance using quantitative metrics and continuously improve processes
Collaborate cross-functionally with Data Engineering, Research, Product, GTM, Legal, and Finance to align supply with business needs
Support commercial discussions and contract processes with guidance from leadership
Build scalable systems that increase data throughput without increasing headcount
Qualifications
Strong technical foundation (engineering, data, scripting, automation, or systems building)
Experience building projects, tools, or pipelines from 01
Comfortable using AI-native tools (e.g., LLM agents, Cursor, automation platforms, workflow builders)
High ownership mindset with the ability to operate independently in ambiguous environments
Strong written and verbal communication skills
Interest in AI, machine learning, and data infrastructure
Growth-oriented mindset with bias toward experimentation and rapid iteration
Experience in startup or high-growth environments preferred
Exposure to data pipelines, scraping, APIs, or automation workflows is a strong plus
Prior vendor management experience is not required
Compensation & Benefits
The base salary range for this position is $110,000 - $160,000 USD annually.
Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work at Abaka AI. This role is eligible for equity, as well as a comprehensive benefits package (health, dental, vision, PTO, flexible work schedule).