Full-Time · HybridÂ
·  based
 in US or Europe
ABOUT PROXY FOODS
Proxy Foods is an advanced AI-powered product development and recipe formulation platform trusted by leading R&D teams to create, reformulate, and optimize food & beverage products at scale and in record time.Â
We help food and beverage companies move from idea to viable formulation faster by unifying fragmented R&D data: ingredients, specs, cost, nutrition, processing constraints, and compliance. As a result, teams launch new products with fewer failed trials, reformulate faster based on consumer feedback and margin pressure, and stay ahead on trends and regulatory requirements.Â
THE ROLE
We are looking for a curious, motivated, and detail-oriented Junior Data Engineer to join our team.Â
This role is ideal for someone with strong foundational SQL and Python skills who wants to grow quickly by working on real-world data challenges in a startup environment. You will support the development of data pipelines, integrations, and data models that power both internal analytics and product features, while learning from a cross-functional team of engineers, product builders, and domain experts.Â
We are especially interested in someone who is eager to keep learning and stay current with modern data tooling and AI/LLM-enabled workflows, while building strong data engineering fundamentals.Â
In this role, you will:Â
-
Support the development and maintenance of ETL/ELT pipelines across APIs, databases, files, and external data sourcesÂ
-
Clean, normalize, validate, and structure datasets related to ingredients, specifications, nutrition, costs, and complianceÂ
-
Write SQL and Python code to transform data and improve data qualityÂ
-
Help maintain data models and storage layers used for analytics, reporting, and product featuresÂ
-
Work closely with product, engineering, and food science teams to understand data needs and business contextÂ
-
Monitor pipeline performance, investigate issues, and help improve reliability and observabilityÂ
-
Learn and apply AI/LLM-assisted workflows where appropriate for data extraction, normalization, enrichment, and structuring tasks under clear guidance and validation practicesÂ
-
Document data flows, transformations, and operational processes clearly and consistentlyÂ
WHAT WE’RE LOOKING FOR
-
1–3 years of experience in data engineering, analytics engineering, software engineering, or a related data-focused roleÂ
-
Good foundation in SQL and PythonÂ
-
Familiarity with data transformation, ETL/ELT concepts, and structured datasetsÂ
-
Exposure to relational databases, APIs, JSON/CSV files, and basic data modeling conceptsÂ
-
Strong attention to detail and a mindset for data quality and correctnessÂ
-
Eagerness to learn new tools, technologies, and ways of working in a fast-moving startup environmentÂ
-
Curiosity about where the data industry is heading, including modern cloud platforms and AI/LLM-enabled workflowsÂ
-
Strong communication skills and willingness to collaborate across technical and non-technical teamsÂ
Nice to have:Â
-
Exposure to cloud data platforms such as AzureÂ
-
Familiarity with Git and basic software development workflowsÂ
-
Experience with pandas, SQLAlchemy, or similar Python tools for data workÂ
-
Basic understanding of BI/reporting tools or dashboardingÂ
-
Interest in AI, LLMs, or data products beyond traditional reportingÂ
WHY JOIN PROXY
-
Work on real product and data problems at the intersection of AI and the food industryÂ
-
Learn fast in a startup environment with meaningful ownership from day oneÂ
-
Build strong data engineering fundamentals while gaining exposure to modern AI-enabled workflowsÂ
-
Collaborate with a small, ambitious, cross-functional teamÂ
-
Grow your role as the company and platform scaleÂ