DATA ENGINEER — AI & MODEL TRAINING
Job Bank — AI Business Engine Development
Location: Remote (Pakistan)
Language: Must speak Fluent EnglishSalary: Competitive — Based on Experience
---
About Job Bank
Job Bank is a UK‑focused platform helping people start recruitment and training businesses through two simple annual plans (£100 & £200). We are now building Job Bank AI — a specialised AI system designed to help users create and scale their own recruitment or training business.
Our AI ecosystem includes:
- Recruitment AI
- Training & Course AI
- Business Builder AI
- Marketing/Outreach AI
- AI Assistants (Recruiter / Trainer / Business Coach / Operations)
- Business Engine (AI Tool Suite)
We are building a vertical AI platform, with deep industry‑specific intelligence — NOT a generic chatbot.
This is your opportunity to become a foundational part of a fast‑growing AI product used across the UK.
---
Role Overview
We are looking for a Data Engineer who can prepare, structure, clean and optimise datasets for AI fine‑tuning and retrieval systems.
Your work will directly power:
- fine‑tuned recruitment models
- training/course intelligence
- business‑building AI
- outreach and marketing AI
- autonomous AI assistants
- AI-driven games and simulations
This role is crucial to the accuracy, reliability and performance of Job Bank AI.
---
Key Responsibilities
Data Preparation & Engineering
- Collect datasets related to recruitment, training, business planning, marketing, and operations
- Clean and standardise raw data (text, CSV, JSON, documents)
- Label and structure data into instruction formats for LLM training
- Build datasets for fine‑tuning small/medium models
- Develop evaluation datasets for model scoring
Data Architecture & Pipelines
- Build and maintain ETL pipelines for AI model ingestion
- Create scalable data workflows to support training cycles
- Manage datasets using tools like Pandas, PySpark or equivalent
- Optimise dataset quality, consistency, and token efficiency
Vector Databases & Retrieval
- Work with embeddings and vector databases (FAISS, Chroma, Pinecone, Weaviate)
- Build and maintain retrieval datasets for RAG systems
- Ensure high‑quality indexing for search, scoring and assistant use
AI Model Support
- Support AI/ML engineers during fine‑tuning cycles
- Prepare domain‑specific datasets for niche models
- Validate and test model outputs against evaluation sets
- Assist with dataset revisions and incremental model improvements
Collaboration
- Work with AI Engineers to support model training
- Work with Full‑Stack Developers to integrate datasets into the platform
- Work directly with the founder to align on roadmap and priorities
---
Technical Skills (Must Have)
Data Engineering
- Data cleaning, validation, normalisation
- ETL pipeline development
- JSON, CSV, XML processing
- Strong Python (Pandas, NumPy)
- Experience with large text datasets
- Dataset versioning and quality control
AI/Data Skills
- Experience preparing datasets for LLM fine‑tuning
- Understanding of instruction tuning formats
- Embeddings & vector stores (FAISS, Pinecone, Chroma)
- RAG data preparation
- Working knowledge of tokenisation and dataset optimisation
General
- Strong attention to detail
- Ability to work quickly and independently
- Fluent English communication skills
- Experience supporting AI development teams
- Ability to think like a product-focused engineer
---
Bonus Skills (Nice to Have)
- Experience with recruitment or training datasets
- Knowledge of course design or HR workflows
- Familiarity with LangChain, LlamaIndex, or similar
- Exposure to machine learning pipelines
- Experience working with international/remote teams
---
What We Offer
- Competitive salary (based on expertise)
- Long‑term role with growth potential
- Work on a high‑impact AI product
- A stable, ambitious product roadmap
- Flexible remote working
- Supportive, fast‑paced, visionary team
- Opportunity to build category‑defining AI
---
Why This Role Matters
You will be the engine behind our AI intelligence.
Your datasets will shape:
- how accurately the AI writes job ads
- how well it scores CVs
- how intelligent the business plans are
- how powerful the training/courses become
- how effective the assistants behave
Job Type: Full-time
Pay: Rs80,000.00 - Rs100,000.00 per month
Work Location: Remote