We are building AI-powered products that rely on high-quality, well-governed data. We are looking for an AI Data Engineer to design and operate the data pipelines, datasets, and evaluation workflows that make our AI systems reliable, reproducible, and measurable in production.
Requirements
- Build and maintain reliable data pipelines (batch and/or streaming) that power AI features, evaluation, and analytics use cases.
- Develop curated datasets and feature tables for training and evaluation; implement validation checks, lineage, and clear ownership.
- Support knowledge ingestion for RAG: document processing, chunking, metadata enrichment, indexing/backfills, and freshness monitoring.
- Implement and operate evaluation data workflows: golden sets, labeling support, drift checks, regression reporting, and dataset versioning.
- Collaborate with AI/ML and product engineers to translate requirements into scalable data models and pipelines.
- Improve pipeline performance and cost efficiency through incremental processing, partitioning, and resource tuning on GCP.
- Contribute to operational excellence: monitoring, alerting, troubleshooting, and documentation.
Benefits
- Hybrid Work Framework
- Saving Plan Canal +
- Paternity leave or Coparental leave extended
- Living Employee Culture (Events/Trainings/Parties/All hands,...)
- Career development support (training/internal mobility/compensation cycle/360 feedback review...)
- High-end Health Insurance and Personal Services Vouchers (CESU)
- Paid Time off – RTT and Saving time plan (CET)
- Meal Vouchers – Public Transport and Bike refund
- European Economic and Social Committee (sport membership/cinemas vouchers/gift vouchers/discount)
- Fitness Subscription thanks to our partnership with Gymlib