We are looking for a Senior Backend Engineer to lead the unification of large, highly rich, and heterogeneous datasets sourced from a wide range of external providers. This role centers on high-impact bulk ingestion and advanced data linkage.
Requirements
- Experience working with large, heterogeneous datasets from multiple providers or domains.
- Strong background in entity resolution, deduplication, data unification, or related large-scale data integration techniques.
- Proficiency in Python, with an emphasis on efficient, scalable data processing.
- Experience with BigQuery, Google Dataflow/Apache Beam, or similar batch-processing frameworks.
- Familiarity with data validation, normalization, reconciliation, and building consistent views across diverse data sources.
- Ability to craft well-structured matching and decision strategies that balance accuracy, completeness, and computational efficiency.
- Comfortable iterating quickly on pragmatic solutions, balancing correctness with time-to-delivery.
- Clear communication skills and the ability to collaborate closely with ML and research teams.
Benefits
- Highly competitive salary and equity
- Quarterly productivity budget
- Flexible time off
- Fantastic office location in Manhattan
- Productivity package, including ChatGPT Plus, Claude Code, and Copilot
- Top notch private health, dental, and vision insurance for you and your dependents
- 401(k) plan options with employer matching
- Concierge medical/primary care through One Medical and Rightway
- Mental health support from Spring Health
- Personalized life insurance, travel assistance, and many other perks