At H1, we believe access to the best healthcare information is a basic human right. Our mission is to provide a platform that can optimally inform every doctor interaction globally. This promotes health equity and builds needed trust in healthcare systems. To accomplish this, our teams harness the power of data and AI technology to unlock groundbreaking medical insights and convert those insights into actions that result in optimal patient outcomes and accelerate an equitable and inclusive drug development lifecycle.
Requirements
- 8+ years of experience in data engineering, software engineering, or related fields with significant experience building and scaling distributed data platforms
- Demonstrated technical leadership experience with interest in or experience mentoring and leading engineers
- Strong proficiency in Python (PySpark), Java, Scala, or similar programming languages
- Advanced SQL expertise, including performance tuning and optimization across large datasets
- Deep experience with Apache Spark and cloud-native big data platforms, preferably within AWS environments (EMR, Glue, S3, Athena, Redshift, or similar)
- Experience designing and scaling modern cloud-native data lake architectures and large-scale ingestion frameworks
- Experience with orchestration and workflow management tools such as Argo, Airflow, or similar technologies
- Strong understanding of distributed storage systems, partitioning strategies, and file formats such as Parquet, Avro, and ORC
- Experience with Docker, Kubernetes, and modern containerization technologies
- Experience implementing monitoring, observability, and data quality frameworks within production environments
- Experience with large-scale data cleaning, parsing, normalization, and validation workflows preferred
- Experience working with healthcare, life sciences, publication, or large-scale entity-resolution datasets preferred
- Exposure to ML/AI-driven data enrichment, parsing, or validation workflows is a plus
- Experience using AI-assisted coding tools (e.g., GitHub Copilot, Claude Code) to accelerate development while maintaining quality is encouraged
Benefits
- Full suite of health insurance options
- Generous paid time off
- Pre-planned company-wide wellness holidays
- Retirement options
- Health & charitable donation stipends
- Impactful Business Resource Groups
- Flexible work hours & the opportunity to work from anywhere