The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models, designing, building, and maintaining scalable data pipelines, and ensuring the integrity, accuracy, and consistency of all training datasets.
Requirements
- Bachelor's degree in Computer Science, Engineering, or a related field
- Proven experience as a Data Engineer, with a focus on big data technologies
- Strong proficiency in programming languages such as Python, Scala, or Java
- Extensive experience with data warehousing, ETL processes, and data modeling
- Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services
- Hands-on experience with big data frameworks like Apache Spark for distributed processing
Benefits
- Competitive salary and benefits package
- Flexible working arrangements (remote or hybrid options available)
- The opportunity to work on life-changing AI technology that directly impacts patient outcomes
- Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity
- Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare