The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models, responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets.
Requirements
- Bachelor's degree in Computer Science, Engineering, or a related field
- Proven experience as a Data Engineer, with a focus on big data technologies
- Strong proficiency in programming languages such as Python, Scala, or Java
- Extensive experience with data warehousing, ETL processes, and data modeling
- Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services
- Hands-on experience with big data frameworks like Apache Spark for distributed processing
- Excellent problem-solving skills and the ability to work independently and as part of a team
- Strong communication and interpersonal skills
Benefits
- Competitive salary and benefits package
- Flexible working arrangements (remote or hybrid options available)
- The opportunity to work on life-changing AI technology that directly impacts patient outcomes
- Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity
- Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare