Baylor Genetics is seeking an experienced and visionary Principal Bioinformatics Data Scientist to join our Bioinformatics R&D and Data Science team. This individual will play a pivotal role in advancing our genomic analysis capabilities through the development of innovative computational methods, algorithms, and ML/AI-driven models to support our mission of delivering accurate, fast, and clinically actionable genomic insights.
Requirements
- Master’s and higher degree (PhD preferred) in Bioinformatics, Computer Science, Data Science, Computational Biology, or a related field
- 6+ years of professional experience in genomic data science related to bioinformatics, computational genomics, or similar, including at least 3 years in a senior or lead role
- Proven track record of statistical, machine learning, and AI model development using genomic and clinical data
- Strong experience with secondary and tertiary genomic analysis (alignment, variant calling, annotation, and interpretation)
- Experience in data Lakehouse (Databricks, Snowflakes), and precision health platforms (DNAnexus, Velsera)
- Experience in big data, data ETL, data visualization, workflow orchestration/logging, and databases (including SQL, no-SQL, and graph-based)
- Demonstrated experience working in a clinical or diagnostic genetics environment is highly desirable
- Proficient in Python, R, C/C++, Java, or similar programming languages
- Expertise in machine learning frameworks (e.g., TensorFlow, PyTorch, Scikit-learn, XGBoost)
- Advanced understanding of statistical modeling, including Bayesian inference, GLMs, mixed models, and resampling methods
- Experience applying deep learning architectures (transformers, CNNs, GNNs) to genomic and biomedical data
- Deep understanding of statistical analysis, data modeling, and computational methods used in genomics
- Experience with NGS data formats and genome databases
- Familiarity with cloud computing environments (Azure, AWS, GCP) and distributed computing frameworks (e.g., Spark, Dask)
- Deep knowledge of statistical modeling, dimensionality reduction, and data visualization
- Familiarity with CI/CD, containerization (Docker/Kubernetes), and version control (Git)
- Exceptional analytical, problem-solving, and critical thinking skills
- Ability to translate complex data-driven analyses into actionable biological and clinical insights
- Excellent written and verbal communication skills, with the ability to communicate effectively across disciplines
- Deep understanding of both computational methods and biological context
- Demonstrated leadership in cross-functional team environments
- Passion for innovation in precision medicine and clinical genomics
Benefits
- Medical, dental, and vision insurance
- 401(k) plan with company match
- Generous paid time off
- Relocation assistance
- Professional development opportunities