As a Lead Data Scientist, you will be the technical authority responsible for the architectural vision and hands-on execution of our most complex AI/ML/GenAI initiatives. You will design the entire ecosystem—from automated data ingestion to CI/CD-driven deployment—ensuring our solutions are scalable, resilient, and integrated into the S&P Global fabric.
Requirements
- Machine Learning: Supervised/Unsupervised machine learning, Deep Learning (PyTorch/TensorFlow), LLMs (Bedrock, OpenAI, Gemini), RAG, and Agentic AI.
- AWS Stack: SageMaker, Lambda, ECS Fargate, Batch, Step Functions, API Gateway, S3, DynamoDB, Bedrock, and OpenSearch.
- Engineering: Expert Python, Fast API for microservices, Docker, and SQL/NoSQL optimization.
- Build & Ops: Git/GitHub, GitHub Actions (CI/CD), Terraform (IaC), and Mops (MLflow/DVC).
- Data Tooling: Snowflake, Apache Iceberg, Trino, and Vector Databases (ElasticSearch/OpenSearch).
- 8+ years in Data Science/ML Engineering; 3+ years in a Lead capacity.
Benefits
- Health & Wellness
- Flexible Downtime
- Continuous Learning
- Invest in Your Future
- Family Friendly Perks
- Beyond the Basics