The Artificial Intelligence Data Engineer II designs, develops, and manages scalable data pipelines and feature stores that enable AI/Machine Learning (ML) model training and deployment across the enterprise.
Requirements
- Design and implement scalable data pipelines for AI/ML workloads.
- Develop and deploy AI/ML solutions using Python, Snowpark, or cloud-native ML services.
- Build and manage feature stores to support model training and inference.
- Integrate structured and unstructured data sources from internal and external systems.
- Collaborate with data scientists to understand data requirements and optimize pipelines.
- Implement data quality checks, metadata tagging, and lineage tracking.
- Ensure compliance with Health Insurance Portability and Accountability Act (HIPAA), Centers for Medicare and Medicaid Services (CMS), and enterprise data governance standards.
- Automate data ingestion and transformation using tools like AWS Glue, Snowflake, and Informatica Data Management Cloud (IDMC).
- Implement DevOps/MLOps and Continuous Integration (CI)/Continuous Delivery (CD) pipelines using git actions or similar tools.
- Monitor pipeline performance and troubleshoot issues in production environments.
- Contribute to backlog grooming and sprint planning for AI data initiatives.
- Perform other duties as assigned.
Benefits
- Paid Time Off (PTO)
- Tuition Reimbursement
- Retirement Plans
- Medical, Dental and Vision
- Wellness Program
- Volunteer Time Off (VTO)