We are seeking an experienced Data Engineer to join our data team, responsible for designing, building, and maintaining scalable data pipelines, data integration processes, and data infrastructure on AWS cloud.
Requirements
- Design, develop, and deploy end-to-end data pipelines on AWS cloud infrastructure
- Implement data processing and transformation workflows using Databricks, Apache Spark, and SQL
- Build and maintain orchestration workflows using Apache Airflow
- Support data preparation and ingestion for AI/ML and Generative AI workloads
- Enable data pipelines that support LLM-based applications
- Lead the migration of legacy data systems to modern cloud-based data architectures
- Develop and maintain CI/CD pipelines for data workflows and platform automation
- Collaborate with data scientists, ML engineers, and AI teams to ensure data availability for model training, inference, and GenAI applications
- Optimize data pipelines for performance, reliability, scalability, and cost-effectiveness using AWS best practices
Benefits
- Significant career development opportunities
- Equal employment opportunities
- High degree of individual responsibility
- Unique opportunity to be part of a small, challenging, and entrepreneurial environment