Tiger Analytics is a fast-growing advanced analytics consulting firm looking for a Lead Data Engineer to design, build, and maintain scalable data pipelines on AWS cloud infrastructure.
Requirements
- 10+ years of experience building and deploying large-scale data processing pipelines in a production environment
- Hands-on experience in designing and building data pipelines on AWS cloud infrastructure
- Strong proficiency in AWS services such as Amazon S3, AWS Glue, AWS Lambda, Amazon Redshift, etc.
- Lead the design, development, and optimization of large-scale data pipelines and data lakehouse architectures using Databricks
- Architect and implement batch and real-time streaming solutions leveraging Apache Spark on Databricks
- Hands-on experience with Apache Airflow for orchestrating and scheduling data pipelines
- Solid understanding of data modeling, database design principles, and SQL and Spark SQL
- Experience with version control systems (e.g., Git) and CI/CD pipelines
- Excellent communication skills and the ability to collaborate effectively with cross-functional teams
- Strong problem-solving skills and attention to detail
Benefits
- Excellent opportunity for significant career development in a fast-growing and challenging entrepreneurial environment
- High degree of individual responsibility
- Equal employment opportunities to applicants and employees without regard to protected characteristics