Sr. PySpark Developer Data Engineer with 7+ years of experience in data-driven roles, data pipelines, and ML pipelines.
Requirements
- Degree in computer science, engineering, mathematics, or equivalent experience
- 7+ years of previous commercial experience as a leader in a data-driven role
- 7+ years of hands-on experience building data pipelines in production
- 7+ years of experience in ML pipeline for streaming/batch workflow
- Ability to write clean, maintainable, and robust code in Python
- Understanding and expertise of software engineering concepts and best practices
- Experience with analytics, feature engineer, algorithms, anomaly detection, and data quality assessment
- Hands-on experience of technologies like Python, Spark/Pyspark, Hadoop/MapReduce/HIVE, Pandas
- Familiarity with query languages, database technologies, CI/CD, and testing and validation of data and software
Benefits
- Flexible workplace arrangements
- Mentoring
- Internal mobility
- Learning and development programs