Ready to shape the future of work? Genpact is a leading technology services and solutions company that delivers lasting value for leading enterprises globally. In this role, a Data Engineer will leverage cloud technologies to manage and analyze large datasets, using expertise in Databricks, Azure Data Factory (ADF), Python, PySpark and Unity Catalog.
Requirements
- Architect, build, and optimize data ingestion and transformation pipelines using Azure ADF and Databricks.
- Implement data integration and transformation solutions using Azure Databricks.
- Develop and deploy data models and solutions using Azure services.
- Pull, ingest, transform, stitch, and wrangle data from various sources for advanced analytics.
- Design, implement, and deploy data loaders to load data into the engineering sandbox.
- Monitor and optimize data pipelines for performance and reliability.
- Collaborate with cross-functional teams to gather requirements and understand data needs.
- Provide input to machine learning (ML) engineers and cloud engineers for designing and implementing data management or architecture solutions.
- Assist ML engineers in pulling, filtering, tagging, joining, parsing, and normalizing datasets.
- Implement data quality checks, validation rules, and governance policies to ensure accuracy, reliability, and security of data assets.
- Troubleshoot and resolve data-related issues promptly.
- Implement data security and privacy measures to protect sensitive information.
- Manage data governance and security using Unity Catalog to ensure compliance and protect sensitive information.
- Develop scalable data ingestion or ETL from Workday and HR systems.
Benefits
- Competitive salary
- Benefits package
- Opportunities for career growth and advancement
- Collaborative and dynamic work environment
- Comprehensive training and development programs
- Diverse and inclusive workplace