Tiger Analytics is a fast-growing advanced analytics consulting firm seeking an experienced Data Engineer with expertise in Dataiku to join its data team. The role involves designing, building, and maintaining data pipelines, data integration processes, and data infrastructure, collaborating closely with data scientists and stakeholders.
Requirements
- Design and implement robust data pipelines that ingest, process, and store unstructured data formats at scale within Snowflake and GCP.
- Leverage Snowflake's unstructured data capabilities to make 'dark data' queryable and actionable.
- Build and maintain cloud-native ETL/ELT processes using BigQuery, Cloud Storage, and Dataflow, ensuring seamless integration between GCP and Snowflake.
- Integrate AI tools (OCR, NLP entities, Document AI) into the engineering flow to transform unstructured blobs into structured insights.
- Tune complex SQL queries and Python-based processing jobs to handle petabyte-scale environments efficiently.
Benefits
- Significant career development opportunities exist as the company grows.
- A unique opportunity to be part of a small, challenging, and entrepreneurial environment, with a high degree of individual responsibility.