The Data Engineer intern will participate in the acquisition and manipulation of massive datasets, building and optimizing data pipelines, and supporting software developers and machine learning engineers on product/research initiatives.
Requirements
- Proficient with at least one object-oriented/object function scripting languages: Python, Java, C++, Scala, etc
- Working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases (Postgres)
- Experience building and optimizing ‘big data’ data pipelines, architectures and data sets
- Solid understanding of information retrieval, statistics and machine learning. Experience with Computer Vision and NLP is a plus
- Prefer 1+ years in big data and related technology (e.g. DFS); experience with high-performance and scalable distributed system
- Prefer experience with AWS cloud services: EC2, EMR, RDS, Redshift
- Detail-oriented, well organized and self-motivated with a continuous drive to learn, explore and challenge; good communication skills and team player
- Experience supporting and working with cross-functional teams in a dynamic environment
- MS, BA/BS degree in computer science, statistics or related field
Benefits
- Competitive compensation
- Excellent Medical, Dental, and Vision coverage
- 401k
- paid Vacation and Holiday