DeepInfra is looking for early-career Software Engineers to join our team. We'll design, build, and scale infrastructure for serving top open-source AI models in production.
Requirements
- Design, develop, and test inference solutions for state-of-the-art AI models
- Implement, optimize, and evaluate AI models using Python, C++, CUDA, and NCCL
- Own and operate production model-serving systems, including monitoring and debugging
- Build new features, improve system performance, and contribute to overall system design
- Participate in code reviews and technical discussions to maintain high engineering standards
- Explore and apply new AI/ML techniques to improve model performance and efficiency
- Take ideas from concept to production
Benefits
- Annual base salary range $150,000 - $195,000
- Opportunity to learn from engineers building high-performance inference at scale
- Fast-paced environment with ownership, autonomy, and end-to-end responsibility
- Small team, huge impact: your work ships directly to customers