Phota Labs is a startup building visual GenAI that helps people capture, express, and relive their memories. They are seeking an ML Engineer specializing in inference and optimization to bridge the gap between cutting-edge research models and production systems.
Requirements
- Deploy and integrate researcher-trained model checkpoints into cloud infrastructure and production pipelines
- Conduct thorough performance profiling and benchmarking to identify and eliminate computational bottlenecks
- Implement neural network optimization techniques including quantization, pruning, and architectural refinements while preserving model accuracy
- Develop efficient training and fine-tuning strategies with optimal precision trade-offs and parallelism
- Build and maintain scalable multi-GPU inference solutions with sophisticated model parallelism and serving architectures
- Collaborate with the research team to ensure optimization integrate smoothly with model development workflows
Benefits
- Generous health, dental, and vision coverage
- Unlimited PTO
- Paid parental leave
- Relocation support as needed