Join Tether and shape the future of digital finance by driving innovation in model serving and inference architectures for advanced AI systems.
Requirements
- Degree in Computer Science or related field
- PhD in NLP, Machine Learning, or related field (ideal)
- Knowledge of Metal Shading Language (MSL)
- Proven experience in low-level kernel optimizations and inference optimization on mobile devices
- Strong expertise in writing GPU kernels for mobile devices
- Deep understanding of modern model serving architectures and inference optimization techniques
- Distributed Inference Systems
- Deep understanding of the math and structure behind Diffusion Models and Vision Transformers
- Understanding of Pruning, Quantization, Flash attention, KV Cache, Speculative Decoding (Eagle) etc.