
Job description
We are building the next-generation AI infrastructure for open source and enterprise. Our work is deeply research-oriented and passionate about developing ground-breaking innovations to take state-of-the-art AI applications to the next level. Push the boundaries of performance by developing custom kernels and low-level optimizations for next-generation AI workloads.
Design and implement custom GPU/accelerator kernels to maximize performance. Profile, benchmark, and optimize critical ML workloads. Collaborate with researchers to translate algorithmic advances into efficient, production-ready code.
You are detail-oriented, performance-obsessed, and excited by the challenge of squeezing out every ounce of compute efficiency. You enjoy working at the intersection of algorithms and hardware, and you thrive in a collaborative environment where bold ideas are encouraged.
Company
Keep exploring
Sign in to see similar jobs
Create a free account to discover roles related to this posting.

Tech, Software & IT Services
Mindbeam AI specializes in delivering cutting-edge AI infrastructure solutions for businesses leveraging generative AI and large language models (LLMs). We provide a comprehensive suite of services including pre-training, fine-tuning, inference, and LLM optimization, all powered by advanced GPU optimization techniques. We empower organizations to efficiently deploy and scale AI applications with a focus on accelerated computing and cloud solutions (AWS, NVIDIA). Mindbeam AI offers both direct service delivery and expert consulting, helping clients maximize performance, reduce costs, and implement sustainable, energy-efficient AI strategies.