
Job description
We are building the next-generation AI infrastructure for both open-source and enterprise applications. Our work is deeply research-oriented and passionate about developing ground-breaking innovations to take state-of-the-art AI applications to the next level. Advance AI performance and efficiency by engineering systems for fine-tuning, evaluation, and deployment at scale.
Develop pipelines for post-training tasks such as fine-tuning, evaluation, and model compression. Implement scalable systems for model deployment, monitoring, and optimization. Collaborate with researchers to validate experimental results in production contexts. Build tools to automate benchmarking and regression testing. Identify opportunities to improve efficiency in resource utilization and inference speed.
You combine deep technical expertise with a pragmatic mindset. You thrive on bridging research and production, and you’re motivated by the challenge of making cutting-edge models usable and efficient at scale.
Company
Keep exploring
Sign in to see similar jobs
Create a free account to discover roles related to this posting.

Tech, Software & IT Services
Mindbeam AI specializes in delivering cutting-edge AI infrastructure solutions for businesses leveraging generative AI and large language models (LLMs). We provide a comprehensive suite of services including pre-training, fine-tuning, inference, and LLM optimization, all powered by advanced GPU optimization techniques. We empower organizations to efficiently deploy and scale AI applications with a focus on accelerated computing and cloud solutions (AWS, NVIDIA). Mindbeam AI offers both direct service delivery and expert consulting, helping clients maximize performance, reduce costs, and implement sustainable, energy-efficient AI strategies.