
Job description
We’re looking for a Staff Engineer to take technical ownership of latency, throughput, and reliability across Runware’s AI inference platform. This is a senior technical leadership role for someone who obsesses over performance at scale.
Own end-to-end inference performance across the platform, lead the architecture and design of core inference systems, drive the platform toward sub-1 second inference, and make high-impact architectural decisions.
Ideal for someone who enjoys operating at the intersection of systems design, performance engineering, and real-world scale, and who wants clear ownership over outcomes that matter directly to customers.
Keep exploring
Sign in to see similar jobs
Create a free account to discover roles related to this posting.
Company

Tech, Software & IT Services
Runware is a rapidly growing AI-as-a-Service provider focused on delivering generative AI capabilities to developers and businesses. The company offers a scalable platform and API access for GenAI, enabling users to create and deploy AI-powered applications at a significantly lower cost and faster speed than competing solutions. With a proven track record of powering over 4 billion creations for 100,000+ developers and 250 million end-users, Runware is backed by leading investors including Insight Partners and a16z. The company presents a compelling opportunity for individuals seeking to contribute to a cutting-edge AI infrastructure company experiencing substantial growth.