Role Overview

We’re looking for a Staff Engineer to take technical ownership of latency, throughput, and reliability across Runware’s AI inference platform. This is a senior technical leadership role for someone who obsesses over performance at scale.

What You Will Do

Own end-to-end inference performance across the platform, lead the architecture and design of core inference systems, drive the platform toward sub-1 second inference, and make high-impact architectural decisions.

Why It Might Be a Fit

Ideal for someone who enjoys operating at the intersection of systems design, performance engineering, and real-world scale, and who wants clear ownership over outcomes that matter directly to customers.

Requirements

Excellent experience in software engineering, with a strong focus on backend and systems development
Proven experience building and operating high-performance, low-latency distributed systems in production
Deep understanding of asynchronous processing, queues, concurrency models, and back pressure
Strong intuition for performance trade-offs across CPU, GPU, networking, storage, and application layers
Experience making and defending critical architectural decisions in complex systems
Hands-on experience troubleshooting real production issues under load
Familiarity with modern cloud infrastructure, CI/CD, and observability stacks
Ability to communicate clearly and influence across teams in a remote-first environment
Strong mentorship mindset and a desire to raise the technical bar across the organisation

Benefits

Generous paid time off – vacation, sick days, public holidays
Meaningful stock options – share in the upside you create
Remote-first setup – work from home anywhere we can employ you
Flexible hours – own your schedule outside core collaboration blocks
Family leave – paid maternity, paternity, and caregiver time
Company retreats – twice-yearly gatherings in inspiring locations

Requirements

Excellent experience in software engineering, with a strong focus on backend and systems development

Proven experience building and operating high-performance, low-latency distributed systems in production

Deep understanding of asynchronous processing, queues, concurrency models, and back pressure

Strong intuition for performance trade-offs across CPU, GPU, networking, storage, and application layers

Experience making and defending critical architectural decisions in complex systems

Hands-on experience troubleshooting real production issues under load

Familiarity with modern cloud infrastructure, CI/CD, and observability stacks

Ability to communicate clearly and influence across teams in a remote-first environment

Strong mentorship mindset and a desire to raise the technical bar across the organisation

Benefits

Generous paid time off – vacation, sick days, public holidays

Meaningful stock options – share in the upside you create

Remote-first setup – work from home anywhere we can employ you

Flexible hours – own your schedule outside core collaboration blocks

Family leave – paid maternity, paternity, and caregiver time

Company retreats – twice-yearly gatherings in inspiring locations

Staff Software Engineer - Inference & Performance

About the role

Role Overview

What You Will Do

Why It Might Be a Fit

Requirements

Benefits

Similar jobs

Products

Use Cases

Insights

Resources

Browse Jobs

Company

Staff Software Engineer - Inference & Performance

About the role

Role Overview

What You Will Do

Why It Might Be a Fit

Requirements

Benefits

Similar jobs

About Runware

Runware