Role Overview

We are seeking an exceptional ML Performance Engineer to optimise large-scale workloads across our GPU and CPU infrastructure. You will design and implement techniques that improve performance and capabilities of research workloads on cutting-edge compute infrastructure.

What You Will Do

Profiling, benchmarking and tuning large-scale training and inference workloads for performance on distributed CPU, GPU and memory-intensive jobs. Developing reference implementations, libraries and tools to improve job efficiency and reliability.

Why It Might Be a Fit

Ideal candidate will have a proven track record of profiling, benchmarking and optimising distributed workloads. Strong knowledge of Python, C++, and CUDA. Strong understanding of one or more deep learning frameworks, such as PyTorch.

Requirements

Bachelors, Masters or PhD degree in computer science, or equivalent experience
Proven track record of profiling, benchmarking and optimising distributed workloads
Strong knowledge of Python, C++, and CUDA
Strong understanding of one or more deep learning frameworks, such as PyTorch
Strong background in data structures, algorithms, and parallel programming on heterogeneous systems
Deep understanding of Linux OS fundamentals
Experience with HPC schedulers and Kubernetes-based workload orchestration
Familiarity with profiling and monitoring tools

Benefits

Highly competitive compensation
Annual discretionary bonus
Lunch provided
Dedicated barista bar
35 days’ annual leave
9% company pension contributions
Informal dress code
Excellent work/life balance
Comprehensive healthcare
Life assurance
Cycle-to-work scheme
Monthly company events

Role Overview

What You Will Do

Why It Might Be a Fit

Requirements

Bachelors, Masters or PhD degree in computer science, or equivalent experience
Proven track record of profiling, benchmarking and optimising distributed workloads
Strong knowledge of Python, C++, and CUDA
Strong understanding of one or more deep learning frameworks, such as PyTorch
Strong background in data structures, algorithms, and parallel programming on heterogeneous systems
Deep understanding of Linux OS fundamentals
Experience with HPC schedulers and Kubernetes-based workload orchestration
Familiarity with profiling and monitoring tools

Benefits

Highly competitive compensation
Annual discretionary bonus
Lunch provided
Dedicated barista bar
35 days’ annual leave
9% company pension contributions
Informal dress code
Excellent work/life balance
Comprehensive healthcare
Life assurance
Cycle-to-work scheme
Monthly company events

Machine Learning Performance Engineer

About the role

Role Overview

What You Will Do

Why It Might Be a Fit

Requirements

Benefits

Similar jobs

Products

Use Cases

Insights

Resources

Browse Jobs

Company

Machine Learning Performance Engineer

About the role

Role Overview

What You Will Do

Why It Might Be a Fit

Requirements

Benefits

Similar jobs

About G-Research

G-Research