At Fluidstack, we're building the infrastructure for abundant intelligence. We partner with top AI labs, governments, and enterprises to unlock compute at the speed of light. We're working with urgency to make AGI a reality.
Requirements
- Manage, operate, and optimize hyperscale GPU compute infrastructure
- Ensure high availability, performance, and reliability of GPU server fleet
- Perform hands-on troubleshooting and root cause analysis of complex hardware, firmware, OS, and application issues
- Develop and maintain automation scripts for provisioning, configuration management, monitoring, and remediation at scale
- Build and improve tooling for GPU health checks, performance diagnostics, driver validation, and automated recovery
- Execute server provisioning, configuration, firmware updates, and OS installation using automation frameworks
- Participate in 24x7 on-call rotation; respond to production incidents and coordinate resolution with cross-functional teams
Benefits
- Competitive total compensation package (salary + equity)
- Retirement or pension plan
- Health, dental, and vision insurance
- Generous PTO policy