As an Infrastructure Engineer, you will own and evolve the platform that everything at Menlo runs on, from inference serving to training rigs to the agentic coding infrastructure that powers day-to-day engineering. You will work deep in the stack, across OpenStack, Kubernetes, and bare metal, and set the technical direction for how Menlo Cloud scales.
Requirements
- Minimum 5+ years of hands-on infrastructure engineering experience in production environments
- Extensive experience with OpenStack in production: Nova, Neutron, Cinder, Trove, Horizon, and CLI administration
- Strong Kubernetes experience without managed control planes: Cluster API, kubeadm, self-managed clusters
- Deep Linux proficiency: RHEL, Ubuntu, or equivalent, including kernel-level debugging and performance tuning
- Experience with infrastructure-as-code and automation: Ansible, Terraform, or equivalent
- Familiarity with GPU infrastructure: inference serving, vLLM, model orchestration, and cluster management
- Solid understanding of GitOps workflows and tools like ArgoCD
- Experience with observability: Prometheus, Grafana, distributed tracing, log aggregation
- Strong networking fundamentals: VPCs, firewalls, load balancers, private cluster architecture
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Four Day Work Week