Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. One person, one GPU. We are seeking a Data Center Operations System Engineer to join our team in Kansas City, MO.
Requirements
- Ensure new server, storage and network infrastructure is properly racked, labeled, cabled, and configured.
- Troubleshoot hardware and software issues in some of the world’s most advanced GPU and Networking systems.
- Document and update data center layout and network topology in DCIM software
- Work with supply chain & manufacturing teams to ensure timely deployment of systems and project plans for large-scale deployments
- Manage a parts depot inventory and track equipment through the delivery-store-stage-deploy-handoff process in each of our data centers
- Partner with HW Support teams to ensure data center hardware incidents with higher level troubleshooting challenges are resolved, reported on and solutions are disseminated to the large operations organization.
- Work with RMA team to ensure faulty parts are returned and replacements are ordered
- Follow installation standards and documentation for placement, labeling, and cabling to drive consistency and discoverability across all data centers
Benefits
- Generous cash & equity compensation
- Health, dental, and vision coverage for you and your dependents
- Wellness and commuter stipends for select roles
- 401k Plan with 2% company match (USA employees)
- Flexible paid time off plan