
Job description
As the Lead Site Reliability Engineer for our ComputeBridge Engagement, you'll be responsible for the reliability, scalability, and performance of one of the largest hardware and AI infrastructure efforts in the U.S. defense sector. You will lead the deployment, management, and automation of a high-performance computing mesh across multiple secure environments, ensuring operational excellence and mission continuity for a 9-figure government program.
Lead infrastructure design, deployment, and operations for ComputeBridge hardware clusters across secure and distributed environments. Install and configure physical systems, including high-density GPU servers, networking gear, and storage arrays.
This is a hands-on engineering leadership role that bridges physical infrastructure and modern DevOps automation, ideal for someone who thrives at the intersection of hardware systems, distributed computing, and AI/ML workflows.
Keep exploring
Sign in to see similar jobs
Create a free account to discover roles related to this posting.
Company

Manufacturing • Tech, Software & IT Services • Public Safety
Bridge Defense is a rapidly growing defense company specializing in the development of cutting-edge technologies for national security applications. We focus on delivering scalable and sustainable systems that integrate software, infrastructure, and deployable platforms to enable advanced operations and artificial intelligence in challenging environments. We partner with demanding national security customers to translate complex mission requirements into practical, mission-ready capabilities. Bridge Defense offers a dynamic environment for professionals seeking to contribute to critical national security initiatives and advance the state-of-the-art in defense technology.