Our client is redefining how modern defense technology is delivered. They provide full-spectrum national security solutions that combine secure infrastructure, cleared talent, and mission-ready software to meet evolving defense challenges.
Requirements
- 3+ years of experience in site reliability, systems engineering, or hardware operations roles
- Deep expertise with physical infrastructure: server racking, cabling, diagnostics, and troubleshooting
- Strong Linux systems administration experience, including imaging and automated deployment
- Hands-on experience managing large-scale clusters or distributed systems in OpenShift or Kubernetes
- Familiarity with DevOps automation (Ansible, Terraform, CI/CD pipelines)
- Experience configuring and managing networking and mesh architectures
- Direct experience with NVIDIA GPUs, CUDA, and AI/ML frameworks
- Proficiency with out-of-band management tools (IPMI/iDRAC)
- Certifications: Linux+ and Security+ (required or in progress)
- Excellent communication, documentation, and problem-solving skills
- Clearance: Active TS/SCI required
Benefits
- Competitive compensation
- Robust benefits
- Professional development and certification opportunities
- Clear paths for growth