Team CATHEXIS is seeking a dynamic Site Reliability Engineer with a Top Secret clearance to join their team. The successful candidate will manage, monitor, and optimize clusters on Kubernetes, and work closely with development, services, and operations teams to ensure a seamless integration between application development, deployment, and infrastructure.
Requirements
- Monitor and Manage Kubernetes Clusters: Ensure the stability, health, and scalability of Kubernetes Clusters, deploying applications and services on Kubernetes
- Kubernetes Management: Deploy, monitor, and scale applications on Kubernetes clusters. Maintain Helm charts, manage services, and ensure resource allocation for optimal cluster performance
- Containerization & Deployment: Design and maintain Docker-based microservices architecture, ensuring consistent and reproducible deployments across staging, QA, and production environments
- Cloud Infrastructure Management: Work with leading Cloud Platforms (AWS, Azure and/or GCP) to set up, configure, and manage infrastructure resources using Infrastructure as Code (Terraform, CloudFormation, etc.)
- Monitoring & Incident Response: Set up monitoring solutions, define alerts, an manage the incident response process for any issues related to Jenkins or Kubernetes clusters
- Automate Infrastructure Processes: Build automation tools for scaling, monitoring, and maintaining infrastructure using modern tools like Terraform, Ansible, Linux, or equivalent
- Collaborate Across Teams: Work closely with development, services, and operations teams to ensure a seamless integration between application development, deployment, and infrastructure
- Security & Compliance: Ensure all systems follow best practices in terms of security and compliance with relevant regulations. This includes role-based access, encryption, and automated vulnerability scanning
Benefits
- Performance Bonuses
- Medical Insurance
- Dental Insurance
- Vision Insurance
- 401(k) Plan (Traditional and ROTH)
- Life Insurance (Basic, Voluntary & AD&D)
- Paid Time Off
- 11 Federal Holidays
- Parental Leave
- Commute Benefits
- Short Term & Long Term Disability
- Training & Development
- Wellness Program
- Community Outreach Initiatives