Join Crusoe's Site Reliability Engineering team to drive meaningful innovation and make a tangible impact on sustainable technology. As a Site Reliability Engineer, you'll play a pivotal role in ensuring the reliability and performance of Crusoe's infrastructure, collaborating with software engineers to build resilient code and reviewing changes before deployment.
Requirements
- 1-3 years of professional SRE experience
- Server Hardware and Provisioning: Exposure to server-class hardware & provisioning
- Distributed Systems Architecture: Understanding of distributed system architecture; exposure to common design patterns, reliability, and scaling
- Infrastructure Design: Basic understanding of infrastructure design: Familiarity with the operational trade-offs of network, storage, and RPC serving designs
- Programming Proficiency: Proficiency with at least one programming language (Python, Go, or similar)
- Infrastructure Tooling: Familiarity with infrastructure tools: Use of Docker, Kubernetes, Ansible, Cloud Formation, Terraform
- CI/CD Practices: Appreciation of CI/CD practices: Familiarity with tools such as Jenkins, Gitlab workflows, CircleCI, GitHub Actions, etc.
- Observability Tooling: Exposure to Observability tooling and philosophy: logging, monitoring, and alerting tools
- Operating Systems: Experience with Unix/Linux environments
- Networking Fundamentals: Understanding of network fundamentals: Basics of TCP/IP and network programming
- Information Security Awareness: Awareness of basic information security best practices
- Education: Bachelor's Degree in Computer Science, related field, or self-educated in computer science fundamentals
Benefits
- Competitive benefits package
- Pension contributions
- Private health and dental insurance
- Income protection
- Life assurance