We are seeking a Senior Site Reliability Engineer to help ensure the reliability, security, and performance of our customer-facing systems and infrastructure. This role will bridge development and operations, applying engineering principles to improve system resilience and scalability.
Requirements
- Security & Compliance: Safeguard systems against breaches and ensure compliance with PCI-DSS, HIPAA, SOC2, and other standards.
- Cloud Infrastructure: Build and maintain scalable, secure services in Azure, AWS, or GCP using cloud-native tools.
- Automation & System Admin: Automate routine tasks, manage backups, and configure servers for high availability and disaster recovery.
- Monitoring & Performance: Implement observability tools, optimize system performance, and proactively address issues.
- Incident Response: Participate in on-call rotations, troubleshoot service incidents, and lead postmortem reviews.
- Collaboration: Work closely with developers, QA, and support teams in agile environments.
- Customer Provisioning: Handle complex customer account setups in coordination with Sales and Professional Services.
- Infrastructure as Code: Champion IaC practices using tools like Terraform, Ansible, Chef, or Puppet.
Benefits
- Health insurance plan that provides coverage for a wide range of medical services
- Life and disability insurance for our employees
- Competitive salary
- Bonus or commission (according to your position)
- Best-in-class Employee Stock Purchase Program (ESPP) with a 27-month lookback
- Paid day off for your birthday
- Company holidays
- Additional paid time off