We are seeking a DevOps Engineer III to own infrastructure end-to-end, design systems from scratch, and enable highly resilient AI workloads at scale. This is a senior, hands-on role that requires deep expertise in cloud-native DevOps, infrastructure automation, observability, and security.
Requirements
- 5–8 years of core DevOps experience
- Deep expertise in Docker, Kubernetes, Helm, and container orchestration
- Hands-on with Terraform, Crossplane, and declarative infra management
- Strong experience in CI/CD pipelines (ArgoCD, Jenkins, GitOps workflows) and building custom automation
- Proven ability to deploy AI/LLMs & agent workflows reliably in production
- Mandatory expertise with vector databases – tuning, scaling, and optimizing retrieval performance
- Proficiency in monitoring & logging tools (Prometheus, Grafana, OpenTelemetry, ELK/OpenSearch)
- Familiarity with service mesh (Istio/Linkerd), networking, and multi-cluster workloads
- Proficiency in scripting/programming (Python, Bash, Go preferred)
- Knowledge of security best practices in cloud environments (IAM, secrets, secure networking)
Benefits
- Competitive salary
- Stock options
- Generous Paid Time Off
- 401k Matching
- Health Insurance
- Retirement Plan
- Flexible work hours
- Collaborative and dynamic work environment