We are seeking a Principal Kubernetes Architect to serve as a senior technical authority for the design, evolution, and operation of InterSystems Kubernetes based cloud platforms.
Requirements
- Act as a senior technical authority for Kubernetes architecture across managed services and SaaS platforms
- Define and maintain reference architectures and design patterns for Kubernetes platforms across on premises, private cloud, and public cloud environments
- Lead architectural reviews and provide guidance on cluster design, networking, security, scaling, and multi region strategies
- Design, build, and evolve Kubernetes clusters using platforms such as EKS, AKS, GKE, Rancher, or equivalent enterprise distributions
- Establish best practices for cluster lifecycle management, upgrades, multi tenancy, and workload isolation
- Architect and implement advanced Kubernetes networking models including CNI plugins, ingress controllers, and network policies
- Design secure RBAC models, secrets management approaches, and workload isolation aligned with enterprise security requirements
- Ensure platform changes are engineered, tested, versioned, and rolled out with the same rigor as application software
- Define and enforce Infrastructure as Code standards using Terraform, Helm, and GitOps based workflows
- Design reusable and composable infrastructure modules to enable consistent and repeatable platform deployments
- Drive automation first approaches to cluster provisioning, configuration, and lifecycle management
- Ensure platform changes are versioned, tested, and delivered through controlled CI and CD pipelines
- Architect observability solutions for Kubernetes platforms using Prometheus, Grafana, Loki, Fluentd or Fluent Bit and related tooling
- Define strategies for monitoring, alerting, capacity planning, and performance optimization
- Lead troubleshooting of complex platform incidents involving cluster degradation, networking issues, or systemic failures
- Partner with Site Reliability Engineering teams to establish service level objectives, error budgets, and reliability engineering practices
- Design Kubernetes solutions that operate consistently across public cloud and on premises environments
- Architect backup, restore, and disaster recovery strategies using tools such as Velero, Kasten, or Stash
- Address cloud specific constraints related to identity, networking, storage, and cost optimization
- Mentor senior engineers and architects across platform, DevOps, and Site Reliability Engineering teams
- Act as a trusted advisor to application teams on cloud native and containerization strategies
- Contribute to technical standards, documentation, and internal enablement
- Represent platform architecture in cross functional design reviews and technical forums