Zyte is seeking an experienced Team Lead to manage our Core & MLOps Squad, responsible for designing and maintaining the scalable foundation that enables all Zyte teams to build and run their services with confidence.
Requirements
- Design and evolve the core platform (Kubernetes, Mesos, GPU scheduling/autoscaling, distributed compute).
- Own the model platform: registry, experiment tracking, training orchestration, evaluation, serving, and monitoring.
- Build the Golden Path: reference repos, a scaffold CLI, opinionated CI/CD pipelines, runtime contracts (health/metrics/tracing/SLOs), high-performance clients, circuit breakers and other productionâready defaults.
- Operate a secure, multiâtenant model registry and training platform with standardized experiment/evaluation harnesses.
- Provide turnkey serving patterns (online + batch), drift/quality monitoring, and rollback playbooks.
- Integrate public/openâsource AI capabilities as managed platform services with cost and dataâgovernance guardrails.
- Run the squad: roadmap/prioritization, delivery, mentoring, and high engineering standards.
- Partner with product engineering (Zyte API, Scrapy Cloud), Prod Ops, and Security on adoption and rollout plans.
- Mentor the team and foster a platform-thinking mindset.
- Ownership Areas: container orchestration (Kubernetes/Knative), GPU provisioning & autoscaling, environment & secret management, operators, sidecars, and internal SDKs/libraries, model platform, observability, billing pipeline, golden path, reliability enablement, cost governance, supplyâchain security.
Benefits
- We love fostering and nourishing new ideas and bringing them to market
- Become part of a self-motivated, progressive, multi-cultural team.
- Have the freedom and flexibility to work from where you do your best work, as we are a completely remote company.
- Get the chance to work with cutting-edge open-source technologies and tools.