Join Cato Networks, a leading company in cloud networking and security, as an AI Platform Engineer and help build a cutting-edge enterprise network and secure cloud platform.
Requirements
- 3+ years of hands-on experience in AI inference, production ML infrastructure, model serving, or MLOps
- Experience with production inference technologies such as Triton, vLLM, CUDA, Kubernetes, Docker, PyTorch, ONNX, TensorRT, or similar
- Strong understanding of low-latency, high-throughput production systems
- Experience with model lifecycle concepts: model registry, versioning, deployment, rollout, rollback, monitoring, and observability
- 3+ years of experience with Go, or strong experience with a similar high-performance backend language such as C++, Rust, or Java