We are seeking a Lead Cloud Engineer, AI Platforms to design, build, and operate the secure and scalable cloud infrastructure that powers next-generation AI and agentic systems across the enterprise.
Requirements
- Design and implement cloud infrastructure supporting LLM platforms, vector databases, and model inference pipelines
- Build and operate scalable environments supporting agentic AI systems, predictive models, and enterprise AI applications
- Implement and maintain MLOps pipelines supporting model training, deployment, monitoring, and lifecycle management
- Develop Infrastructure-as-Code environments using tools such as Terraform to enable scalable and repeatable deployments
- Optimize cloud performance, scalability, and reliability for AI and data workloads
- Implement monitoring, logging, and observability platforms to ensure operational visibility and system performance
- Collaborate with security and compliance teams to ensure data protection, platform security, and regulatory compliance
- Enforce cloud governance, cost optimization, and operational resilience best practices
- Design and support infrastructure for edge computing solutions that enable AI capabilities in field operations environments
- Partner with engineering, data science, and platform teams to ensure seamless integration between AI infrastructure and enterprise systems