We are looking for a Generative AI Platform Engineer to design, build, and scale enterprise-grade AI platforms and APIs that enable the adoption of large language models and generative AI capabilities across the organization.
Requirements
- Design and build scalable Generative AI platforms, services, and APIs for internal and external consumers
- Develop and maintain high-performance backend services using Python and one or more of C++, C#, or Java
- Integrate and operationalize LLM and foundation model APIs, including: Azure OpenAI Google Vertex AI AWS Bedrock
- Build abstraction layers and orchestration logic to support multiple model providers and deployments
- Design RESTful and/or gRPC APIs with a strong focus on reliability, security, and performance
- Implement platform capabilities such as: Prompt management and versioning Model routing and fallback strategies, Observability, logging, and monitoring Cost and usage tracking
- Deploy and operate services on Google Cloud Platform (GCP), leveraging managed services where appropriate
- Support CI/CD, infrastructure-as-code, and production operations
Benefits
- Opportunity to work with a leading company in the field of AI
- Chance to design and build scalable AI platforms and APIs
- Collaboration with software engineers, data scientists, and product teams