Machine Learning Engineer to design, build and operationalize generative and agentic AI systems that are safe, auditable and production ready.
Requirements
- Design, implement and deploy LLM solutions and agent frameworks for production workflows, focusing on reliability, interpretability and safety.
- Build and run evaluation pipelines: automated metrics, unit and integration test suites, adversarial/red-team tests, and human-in-the-loop evaluation.
- Create and operationalize guardrails: prompt patterns, policy enforcement, input sanitization, output validation, data filtering and explainability tools.
- Prototype and productionize agentic behaviours: RAGs, multi-step planners, tool-use interfaces, state management and safe action execution.
- Implement MLOps best practices: model versioning, CI/CD for models, scalable serving, observability and cost controls.