At Toku, we create bespoke cloud communications and customer engagement solutions to reimagine customer experiences for enterprises. We provide an end-to-end approach to help businesses overcome the complexity of digital transformation and deliver mission-critical CX through cloud communication solutions.
Requirements
- Train, fine-tune, evaluate, and improve NLP, speech-to-text, and LLM-based models used in production environments
- Work hands-on with chatbots, summarisation, and language understanding features, including retrieval-augmented generation (RAG) and vector-based retrieval systems
- Design and run model evaluations, benchmarking existing approaches and validating improvements before deployment
- Read, assess, and experiment with relevant AI/ML research and emerging techniques, translating promising ideas into practical, production-ready solutions
- Contribute to prompt design, model optimisation, and iterative experimentation to improve accuracy, latency, and reliability of deployed models
- Integrate models into existing backend services using Python-based APIs, collaborating closely with backend engineers
- Ensure models are production-ready, maintainable, and resilient when deployed in live customer-facing systems
- Support investigation and resolution of AI-related production issues in collaboration with engineering and platform teams
- Work closely with engineering teams to align AI capabilities with product requirements and platform constraints
- Communicate progress, trade-offs, and technical decisions clearly in planning and delivery discussions
Benefits
- Training and Development
- Discretionary Yearly Bonus & Salary Review
- Healthcare Coverage based on location
- 20 days Paid Annual Leave (15 days for Malaysia based roles), plus other leave allowances