At Toku, we create bespoke cloud communications and customer engagement solutions to reimagine customer experiences for enterprises. We provide an end-to-end approach to help businesses overcome the complexity of digital transformation and deliver mission-critical CX through cloud communication solutions.

Requirements

Train, fine-tune, evaluate, and improve NLP, speech-to-text, and LLM-based models used in production environments
Work hands-on with chatbots, summarisation, and language understanding features, including retrieval-augmented generation (RAG) and vector-based retrieval systems
Design and run model evaluations, benchmarking existing approaches and validating improvements before deployment
Read, assess, and experiment with relevant AI/ML research and emerging techniques, translating promising ideas into practical, production-ready solutions
Contribute to prompt design, model optimisation, and iterative experimentation to improve accuracy, latency, and reliability of deployed models
Integrate models into existing backend services using Python-based APIs, collaborating closely with backend engineers
Ensure models are production-ready, maintainable, and resilient when deployed in live customer-facing systems
Support investigation and resolution of AI-related production issues in collaboration with engineering and platform teams
Work closely with engineering teams to align AI capabilities with product requirements and platform constraints
Communicate progress, trade-offs, and technical decisions clearly in planning and delivery discussions

Benefits

Training and Development
Discretionary Yearly Bonus & Salary Review
Healthcare Coverage based on location
20 days Paid Annual Leave (15 days for Malaysia based roles), plus other leave allowances

Requirements

Train, fine-tune, evaluate, and improve NLP, speech-to-text, and LLM-based models used in production environments

Work hands-on with chatbots, summarisation, and language understanding features, including retrieval-augmented generation (RAG) and vector-based retrieval systems

Design and run model evaluations, benchmarking existing approaches and validating improvements before deployment

Read, assess, and experiment with relevant AI/ML research and emerging techniques, translating promising ideas into practical, production-ready solutions

Contribute to prompt design, model optimisation, and iterative experimentation to improve accuracy, latency, and reliability of deployed models

Integrate models into existing backend services using Python-based APIs, collaborating closely with backend engineers

Ensure models are production-ready, maintainable, and resilient when deployed in live customer-facing systems

Support investigation and resolution of AI-related production issues in collaboration with engineering and platform teams

Work closely with engineering teams to align AI capabilities with product requirements and platform constraints

Communicate progress, trade-offs, and technical decisions clearly in planning and delivery discussions

Applied AI Engineer - LLM & NLP

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Applied AI Engineer - LLM & NLP

Founding AI Engineer - APAC Speech Recognition

Backend Engineer – Cloud and Microservices

Applied AI Engineer - LLM & NLP

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Applied AI Engineer - LLM & NLP

Founding AI Engineer - APAC Speech Recognition

Backend Engineer – Cloud and Microservices

Job Details

About Toku