Innodata is seeking a Sr Language Data Scientist to lead projects and own processes for optimizing search and retrieval systems by creating, validating and annotating search-specific data for LLM/ML applications. The role involves working with search-specific datasets, collaborating with cross-functional partners, and leveraging expertise in query understanding, semantic matching, and ranking systems to drive innovation in search relevance and user experience.
Requirements
- MA in (computational) linguistics, data science, computer science (AI / ML / NLU), quantitative social sciences or a related scientific / quantitative field, PhD strongly preferred
- Ability to collaborate directly with technical stakeholders including senior project managers, data engineers, and research scientists.
- Collaborating with cross-functional teams to define AI project requirements and objectives, ensuring alignment with overall business goals
- Design efficient data strategies for complex long-term projects, potentially involving multiple teams and workflows.
- Knowledge of how components of GenAI products or services combine to work
- Developing clear and concise documentation, including technical specifications, user guides, and presentations, to communicate complex AI concepts to both technical and nontechnical stakeholders
- Familiarity with GenAI technologies that enables you to improve existing processes to handle future challenges.
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Visa Sponsorship