Job Description
The candidate needs to research and develop cutting-edge multimodal (speech-text) foundation model, define, develop novel deep learning techniques to achieve and advance SOTA accuracy with real-time performance. The candidate will have to collaborate with other research scientists and engineers to apply the foundation model for identified use cases. Strong problem solving, coding, effective communication, and collaborative teamwork is required.
Job Requirement
- PhD in ML, NLP, speech and spoken language processing and multi-modal AI
- At least 5 years of experience
- Proficient programming skills, familiarity with Linux is a must
- Familiar with different deep learning and LLM training frameworks
- Strong analytical and critical thinking skills, good team player with good communication and interpersonal skills
- Ability to replicate and reproduce state-of-the-art models and results, and then innovate on top of these benchmarks
The above eligibility criteria are not exhaustive. A*STAR may include additional selection criteria based on its prevailing recruitment policies. These policies may be amended from time to time without notice. We regret that only shortlisted candidates will be notified.