About the Role
Artificial intelligence (AI) is dramatically transforming people’s work and life now. Speech technology is at the center of this transformation. Speech recognition and text to speech (TTS) are essential pieces to enable intelligent agents in the AI era. Microsoft’s mission is to build world-class speech technologies to break language barriers and empower every person and organization on the earth to achieve more. Our group brings together talents in the areas of signal processing, speech recognition, statistical modelling, and Deep Learning to develop and deliver robust, natural, and scalable speech recognition and text to speech across a rich set of scenarios and languages.
Our team is looking for an experienced data scientist with Speech processing and Applied Machine/Deep Learning in all aspects of spoken language processing. The candidate will work with a multidisciplinary team of engineers, data scientists and product managers to advance the state-of-the-art in the Speech world.
We are looking for a motivated, self-driven ML engineer/scientist to join our mission to change the world with Speech technologies.
Essential attributes and competencies include: Excellence in scientific thinking and execution, ability to drive efficient experiment definition and investigations, solid skill in developing state-of-the-art machine learning and ASR/TTS algorithms, broad scope in solving SR/TTS related engineering problems, and passion for new UI paradigms incorporating speech technologies.
Responsibilities
- Developing novel machine learning and data mining algorithms.
- Designing and executing offline/online experiments.
- Advancing the state of the art of Speech (ASR/TTS) technologies for real world scenarios.
- Investigating and solving speech accuracy and robustness issues across all processing chains, including model development, test and quality control, deployment, and user feedback stages.
- Contribute to the speech technology roadmap for Microsoft.
Required Qualifications
- Master’s/PhD degree in CS/EE or related fields with knowledge in speech, Machine Learning and Data Mining.
- Familiarity with programming languages such as C/C++, C#, Python, or Perl.
- Familiarity with PyTorch or Tensorflow.
- Experience in speech with deep understanding of problems such as speech recognition, speaker recognition, acoustic modeling, language modeling and text to Speech.
- 4-6 years of professional experience applying Deep learning, Machine Learning concepts in real world applications.
- Excited to work as part of diverse team and collaborate across geographies.
- Outstanding communication and collaboration skills.
Preferred Qualifications
- Strong research and/or development experience with machine learning, speech recognition and data mining.
- Experience in architecting, developing, and delivering advanced speech ML projects is a strong plus.
- PhD degree in CS, EE or equivalent.
- Proven track record on shipping products/services with high quality.