He joined Carnegie Mellon University in 1989 and worked with Raj Reddy and Kai-Fu Lee on speech recognition. At CMU, Huang directed Sphinx-II speech system research and achieved the best performance in every category of DARPA's 1992 benchmarking. He received the Allen Newell research excellence medal for his leadership in speech recognition in 1992, and IEEE Speech Processing Best Paper Award in 1993. He was elected to be a Fellow of IEEE in 2000, and a Fellow of ACM in 2017. Huang has co-authored over 100 papers and two books: Hidden Markov Models for Speech Recognition, and Spoken Language Processing, Prentice Hall]. In 2014 he coauthored a historical speech recognition review with Raj Reddy and James K. Baker for Communications of the ACM that reflected several generations of speech research. In 2016, he led his team reaching a historical human parity milestone in transcribing conversational speech on the Switchboard task. In 2018, he led his teams achieving more historical human parity milestones in Chinese to English Machine Translation on the WMT-2017 task. He was promoted to CTO to lead Azure AI Cognitive Services in 2020. He is best known for founding and leading Microsoft's speech and language initiatives as well as his pioneering work on Microsoft's multimodal interactive MiPad prototype as Bill Gates demonstrated at the Consumer Electronics Show in 2001. Huang was instrumental in introducing Microsoft's Speech Application Programming Interface in 1995 and numerous speech and language services since then. From 2000 to 2004, Huang served as the general manager of Microsoft's Speech Platforms Group and shipped Microsoft Speech Server and other voice technologies used in Microsoft Windows, Microsoft Office, Windows Mobile and Microsoft Exchange Server. Microsoft Response Point received 2009's Technology of the Year Awards as the best VOIP phone system from the InfoWorld Magazine. From 2009 to 2014, he served as the Chief Architect for Bing. He is currently a CVP leading Microsoft's world-wide Speech and Language responsible for Microsoft Cortana, Microsoft Translator, Office 365, Microsoft Windows, Microsoft Azure, and many 3rd parties' speech and translation services.
TV and books
Robert MacNeil, William Cran, Robert McCrum. Do You Speak American? page 191–197, Harcourt Trade
*
Xuedong Huang, Alex Acero, Hsiao-Wuen Hon. Spoken Language Processing: a guide to theory, algorithm, and system development, page 1-980. Prentice Hall
Xuedong D Huang, Yasuo Ariki, Mervyn A Jack. Hidden Markov Models for Speech Recognition, Edinburgh University Press