Hainan Xu | Speech Recognition Researcher

Biography

I am currently working in NVIDIA's NeMo Team, supervised by Boris Ginsburg. Before joining NVIDIA, I worked in Google's Speech Team under Bhuvana Ramabhadran from September 2019 to October 2021, after receiving my Ph.D. degree in Computer Science from Johns Hopkins University, working in the Center for Language and Speech Processing (CLSP) under former JHU Prof. Daniel Povey and Prof. Sanjeev Khudanpur.

I received my B.S. in Software Engineering in 2012 from School of Software Engineering at Shanghai Jiaotong University in Shanghai, China. From 2012 to 2013, I worked with Professor Kai Yu in SJTU Speech Lab.

September 2013 - September 2019

September 2008 - June 2012

November 2021 - Present

NVIDIA

Research in the NeMo team, supervised by Boris Ginsburg.

September 2019 - October 2021

Google Inc., New York City, NY

Research in the speech team to improve Google's speech recognition models, supervised by Bhuvana Ramabhadran.

May 2017 - August 2017

Spoken Communications, Seattle, WA

Incorporated support for TensorFlow-based language model rescoring in Kaldi.

May 2015 - August 2015

Google Inc., New York City, NY

Worked with Cyril Allauzen and Michael Riley on improving contextual language modeling for Google speech recognition using finite state methods.

September 2013 - September 2019

Worked with Dan Povey and Sanjeev Khudanpur on speech recognition and contributed to the Kaldi project.

July 2012 - July 2013

Worked with Kai Yu on speech recognition, speech synthesis, and human-computer interactions.

See my Google Scholar profile for a complete list of publications.