Hainan Xu

Hainan Xu (许海南)

Senior Applied Scientist at NVIDIA

Email: see my resume

Biography

I am currently working in NVIDIA's NeMo Team, supervised by Boris Ginsburg. Before joining NVIDIA, I worked in Google's Speech Team under Bhuvana Ramabhadran from September 2019 to October 2021, after receiving my Ph.D. degree in Computer Science from Johns Hopkins University, working in the Center for Language and Speech Processing (CLSP) under former JHU Prof. Daniel Povey and Prof. Sanjeev Khudanpur.

I received my B.S. in Software Engineering in 2012 from School of Software Engineering at Shanghai Jiaotong University in Shanghai, China. From 2012 to 2013, I worked with Professor Kai Yu in SJTU Speech Lab.

Education

Ph.D. in Computer Science

September 2013 - September 2019

View Dissertation

B.S. in Software Engineering

September 2008 - June 2012

Work Experience

Staff Research Scientist

November 2021 - Present
NVIDIA

Research in the NeMo team, supervised by Boris Ginsburg.

Software Engineer

September 2019 - October 2021
Google Inc., New York City, NY

Research in the speech team to improve Google's speech recognition models, supervised by Bhuvana Ramabhadran.

Research Intern

May 2017 - August 2017
Spoken Communications, Seattle, WA

Incorporated support for TensorFlow-based language model rescoring in Kaldi.

Research Intern

May 2015 - August 2015
Google Inc., New York City, NY

Worked with Cyril Allauzen and Michael Riley on improving contextual language modeling for Google speech recognition using finite state methods.

Research Assistant

September 2013 - September 2019

Worked with Dan Povey and Sanjeev Khudanpur on speech recognition and contributed to the Kaldi project.

Research Assistant

July 2012 - July 2013

Worked with Kai Yu on speech recognition, speech synthesis, and human-computer interactions.

Publications

See my Google Scholar profile for a complete list of publications.