Professor Thomas Hain
School of Computer Science
Professor of Speech and Audio Technology
Director of CDT in Speech and Language Technologies
Director of Liveperson Centre
Member of the Speech and Hearing (SpandH) research group


Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
ºù«Ӱҵ
S1 4DP
- Profile
-
Thomas Hain obtained the degree 'Dipl.-Ing' in Electrical/Communication Engineering in 1994 from the University of Technology, Vienna. He joined the Speech Technology Group at Philips Speech Processing which he left in a senior position.
In 1997 he joined the Speech, Vision and Robotics Group at the Cambridge University Engineering Department as Research Associate and PhD Student. He took up a Lectureship at the SVR group in 2001.
In 2004 he joined the Speech and Hearing Group to work as Lecturer in Computer Science. He was promoted to Senior Lecturer in 2008 and Reader in 2011.
- Research interests
-
Thomas' research interests cover many areas in natural language processing, speech, audio and multimedia technology, machine learning, and complex system optimisation and design.
His interests include: large vocabulary continuous speech recognition, non-linear methods in speech processing, low bit-rate speech coding, machine learning, multi-modal systems, image classification, microphone arrays, system and resource optimisation.
- Publications
-
Books
Journal articles
Chapters
Conference proceedings papers
Reports
Theses / Dissertations
Datasets
Other
Preprints
- Grants
-
Current grants
- , EPSRC, 04/2019 - 09/2027, £5,508,850, as PI
- VoiceBase Centre, VoiceBase Inc./Liveperson, 04/2018 - 03/2026, £2,488,691, as PI
- WFST-based integration of ASR and MT in Spoken Language Translation, Industrial, 03/2014 - 12/2026, £63,588, as PI
Previous grants
- Automatic voice conversion for transforming professional adult voice actors to artificial child voice actors, Innovate UK, 01/2021 - 01/2023, £173,605, as PI
- MAUDIE: Multimedia Analysis for Unsupervised Dubbing In Entertainment, Innovate UK, 05/2018 - 07/2021, £393,115, as PI
- TUTO II: Reading skills tutoring system, ITSLANGUAGE BV, 08/2017 - 12/2019, £121,439, as PI
- Sound Source Separation Based on Deep Learning, Industrial, 05/2019 - 04/2020, £48,000, as PI
- Acoustic correlates of emotions for automatic recognition, Industrial, 10/2018 - 09/2019, £48,900, as PI
- Bridge Project, VoiceBase Inc., 09/2017 - 03/2018, £61,200, as PI
- STATUS IV: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2017 - 10/2017, £60,000, as PI
- TUTO: Reading skills tutoring system, ITSLANGUAGE BV, 09/2016 - 08/2017, £61,983, as PI
- STATUS III: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2015 - 07/2016, £78,684, as PI
- STATUS II: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 11/2013 - 05/2014, £98,982, as PI
- ItsLanguage, ITSLANGUAGE BV, 11/2012 - 03/2015, £68,333, as PI
- German System Adaptation, ITSLANGUAGE BV, 11/2012 - 03/2015, £42,373, as PI
- DocuMeet: , EC FP7, 11/2012 - 10/2014, £368,433, as PI
- STATUS: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 10/2012 - 08/2013, £73,726, as PI
- A Joint Model of Spoken Language Translation, Google, 09/2011 - 12/2016, £43,014, as PI
- , EPSRC, 05/2011 - 07/2016, £1,798,665, as PI
- Unsupervised Domain Adaptation, CISCO, 11/2010 - 04/2012, £121,745, as PI
- , EC FP6, 10/2006 - 12/2009, £467,074, as PI
- , EC FP6, 10/2006 - 12/2009, £345,350, as PI
- Professional activities and memberships
-
- Head of the research group
- Editorial Board member,
- Associate Editor,
- Organising committee member, ASRU 2013
- Area Chair, Interspeech 2014, Speech Recognition - Signal Processing, Acoustic Modelling, Robustness and Adaptation.
- Area Chair, ICPR 2014, Track 3 Image, Speech. Signal and Video Processing
- Programme Committee,