Dr Stefan Goetze
School of Computer Science
Senior Lecturer
Member of the Speech and Hearing (SpandH) research group
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
ºù«Ӱҵ
S1 4DP
- Profile
-
Stefan Goetze is Senior Lecturer in the Department of Computer Science. He obtained the degree 'Dipl.-Ing' in 2004 and 'Dr.-Ing.' in 2013 in Electrical/Communication Engineering from the University of Bremen, Germany.
From 2008 to 2020 he was with the Fraunhofer-Institute for Digital Media Technology IDMT in Oldenburg, Germany where he was first Head of "Audio System Technology for Audiology and Assistive Systems" (2010-2017) and later Head of "Automatic Speech Recognition" as well as Dept. Head of the Department "Hearing, Speech and Audio Technology" (2017-2020).
- Research interests
-
His research interests include machine learning, signal analysis, enhancement and classification as well for large scale applications as for resource-limited IoT (Internet of Things) and assistive devices.
- Publications
-
Journal articles
- . Frontiers in Signal Processing, 2.
- . IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(7), 1151-1163.
- . IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27, 1151-1163.
- . IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(2), 255-267.
- . Pflegezeitschrift, 72(1-2), 17-19.
- . IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(10), 1809-1820.
- . Computer Speech & Language, 46, 558-573.
- . IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1304-1314.
- . Journal of the Audio Engineering Society, 65(1/2), 117-129.
- Special Issue on Dereverberation and Reverberation of Audio, Music, and Speech. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 65(1-2), 6-7.
- . The Journal of the Acoustical Society of America, 139(4), 2224-2225.
- . EURASIP Journal on Advances in Signal Processing, 2015(1).
- . IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(12), 2198-2208.
- . EURASIP Journal on Advances in Signal Processing, 2015(1).
- . IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(10), 1680-1691.
- . EURASIP Journal on Audio, Speech, and Music Processing, 2014(1).
- . Informatics for Health and Social Care, 39(3-4), 166-187.
- . IEEE Transactions on Audio, Speech, and Language Processing, 21(9), 1879-1890.
- Notrufsysteme mit automatischer akustischer Gefahrendetektion. Science^2 - Safety and Security, 1, 12-18.
- . Journal of Computing Science and Engineering, 6(1), 40-50.
- Acoustic User Interfaces for Ambient Assisted Living Technologies. Informatics for Health and Social Care, SI Ageing & Technology, 35, 161-179.
- The Lower Saxony Research Network Design of Environments for Ageing (GAL) - Towards Interdisciplinary Research on ICT in Ageing Societies. Informatics for Health and Social Care, SI Ageing & Technology, 35, 92-103.
- . Informatics for Health and Social Care, 35(3-4), 125-143.
- . Informatics for Health and Social Care, 35(3-4), 92-103.
- . The Journal of the Acoustical Society of America, 120(5), 3258-3258.
- . Journal of Medical Internet Research.
- . Journal of the Audio Engineering Society, 62(6), 386-399.
Chapters
- , Ambient Assisted Living (pp. 163-172). Springer International Publishing
- Innovative Hörunterstützung in Kommunikationssystemen In Schick A, Meis M & Nocke C (Ed.), Beiträge zur psychologischen Akustik, Akustik in Büro und Objekt (pp. in press-in press). Oldenburg: Isensee Verlag.
- Acoustic Applications and Technologies for Ambient Assisted Living Scenarios, Ambient Assisted Living (AAL) Forum (pp. 337-342). Lecce, Italy.
- , Ambient Assisted Living (pp. 63-74). Springer Berlin Heidelberg
- Considering Hearing Deficiencies in Human-Computer Interaction In Ziefle M & Röcker C (Ed.), Human-Centered Design of E-Health Technologies: Concepts, Methods and Applications (pp. 180-207). IGI Global
- Detection and Classification of Acoustic Events for In-Home Care (Best-Paper Award) In Wichert R & Eberhardt B (Ed.), Ambient Assisted Living - Advanced Technologies and Societal Change, Springer Lecture Notes in Computer Science (LNCS) (pp. 181-196). Springer Science
- , Ambient Assisted Living (pp. 181-195). Springer Berlin Heidelberg
- Intelligente Konferenzsysteme für natürliche Freisprechkommunikation In Schick A, Meis M & Nocke C (Ed.), Beiträge zur psychologischen Akustik, Akustik in Büro und Objekt (pp. 249-266). Oldenburg: Isensee Verlag.
- , Lecture Notes in Computer Science (pp. 568-575). Springer Berlin Heidelberg
Conference proceedings papers
- . ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 14 May 2024 - 19 May 2024.
- Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models.. ICASSP (pp 306-310)
- Hallucination in Perceptual Metric-Driven Speech Enhancement Networks. European Signal Processing Conference (pp 21-25)
- Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement. European Signal Processing Conference (pp 421-425)
- . 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 22 October 2023 - 25 October 2023.
- Pre-Trained Intermediate ASR Features and Human Memory Simulation for Non-Intrusive Speech Intelligibility Prediction in the Clarity Prediction Challenge 2. he 4th Clarity Workshop on Machine Learning Challenges for Hearing Aids (Clarity-2023). https://claritychallenge.org/clarity2023-workshop/results.html, 19 August 2023 - 19 August 2023.
- . Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (pp 1-5). New York, NY, USA, 22 October 2023 - 25 October 2023.
- Bridging the Communication Rate Gap: Enhancing Text Input for Augmentative and Alternative Communication (AAC). HCII 2023 Conference Proceedings, Vol. 10, 23 July 2023 - 23 July 2023.
- . ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023.
- . ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023.
- Non-intrusive Speech Intelligibility Estimated By Metric Prediction for Hearing Impaired Individuals for the Clarity Prediction Challenge 1. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 18 September 2022 - 22 September 2022.
- MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data. Proc. 30th European Signal Processing Conference, EUSIPCO 2022. Belgrade, Serbia, 29 August 2022 - 2 September 2022.
- . Proceedings of Forum Acusticum (pp 2441-2445). Lyon, France, 7 December 2020 - 11 December 2020.
- Single-ended Prediction of Listening Effort for English Speech. DAGA 2020 - 46. Jahrestagung für Akustik (pp 775-777). Hannover, Germany
- 2D audio-visual localization in home environments using a particle filter. Sprachkommunikation - 10. ITG-Fachtagung (pp 75-78)
- Context and user requirement analyses of a new digital speech therapy system (THERESIAH). Conf. on Implantable Auditory Prosthesis (CIAP). Lake Tahoe, CA, USA
- Hearing support to reduce listening effort at work: an EEG study. DAGA 2019 – Proc. 45th Annual Meeting of the Deutsche Gesellschaft für Akustik e.V.. Rostock, Germany
- Erfassung der Höranstrengung fertiger TV-Mischungen. DAGA 2019 – Proc. 45th Annual Meeting of the Deutsche Gesellschaft für Akustik e.V.. Rostock, Germany
- Automatische Überwachung der Sprachverständlichkeit im Rundfunkmaterial. 30th Tonmeistertagung – VDT International Convention. Düsseldorf, Germany
- . Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 381-385). New Orleans, LA, USA, 5 March 2017 - 9 March 2017.
- . Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 5250-5254). New Orleans, LA, USA, 5 March 2017 - 9 March 2017.
- . Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 4870-4874). New Orleans, LA, USA, 5 March 2017 - 9 March 2017.
- . 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 1 March 2017 - 3 March 2017.
- . 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 13 September 2016 - 16 September 2016.
- Acoustic Scene Classification using Time-Delay Neural Networks and Amplitude Modulation Filter Bank Features. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016) (pp 70-74). Budapest, Hungary
- Performance comparison of GMM, HMM and DNN based approaches for acoustic event detection within Task 3 of the DCASE 2016 challenge. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016) (pp 80-84). Budapest, Hungary
- Messung der Höranstrengung älterer Mitarbeiter eines Callcenters mittels neuroergonomischer Messmethoden / Neuroergonomic assessment of listening effort in older call center employees. Proc. Zukunft Lebensräume Kongress 2016 (pp 327-332). Frankfurt, Germany
- . 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016.
- . 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016.
- Concept for automated usability evaluation of graphical user interfaces. Proc. Kognitive Systeme: Mensch, Teams, Systeme und Automaten. Bochum, Germany
- Spectrally and spatially informed noise suppression using beamforming and convolutive NMF. Proc. AES 60th Conference on Dereverberation and Reverberation of Audio, Music, and Speech. Leuven, Belgium
- Predicting the quality of processed speech by combining modulation-based features and model trees. Speech Communication - 12. ITG-Fachtagung Sprachkommunikation (pp 180-184)
- . 2015 23rd European Signal Processing Conference (EUSIPCO), 31 August 2015 - 4 September 2015.
- . 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015.
- Joint estimation of reverberation time and direct-to-reverberation ratio from speech using auditory inspired features. Proc. ACE Challenge Workshop, a satellite event of WASPAA. New Paltz, NY, USA
- . 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015.
- Concept of a Nutrition Consultant Application with Context Based Speech Recognition. 4. Interdisziplinärer Workshop Kognitive Systeme 2015, Mensch, Teams, Systeme und Automaten. Bielefeld, Germany
- CooCo, what can i cook today? Surprise me. CEUR Workshop Proceedings, Vol. 1520 (pp 233-240)
- . 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), 8 September 2014 - 11 September 2014.
- . 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), 8 September 2014 - 11 September 2014.
- . 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 May 2014 - 9 May 2014.
- Joint Dereverberation and Noise Reduction Using Beamforming and a Single-Channel Speech Enhancement Scheme. Proc. REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. Florence, Italy
- Estimating room acoustic parameters for speech recognizer adaptation and combination in reverberant environments. Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (pp 5559-5563). Florence, Italy
- Robust ASR in reverberant environments using temporal cepstrum smoothing for speech enhancement and an amplitude modulation filterbank for feature extraction. Proc. REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. Florence, Italy
- Improving acoustic event detection by localization algorithms. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 523-524). Oldenburg, Germany
- Nutzbarkeit von modellierten Phonemfolgen zur Erkennung von unbekannten Wörtern in phonembasierten Spracherkennern. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 538-539). Oldenburg, Germany
- Influence of a spherical microphone array on a sound source number estimator based upon independent component analysis. Proc. 40th German Annual Conference on Acoustics (DAGA 14). Oldenburg, Germany
- A 2-Stage Approach for Joint Noise Reduction and Dereverberation by means of Multi-Channel Equalization and a Noise Processor. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 186-187). Oldenburg, Germany
- Room Transfer Function Estimation using Cepstral Smoothing. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 493-494). Oldenburg, Germany
- PTP Synchronized Isosynchronous Multi-Channel Audio-Streaming over Gigabit-Ethernet based on FPGAs. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 182-183). Oldenburg, Germany
- Networked embedded acoustic processing system for smart building applications. Conference on Design and Architectures for Signal and Image Processing, DASIP (pp 349-350)
- Acoustic Event Detection Using Signal Enhancement and Spectro-temporal Feature Extraction. IEEE AASP Challenge: Detection and Classification of Acoustic Scenes and Events. New Paltz, NY, USA
- . 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 20 October 2013 - 23 October 2013.
- MOBECS - User Requirements for a Mobile Emergency Call System. AAL Forum 2013. Norrköping, Sweden
- . 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing, 20 August 2013 - 23 August 2013.
- Noise Robust Distant Automatic Speech Recognition Utilizing NMF based Source Separation and Auditory Feature Extraction. Proc. 2nd International Workshop on Machine Listening in Multisource Environments (CHiME 2013) (pp 1-6). Vancouver, Canada
- . 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- . 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- . 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Anwendungen akustischer Ereigniserkennung im Automobil. Proc. AmE 2013 - Automotive meets Electronics. Dortmund, Germany
- MOBECS - Mobility by Safety: Konzept und Nutzeranforderungen. AAL Kongress 2013 (pp 504-507). Berlin, Germany
- . 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, 14 November 2012 - 17 November 2012.
- The Ambient Adaptable Living Assistant is Meeting its Users. In Proc. AAL Forum 2012 (pp 629-636). Eindhoven, The Netherlands
- Computational Efficient Noise Reduction for Dialogue Systems in Car Environments based on Binary Time-Frequency Masking and Autoregressive Interpolation. Workshop on Dialog systems that think along - Do they really understand me. Saarbrücken, Germany
- Reduction of Non-stationary Noise for a Robotic Living Assistant using Sparse Non-negative Matrix Factorization. Proc. Speech and Multimodal Interaction in Assistive Environments (SMIAE 2012). Jeju Island, Republic of Korea
- Multimodal Human-Machine Interaction for Service Robots in Home-Care Environments. Proc. Speech and Multimodal Interaction in Assistive Environments (SMIAE 2012). Jeju Island, Republic of Korea
- Objective Methods to Asses Speech Signals Processed by Short-Term Spectral Attenuation. Proc. 38th Annual Convention for Acoustics (DAGA). Darmstadt, Germany
- . 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 25 March 2012 - 30 March 2012.
- 2D audio-visual localization in home environments using a particle filter. Proceedings of 10th ITG Symposium on Speech Communication
- Increasing the robustness of acoustic multichannel equalization by means of regularization. International Workshop on Acoustic Signal Enhancement, IWAENC 2012
- A new approach for reduction of supergaussian noise using autoregressive interpolation and time-frequency masking. International Workshop on Acoustic Signal Enhancement, IWAENC 2012
- System identification of equalized room impulse responses by an acoustic echo canceller using proportionate LMS algorithms. 130th Audio Engineering Society Convention 2011, Vol. 2 (pp 1150-1162)
- Room impulse response reshaping by joint optimization of multiple p-norm based criteria. European Signal Processing Conference (pp 1658-1662)
- Speech quality assessment for listening-room compensation. Proceedings of the AES International Conference (pp 11-20)
- Evaluation of joint position-pitch estimation algorithm for localising multiple speakers in adverse acoustical environments. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Room Impulse Response Reshaping by p-Norm Optimization based on Estimates of Room Impulse Responses. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Speech / Non-Speech Discrimination for Acoustic Monitoring Considering Privacy Issues. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Real-time Room Reverberation Estimation for Online Speech Intelligibility Monitoring. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Speech Activity Detection for Activity Monitoring using an Embedded Platform. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Hearing-Loss Compensation in a Telephone System. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
- Ambiente Sprachsteuerung für einen Pers"’onlichen Aktivitäts- und Haushaltsassistenten. 4. Deutscher AAL-Kongress. Berlin, Germany
- Erkennung und Klassifikation von akustischen Ereignissen zur häuslichen Pflege. 4. Deutscher AAL-Kongress. Berlin, Germany
- . 2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010), 7 November 2010 - 10 November 2010.
- . The 12th IEEE International Conference on e-Health Networking, Applications and Services, 1 July 2010 - 3 July 2010.
- Objective Quality Measures for Dereverberation Methods based on Room Impulse Response Equalization. Proc. German Annual Conference on Acoustics (DAGA). Berlin, Germany
- . 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 14 March 2010 - 19 March 2010.
- The Lower Saxony Research Network Design of Environments for Ageing (GAL) - Towards Interdisciplinary Research on ICT in Aging Societies. Medizininformatik-Weltkongress Medinfo 2010
- How can audio technology improve working conditions?. Change 2009 –Ambient Assisted Working Accessible and assistive ICT in Enterprise Environments, Emden, Germany
- Estimation of the Optimum System Delay for Speech Dereverberation by Inverse Filtering. International Conference on Acoustics (NAG/DAGA 2009). Rotterdam, The Netherlands
- Direction of Arrival Estimation based on the Dual Delay Line Approach for Binaural Hearing Aid Microphone Arrays. Int. Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) (pp 185-188). Xiamen, China
- . 2008 42nd Asilomar Conference on Signals, Systems and Computers, 26 October 2008 - 29 October 2008.
- Combined Source Tracking and Noise Reduction for Application in Hearing Aids. 8. ITG-Fachtagung Sprachkommunikation. Aachen, Germany
- A Decoupled Filtered-X LMS Algorithm for Listening-Room Compensation. Proc. Int. Workshop on Acoustic Echo and Noise Control (IWAENC). Seattle, USA
- . 2008 Hands-Free Speech Communication and Microphone Arrays, 6 May 2008 - 8 May 2008.
- System Identification for Multi-Channel Listening-Room Compensation using an Acoustic Echo Canceller. Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA) (pp 224-227). Trento, Italy
- Room Impulse Response Shaping based on Estimates of Room Impulse Responses. German Annual Conference on Acoustics (DAGA) (pp 829-830). Dresden, Germany
- . 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 31 March 2008 - 4 April 2008.
- System identification for multi-channel listening-room compensation using an acoustic echo canceller. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (pp 225-+)
- . 2007 IEEE International Symposium on Circuits and Systems, 27 May 2007 - 30 May 2007.
- Least Squares Equalizer Design under Consideration of Tail Effects. Proc. German Annual Conference on Acoustics (DAGA) (pp 599-600). Stuttgart, Germany
- . 2007 International Symposium on Intelligent Signal Processing and Communication Systems, 28 November 2007 - 1 December 2007.
- Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays. 2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2 (pp 112-+)
- A psychoacoustic noise reduction approach for stereo hands-free systems. Audio Engineering Society - 120th Convention Spring Preprints 2006, Vol. 4 (pp 1980-1989)
- Multichannel-noise reduction-systems for speaker identification in an automotive environment. Audio Engineering Society - 120th Convention Spring Preprints 2006, Vol. 4 (pp 1941-1952)
- . 2006 Fortieth Asilomar Conference on Signals, Systems and Computers, 29 October 2006 - 1 November 2006.
- MetricGAN+KAN: Kolmogorov-Arnold Networks in Metric-Driven Speech Enhancement Systems. Proc. 2025 International Conference on Acoustics, Speech, and Signal Processing, 6 April 2025 - 6 April 2025.
- Transcription-free fine-tuning of speech separation models for noisy and reverberant multi-speaker automatic speech recognition. Proceedings of Interspeech 2024. Kos Island, Greece, 1 September 2024 - 1 September 2024.
- Training data augmentation for dysarthric automatic speech recognition by text-to-dysarthric-speech synthesis. Proceedings of Interspeech 2024. Kos island, Greece, 1 September 2024 - 1 September 2024.
- Combining Conformer and Dual-Path-Transformer Networks for Single Channel Noisy Reverberant Speech Separation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Active Learning for Sound Event Classification using Bayesian Neural Networks with Gaussian Variational Posterior. Proc. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP24). Seoul, South Korea, 14 April 2024 - 19 April 2024.
- Non-intrusive speech intelligibility prediction for hearing-impaired users using intermediate ASR features and human memory models. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2024). Seoul, Korea, 14 April 2024 - 14 April 2024.
- Multi-CMGAN+/+: leveraging multi-objective speech quality metric prediction for speech enhancement. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024). Seoul, Korea, 14 April 2024 - 14 April 2024.
- Improving audiovisual active speaker detection in egocentric recordings with the data-efficient image transformer. Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2023). Taipei, Taiwan, 16 December 2023 - 16 December 2023.
- On time domain conformer models for monaural speech separation in noisy reverberant acoustic environments. Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding. Beitou, Taipei, 16 December 2023 - 16 December 2023.
- . 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023)
- On data sampling strategies for training neural network speech separation models. 2023 31st European Signal Processing Conference (EUSIPCO). Helsinki, Finland, 4 September 2023 - 4 September 2023.
- Message Recommendation Strategies for Tailoring Health Information to Promote Physical Activities. Communications in Computer and Information Science (CCIS). Copenhagen, Denmark, 23 July 2023 - 23 July 2023.
- PAMGAN+/-: Improving Phase-Aware Speech Enhancement Performance via Expanded Discriminator Training. AES Convention Europe 2023
- Moving Towards Non-Binary Gender Identification Via Analysis of System Errors in Binary Gender Classification. 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023), 4 June 2023 - 10 June 2023.
- Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation. Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation, 1 September 2022.
- Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation. IEEE 30th European Signal Processing Conference
- Residual Echo Power Spectral Density Estimation Based on an Optimal Smoothed Misalignment For Acoustic Echo Cancelation. Proc. Int. Workshop on Acoustic Echo and Noise Control (IWAENC-2005) , Eindhoven, The Netherlands (pp 209-212)
- Comparison of Speech Enhancement Systems for Noise Fields in a Car Environment. German 32. Deutsche Jahrestagung für Akustik (DAGA’06) (pp 45-46). Braunschweig, Germany
- Performance of Text-Independent Speaker Identification considering In-Car Acoustics. German 32. Deutsche Jahrestagung für Akustik (DAGA’06) (pp 223-224). Braunschweig, Germany
- Multi-Channel Speech Enhancement using a Psychoacoustic Approach for a Post-Filter. German ITG-Symposium on Speech Communication. Kiel, Germany
- Active Learning for Sound Event Classification using Monte-Carlo Dropout and PANN Embeddings. Proc. DCASE Workshop. Online, 15 November 2021 - 19 November 2021.
Reports
- Clarity Prediction Challenge 1 Entry: Non-intrusive Speech Intelligibility Metric Prediction - Technical Report
Preprints
- Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement, arXiv.
- Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition, arXiv.
- Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis, arXiv.
- Hallucination in Perceptual Metric-Driven Speech Enhancement Networks, arXiv.
- , arXiv.
- Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement, arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , arXiv.
- , JMIR Publications Inc..
- , arXiv.
- Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation, arXiv.
- Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation, arXiv.
- MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data.
- , arXiv.
- Grants
-
Research Grants
- Participatory co-design of a platform for collecting atypical speech data, Research England, 03/2022 - 07/2022, £19,692, as PI