Abstract
In this paper, we present an unsupervised single-channel method for joint blind dereverberation and room impulse response estimation, based on posterior sampling with diffusion models. We parameterize the reverberation operator using a filter with exponential decay for each frequency subband, and iteratively estimate the corresponding parameters as the speech utterance gets refined along the reverse diffusion trajectory. A measurement consistency criterion enforces the fidelity of the generated speech with the reverberant measurement, while an unconditional diffusion model implements a strong prior for clean speech generation. Without any knowledge of the room impulse response nor any coupled reverberant-anechoic data, we can successfully perform dereverberation in various acoustic scenarios. Our method significantly outperforms previous blind unsupervised baselines, and we demonstrate its increased robustness to unseen acoustic conditions in comparison to blind supervised methods.
Original language | English |
---|---|
Title of host publication | 2024 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Proceedings |
Publisher | IEEE |
Pages | 120-124 |
Number of pages | 5 |
ISBN (Electronic) | 979-8-3503-6185-8 |
DOIs | |
Publication status | Published - 2024 |
MoE publication type | A4 Conference publication |
Event | International Workshop on Acoustic Signal Enhancement - Aalborg, Denmark Duration: 9 Sept 2024 → 12 Sept 2024 Conference number: 18 |
Publication series
Name | 2024 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Proceedings |
---|
Workshop
Workshop | International Workshop on Acoustic Signal Enhancement |
---|---|
Abbreviated title | IWAENC |
Country/Territory | Denmark |
City | Aalborg |
Period | 09/09/2024 → 12/09/2024 |
Keywords
- Acoustics
- deep learning
- speech enhancement