BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models

Eloi Moliner Juanpere, Jean Marie Lemercier, Simon Welker, Timo Gerkmann, Vesa Valimaki

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

2 Citations (Scopus)
17 Downloads (Pure)

Abstract

In this paper, we present an unsupervised single-channel method for joint blind dereverberation and room impulse response estimation, based on posterior sampling with diffusion models. We parameterize the reverberation operator using a filter with exponential decay for each frequency subband, and iteratively estimate the corresponding parameters as the speech utterance gets refined along the reverse diffusion trajectory. A measurement consistency criterion enforces the fidelity of the generated speech with the reverberant measurement, while an unconditional diffusion model implements a strong prior for clean speech generation. Without any knowledge of the room impulse response nor any coupled reverberant-anechoic data, we can successfully perform dereverberation in various acoustic scenarios. Our method significantly outperforms previous blind unsupervised baselines, and we demonstrate its increased robustness to unseen acoustic conditions in comparison to blind supervised methods.

Original languageEnglish
Title of host publication2024 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Proceedings
PublisherIEEE
Pages120-124
Number of pages5
ISBN (Electronic)979-8-3503-6185-8
DOIs
Publication statusPublished - 2024
MoE publication typeA4 Conference publication
EventInternational Workshop on Acoustic Signal Enhancement - Aalborg, Denmark
Duration: 9 Sept 202412 Sept 2024
Conference number: 18

Publication series

Name2024 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Proceedings

Workshop

WorkshopInternational Workshop on Acoustic Signal Enhancement
Abbreviated titleIWAENC
Country/TerritoryDenmark
CityAalborg
Period09/09/202412/09/2024

Keywords

  • Acoustics
  • deep learning
  • speech enhancement

Fingerprint

Dive into the research topics of 'BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models'. Together they form a unique fingerprint.

Cite this