Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data

Eloi Moliner Juanpere*, Sebastian Braun, Hannes Gamper

*Tämän työn vastaava kirjoittaja

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

Abstrakti

Audio domain transfer is the process of modifying audio signals to match characteristics of a different domain, while retaining the original content. Examples include transferring room acoustics or altering audio effects such as distortion. This paper investigates the potential of Gaussian Flow Bridges, an emerging approach in generative modeling, for these problems. The presented framework addresses the transport problem across different distributions of audio signals through the implementation of a series of two deterministic probability flows. The proposed framework facilitates manipulation of the target distribution properties through a continuous control variable, which defines a certain aspect of the target domain. Notably, this approach does not rely on paired examples for training. To address identified challenges on maintaining the speech content consistent, we recommend a training strategy that incorporates chunk-based minibatch Optimal Transport couplings of data samples and noise. Comparing our unsupervised method with established baselines, we find competitive performance in tasks of reverberation and distortion manipulation. Despite encoutering limitations, the intriguing results obtained in this study underscore potential for further exploration.

AlkuperäiskieliEnglanti
Otsikko2024 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Proceedings
KustantajaIEEE
Sivut374-378
Sivumäärä5
ISBN (elektroninen)979-8-3503-6185-8
DOI - pysyväislinkit
TilaJulkaistu - 2024
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Workshop on Acoustic Signal Enhancement - Aalborg, Tanska
Kesto: 9 syysk. 202412 syysk. 2024
Konferenssinumero: 18

Julkaisusarja

Nimi2024 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024 - Proceedings

Workshop

WorkshopInternational Workshop on Acoustic Signal Enhancement
LyhennettäIWAENC
Maa/AlueTanska
KaupunkiAalborg
Ajanjakso09/09/202412/09/2024

Sormenjälki

Sukella tutkimusaiheisiin 'Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä