Privacy and Quality Improvements in Open Offices Using Multi-Device Speech Enhancement

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsProfessional

28 Lataukset (Pure)

Abstrakti

Teleconferencing has increased in popularity and often takes place around other people such as open offices. A particular problem of such environments is that multiple users can have independent conversations simultaneously, which leak into each others’ devices. This poses problems of both privacy and quality. In this work, we introduce a multi-device, targeted speech separation network. We call this network IsoNet, as it isolates the dominant speech in a mixture of multiple speakers by generating a mask from interfering speakers. This mask is used to remove speech from other simultaneous conversations in the enhanced speech signal. The privacy improvement is measured by mutual information and the enhancement quality is evaluated with a MUSHRA test, PESQ, and SI-SNR. Our experiments show a statistically significant improvement with IsoNet from 27 to 75 in MUSHRA score and a decrease of mutual information of 60%. IsoNet improves privacy as sensitive speech content is effectively attenuated.
AlkuperäiskieliEnglanti
Otsikko3rd Symposium on Security and Privacy in Speech Communication
KustantajaInternational Speech Communication Association (ISCA)
Sivumäärä5
DOI - pysyväislinkit
TilaJulkaistu - 19 elok. 2023
OKM-julkaisutyyppiD3 Artikkeli ammatillisessa konferenssijulkaisussa
TapahtumaISCA Symposium on Security and Privacy in Speech Communication - Dublin, Irlanti
Kesto: 19 elok. 202319 elok. 2023

Conference

ConferenceISCA Symposium on Security and Privacy in Speech Communication
Maa/AlueIrlanti
KaupunkiDublin
Ajanjakso19/08/202319/08/2023

Sormenjälki

Sukella tutkimusaiheisiin 'Privacy and Quality Improvements in Open Offices Using Multi-Device Speech Enhancement'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä