Privacy and Quality Improvements in Open Offices Using Multi-Device Speech Enhancement

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsProfessional

26 Downloads (Pure)

Abstract

Teleconferencing has increased in popularity and often takes place around other people such as open offices. A particular problem of such environments is that multiple users can have independent conversations simultaneously, which leak into each others’ devices. This poses problems of both privacy and quality. In this work, we introduce a multi-device, targeted speech separation network. We call this network IsoNet, as it isolates the dominant speech in a mixture of multiple speakers by generating a mask from interfering speakers. This mask is used to remove speech from other simultaneous conversations in the enhanced speech signal. The privacy improvement is measured by mutual information and the enhancement quality is evaluated with a MUSHRA test, PESQ, and SI-SNR. Our experiments show a statistically significant improvement with IsoNet from 27 to 75 in MUSHRA score and a decrease of mutual information of 60%. IsoNet improves privacy as sensitive speech content is effectively attenuated.
Original languageEnglish
Title of host publication3rd Symposium on Security and Privacy in Speech Communication
PublisherInternational Speech Communication Association (ISCA)
Number of pages5
DOIs
Publication statusPublished - 19 Aug 2023
MoE publication typeD3 Professional conference proceedings
EventISCA Symposium on Security and Privacy in Speech Communication - Dublin, Ireland
Duration: 19 Aug 202319 Aug 2023

Conference

ConferenceISCA Symposium on Security and Privacy in Speech Communication
Country/TerritoryIreland
CityDublin
Period19/08/202319/08/2023

Keywords

  • privacy-aware
  • voice isolation
  • targeted speech separation
  • multi-device

Fingerprint

Dive into the research topics of 'Privacy and Quality Improvements in Open Offices Using Multi-Device Speech Enhancement'. Together they form a unique fingerprint.

Cite this