An evaluation of stereo speech enhancement methods for different audio-visual scenarios

Alexandra Craciun, Christian Uhle, Tom Backstrom

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

5 Citations (Scopus)

Abstract

Following speech on TV or radio in the presence of interferers is sometimes challenging, in particular for the elderly and the hearing-impaired. To evaluate the performance of speech enhancement methods for such scenarios, we consider a stereo mixture composed of a speech signal and interfering sources. We apply different approaches to separate the mixture into two components, where the first component contains mainly speech (the desired signal) and the second component contains the rest of the mixture. An improved stereo signal is constructed by recombining these components such that speech gets emphasized with respect to the rest of the mixture and at the same time the amount of artifacts is kept to a minimum. Listening tests and objective measures show that the center extraction approach is in general the most effective, although it is sensitive to speaker positioning.

Original languageEnglish
Title of host publication2015 23rd European Signal Processing Conference, EUSIPCO 2015
PublisherIEEE
Pages2048-2052
Number of pages5
ISBN (Electronic)9780992862633
DOIs
Publication statusPublished - 22 Dec 2015
MoE publication typeA4 Article in a conference publication
EventEuropean Signal Processing Conference - Nice, France
Duration: 31 Aug 20154 Sep 2015
Conference number: 23

Conference

ConferenceEuropean Signal Processing Conference
Abbreviated titleEUSIPCO
CountryFrance
CityNice
Period31/08/201504/09/2015

Keywords

  • center extraction
  • direct-ambient decomposition
  • noise suppression
  • speech enhancement

Fingerprint Dive into the research topics of 'An evaluation of stereo speech enhancement methods for different audio-visual scenarios'. Together they form a unique fingerprint.

Cite this