Abstract
Following speech on TV or radio in the presence of interferers is sometimes challenging, in particular for the elderly and the hearing-impaired. To evaluate the performance of speech enhancement methods for such scenarios, we consider a stereo mixture composed of a speech signal and interfering sources. We apply different approaches to separate the mixture into two components, where the first component contains mainly speech (the desired signal) and the second component contains the rest of the mixture. An improved stereo signal is constructed by recombining these components such that speech gets emphasized with respect to the rest of the mixture and at the same time the amount of artifacts is kept to a minimum. Listening tests and objective measures show that the center extraction approach is in general the most effective, although it is sensitive to speaker positioning.
Original language | English |
---|---|
Title of host publication | 2015 23rd European Signal Processing Conference, EUSIPCO 2015 |
Publisher | IEEE |
Pages | 2048-2052 |
Number of pages | 5 |
ISBN (Electronic) | 9780992862633 |
DOIs | |
Publication status | Published - 22 Dec 2015 |
MoE publication type | A4 Article in a conference publication |
Event | European Signal Processing Conference - Nice, France Duration: 31 Aug 2015 → 4 Sep 2015 Conference number: 23 |
Conference
Conference | European Signal Processing Conference |
---|---|
Abbreviated title | EUSIPCO |
Country | France |
City | Nice |
Period | 31/08/2015 → 04/09/2015 |
Keywords
- center extraction
- direct-ambient decomposition
- noise suppression
- speech enhancement