Projects per year
Abstract
Processing of speech and audio signals with time-frequency representations require windowing methods which allow perfect reconstruction of the original signal and where processing artifacts have a predictable behavior. The most common approach for this purpose is overlap-add windowing, where signal segments are windowed before and after processing. Commonly used windows include the half-sine and a Kaiser-Bessel derived window. The latter is an approximation of the discrete prolate spherical sequence, and thus a maximum energy concentration window, adapted for overlap-add. We demonstrate that performance can be improved by including the overlap-add structure as a constraint in optimization of the maximum energy concentration criteria. The same approach can be used to find further special cases such as optimal low-overlap windows. Our experiments demonstrate that the proposed windows provide notable improvements in terms of reduction in side-lobe magnitude.
Original language | English |
---|---|
Title of host publication | 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019; Brighton; United Kingdom; 12-17 May 2019 : Proceedings |
Publisher | IEEE |
Pages | 491-495 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-4799-8131-1 |
ISBN (Print) | 978-1-4799-8132-8 |
DOIs | |
Publication status | Published - 1 May 2019 |
MoE publication type | A4 Conference publication |
Event | IEEE International Conference on Acoustics, Speech, and Signal Processing - Brighton, United Kingdom Duration: 12 May 2019 → 17 May 2019 Conference number: 44 |
Publication series
Name | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing |
---|---|
ISSN (Print) | 1520-6149 |
ISSN (Electronic) | 2379-190X |
Conference
Conference | IEEE International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Abbreviated title | ICASSP |
Country/Territory | United Kingdom |
City | Brighton |
Period | 12/05/2019 → 17/05/2019 |
Keywords
- time-frequency processing
- windowing
- discrete prolate spherical sequences
Fingerprint
Dive into the research topics of 'Overlap-add Windows with Maximum Energy Concentration for Speech and Audio Processing'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Interdisciplinary research on statistical parametric speech synthesis
Alku, P. (Principal investigator)
01/01/2018 → 31/12/2019
Project: Academy of Finland: Other research funding