Abstract
Processing of speech and audio signals with time-frequency representations require windowing methods which allow perfect reconstruction of the original signal and where processing artifacts have a predictable behavior. The most common approach for this purpose is overlap-add windowing, where signal segments are windowed before and after processing. Commonly used windows include the half-sine and a Kaiser-Bessel derived window. The latter is an approximation of the discrete prolate spherical sequence, and thus a maximum energy concentration window, adapted for overlap-add. We demonstrate that performance can be improved by including the overlap-add structure as a constraint in optimization of the maximum energy concentration criteria. The same approach can be used to find further special cases such as optimal low-overlap windows. Our experiments demonstrate that the proposed windows provide notable improvements in terms of reduction in side-lobe magnitude.
| Original language | English |
|---|---|
| Title of host publication | 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019; Brighton; United Kingdom; 12-17 May 2019 : Proceedings |
| Publisher | IEEE |
| Pages | 491-495 |
| Number of pages | 5 |
| ISBN (Electronic) | 978-1-4799-8131-1 |
| ISBN (Print) | 978-1-4799-8132-8 |
| DOIs | |
| Publication status | Published - 1 May 2019 |
| MoE publication type | A4 Conference publication |
| Event | IEEE International Conference on Acoustics, Speech, and Signal Processing - Brighton, United Kingdom Duration: 12 May 2019 → 17 May 2019 Conference number: 44 |
Publication series
| Name | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing |
|---|---|
| ISSN (Print) | 1520-6149 |
| ISSN (Electronic) | 2379-190X |
Conference
| Conference | IEEE International Conference on Acoustics, Speech, and Signal Processing |
|---|---|
| Abbreviated title | ICASSP |
| Country/Territory | United Kingdom |
| City | Brighton |
| Period | 12/05/2019 → 17/05/2019 |
Keywords
- time-frequency processing
- windowing
- discrete prolate spherical sequences
Fingerprint
Dive into the research topics of 'Overlap-add Windows with Maximum Energy Concentration for Speech and Audio Processing'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Interdisciplinary research on statistical parametric speech synthesis
Alku, P. (Principal investigator), Bäckström, T. (Project Member), Nonavinakere Prabhakera, N. (Project Member), Bollepalli, B. (Project Member), Murtola, T. (Project Member), Airaksinen, M. (Project Member) & Juvela, L. (Project Member)
01/01/2018 → 31/12/2019
Project: Academy of Finland: Other research funding
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver