Skip to main navigation Skip to search Skip to main content

Overlap-add Windows with Maximum Energy Concentration for Speech and Audio Processing

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

4 Citations (Scopus)
324 Downloads (Pure)

Abstract

Processing of speech and audio signals with time-frequency representations require windowing methods which allow perfect reconstruction of the original signal and where processing artifacts have a predictable behavior. The most common approach for this purpose is overlap-add windowing, where signal segments are windowed before and after processing. Commonly used windows include the half-sine and a Kaiser-Bessel derived window. The latter is an approximation of the discrete prolate spherical sequence, and thus a maximum energy concentration window, adapted for overlap-add. We demonstrate that performance can be improved by including the overlap-add structure as a constraint in optimization of the maximum energy concentration criteria. The same approach can be used to find further special cases such as optimal low-overlap windows. Our experiments demonstrate that the proposed windows provide notable improvements in terms of reduction in side-lobe magnitude.
Original languageEnglish
Title of host publication44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019; Brighton; United Kingdom; 12-17 May 2019 : Proceedings
PublisherIEEE
Pages491-495
Number of pages5
ISBN (Electronic)978-1-4799-8131-1
ISBN (Print)978-1-4799-8132-8
DOIs
Publication statusPublished - 1 May 2019
MoE publication typeA4 Conference publication
EventIEEE International Conference on Acoustics, Speech, and Signal Processing - Brighton, United Kingdom
Duration: 12 May 201917 May 2019
Conference number: 44

Publication series

NameProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

ConferenceIEEE International Conference on Acoustics, Speech, and Signal Processing
Abbreviated titleICASSP
Country/TerritoryUnited Kingdom
CityBrighton
Period12/05/201917/05/2019

Keywords

  • time-frequency processing
  • windowing
  • discrete prolate spherical sequences

Fingerprint

Dive into the research topics of 'Overlap-add Windows with Maximum Energy Concentration for Speech and Audio Processing'. Together they form a unique fingerprint.
  • Interdisciplinary research on statistical parametric speech synthesis

    Alku, P. (Principal investigator), Bäckström, T. (Project Member), Nonavinakere Prabhakera, N. (Project Member), Bollepalli, B. (Project Member), Murtola, T. (Project Member), Airaksinen, M. (Project Member) & Juvela, L. (Project Member)

    01/01/201831/12/2019

    Project: Academy of Finland: Other research funding

Cite this