Overlap-add Windows with Maximum Energy Concentration for Speech and Audio Processing

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussavertaisarvioitu




Processing of speech and audio signals with time-frequency representations require windowing methods which allow perfect reconstruction of the original signal and where processing artifacts have a predictable behavior. The most common approach for this purpose is overlap-add windowing, where signal segments are windowed before and after processing. Commonly used windows include the half-sine and a Kaiser-Bessel derived window. The latter is an approximation of the discrete prolate spherical sequence, and thus a maximum energy concentration window, adapted for overlap-add. We demonstrate that performance can be improved by including the overlap-add structure as a constraint in optimization of the maximum energy concentration criteria. The same approach can be used to find further special cases such as optimal low-overlap windows. Our experiments demonstrate that the proposed windows provide notable improvements in terms of reduction in side-lobe magnitude.


OtsikkoICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
TilaJulkaistu - 1 toukokuuta 2019
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaIEEE International Conference on Acoustics, Speech, and Signal Processing - Brighton, Iso-Britannia
Kesto: 12 toukokuuta 201917 toukokuuta 2019
Konferenssinumero: 44


NimiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
ISSN (painettu)1520-6149
ISSN (elektroninen)2379-190X


ConferenceIEEE International Conference on Acoustics, Speech, and Signal Processing

Lataa tilasto

Ei tietoja saatavilla

ID: 33980868