TY - GEN
T1 - Sparse time-frequency representation of speech by the Vandermonde transform
AU - Pedersen, Christian Fischer
AU - Bäckström, Tom
PY - 2014/1/1
Y1 - 2014/1/1
N2 - Efficient speech signal representations are prerequisite for efficient speech processing algorithms. The Vandermonde transform is a recently introduced time-frequency transform which provides a sparse and uncorrelated speech signal representation. In contrast, the Fourier transform only decorrelates the signal approximately. To achieve complete decorrelation, the Vandermonde transform is signal adaptive like the Karhunen-Loève transform. Unlike the Karhunen-Loève, however, the Vandermonde transform is a time-frequency transform where the transform domain components correspond to frequency components of the analysis window. In this paper we analyze the performance of sparse speech signal representation by the Vandermonde transform. This is done by applying matching pursuit and comparing with sparse representations based on dictionaries with Fourier, Cosine, Gabor and Karhunen-Loève atoms. Our results show that Karhunen-Loève yields the best sparse signal recovery; however, this is not strictly a time-frequency transform. Of the true time-frequency transforms, Vandermonde is the most efficient for sparse speech signal representation.
AB - Efficient speech signal representations are prerequisite for efficient speech processing algorithms. The Vandermonde transform is a recently introduced time-frequency transform which provides a sparse and uncorrelated speech signal representation. In contrast, the Fourier transform only decorrelates the signal approximately. To achieve complete decorrelation, the Vandermonde transform is signal adaptive like the Karhunen-Loève transform. Unlike the Karhunen-Loève, however, the Vandermonde transform is a time-frequency transform where the transform domain components correspond to frequency components of the analysis window. In this paper we analyze the performance of sparse speech signal representation by the Vandermonde transform. This is done by applying matching pursuit and comparing with sparse representations based on dictionaries with Fourier, Cosine, Gabor and Karhunen-Loève atoms. Our results show that Karhunen-Loève yields the best sparse signal recovery; however, this is not strictly a time-frequency transform. Of the true time-frequency transforms, Vandermonde is the most efficient for sparse speech signal representation.
KW - Orthogonal matching pursuit
KW - Sparse speech representation
KW - Vandermonde transform
UR - http://www.scopus.com/inward/record.url?scp=84910058264&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84910058264
T3 - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
SP - 2248
EP - 2252
BT - Proceedings of the Annual Conference of the International Speech Communication Association
PB - International Speech Communication Association
T2 - Interspeech
Y2 - 14 September 2014 through 18 September 2014
ER -