Abstract
Main-stream speech codecs are based on modelling the speech source by a linear predictor. An efficient domain for quantization and coding of this linear predictor is the line spectral frequency representation, where the predictor is encoded into an ordered set of frequencies that correspond to the roots of the corresponding line spectral polynomials. While this representation is robust in terms of quantization, methods available for finding the line spectral frequencies are computationally complex. In this work, we present a method for finding these frequencies using the FFT, including methods for limiting numerical range in fixed-point implementations. Our experiments show that, in comparison to a zero-crossing search in the Chebyshev domain, the proposed method reduces complexity and improves robustness, while retaining accuracy.
Original language | English |
---|---|
Title of host publication | 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Proceedings |
Publisher | IEEE |
Pages | 5122-5126 |
Number of pages | 5 |
Volume | 2015-August |
ISBN (Electronic) | 9781467369978 |
DOIs | |
Publication status | Published - 1 Jan 2015 |
MoE publication type | A4 Article in a conference publication |
Event | IEEE International Conference on Acoustics, Speech, and Signal Processing - Brisbane, Australia Duration: 19 Apr 2015 → 24 Apr 2015 Conference number: 40 |
Conference
Conference | IEEE International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Abbreviated title | ICASSP |
Country | Australia |
City | Brisbane |
Period | 19/04/2015 → 24/04/2015 |
Keywords
- line spectral frequencies
- linear prediction
- root finding
- speech coding