Analytic Filter Bank for Speech Analysis, Feature Extraction and Perceptual Studies

    Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference contributionScientificvertaisarvioitu

    207 Lataukset (Pure)

    Abstrakti

    Speech signal consists of events in time and frequency, and therefore its analysis with high-resolution time-frequency tools is often of importance. Analytic filter bank provides a simple, fast, and flexible method to construct time-frequency representations of signals. Its parameters can be easily adapted to different situations from uniform to any auditory frequency scale, or even to a focused resolution. Since the Hilbert magnitude values of the channels are obtained at every sample, it provides a practical tool for a high-resolution time-frequency analysis.

    The present study describes the basic theory of analytic filters and tests their main properties. Applications of analytic filter bank to different speech analysis tasks including pitch period estimation and pitch synchronous analysis of formant frequencies and bandwidths are demonstrated. In addition, a new feature vector called group delay vector is introduced. It is shown that this representation provides comparable, or even better results, than those obtained by spectral magnitude feature vectors in the analysis and classification of vowels. The implications of this observation are discussed also from the speech perception point of view.
    AlkuperäiskieliEnglanti
    OtsikkoProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
    KustantajaInternational Speech Communication Association
    Sivut449-453
    Sivumäärä5
    Vuosikerta2017-August
    ISBN (painettu)978-1-5108-4876-4
    DOI - pysyväislinkit
    TilaJulkaistu - elok. 2017
    OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
    TapahtumaInterspeech - Stockholm, Ruotsi
    Kesto: 20 elok. 201724 elok. 2017
    Konferenssinumero: 18
    http://www.interspeech2017.org/

    Julkaisusarja

    NimiInterspeech: Annual Conference of the International Speech Communication Association
    ISSN (elektroninen)1990-9772

    Conference

    ConferenceInterspeech
    Maa/AlueRuotsi
    KaupunkiStockholm
    Ajanjakso20/08/201724/08/2017
    www-osoite

    Sormenjälki

    Sukella tutkimusaiheisiin 'Analytic Filter Bank for Speech Analysis, Feature Extraction and Perceptual Studies'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

    Siteeraa tätä