Abstract
This paper focuses on the problem of estimating fundamental frequency from singing voice. Estimation of fundamental frequency is a well studied topic in the speech research community. From the recent studies on fundamental frequency estimation from singing voice with state-of-art methods proposed for speech, there exists a significant gap in accuracy for singing voice. This is mainly because of the wider and rapid variations in pitch in singing voice compared to that in speech. To overcome this, in this paper we propose a method to derive the fundamental frequency from singing voice by exploiting the harmonics of impulse-like excitation in sequence of glottal cycles. The proposed method is compared with the eight state-of-art methods such as YIN, SWIPE, YAAPT, RAPT, SRH, SFF CEP, PEFAC and SHRP on the LYRICS singing database. From the experimental results, it is observed that the accuracy of fundamental frequency by the proposed method is better than many state-of-art methods in various singing categories and laryngeal mechanisms.
Original language | English |
---|---|
Title of host publication | Interspeech |
Publisher | International Speech Communication Association (ISCA) |
Pages | 2319-2323 |
Number of pages | 5 |
Volume | 2018-September |
DOIs | |
Publication status | Published - 2018 |
MoE publication type | A4 Conference publication |
Event | Interspeech - Hyderabad International Convention Centre, Hyderabad, India Duration: 2 Sept 2018 → 6 Sept 2018 http://interspeech2018.org/ |
Publication series
Name | Proceedings of the Annual Conference of the International Speech Communication Association |
---|---|
Publisher | International Speech Communication Association |
ISSN (Print) | 2308-457X |
Conference
Conference | Interspeech |
---|---|
Country/Territory | India |
City | Hyderabad |
Period | 02/09/2018 → 06/09/2018 |
Internet address |
Keywords
- Excitation source
- Fundamental frequency
- Glottal closure instants
- Singing voice