Projects per year
Abstract
The need for high-quality time-scale modification of audio is increasing, as media streaming services are providing new related functionalities to their users. The main
goal of a time-stretching method is to preserve the pitch and the subjective quality of the different components of the audio signal, namely transients, noise, and tonal components. Many solutions have been proposed throughout the years, with various results depending on the kind of processed audio input. This paper introduces an evaluation method for audio time-scaling algorithms based on a recent fuzzy time-frequency decomposition, which reveals the energy of the tonal, transient, and noise components in the original and stretched sounds. From the energy curves, typical impairments, such as transient smearing and the loss of tonality, can be observed. This analysis approach is compared with the subjective preferences of different techniques. This leads to suggestions for possible improvements of future algorithms. The ultimate goal is having an objective evaluation method which matches the subjective quality assessment.
goal of a time-stretching method is to preserve the pitch and the subjective quality of the different components of the audio signal, namely transients, noise, and tonal components. Many solutions have been proposed throughout the years, with various results depending on the kind of processed audio input. This paper introduces an evaluation method for audio time-scaling algorithms based on a recent fuzzy time-frequency decomposition, which reveals the energy of the tonal, transient, and noise components in the original and stretched sounds. From the energy curves, typical impairments, such as transient smearing and the loss of tonality, can be observed. This analysis approach is compared with the subjective preferences of different techniques. This leads to suggestions for possible improvements of future algorithms. The ultimate goal is having an objective evaluation method which matches the subjective quality assessment.
Original language | English |
---|---|
Title of host publication | Proceedings of the 17th Sound and Music Computing Conference |
Publisher | Axea sas/SMC Network |
Pages | 457-462 |
Number of pages | 6 |
ISBN (Electronic) | 978-88-945415-0-2 |
Publication status | Published - Jun 2020 |
MoE publication type | A4 Article in a conference publication |
Event | Sound and Music Computing Conference - Online, Torino, Italy Duration: 24 Jun 2020 → 26 Jun 2020 Conference number: 17 https://smc2020torino.it/uk/ |
Publication series
Name | Proceedings of the 17th Sound and Music Computing Conference |
---|---|
ISSN (Electronic) | 2518-3672 |
Conference
Conference | Sound and Music Computing Conference |
---|---|
Abbreviated title | SMC |
Country/Territory | Italy |
City | Torino |
Period | 24/06/2020 → 26/06/2020 |
Internet address |
Keywords
- Time-Scale Modification
- Objective Evaluation
- Time stretching
- audio signal processing
Fingerprint
Dive into the research topics of 'Towards Objective Evaluation of Audio Time-Scale Modification Methods'. Together they form a unique fingerprint.Projects
- 2 Active
-
NordicSMC Aalto
Välimäki, V., Alary, B., Fierro, L., Prawda, K., Moliner Juanpere, E., Pulkki, V., Wirler, S., Fagerström, J., Liski, J. & Wright, A.
01/01/2018 → 31/12/2023
Project: Other external funding: Other foreign funding
-
NordicSMC: Nordic Sound and Music Computing Network
Välimäki, V., Louise, B., Mikkonen, O., Prawda, K., Roberts, J. & Fagerström, J.
01/01/2018 → 31/12/2023
Project: Other external funding: Other foreign funding
Equipment
-
Aalto Acoustics Lab
Ville Pulkki (Manager)
School of Electrical EngineeringFacility/equipment: Facility