Towards Objective Evaluation of Audio Time-Scale Modification Methods

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

133 Downloads (Pure)


The need for high-quality time-scale modification of audio is increasing, as media streaming services are providing new related functionalities to their users. The main
goal of a time-stretching method is to preserve the pitch and the subjective quality of the different components of the audio signal, namely transients, noise, and tonal components. Many solutions have been proposed throughout the years, with various results depending on the kind of processed audio input. This paper introduces an evaluation method for audio time-scaling algorithms based on a recent fuzzy time-frequency decomposition, which reveals the energy of the tonal, transient, and noise components in the original and stretched sounds. From the energy curves, typical impairments, such as transient smearing and the loss of tonality, can be observed. This analysis approach is compared with the subjective preferences of different techniques. This leads to suggestions for possible improvements of future algorithms. The ultimate goal is having an objective evaluation method which matches the subjective quality assessment.
Original languageEnglish
Title of host publicationProceedings of the 17th Sound and Music Computing Conference
PublisherAxea sas/SMC Network
Number of pages6
ISBN (Electronic)978-88-945415-0-2
Publication statusPublished - Jun 2020
MoE publication typeA4 Article in a conference publication
EventSound and Music Computing Conference - Online, Torino, Italy
Duration: 24 Jun 202026 Jun 2020
Conference number: 17

Publication series

NameProceedings of the 17th Sound and Music Computing Conference
ISSN (Electronic)2518-3672


ConferenceSound and Music Computing Conference
Abbreviated titleSMC
Internet address


  • Time-Scale Modification
  • Objective Evaluation
  • Time stretching
  • audio signal processing


Dive into the research topics of 'Towards Objective Evaluation of Audio Time-Scale Modification Methods'. Together they form a unique fingerprint.

Cite this