Perceptual Loss Function for Neural Modelling of Audio Systems

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

10 Downloads (Pure)

Abstract

This work investigates alternate pre-emphasis filters used as part of the loss function during neural network training for nonlinear audio processing. In our previous work, the error-to-signal ratio loss function was used during network training, with a first-order highpass pre-emphasis filter applied to both the target signal and neural network output. This work considers more perceptually relevant pre-emphasis filters, which include lowpass filtering at high frequencies. We conducted listening tests to determine whether they offer an improvement to the quality of a neural network model of a guitar tube amplifier. Listening test results indicate that the use of an A-weighting pre-emphasis filter offers the best improvement among the tested filters. The proposed perceptual loss function improves the sound quality of neural network models in audio processing without affecting the computational cost.
Original languageEnglish
Title of host publicationICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublisherIEEE
Pages251-255
Number of pages5
ISBN (Electronic)978-1-5090-6631-5
ISBN (Print)978-1-5090-6632-2
DOIs
Publication statusPublished - 4 May 2020
MoE publication typeA4 Article in a conference publication
EventIEEE International Conference on Acoustics, Speech and Signal Processing - Barcelona, Spain
Duration: 4 May 20208 May 2020
Conference number: 45

Publication series

NameProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

ConferenceIEEE International Conference on Acoustics, Speech and Signal Processing
Abbreviated titleICASSP
CountrySpain
CityBarcelona
Period04/05/202008/05/2020

Keywords

  • Acoustic signal processing
  • Music technology
  • Psychoacoustics

Fingerprint Dive into the research topics of 'Perceptual Loss Function for Neural Modelling of Audio Systems'. Together they form a unique fingerprint.

  • Projects

    NordicSMC Aalto

    Liski, J., Välimäki, V., Pulkki, V., Wright, A., Fierro, L., Wirler, S. & Alary, B.

    01/01/201831/12/2023

    Project: Other external funding: Other foreign funding

    NordicSMC: Nordic Sound and Music Computing Network

    Prawda, K., Välimäki, V. & McCrea, M.

    01/01/201831/12/2023

    Project: Other external funding: Other foreign funding

    Equipment

  • Cite this

    Wright, A., & Välimäki, V. (2020). Perceptual Loss Function for Neural Modelling of Audio Systems. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 251-255). [9052944] (Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing). IEEE. https://doi.org/10.1109/ICASSP40776.2020.9052944