Sampling the user controls in neural modeling of audio devices

Otto Mikkonen*, Alec Wright, Vesa Välimäki

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

6 Downloads (Pure)

Abstract

This work studies neural modeling of nonlinear parametric audio circuits, focusing on how the diversity of settings of the target device user controls seen during training affects network generalization. To study the problem, a large corpus of training datasets is synthetically generated using SPICE simulations of two distinct devices, an analog equalizer and an analog distortion pedal. A proven recurrent neural network architecture is trained using each dataset. The difference in the datasets is in the sampling resolution of the device user controls and in their overall size. Based on objective and subjective evaluation of the trained models, a sampling resolution of five for the device parameters is found to be sufficient to capture the behavior of the target systems for the types of devices considered during the study. This result is desirable, since a dense sampling grid can be impractical to realize in the general case when no automated way of setting the device parameters is available, while collecting large amounts of data using a sparse grid only incurs small additional costs. Thus, the result provides guidance for efficient collection of training data for neural modeling of other similar audio devices.

Original languageEnglish
Article number26
Number of pages13
JournalEurasip Journal on Audio, Speech, and Music Processing
Volume2024
Issue number1
DOIs
Publication statusPublished - Dec 2024
MoE publication typeA1 Journal article-refereed

Keywords

  • Audio systems
  • Deep learning
  • Emulation

Fingerprint

Dive into the research topics of 'Sampling the user controls in neural modeling of audio devices'. Together they form a unique fingerprint.

Cite this