Sampling the user controls in neural modeling of audio devices

Otto Mikkonen*, Alec Wright, Vesa Välimäki

*Tämän työn vastaava kirjoittaja

Tutkimustuotos: LehtiartikkeliArticleScientificvertaisarvioitu

14 Lataukset (Pure)

Abstrakti

This work studies neural modeling of nonlinear parametric audio circuits, focusing on how the diversity of settings of the target device user controls seen during training affects network generalization. To study the problem, a large corpus of training datasets is synthetically generated using SPICE simulations of two distinct devices, an analog equalizer and an analog distortion pedal. A proven recurrent neural network architecture is trained using each dataset. The difference in the datasets is in the sampling resolution of the device user controls and in their overall size. Based on objective and subjective evaluation of the trained models, a sampling resolution of five for the device parameters is found to be sufficient to capture the behavior of the target systems for the types of devices considered during the study. This result is desirable, since a dense sampling grid can be impractical to realize in the general case when no automated way of setting the device parameters is available, while collecting large amounts of data using a sparse grid only incurs small additional costs. Thus, the result provides guidance for efficient collection of training data for neural modeling of other similar audio devices.

AlkuperäiskieliEnglanti
Artikkeli26
Sivumäärä13
JulkaisuEurasip Journal on Audio, Speech, and Music Processing
Vuosikerta2024
Numero1
DOI - pysyväislinkit
TilaJulkaistu - jouluk. 2024
OKM-julkaisutyyppiA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Sormenjälki

Sukella tutkimusaiheisiin 'Sampling the user controls in neural modeling of audio devices'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä