Abstract
In recent years, neural network-based black-box modeling of nonlinear audio effects has improved considerably. Present convolutional and recurrent models can model audio effects with long-term dynamics, but the models require many parameters, thus increasing the processing time. In this paper, we propose KLANN, a Koopman-Linearised Audio Neural Network structure that lifts a one-dimensional signal (mono audio) into a high-dimensional approximately linear state-space representation with nonlinear mapping, and then uses differentiable biquad filters to predict linearly within the lifted state-space. Results show that the proposed models match the high performance of the state-of-the-art neural models while having a more compact architecture, reducing the number of parameters by tenfold, and having interpretable components.
Original language | English |
---|---|
Pages (from-to) | 1169-1173 |
Number of pages | 5 |
Journal | IEEE Signal Processing Letters |
Volume | 31 |
DOIs | |
Publication status | Published - 16 Apr 2024 |
MoE publication type | A1 Journal article-refereed |
Keywords
- Closed box
- Discrete Fourier transforms
- Filters
- Frequency-domain analysis
- Logic gates
- Time-domain analysis
- Training
Fingerprint
Dive into the research topics of 'KLANN: Linearising Long-Term Dynamics in Nonlinear Audio Effects Using Koopman Networks'. Together they form a unique fingerprint.Equipment
-
Aalto Acoustics Lab
Ville Pulkki (Manager)
School of Electrical EngineeringFacility/equipment: Facility