DDSP-based Neural Waveform Synthesis of Polyphinic Guitar Performance From String-Wise Midi Input

Nicolas Jonason, Xin Wang, Erica Cooper, Lauri Juvela, Bob L.T. Sturm, Junichi Yamagishi

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

10 Downloads (Pure)

Abstract

We explore the use of neural synthesis for acoustic guitar from string-wise MIDI input. We propose four different systems and compare them with both objective metrics and subjective evaluation against natural audio and a sample-based baseline. We iteratively develop these four systems by making various considerations on the architecture and intermediate tasks, such as predicting pitch and loudness control features. We find that formulating the control feature prediction task as a classification task rather than a regression task yields better results. Furthermore, we find that our simplest proposed system, which directly predicts synthesis parameters from MIDI input performs the best out of the four proposed systems. Audio examples and code are available.

Original languageEnglish
Title of host publicationProceedings of the 27th International Conference on Digital Audio Effects (DAFx24)
PublisherUniversity of Surrey
Pages208-215
Number of pages8
Publication statusPublished - 2024
MoE publication typeA4 Conference publication
EventInternational Conference on Digital Audio Effects - University of Surrey, Guildford, United Kingdom
Duration: 3 Sept 20247 Sept 2024
Conference number: 27
https://dafx24.surrey.ac.uk/

Publication series

NameProceedings of the International Conference on Digital Audio Effects, DAFx
ISSN (Print)2413-6700
ISSN (Electronic)2413-6689

Conference

ConferenceInternational Conference on Digital Audio Effects
Abbreviated titleDAFX
Country/TerritoryUnited Kingdom
CityGuildford
Period03/09/202407/09/2024
Internet address

Fingerprint

Dive into the research topics of 'DDSP-based Neural Waveform Synthesis of Polyphinic Guitar Performance From String-Wise Midi Input'. Together they form a unique fingerprint.

Cite this