Spatial Audio Feature Discovery with Convolutional Neural Networks

Etienne Thuillier, Hannes Gamper, Ivan J. Tashev

    Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

    26 Sitaatiot (Scopus)
    377 Lataukset (Pure)

    Abstrakti

    The advent of mixed reality consumer products brings about a pressing need to develop and improve spatial sound rendering techniques for a broad user base. Despite a large body of prior work, the precise nature and importance of various sound localization cues and how they should be personalized for an individual user to improve localization performance is still an open research problem. Here we propose training a convolutional neural network (CNN) to classify the elevation angle of spatially rendered sounds and employing Layer-wise Relevance Propagation (LRP) on the trained CNN model. LRP provides saliency maps that can be used to identify spectral features used by the network for classification. These maps, in addition to the convolution filters learned by the CNN, are discussed in the context of listening tests reported in the literature. The proposed approach could potentially provide an avenue for future studies on modeling and personalization of head-related transfer functions (HRTFs).

    AlkuperäiskieliEnglanti
    Otsikko2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Proceedings
    KustantajaIEEE
    Sivut6797-6801
    Sivumäärä5
    Vuosikerta2018-April
    ISBN (elektroninen)978-1-5386-4658-8
    ISBN (painettu)978-1-5386-4659-5
    DOI - pysyväislinkit
    TilaJulkaistu - 10 syysk. 2018
    OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
    TapahtumaIEEE International Conference on Acoustics, Speech, and Signal Processing - Calgary, Kanada
    Kesto: 15 huhtik. 201820 huhtik. 2018
    https://2018.ieeeicassp.org/

    Julkaisusarja

    NimiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
    ISSN (elektroninen)2379-190X

    Conference

    ConferenceIEEE International Conference on Acoustics, Speech, and Signal Processing
    LyhennettäICASSP
    Maa/AlueKanada
    KaupunkiCalgary
    Ajanjakso15/04/201820/04/2018
    www-osoite

    Sormenjälki

    Sukella tutkimusaiheisiin 'Spatial Audio Feature Discovery with Convolutional Neural Networks'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

    Siteeraa tätä