Categorization of Natural Dynamic Audiovisual Scenes

Olli Rummukainen, Jenni Radun, Toni Virtanen, Ville Pulkki

Research output: Contribution to journalArticleScientificpeer-review

14 Citations (Scopus)
311 Downloads (Pure)

Abstract

This work analyzed the perceptual attributes of natural dynamic audiovisual scenes. We presented thirty participants with 19 natural scenes in a similarity categorization task, followed by a semi-structured interview. The scenes were reproduced with an immersive audiovisual display. Natural scene perception has been studied mainly with unimodal settings, which have identified motion as one of the most salient attributes related to visual scenes, and sound intensity along with pitch trajectories related to auditory scenes. However, controlled laboratory experiments with natural multimodal stimuli are still scarce. Our results show that humans pay attention to similar perceptual attributes in natural scenes, and a two-dimensional perceptual map of the stimulus scenes and perceptual attributes was obtained in this work. The exploratory results show the amount of movement, perceived noisiness, and eventfulness of the scene to be the most important perceptual attributes in naturalistically reproduced real-world urban environments. We found the scene gist properties openness and expansion to remain as important factors in scenes with no salient auditory or visual events. We propose that the study of scene perception should move forward to understand better the processes behind multimodal scene processing in real-world environments. We publish our stimulus scenes as spherical video recordings and sound field recordings in a publicly available database.
Original languageEnglish
Article numbere95848
Pages (from-to)1-14
JournalPloS one
Volume9
Issue number5
DOIs
Publication statusPublished - 2014
MoE publication typeA1 Journal article-refereed

Fingerprint Dive into the research topics of 'Categorization of Natural Dynamic Audiovisual Scenes'. Together they form a unique fingerprint.

Cite this