Blind Identification of Binaural Room Impulse Responses from Smart Glasses

Thomas Deppisch, Nils Meyer-Kahlen, Sebastia V.Amengual Gari

Research output: Contribution to journalArticleScientificpeer-review

1 Citation (Scopus)

Abstract

Smart glasses are increasingly recognized as a key medium for augmented reality, offering a hands-free platform with integrated microphones and non-ear-occluding loudspeakers to seamlessly mix virtual sound sources into the real-world acoustic scene. To convincingly integrate virtual sound sources, the room acoustic rendering of the virtual sources must match the real-world acoustics. Information about a user's acoustic environment however is typically not available. This work uses a microphone array in a pair of smart glasses to blindly identify binaural room impulse responses (BRIRs) from a few seconds of speech in the real-world environment. The proposed method uses dereverberation and beamforming to generate a pseudo reference signal that is used by a multichannel Wiener filter to estimate room impulse responses which are then converted to BRIRs. The multichannel room impulse responses can be used to estimate room acoustic parameters which is shown to outperform baseline algorithms in the estimation of reverberation time and direct-To-reverberant energy ratio. Results from a listening experiment further indicate that the estimated BRIRs often reproduce the real-world room acoustics perceptually more convincingly than measured BRIRs from other rooms of similar size.

Original languageEnglish
Pages (from-to)4052-4065
Number of pages14
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume32
Early online date5 Sept 2024
DOIs
Publication statusPublished - 2024
MoE publication typeA1 Journal article-refereed

Keywords

  • Augmented Reality
  • Binaural Room Impulse Response
  • Blind System Identification
  • Microphone Array
  • Smart Glasses

Fingerprint

Dive into the research topics of 'Blind Identification of Binaural Room Impulse Responses from Smart Glasses'. Together they form a unique fingerprint.

Cite this