Blind Room Volume Estimation from Single-channel Noisy Speech

A. F. Genovese, H. Gamper, V. Pulkki, N. Raghuvanshi, I. J. Tashev

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

5 Citations (Scopus)
194 Downloads (Pure)

Abstract

Recent work on acoustic parameter estimation indicates that geometric room volume can be useful for modeling the character of an acoustic environment. However, estimating volume from audio signals remains a challenging problem. Here we propose using a convolutional neural network model to estimate the room volume blindly from reverberant single-channel speech signals in the presence of noise. The model is shown to produce estimates within approximately a factor of two to the true value, for rooms ranging in size from small offices to large concert halls.
Original languageEnglish
Title of host publication44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019; Brighton; United Kingdom; 12-17 May 2019 : Proceedings
PublisherIEEE
Pages231-235
Number of pages5
ISBN (Electronic)978-1-4799-8131-1
ISBN (Print)978-1-4799-8132-8
DOIs
Publication statusPublished - 1 May 2019
MoE publication typeA4 Article in a conference publication
EventIEEE International Conference on Acoustics, Speech, and Signal Processing - Brighton, United Kingdom
Duration: 12 May 201917 May 2019
Conference number: 44

Publication series

NameProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

ConferenceIEEE International Conference on Acoustics, Speech, and Signal Processing
Abbreviated titleICASSP
CountryUnited Kingdom
CityBrighton
Period12/05/201917/05/2019

Keywords

  • Acoustics
  • Volume measurement
  • Training
  • Solid modeling
  • Noise measurement
  • Data models
  • Acoustic measurements
  • Room acoustics
  • room size
  • non-intrusive parameter estimation
  • signal processing
  • convolutional neural network

Fingerprint Dive into the research topics of 'Blind Room Volume Estimation from Single-channel Noisy Speech'. Together they form a unique fingerprint.

Cite this