Skip to main navigation Skip to search Skip to main content

Time Delay Estimation from Mixed Multispeaker Speech Signals Using Single Frequency Filtering

  • B. H.V.S. Narayana Murthy
  • , B. Yegnanarayana
  • , Sudarsana Reddy Kadiri*
  • *Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

8 Citations (Scopus)
158 Downloads (Pure)

Abstract

A method is proposed for time delay estimation (TDE) from mixed source (speaker) signals collected at two spatially separated microphones. The key idea in this proposal is that the crosscorrelation between corresponding segments of the mixed source signals is computed using the outputs of single frequency filtering (SFF) obtained at several frequencies, rather than using the collected waveforms directly. The advantage of the SFF output is that it will have high signal-to-noise ratio regions in both time and frequency domains. Also it gives multiple evidences, one from each of the SFF outputs. These multiple evidences are combined to obtain robustness in the TDE. The estimated time delays can be used to determine the number of speakers present in the mixed signals. The TDE is shown to be robust against different types and levels of degradations. The results are shown for actual mixed signals collected at two spatially separated microphones in a live laboratory environment, where the mixed signals contain speech from several spatially distributed speakers.

Original languageEnglish
JournalCircuits, Systems, and Signal Processing
DOIs
Publication statusPublished - 1 Jan 2019
MoE publication typeA1 Journal article-refereed

Funding

Open access funding provided by Aalto University. The second author would like to thank the Indian National Science Academy (INSA) for their support. The third author would like to thank the Academy of Finland (Project 312490) for supporting his stay in Finland as a Postdoctoral Researcher.

Keywords

  • Crosscorrelation
  • Multispeaker speech
  • Number of speakers
  • Single frequency filtering
  • Speech analysis
  • Time delay estimation

Fingerprint

Dive into the research topics of 'Time Delay Estimation from Mixed Multispeaker Speech Signals Using Single Frequency Filtering'. Together they form a unique fingerprint.
  • Interdisciplinary research on statistical parametric speech synthesis

    Alku, P. (Principal investigator), Bäckström, T. (Project Member), Nonavinakere Prabhakera, N. (Project Member), Bollepalli, B. (Project Member), Murtola, T. (Project Member), Airaksinen, M. (Project Member) & Juvela, L. (Project Member)

    01/01/201831/12/2019

    Project: Academy of Finland: Other research funding

Cite this