Advances in phase-aware signal processing in speech communication

Research output: Contribution to journalArticleScientificpeer-review

Researchers

  • Pejman Mowlaee
  • Rahim Saeidi
  • Yannis Stylianou

Research units

  • Graz University of Technology
  • University of Crete

Abstract

During the past three decades, the issue of processing spectral phase has been largely neglected in speech applications. There is no doubt that the interest of speech processing community towards the use of phase information in a big spectrum of speech technologies, from automatic speech and speaker recognition to speech synthesis, from speech enhancement and source separation to speech coding, is constantly increasing. In this paper, we elaborate on why phase was believed to be unimportant in each application. We provide an overview of advancements in phase-aware signal processing with applications to speech, showing that considering phase-aware speech processing can be beneficial in many cases, while it can complement the possible solutions that magnitude-only methods suggest. Our goal is to show that phase-aware signal processing is an important emerging field with high potential in the current speech communication applications. The paper provides an extended and up-to-date bibliography on the topic of phase aware speech processing aiming at providing the necessary background to the interested readers for following the recent advancements in the area. Our review expands the step initiated by our organized special session and exemplifies the usefulness of spectral phase information in a wide range of speech processing applications. Finally, the overview will provide some future work directions.

Details

Original languageEnglish
Pages (from-to)1-29
JournalSpeech Communication
Volume81
Publication statusPublished - Jul 2016
MoE publication typeA1 Journal article-refereed

    Research areas

  • Automatic speech recognition, Phase-aware speech processing, Phase-based features, Signal enhancement, Speaker recognition, Speech analysis, Speech coding, Speech synthesis

ID: 4462817