This book constitutes of the major results of the EU COST (European Cooperation in the field of Scientific and Technical Research) Action 277: NSP - Nonlinear Speech Processing - running from April 2001 to June 2005. The results were presented at the last meeting of the management committee of COST Action 277, held in Heraklion, Crete, Greece on September 20-23, 2005 during the Workshop on Nonlinear Speech Processing, WNSP 2005.The 13 revised full papers in this state-of-the-art survey were carefully reviewed and selected for inclusion in the book and are preceded with an introductory leading-in by the editors. The articles present overviews of the four years research combining linear and non linear approaches for processing the speech signal. The aim of this book is to provide an additional and/or an alternative way to the traditional approach of linear speech processing and be mainly used by the researcher working in the domain. The papers cover areas such as speech analysis for speech synthesis, speech recognition, speech-non speech discrimination and voice quality assessment, speaker recognition/verification from a natural or modified speech signal, speech recognition, speech enhancement, and emotional state detection.
Author(s): Jacqueline Walker, Peter Murphy (auth.), Yannis Stylianou, Marcos Faundez-Zanuy, Anna Esposito (eds.)
Series: Lecture Notes in Computer Science 4391
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2007
Language: English
Pages: 276
Tags: User Interfaces and Human Computer Interaction; Artificial Intelligence (incl. Robotics); Language Translation and Linguistics; Pattern Recognition; Image Processing and Computer Vision
Front Matter....Pages -
A Review of Glottal Waveform Analysis....Pages 1-21
Rahmonic Analysis of Signal Regularity in Synthesized and Human Voice....Pages 22-40
Spectral Analysis of Speech Signals Using Chirp Group Delay....Pages 41-57
Towards Neurocomputational Speech and Sound Processing....Pages 58-77
Extraction of Speech-Relevant Information from Modulation Spectrograms....Pages 78-88
On the Detection of Discontinuities in Concatenative Speech Synthesis....Pages 89-100
Voice Disguise and Automatic Detection: Review and Perspectives....Pages 101-117
Audio-visual Identity Verification: An Introductory Overview....Pages 118-134
Text-Independent Speaker Verification: State of the Art and Challenges....Pages 135-169
Nonlinear Predictive Models: Overview and Possibilities in Speaker Recognition....Pages 170-189
SVMs for Automatic Speech Recognition: A Survey....Pages 190-216
Nonlinear Speech Enhancement: An Overview....Pages 217-248
The Amount of Information on Emotional States Conveyed by the Verbal and Nonverbal Channels: Some Perceptual Data....Pages 249-268
Back Matter....Pages -