Speech and Audio Processing for Coding, Enhancement and Recognition

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas.

Author(s): Tokunbo Ogunfunmi, Roberto Togneri, Madihally (Sim) Narasimha (eds.)
Edition: 1
Publisher: Springer-Verlag New York
Year: 2015

Language: English
Pages: 345
Tags: Signal, Image and Speech Processing; User Interfaces and Human Computer Interaction; Multimedia Information Systems

Front Matter....Pages i-x
Front Matter....Pages 1-1
From “Harmonic Telegraph” to Cellular Phones....Pages 3-17
Challenges in Speech Coding Research....Pages 19-39
Scalable and Multi-Rate Speech Coding for Voice-over-Internet Protocol (VoIP) Networks....Pages 41-74
Recent Speech Coding Technologies and Standards....Pages 75-109
Front Matter....Pages 111-111
Ensemble Learning Approaches in Speech Recognition....Pages 113-152
Deep Dynamic Models for Learning Hidden Representations of Speech Features....Pages 153-195
Speech Based Emotion Recognition....Pages 197-228
Speaker Diarization: An Emerging Research....Pages 229-277
Front Matter....Pages 279-279
Maximum A Posteriori Spectral Estimation with Source Log-Spectral Priors for Multichannel Speech Enhancement....Pages 281-317
Modulation Processing for Speech Enhancement....Pages 319-345