Incorporating Knowledge Sources into Statistical Speech Recognition

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible.

The authors provide an efficient general framework for incorporating knowledge sources into state-of-the-art statistical ASR systems. This framework, which is called GFIKS (graphical framework to incorporate additional knowledge sources), was designed by utilizing the concept of the Bayesian network (BN) framework. This framework allows probabilistic relationships among different information sources to be learned, various kinds of knowledge sources to be incorporated, and a probabilistic function of the model to be formulated.

Incorporating Knowledge Sources into Statistical Speech Recognition demonstrates how the statistical speech recognition system may incorporate additional information sources by utilizing GFIKS at different levels of ASR. The incorporation of various knowledge sources, including background noises, accent, gender and wide phonetic knowledge information, in modeling is discussed theoretically and analyzed experimentally.

Author(s): Wolfgang Minker, Satoshi Nakamura, Konstantin Markov, Sakriani Sakti (auth.)
Series: Lecture Notes in Electrical Engineering 42
Edition: 1
Publisher: Springer US
Year: 2009

Language: English
Pages: 196
Tags: Electrical Engineering;Computer Communication Networks;Communications Engineering, Networks;Acoustics;Signal, Image and Speech Processing

Front Matter....Pages 1-20
Introduction and Book Overview....Pages 1-17
Statistical Speech Recognition....Pages 1-35
Graphical Framework to Incorporate Knowledge Sources....Pages 1-23
Speech Recognition Using GFIKS....Pages 1-59
Conclusions and Future Directions....Pages 1-5
Back Matter....Pages 1-47