In this book, hierarchical structures based on neural networks are investigated for automatic speech recognition. These structures are mainly evaluated within the phoneme recognition task under the Hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) paradigm. The baseline hierarchical scheme consists of two levels each which is based on a Multilayered Perceptron (MLP). Additionally, the output of the first level is used as an input for the second level. This system can be substantially speeded up by removing the redundant information contained at the output of the first level.
The subject of this study is the role of hierarchical structures, based on neural networks, in identifying phonemes in automated speech recognition systems. It shows how the artificial neural network paradigm can simplify the analysis of spoken language.
Background in Speech Recognition.- Phoneme Recognition Task.- Hierarchical Approach and Downsampling Schemes.- Extending the Hierarchical Scheme: Inter and Intra Phonetic Information.- Theoretical framework for phoneme recognition analysis.
From the reviews:
This brief book comes packed with useful information about some novel techniques for the recognition of speech building blocks known as phonemes. & it is brimming with useful and well-presented information. I recommend it for graduate students in the field, as well as for practicing professionals. (Vladimir Botchev, Computing Reviews, May, 2013)
In this book, hierarchical structures based on neural networks are investigated for automatic speech recognition. These structures are evaluated on the phoneme recognition task where a ?Hybrid Hidden Markov Model/Artificial Neural Network paradigm is used. The baseline hierarchical scheme consists of two levels each which is based on a Multilayered Perceptron. Additionally, the output of the first level serves as a second level input. The computational speed of the phoneme recognizer can be sublSA