Abstract
A simple and successful phoneme recognizer in a hierarchical ANN framework is proposed in [Pinto 08b]. In Section 3.4 we could observed that this method compares favorably to hitherto approaches. In this scheme, phoneme posteriors are estimated by a two-level hierarchical structure. In the first level, a MLP estimates intermediate phoneme posteriors based on a temporal window of cepstral features. In the second level, another MLP estimates final phoneme posteriors based on a temporal window of intermediate posterior features. The final phoneme posteriors are then input to a Viterbi decoder.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2013 Springer Berlin Heidelberg
About this chapter
Cite this chapter
Vasquez, D., Gruhn, R., Minker, W. (2013). Hierarchical Approach and Downsampling Schemes. In: Hierarchical Neural Network Structures for Phoneme Recognition. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34425-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-34425-1_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34424-4
Online ISBN: 978-3-642-34425-1
eBook Packages: EngineeringEngineering (R0)