Год выпуска: 2012 Автор: Mohammad Nasiruddin Издательство: LAP Lambert Academic Publishing Страниц: 84 ISBN: 9783847347316
Описание
This monograph discusses the dominance of Local Features (LFs), as input to the Multilayer Neural Network (MLN), extracted from a Bangla input speech over Mel Frequency Cepstral Coefficients (MFCCs). Here, LF-based method comprises three stages- (i) LF extraction from input speech, (ii) Phoneme probabilities extraction using MLN from LF and (iii) The Hidden Markov Model (HMM) based classifier to obtain more accurate phoneme strings. In the experiments on Bangla speech corpus prepared by us, it is observed that the LF-based Automatic Speech Recognition system provides higher phoneme correct rate than the MFCC-based system. Moreover, the proposed system requires fewer mixture components in the HMMs. Moreover, this paper reviews some of the key advances in several areas of automatic speech recognition. We also illustrate, by examples, how these key advances can be used for continuous speech recognition of Bangla. Finally we elaborate the requirements in designing successful real-world...