Hidden Markov Model Based Automatic Speech Recognition Using Mel Frequency Cepstral Coefficients in Nepalese

List of Tables	第7-8页
List of Figures	第8-9页
Abstract	第9页
Acknowledgements	第10-11页
Chapter 1 Introduction to ASR	第11-19页
1.1 Introduction	第11页
1.2 Are Speech Recognition and Voice Recognition Synonymous?	第11-12页
1.3 Why Speech Recognition? Uses and Applications	第12-13页
1.4 Types of Speech Recognition	第13页
1.5 An overview of ASR	第13-14页
1.6 Project Outline	第14-15页
1.7 Understanding Digital Audio	第15-17页
1.8 The Hidden Markov Model Toolkit (HTK)	第17-19页
Chapter 2 Understanding the features of Nepali Language	第19-28页
2.1 Devnagari Script	第19-20页
2.2 Romanization of Nepali words	第20-26页
2.3 Representation of Pronunciation of Nepali Words	第26-28页
Chapter 3 Classification of Nepalese phonemes	第28-37页
3.1 What is a phoneme?	第28-29页
3.2 Why classification?	第29页
3.3 Consonants and Vowels	第29-37页
3.3.1 Consonants	第30-35页
3.3.2 Vowels	第35-37页
Chapter 4 Hidden Markov Model	第37-52页
4.1 Definition	第37-39页
4.2 Assumptions in the theory of HMMs	第39页
4.3 Three main questions on HMMs	第39-52页
4.3.1 The Evaluation Problem and the Forward Algorithm	第40-42页
4.3.2 The Decoding Problem and the Viterbi Algorithm	第42-43页
4.3.3 The Learning Problem	第43-52页
Chapter 5 Understanding Hidden Markov Model Tool Kit (HTK)	第52-57页
5.1 HTK Software Architecture	第52-53页
5.2 The Toolkit	第53-57页
5.2.1 Data Preparation Tools	第54-55页
5.2.2 Training Tools	第55页
5.2.3 Recognition Tools	第55-56页
5.2.4 Analysis Tool	第56-57页
Chapter 6 Implementation Details	第57-78页
6.1 Preparing Target Nepali Vocabulary	第57-61页
6.1.1 Initialization	第59-60页
6.1.2 Transcription of Devnagari files	第60页
6.1.3 Making new entries in the dictionary database	第60-61页
6.2 Language Model	第61-63页
6.3 Recording the speech data	第63页
6.4 Creating the Phoneme-level Transcriptions	第63-66页
6.5 Parameterization of speech data	第66-67页
6.6 Creating monophone HMMs	第67-70页
6.7 Creating tied state tri-phone HMMs	第70-74页
6.8 Evaluation	第74-76页
6.9 Problems Encountered	第76-78页
References	第78-79页