Deep Learning Based Urdu Optical Character Recognition

摘要	第5-7页
ABSTRACT	第7-8页
Biographical Sketch	第9-21页
CHAPTER 1 Optical Character Recognition	第21-44页
1.1 Background of Optical Character Recognition (OCR)	第21-25页
1.2 Urdu Language	第25-27页
1.2.1 Urdu Character Set	第25页
1.2.2 Diacritics	第25-26页
1.2.3 Writing Styles	第26-27页
1.3 Complexities of Nastaleeq Urdu Text	第27-33页
1.4 Segmentation of Urdu Nastaleeq text	第33-34页
1.5 OCRs for Urdu Language	第34-40页
1.5.1 Segmentation based Approach	第35-39页
1.5.2 Segmentation Free OCR	第39-40页
1.6 Motivation	第40-41页
1.7 Contribution of Thesis	第41-42页
1.8 Thesis Structure	第42-44页
CHAPTER 2 Preliminaries	第44-64页
2.1 Projection Profile Method	第44-45页
2.2 Salt-and-Pepper Noise (SP)	第45页
2.3 X-Y Cut	第45-47页
2.4 Smearing Algorithm	第47页
2.5 Rectified Linear Unit (ReLU)	第47-48页
2.6 Softmax Function	第48页
2.7 Artificial Neural Network (ANN)	第48-54页
2.7.1 Common Activation Functions	第49-50页
2.7.2 Network Topologies	第50-51页
2.7.3 Learning Paradigms	第51-52页
2.7.4 Perceptron	第52-53页
2.7.5 Multilayer Perceptron	第53-54页
2.8 Autoencoder	第54-56页
2.8.1 Denoising Autoencoder	第55页
2.8.2 Stacked Denoising Autoencoder	第55-56页
2.9 Hidden Morkov Model	第56页
2.10 Support Vector Machines	第56-57页
2.11 Recurrent Neural Network	第57-58页
2.12 Long Short Term Memory	第58-60页
2.12.1 Bidirectional Long Short Term Memory	第59-60页
2.13 Connectionist Temporal Classification	第60页
2.14 Datasets	第60-61页
2.15 Project	第61-62页
2.16 Accuracy Measurement	第62页
2.17 Summary	第62-64页
CHAPTER 3 Segmentation for Printed Urdu Text	第64-96页
3.1 Introduction to Segmentation and Challenges for Robust Urdu OCRs	第64-66页
3.2 Related work	第66-70页
3.2.1 Line segmentation	第66-68页
3.2.2 Ligature Segmentation	第68-69页
3.2.3 Motivation	第69-70页
3.2.4 Contribution	第70页
3.3 Line Segmentation	第70-81页
3.3.1 Curved Line Split Algorithm	第70-73页
3.3.2 Line Segmentation Algorithm	第73-81页
3.4 Ligature segmentation	第81-84页
3.4.1 Extract Information	第82页
3.4.2 Decide Primary and Secondary Components	第82-83页
3.4.3 Allocate Secondary to Primary Components	第83-84页
3.5 Results and analysis	第84-95页
3.5.1 Results of the Line Segmentation Algorithm	第84-86页
3.5.2 Comparison of Line Segmentation Algorithms	第86-91页
3.5.3 Results of the proposed ligature segmentation algorithm	第91-95页
3.6 Conclusion	第95页
3.7 Summary	第95-96页
CHAPTER 4 Stacked Denoising Autoencoder based Urdu Nastaleeq LigatureRecognition	第96-114页
4.1 Introduction	第96-99页
4.1.1 Prevalent Works	第96-98页
4.1.2 Motivation	第98页
4.1.3 Contribution	第98-99页
4.2 Learning model	第99-105页
4.2.1 Autoencoder	第99-100页
4.2.2 Urdu Ligature Recognition Stacked Denoising Autoencoder (ULR-SDA)	第100-105页
4.3 Experiments	第105-108页
4.3.1 Dataset and Feature Extraction	第105-106页
4.3.2 Experiment Setup	第106-107页
4.3.3 Training ULR-SDA	第107-108页
4.4 Results	第108-112页
4.4.1 Comparison of ULR-SDA and SVM	第108-109页
4.4.2 Structure of ULR-SDA	第109-110页
4.4.3 Dimensions	第110-112页
4.5 Conclusion	第112页
4.6 Summary	第112-114页
CHAPTER 5 Ligature based Urdu Nastaleeq Sentence Recognition usingGated Bidirectional Long Short Term Memory	第114-135页
5.1 Introduction	第114-119页
5.1.1 Motivation	第115-119页
5.1.2 Contribution	第119页
5.2 Overview of Recurrent Neural Network and Related Work	第119-122页
5.3 Proposed Model	第122-126页
5.4 Dataset Description and Preprocessing	第126-127页
5.5 Network Parameters	第127-128页
5.6 Experiments and Results	第128-134页
5.6.1 Comparison of GBLSTM with Prevalent LSTM based Urdu Recognition Systems	第129-131页
5.6.2 Comparison of GBLSTM with Prevalent Ligature Based Urdu RecognitionSystems	第131页
5.6.3 Number of Ligature Classes Incorrectly Predicted by GBLSTM	第131-133页
5.6.4 Top 10 Misclassified Classes	第133-134页
5.6.5 Performance Analysis on the Basis of Full Predicted Sentences	第134页
5.7 Conclusion	第134页
5.8 Summary	第134-135页
Chapter 6 Conclusions and Potential Future Directions	第135-138页
6.1 Research Insights	第135-137页
6.2 Potential Future Recommendations	第137-138页
Bibliography	第138-151页
Acknowledgement	第151-152页
List of Publications	第152页