SlideShare uma empresa Scribd logo
1 de 47
Arabic Handwritten Text Recognition and Writer
Identification
Supervisor:
Asst. Prof . Dr. Alia K. Abdul Hassan
Prepared by:
Mustafa Salam Kadhm
2017
Ministry of Higher Education &
Scientific Research
University of Technology
Department of Computer Science
Contents
Problem Statement1
Aim of Thesis2
Proposed System3
Experiments and Results4
Conclusions5
Problem Statement
3
• Most of the governments and organizations have handwritten need to be editable and searchable.
• Arabic handwritten text recognition is a complex process compared with other handwritten languages
because it is cursive in nature.
• Poor obtained accuracy of existing recognition systems (depended on character segmentation).
• Unauthenticated recognition results of the existing systems.
• The availability problem of Arabic handwritten database.
Aim of Thesis
4
 Develop an accurate Arabic handwritten text recognition system based on multi-scale features
extraction methods and SVM classifier.
 Employ the proposed system in a security application by identifying the writer of the input
handwritten text.
 Develop an Arabic handwritten database with colored and gray handwritten images that works
for character, word, text recognition system and can be used for the security applications.
The Proposed System Architecture
5
Input Output
The Proposed System Architecture
6
Input
Output
Image Acquisition
7
Gray
Color
Color
8
Segmentation
Input Image
applying dilation
and filling
drawing rectangles around the objectsDrawing the obtained
rectangles on original image
handwritten sub-images
applying Sobel filter
Preprocessing (Image Thresholding )
9
AHDB
IESK-arDB
Proposed
Input Image (gray) Output Image (binary)
Preprocessing (Noise Removal)
10
Noise Removed
Noise Removed
Preprocessing
11
Image 1
Image 2
Edge Detection
Image 3
Preprocessing (Image Normalization)
12
Image 1
Image 2
Image 3
128 x 128
Features Base1 Construction
13
Image 1
Structural Features
Features Base1 Construction
14
Block 1
Block 3
Block 2
Block 4
Image 2
Statistical Features
Features Base1 Construction
15
Image 2
Take first 10 coefficients in
zigzag order
DCT Features
Discrete Cosine Transform (DCT) Features
Features Base1 Construction
16
Image 3
X-axis
Y-axis
Magnitude Orientation
Modified Histogram of Oriented Gradient (MHOG1) Features
Features Base1 Construction
17
Modified Histogram of Oriented Gradient (MHOG1) Features
Features Base1 Construction
18
Interpolation votes of gradient orientation
Modified Histogram of Oriented Gradient (MHOG1) Features
Features Base1 Construction
19
Feature Vector
Classification
20
Post-processing
21
Arabic Text Unicode
‫هللا‬ [ 1575,1604,1604,1607]
‫عبد‬ [1593,1576,1583]
‫العام‬ [ 1575,1604,1593,1575,1605]
Proposed Arabic lexicon
Features Base2 Construction (module2)
22
Modified Histogram of Oriented Gradient (MHOG2) Features
Shape Features
23
Image 4
Shape Features
Classification (module2)
24
Post-processing (module2)
25
Class Label Writer Name
[1] Writer1
[2] Writer2
.
.
.
.
.
.
[n] Writer(n)
Proposed writers’ lexicon
Proposed Arabic Handwritten Database
26
Proposed Arabic Handwritten Database
27
Proposed Arabic Handwritten Database
28
Proposed Arabic Handwritten Database
29
Experiments and Results
30
Database Correct Segm. Under Segm. Over Segm. Misplaced Segm.
AHDB 89% 3% 6% 2%
Proposed 92% 4% 2% 2%
 Segmentation
Testing Set = 50
True Positive = 46
Accuracy =
46
50
x 100
Evaluation of The AHTRS System (module1)
31
 Preprocessing ( image thresholding)
Evaluation of The AHTRS System (module1)
32
 Preprocessing ( noise removal)
Evaluation of The AHTRS System (module1)
33
Preprocessing ( noise removal)
Evaluation of The AHTRS System (module1)
34
System Accuracy
AHTRS system without BSE algorithm 93%
AHTRS system + BSE algorithm 96.317%
Preprocessing ( black space elimination)
Image Size Accuracy
32 x 32 94 %
64 x 64 94.8%
64 x 128 95.22%
128 x 64 95%
128 x 128 96.317%
Preprocessing ( image normalization)
Evaluation of The AHTRS System (module1)
35
Edge Detection Filter Accuracy
HOG filter 89.2%
Sobel 89%
Canny 87%
Roberts 90.1%
Proposed 92.70%
 Features Extraction( MHOG1)
Approach Accuracy
un-overlapped blocks 88.5%
overlapped blocks 92.70%
Evaluation of The AHTRS System (module1)
36
Blocks Accuracy
1 block 67.92%
4x4 blocks 60%
6x6 blocks 61.22%
8x8 blocks 64.7%
Ordering technique Accuracy
Sequential 66.7%
Zig-zag 67.92%
Method Extraction Time
DCT 1.6
FCT 0.8
Features Extraction( DCT)
Evaluation of The AHTRS System (module1)
37
Features Accuracy
DCT 67.92%
MHOG1 92.70%
Statistical + Structural 70.88%
All features 96.317%
Features Extraction
Features Classification Time
Without FN 4.5
With FN 0.9
 Features Normalization
Evaluation of The AHTRS System (module1)
38
 Classification
Evaluation of The AHTRS System (module1)
39
Database Kernel Accuracy
AHDB linear 92%
AHDB polynomial 96.317 %
AHDB RBF 93.1%
IESK-arDB linear 76%
IESK-arDB polynomial 82 %
IESK-arDB RBF 78.66%
Proposed linear 96.2%
Proposed polynomial 98%
Proposed RBF 97%
Classification
Testing Set = 1365
True Positive = 1314
Accuracy =
1314
1365
x 100
Evaluation of The AHTRS System (module2)
40
Features Accuracy
MHOG2 95.9%
Shape 93 %
MHOG2 + Shape 100%
 Features Extraction ( module2)
Evaluation of The AHTRS System (module2)
41
Approach Kernel Accuracy
Sub-word level linear 80%
Sub-word level polynomial 85 %
Sub-word level RBF 81.9%
Text level linear 98%
Text level polynomial 100%
Text level RBF 98.6%
 Classification ( module2)
Evaluation of The AHTRS System
42
Module Classifier Accuracy
1 KNN 93%
1 SVM 98%
1 ANN 94%
2 KNN 95%
2 SVM 100%
2 ANN 98%
Classification ( module 1 & 2)
Conclusions
1. The proposed system depends on handwritten sub-images segmentation approach which
is simple, practical and efficient and leads to more accurate accuracy than of the systems that
depends on the character segmentation.
2. The steps of the proposed preprocessing stage lead to efficient results of binary, thinned and
cropped images without noise that increase the system accuracy. Besides, the choose of
appropriate edge detector and image normalization size enhance the obtained outcomes of
the system. information of the handwritten text.
43
Conclusions Cont.
3. The employment of MHOG1 and MHOG2 in the proposed system is the main successful part of
this thesis which leads to better recognition and identification accuracy. Furthermore, the
obtained results show the strength of using the proposed edge detection filter for HMOG1 over
the other filters.
4. The proposed features, DCT, statistical and shape features in another hand, are made the
system more accurate.
5. The training and classification time are reduced by features normalization (FN) algorithm,
subsequently reducing the system processing time.
44
Conclusions Cont.
6. The use of one vs all approach with polynomial kernel of Support Vector Machines (SVM)
classification algorithm yields more robust recognition results and identification performance than
the use of other approaches, kernels and classifiers.
7. The proposed system has achieved better accuracy with three different Arabic handwritten
databases than all the previous works.
8. The proposed text handwritten database gives a better accuracy result than the other handwritten
databases, and it can works in identification. Besides, the database can work for character and
word recognition
45
List of Publications
Journals:
1. Mustafa S., Alia K., ”ACRS: Arabic Character Recognition System Based on Multi Features Extraction Methods”, International Journal of Scientific and Engineering
Research, vol. 6, Issue 10, pp. 656-661, 2015.
2. Alia K., Mustafa S., “Handwriting Word Recognition Based on SVM Classifier”, International Journal of Advanced Computer Science & Applications, vol. 1, issue 6, pp. 64-
68, 2015.
3. Mustafa S., Alia K., “Handwriting Word Recognition Based on Neural Networks” International Journal of Applied Engineering Research, vol. 10, issue 22, pp. 43120-
43124, 2015.
4. Alia K., Mustafa S., “An Efficient Image Thresholding Method for Arabic Handwriting Recognition System”, Engineering and Technology Journal, vol. 34, issue 1, pp.
26-34, 2016.
5. Alia K., Mustafa S., “An Efficient Preprocessing Framework for Arabic Handwriting Recognition System”, Diyala Journal For Pure Sciences, vol. 12, issue 3, pp. 147-
163, 2016.
6. Alia K., Mustafa S., “Arabic Handwriting Text Recognition Based on Efficient Segmentation, DCT and HOG Features”, International Journal of Multimedia and
Ubiquitous Engineering, vol. 11, issue 10, pp. 83-92, 2016.
Conferences:
1. Alia K., Mustafa S., “AHCR: Arabic Handwriting Character Recognition System Using Multi-scale Features, SVM And KNN Classifiers”, 2nd Global Conference on
Contemporary Issues in Education, 2nd Global Conference on Contemporary Issues in Education, pp. 46, 2015.
2. Alia K., Mustafa S., “Arabic Handwriting Text Recognition Based on EOD and HOG Features”, SAI Intelligent Systems Conference (IntelliSys), 2016. (Accepted)
46
Thank you

Mais conteúdo relacionado

Mais procurados

Deep Learning - Overview of my work II
Deep Learning - Overview of my work IIDeep Learning - Overview of my work II
Deep Learning - Overview of my work IIMohamed Loey
 
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIERHANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIERvineet raj
 
Handwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionHandwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionNaiyan Noor
 
Introduction to Deep learning
Introduction to Deep learningIntroduction to Deep learning
Introduction to Deep learningleopauly
 
An Introduction to Computer Vision
An Introduction to Computer VisionAn Introduction to Computer Vision
An Introduction to Computer Visionguestd1b1b5
 
Machine learning
Machine learningMachine learning
Machine learningWes Eklund
 
Machine learning
Machine learningMachine learning
Machine learningeonx_32
 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learningbutest
 
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...iosrjce
 
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Simplilearn
 
Breast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning pptBreast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning pptAnkitGupta1476
 
Handwriting Recognition
Handwriting RecognitionHandwriting Recognition
Handwriting RecognitionBindu Karki
 
Performance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsPerformance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsKush Kulshrestha
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Marina Santini
 
Breast cancer detection using Artificial Neural Network
Breast cancer detection using Artificial Neural NetworkBreast cancer detection using Artificial Neural Network
Breast cancer detection using Artificial Neural NetworkSubroto Biswas
 
Predictive Analysis of Breast Cancer Detection using Classification Algorithm
Predictive Analysis of Breast Cancer Detection using Classification AlgorithmPredictive Analysis of Breast Cancer Detection using Classification Algorithm
Predictive Analysis of Breast Cancer Detection using Classification AlgorithmSushanti Acharya
 

Mais procurados (20)

Deep Learning - Overview of my work II
Deep Learning - Overview of my work IIDeep Learning - Overview of my work II
Deep Learning - Overview of my work II
 
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIERHANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER
 
Machine learning
Machine learningMachine learning
Machine learning
 
Handwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionHandwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer Version
 
Introduction to Deep learning
Introduction to Deep learningIntroduction to Deep learning
Introduction to Deep learning
 
An Introduction to Computer Vision
An Introduction to Computer VisionAn Introduction to Computer Vision
An Introduction to Computer Vision
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learning
 
Handwritten Character Recognition
Handwritten Character RecognitionHandwritten Character Recognition
Handwritten Character Recognition
 
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
 
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
 
Breast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning pptBreast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning ppt
 
Handwriting Recognition
Handwriting RecognitionHandwriting Recognition
Handwriting Recognition
 
Performance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsPerformance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning Algorithms
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
Breast cancer detection using Artificial Neural Network
Breast cancer detection using Artificial Neural NetworkBreast cancer detection using Artificial Neural Network
Breast cancer detection using Artificial Neural Network
 
Predictive Analysis of Breast Cancer Detection using Classification Algorithm
Predictive Analysis of Breast Cancer Detection using Classification AlgorithmPredictive Analysis of Breast Cancer Detection using Classification Algorithm
Predictive Analysis of Breast Cancer Detection using Classification Algorithm
 
Machine learning
Machine learningMachine learning
Machine learning
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
 

Semelhante a Arabic Handwritten Text Recognition and Writer Identification

Efficient feature descriptor selection for improved Arabic handwritten words ...
Efficient feature descriptor selection for improved Arabic handwritten words ...Efficient feature descriptor selection for improved Arabic handwritten words ...
Efficient feature descriptor selection for improved Arabic handwritten words ...IJECEIAES
 
Design and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontDesign and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontIRJET Journal
 
An improved Arabic text classification method using word embedding
An improved Arabic text classification method using word embeddingAn improved Arabic text classification method using word embedding
An improved Arabic text classification method using word embeddingIJECEIAES
 
A Tale of Experiments on Bug Prediction
A Tale of Experiments on Bug PredictionA Tale of Experiments on Bug Prediction
A Tale of Experiments on Bug PredictionMartin Pinzger
 
Applying static code analysis for domain-specific languages
Applying static code analysis for domain-specific languagesApplying static code analysis for domain-specific languages
Applying static code analysis for domain-specific languagesIván Ruiz-Rube
 
IRJET- Speech Based Answer Sheet Evaluation System
IRJET- Speech Based Answer Sheet Evaluation SystemIRJET- Speech Based Answer Sheet Evaluation System
IRJET- Speech Based Answer Sheet Evaluation SystemIRJET Journal
 
SANN: Programming Code Representation Using Attention Neural Network with Opt...
SANN: Programming Code Representation Using Attention Neural Network with Opt...SANN: Programming Code Representation Using Attention Neural Network with Opt...
SANN: Programming Code Representation Using Attention Neural Network with Opt...Peter Brusilovsky
 
Summarizing Software API Usage Examples Using Clustering Techniques
Summarizing Software API Usage Examples Using Clustering TechniquesSummarizing Software API Usage Examples Using Clustering Techniques
Summarizing Software API Usage Examples Using Clustering TechniquesNikos Katirtzis
 
Optical Character Recognition (OCR) based Retrieval
Optical Character Recognition (OCR) based RetrievalOptical Character Recognition (OCR) based Retrieval
Optical Character Recognition (OCR) based RetrievalBiniam Asnake
 
Holistic Approach for Arabic Word Recognition
Holistic Approach for Arabic Word RecognitionHolistic Approach for Arabic Word Recognition
Holistic Approach for Arabic Word RecognitionEditor IJCATR
 
Populating a Release History Database (ICSM 2013 MIP)
Populating a Release History Database (ICSM 2013 MIP)Populating a Release History Database (ICSM 2013 MIP)
Populating a Release History Database (ICSM 2013 MIP)Martin Pinzger
 
Literature Review on Indian Sign Language Recognition System
Literature Review on Indian Sign Language Recognition SystemLiterature Review on Indian Sign Language Recognition System
Literature Review on Indian Sign Language Recognition SystemIRJET Journal
 
IRJET- Review on Optical Character Recognition
IRJET- Review on Optical Character RecognitionIRJET- Review on Optical Character Recognition
IRJET- Review on Optical Character RecognitionIRJET Journal
 
A453 programming task 1
A453 programming task 1A453 programming task 1
A453 programming task 1Tom Dale
 
Question Answering System using machine learning approach
Question Answering System using machine learning approachQuestion Answering System using machine learning approach
Question Answering System using machine learning approachGarima Nanda
 
Model Checking as a Service: Towards Pragmatic Hidden Formal Methods
Model Checking as a Service: Towards Pragmatic Hidden Formal MethodsModel Checking as a Service: Towards Pragmatic Hidden Formal Methods
Model Checking as a Service: Towards Pragmatic Hidden Formal MethodsIncQuery Labs
 
A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...TELKOMNIKA JOURNAL
 
Improved Arabic Alphabet Characters Classification Using.pdf
Improved Arabic Alphabet Characters Classification Using.pdfImproved Arabic Alphabet Characters Classification Using.pdf
Improved Arabic Alphabet Characters Classification Using.pdfNesrine Wagaa
 

Semelhante a Arabic Handwritten Text Recognition and Writer Identification (20)

Efficient feature descriptor selection for improved Arabic handwritten words ...
Efficient feature descriptor selection for improved Arabic handwritten words ...Efficient feature descriptor selection for improved Arabic handwritten words ...
Efficient feature descriptor selection for improved Arabic handwritten words ...
 
Design and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English FontDesign and Description of Feature Extraction Algorithm for Old English Font
Design and Description of Feature Extraction Algorithm for Old English Font
 
An improved Arabic text classification method using word embedding
An improved Arabic text classification method using word embeddingAn improved Arabic text classification method using word embedding
An improved Arabic text classification method using word embedding
 
A Tale of Experiments on Bug Prediction
A Tale of Experiments on Bug PredictionA Tale of Experiments on Bug Prediction
A Tale of Experiments on Bug Prediction
 
Applying static code analysis for domain-specific languages
Applying static code analysis for domain-specific languagesApplying static code analysis for domain-specific languages
Applying static code analysis for domain-specific languages
 
IRJET- Speech Based Answer Sheet Evaluation System
IRJET- Speech Based Answer Sheet Evaluation SystemIRJET- Speech Based Answer Sheet Evaluation System
IRJET- Speech Based Answer Sheet Evaluation System
 
Generator of pseudorandom sequences
Generator of pseudorandom sequences Generator of pseudorandom sequences
Generator of pseudorandom sequences
 
SANN: Programming Code Representation Using Attention Neural Network with Opt...
SANN: Programming Code Representation Using Attention Neural Network with Opt...SANN: Programming Code Representation Using Attention Neural Network with Opt...
SANN: Programming Code Representation Using Attention Neural Network with Opt...
 
Summarizing Software API Usage Examples Using Clustering Techniques
Summarizing Software API Usage Examples Using Clustering TechniquesSummarizing Software API Usage Examples Using Clustering Techniques
Summarizing Software API Usage Examples Using Clustering Techniques
 
Optical Character Recognition (OCR) based Retrieval
Optical Character Recognition (OCR) based RetrievalOptical Character Recognition (OCR) based Retrieval
Optical Character Recognition (OCR) based Retrieval
 
LEXICAL ANALYZER
LEXICAL ANALYZERLEXICAL ANALYZER
LEXICAL ANALYZER
 
Holistic Approach for Arabic Word Recognition
Holistic Approach for Arabic Word RecognitionHolistic Approach for Arabic Word Recognition
Holistic Approach for Arabic Word Recognition
 
Populating a Release History Database (ICSM 2013 MIP)
Populating a Release History Database (ICSM 2013 MIP)Populating a Release History Database (ICSM 2013 MIP)
Populating a Release History Database (ICSM 2013 MIP)
 
Literature Review on Indian Sign Language Recognition System
Literature Review on Indian Sign Language Recognition SystemLiterature Review on Indian Sign Language Recognition System
Literature Review on Indian Sign Language Recognition System
 
IRJET- Review on Optical Character Recognition
IRJET- Review on Optical Character RecognitionIRJET- Review on Optical Character Recognition
IRJET- Review on Optical Character Recognition
 
A453 programming task 1
A453 programming task 1A453 programming task 1
A453 programming task 1
 
Question Answering System using machine learning approach
Question Answering System using machine learning approachQuestion Answering System using machine learning approach
Question Answering System using machine learning approach
 
Model Checking as a Service: Towards Pragmatic Hidden Formal Methods
Model Checking as a Service: Towards Pragmatic Hidden Formal MethodsModel Checking as a Service: Towards Pragmatic Hidden Formal Methods
Model Checking as a Service: Towards Pragmatic Hidden Formal Methods
 
A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...
 
Improved Arabic Alphabet Characters Classification Using.pdf
Improved Arabic Alphabet Characters Classification Using.pdfImproved Arabic Alphabet Characters Classification Using.pdf
Improved Arabic Alphabet Characters Classification Using.pdf
 

Mais de Mustafa Salam

Switch-based Interconnection Networks 2
Switch-based Interconnection Networks 2Switch-based Interconnection Networks 2
Switch-based Interconnection Networks 2Mustafa Salam
 
Interconnection Networks 1
Interconnection Networks 1Interconnection Networks 1
Interconnection Networks 1Mustafa Salam
 
Web Services-Enhanced Agile Modeling and Integrating Business Processes
Web Services-Enhanced Agile Modeling and Integrating Business ProcessesWeb Services-Enhanced Agile Modeling and Integrating Business Processes
Web Services-Enhanced Agile Modeling and Integrating Business ProcessesMustafa Salam
 
The Burrows-Wheeler Algorithm
The Burrows-Wheeler AlgorithmThe Burrows-Wheeler Algorithm
The Burrows-Wheeler AlgorithmMustafa Salam
 
Link analysis .. Data Mining
Link analysis .. Data MiningLink analysis .. Data Mining
Link analysis .. Data MiningMustafa Salam
 
Template Matching - Pattern Recognition
Template Matching - Pattern RecognitionTemplate Matching - Pattern Recognition
Template Matching - Pattern RecognitionMustafa Salam
 
IP Addressing & subnetting strategy
IP Addressing & subnetting strategyIP Addressing & subnetting strategy
IP Addressing & subnetting strategyMustafa Salam
 
Cuckoo Search & Firefly Algorithms
Cuckoo Search & Firefly AlgorithmsCuckoo Search & Firefly Algorithms
Cuckoo Search & Firefly AlgorithmsMustafa Salam
 
Cloud technology (Mashup) + Case Study
Cloud technology (Mashup) + Case StudyCloud technology (Mashup) + Case Study
Cloud technology (Mashup) + Case StudyMustafa Salam
 

Mais de Mustafa Salam (11)

Switch-based Interconnection Networks 2
Switch-based Interconnection Networks 2Switch-based Interconnection Networks 2
Switch-based Interconnection Networks 2
 
Interconnection Networks 1
Interconnection Networks 1Interconnection Networks 1
Interconnection Networks 1
 
Parallel Processing
Parallel ProcessingParallel Processing
Parallel Processing
 
Web Services-Enhanced Agile Modeling and Integrating Business Processes
Web Services-Enhanced Agile Modeling and Integrating Business ProcessesWeb Services-Enhanced Agile Modeling and Integrating Business Processes
Web Services-Enhanced Agile Modeling and Integrating Business Processes
 
The Burrows-Wheeler Algorithm
The Burrows-Wheeler AlgorithmThe Burrows-Wheeler Algorithm
The Burrows-Wheeler Algorithm
 
Link analysis .. Data Mining
Link analysis .. Data MiningLink analysis .. Data Mining
Link analysis .. Data Mining
 
Template Matching - Pattern Recognition
Template Matching - Pattern RecognitionTemplate Matching - Pattern Recognition
Template Matching - Pattern Recognition
 
IP Addressing & subnetting strategy
IP Addressing & subnetting strategyIP Addressing & subnetting strategy
IP Addressing & subnetting strategy
 
Cuckoo Search & Firefly Algorithms
Cuckoo Search & Firefly AlgorithmsCuckoo Search & Firefly Algorithms
Cuckoo Search & Firefly Algorithms
 
Cloud technology (Mashup) + Case Study
Cloud technology (Mashup) + Case StudyCloud technology (Mashup) + Case Study
Cloud technology (Mashup) + Case Study
 
Color Models
Color ModelsColor Models
Color Models
 

Último

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 

Último (20)

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Arabic Handwritten Text Recognition and Writer Identification

  • 1. Arabic Handwritten Text Recognition and Writer Identification Supervisor: Asst. Prof . Dr. Alia K. Abdul Hassan Prepared by: Mustafa Salam Kadhm 2017 Ministry of Higher Education & Scientific Research University of Technology Department of Computer Science
  • 2. Contents Problem Statement1 Aim of Thesis2 Proposed System3 Experiments and Results4 Conclusions5
  • 3. Problem Statement 3 • Most of the governments and organizations have handwritten need to be editable and searchable. • Arabic handwritten text recognition is a complex process compared with other handwritten languages because it is cursive in nature. • Poor obtained accuracy of existing recognition systems (depended on character segmentation). • Unauthenticated recognition results of the existing systems. • The availability problem of Arabic handwritten database.
  • 4. Aim of Thesis 4  Develop an accurate Arabic handwritten text recognition system based on multi-scale features extraction methods and SVM classifier.  Employ the proposed system in a security application by identifying the writer of the input handwritten text.  Develop an Arabic handwritten database with colored and gray handwritten images that works for character, word, text recognition system and can be used for the security applications.
  • 5. The Proposed System Architecture 5 Input Output
  • 6. The Proposed System Architecture 6 Input Output
  • 8. 8 Segmentation Input Image applying dilation and filling drawing rectangles around the objectsDrawing the obtained rectangles on original image handwritten sub-images applying Sobel filter
  • 9. Preprocessing (Image Thresholding ) 9 AHDB IESK-arDB Proposed Input Image (gray) Output Image (binary)
  • 10. Preprocessing (Noise Removal) 10 Noise Removed Noise Removed
  • 12. Preprocessing (Image Normalization) 12 Image 1 Image 2 Image 3 128 x 128
  • 13. Features Base1 Construction 13 Image 1 Structural Features
  • 14. Features Base1 Construction 14 Block 1 Block 3 Block 2 Block 4 Image 2 Statistical Features
  • 15. Features Base1 Construction 15 Image 2 Take first 10 coefficients in zigzag order DCT Features Discrete Cosine Transform (DCT) Features
  • 16. Features Base1 Construction 16 Image 3 X-axis Y-axis Magnitude Orientation Modified Histogram of Oriented Gradient (MHOG1) Features
  • 17. Features Base1 Construction 17 Modified Histogram of Oriented Gradient (MHOG1) Features
  • 18. Features Base1 Construction 18 Interpolation votes of gradient orientation Modified Histogram of Oriented Gradient (MHOG1) Features
  • 21. Post-processing 21 Arabic Text Unicode ‫هللا‬ [ 1575,1604,1604,1607] ‫عبد‬ [1593,1576,1583] ‫العام‬ [ 1575,1604,1593,1575,1605] Proposed Arabic lexicon
  • 22. Features Base2 Construction (module2) 22 Modified Histogram of Oriented Gradient (MHOG2) Features
  • 25. Post-processing (module2) 25 Class Label Writer Name [1] Writer1 [2] Writer2 . . . . . . [n] Writer(n) Proposed writers’ lexicon
  • 30. Experiments and Results 30 Database Correct Segm. Under Segm. Over Segm. Misplaced Segm. AHDB 89% 3% 6% 2% Proposed 92% 4% 2% 2%  Segmentation Testing Set = 50 True Positive = 46 Accuracy = 46 50 x 100
  • 31. Evaluation of The AHTRS System (module1) 31  Preprocessing ( image thresholding)
  • 32. Evaluation of The AHTRS System (module1) 32  Preprocessing ( noise removal)
  • 33. Evaluation of The AHTRS System (module1) 33 Preprocessing ( noise removal)
  • 34. Evaluation of The AHTRS System (module1) 34 System Accuracy AHTRS system without BSE algorithm 93% AHTRS system + BSE algorithm 96.317% Preprocessing ( black space elimination) Image Size Accuracy 32 x 32 94 % 64 x 64 94.8% 64 x 128 95.22% 128 x 64 95% 128 x 128 96.317% Preprocessing ( image normalization)
  • 35. Evaluation of The AHTRS System (module1) 35 Edge Detection Filter Accuracy HOG filter 89.2% Sobel 89% Canny 87% Roberts 90.1% Proposed 92.70%  Features Extraction( MHOG1) Approach Accuracy un-overlapped blocks 88.5% overlapped blocks 92.70%
  • 36. Evaluation of The AHTRS System (module1) 36 Blocks Accuracy 1 block 67.92% 4x4 blocks 60% 6x6 blocks 61.22% 8x8 blocks 64.7% Ordering technique Accuracy Sequential 66.7% Zig-zag 67.92% Method Extraction Time DCT 1.6 FCT 0.8 Features Extraction( DCT)
  • 37. Evaluation of The AHTRS System (module1) 37 Features Accuracy DCT 67.92% MHOG1 92.70% Statistical + Structural 70.88% All features 96.317% Features Extraction Features Classification Time Without FN 4.5 With FN 0.9  Features Normalization
  • 38. Evaluation of The AHTRS System (module1) 38  Classification
  • 39. Evaluation of The AHTRS System (module1) 39 Database Kernel Accuracy AHDB linear 92% AHDB polynomial 96.317 % AHDB RBF 93.1% IESK-arDB linear 76% IESK-arDB polynomial 82 % IESK-arDB RBF 78.66% Proposed linear 96.2% Proposed polynomial 98% Proposed RBF 97% Classification Testing Set = 1365 True Positive = 1314 Accuracy = 1314 1365 x 100
  • 40. Evaluation of The AHTRS System (module2) 40 Features Accuracy MHOG2 95.9% Shape 93 % MHOG2 + Shape 100%  Features Extraction ( module2)
  • 41. Evaluation of The AHTRS System (module2) 41 Approach Kernel Accuracy Sub-word level linear 80% Sub-word level polynomial 85 % Sub-word level RBF 81.9% Text level linear 98% Text level polynomial 100% Text level RBF 98.6%  Classification ( module2)
  • 42. Evaluation of The AHTRS System 42 Module Classifier Accuracy 1 KNN 93% 1 SVM 98% 1 ANN 94% 2 KNN 95% 2 SVM 100% 2 ANN 98% Classification ( module 1 & 2)
  • 43. Conclusions 1. The proposed system depends on handwritten sub-images segmentation approach which is simple, practical and efficient and leads to more accurate accuracy than of the systems that depends on the character segmentation. 2. The steps of the proposed preprocessing stage lead to efficient results of binary, thinned and cropped images without noise that increase the system accuracy. Besides, the choose of appropriate edge detector and image normalization size enhance the obtained outcomes of the system. information of the handwritten text. 43
  • 44. Conclusions Cont. 3. The employment of MHOG1 and MHOG2 in the proposed system is the main successful part of this thesis which leads to better recognition and identification accuracy. Furthermore, the obtained results show the strength of using the proposed edge detection filter for HMOG1 over the other filters. 4. The proposed features, DCT, statistical and shape features in another hand, are made the system more accurate. 5. The training and classification time are reduced by features normalization (FN) algorithm, subsequently reducing the system processing time. 44
  • 45. Conclusions Cont. 6. The use of one vs all approach with polynomial kernel of Support Vector Machines (SVM) classification algorithm yields more robust recognition results and identification performance than the use of other approaches, kernels and classifiers. 7. The proposed system has achieved better accuracy with three different Arabic handwritten databases than all the previous works. 8. The proposed text handwritten database gives a better accuracy result than the other handwritten databases, and it can works in identification. Besides, the database can work for character and word recognition 45
  • 46. List of Publications Journals: 1. Mustafa S., Alia K., ”ACRS: Arabic Character Recognition System Based on Multi Features Extraction Methods”, International Journal of Scientific and Engineering Research, vol. 6, Issue 10, pp. 656-661, 2015. 2. Alia K., Mustafa S., “Handwriting Word Recognition Based on SVM Classifier”, International Journal of Advanced Computer Science & Applications, vol. 1, issue 6, pp. 64- 68, 2015. 3. Mustafa S., Alia K., “Handwriting Word Recognition Based on Neural Networks” International Journal of Applied Engineering Research, vol. 10, issue 22, pp. 43120- 43124, 2015. 4. Alia K., Mustafa S., “An Efficient Image Thresholding Method for Arabic Handwriting Recognition System”, Engineering and Technology Journal, vol. 34, issue 1, pp. 26-34, 2016. 5. Alia K., Mustafa S., “An Efficient Preprocessing Framework for Arabic Handwriting Recognition System”, Diyala Journal For Pure Sciences, vol. 12, issue 3, pp. 147- 163, 2016. 6. Alia K., Mustafa S., “Arabic Handwriting Text Recognition Based on Efficient Segmentation, DCT and HOG Features”, International Journal of Multimedia and Ubiquitous Engineering, vol. 11, issue 10, pp. 83-92, 2016. Conferences: 1. Alia K., Mustafa S., “AHCR: Arabic Handwriting Character Recognition System Using Multi-scale Features, SVM And KNN Classifiers”, 2nd Global Conference on Contemporary Issues in Education, 2nd Global Conference on Contemporary Issues in Education, pp. 46, 2015. 2. Alia K., Mustafa S., “Arabic Handwriting Text Recognition Based on EOD and HOG Features”, SAI Intelligent Systems Conference (IntelliSys), 2016. (Accepted) 46