SlideShare uma empresa Scribd logo
1 de 8
Baixar para ler offline
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING &
ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME

TECHNOLOGY (IJCET)

ISSN 0976 – 6367(Print)
ISSN 0976 – 6375(Online)
Volume 4, Issue 5, September – October (2013), pp. 224-231
© IAEME: www.iaeme.com/ijcet.asp
Journal Impact Factor (2013): 6.1302 (Calculated by GISI)
www.jifactor.com

IJCET
©IAEME

IDENTIFICATION OF DEVANAGARI SCRIPT FROM IMAGE DOCUMENT
SATISH R. DAMADE1, K. P. ADHIYA2 and RANJANA S. ZINJORE3
1

Computer Engineering, KBS’s College of Engineering & Technology, North Maharashtra
Knowledge city, Jalgaon.
2
Computer Engineering, SSBT’s College of Engineering & Technology, Bambhori, Jalgaon.
3
Computer Application, KCES’s Institute of Management and Research, Jalgaon.

ABSTRACT
Texts that appear in the image contain useful and important information. Optical Character
Recognition technology is restricted to finding text printed against clean backgrounds, and cannot
handle text printed against shaded or textured backgrounds or embedded in images. It is necessary to
extract the text form image which is helpful in a society for a blind and visually impaired person
when voice synthesizer is attached with the system. In this paper, we present a methodology for
extracting text from printed image document and then identified Devanagari Script (Hindi language)
from extracted text. Firstly we used Morphological Approach for extracting the text from image
documents. The resultant text image is passed to Optical Character Recognition for Identification
purpose. Projection profile is used for segmentation followed by Visual Discriminating approach for
feature extraction. Finally for classification purpose Heuristic search is used. The result of proposed
method for text extraction is compared with edge based and connected component with projection
profile approach. After comparison using precision and recall rate it is observed that proposed
algorithm work well.
Keywords: Area, Bounding Box, Canny edge detector, Heuristic Search, Projection Profile,
Visual Discriminating feature.
I. INTRODUCTION
In recent years, the escalating use of physical documents has made to progress towards the
creation of electronic documents to facilitate easy communication and storage of documents. Now a
day, information is becoming increasingly enriched by multimedia components containing images
and video in addition to the textual information. The extraction of text in an image is a classical
224
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME

problem in the computer vision research area. Text extraction from images and video find many
applications in document processing, detection of vehicle license plate, mobile robot navigation,
object identification, text in www images, content based image retrieval from image database and
video content analysis [1]. There are basically three kinds of images: Document image, Scene text
images and Caption text images for extracting the text from these images basically two approaches
are used i.e. – Region based approach and Textual based approach [2]. After extracting the text from
image document, script identification play a vital role in designing of Optical character Recognition.
Script Identification is a key step that arises in document image analysis especially when the
environment is multi script and language identification is required to identify the different language
that exists in the same script. In India, Script Identification facilitates many important applications
such as sorting the images, selecting appropriate script specific text understanding system and
searching online archives of document image containing a particular script [3]. In this paper we used
Hindi language for identification purpose because Hindi is the third most spoken language of the
world after Chinese and English, and there are approximately 500 billion people all over the world
who speak and write in Hindi language. Also there are many forms and application available in
combination of state official language and English language. In this paper we used printed images
consist of Hindi and English text and then identifying Hindi language from such image documents.
Hindi is derived from Devanagari script consisting 12 vowels and 34 consonants apart from a
horizontal line at the upper part of a characters called as Shirorekha. English alphabet is a Latinbased alphabet consisting of 26 letters each of upper and lower case characters. The structure of the
English alphabet contains more vertical and slant strokes.
II. CHALLENGES AND RELAED WORK
Text extraction from complex images is one of the most useful and difficult applications of
pattern recognition and computer vision. Also identifying script form extracted image text is also
very difficult task due to similar shape of the characters of the script. Authors presented a technique
for detecting caption text for indexing purposes. Caption text objects are detected combining texture
and geometric features and textured areas is detected using wavelet analysis. [4]. Zhong et al. [5]
located text form complex images like compact disc, book cover, or traffic scenes. For finding text
location authors used higher spatial variance of the image intensity of horizontal text lines. In this
paper authors [6] proposed a four step system which automatically detects and extracts text in images
including texture segmentation in which image is filter using bank of linear filters followed by
strokes extraction, drawing rectangular box around the text and finally detecting the text. Authors [7]
are used edge based approach for extracting the text based on generating a feature map using three
important properties of edges: edge strength, density and variance of orientation. Neha Gupta[8]
have proposed a method for image segmentation for text extraction based on 2d-Discrete Wavelet
Transform which decompose the image into four sub component. After that edges of three sub-bands
are fused to create a candidate text region followed by projection profile approach and based on
some threshold text is extracted. C. V. Jawahar proposed [9] a technique to distinguish between
Hindi and Telugu script. For Hindi, segmentation involves the removal of shirorekha. For Telugu,
component extraction implies the separation of connected components. S. Basavaraj Patil [10]
presents a approach for identification of Hindi, English and Kannada language script. For feature
extraction, input image is dilated using 3x3 masks in Horizontal, Vertical, left and right diagonal
direction followed by average pixel distribution of resultant image and neural network is used for
classification. Pal and Chaudhuri [11] have proposed an automatic technique of separating the text
lines from 12 Indian scripts (English, Devanagari, Bangla, Gujarati, Tamil, Kashmiri, Malayalam,
Oriya, Punjabi, Telugu and Urdu) using ten triplets formed by grouping English and Devanagari with
225
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME

any one of the other scripts. Santanu Choudhuri, et al. [12] have proposed a method for identification
of Indian languages by combining Gabor filter based technique and direction distance histogram
classifier considering Hindi, English, Malayalam, Bengali, Telugu and Urdu.
III. PROPOSED ARCHITECTURE AND METHODOLOGY
3.1 Proposed Architecture: The proposed architecture for identification of Hindi Script from image
document is shown in Fig.1.
3.2. Methodology
3.2.1) Preprocessing:
i) In this step we have convert the image having RGB color space (Fig. 2) into gray scale image.
Gray scale image is converted into binary using Ostu’s thresholding.
Color image
with complex
background

Detection of
text region

Preprocessing

Display the text region
and removing non-text
region which is a
resultant image

Input
Improving the quality of text
Subtract resultant
image with input
image

Output

Final Image
containing
Text

Passed the
extracted text for
script Identification

Segmentation of
text into line &
words

Heuristic Search

Feature Extraction

Identified
Hindi
Script

Identification of
Hindi Script

Figure 1: Proposed Architecture for Identification of Hindi script from Image Document
ii) Edge detection and Morphological Dilation: An efficient canny method is used for edge
extraction. The edge image is dilated using square structuring element of size. Mathematical dilation
of A by B is denoted by:
----------------- (1.1)
iii) Hole filling: Hole filling is determined by selection of marker and mask images.

0

-------------------(1.2)
226
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME

3.2.2) Detection of Text Region
i) The dilated image is labeled using bwlabel in matlab followed by 8 way connectivity. To
obtained measurement (‘Bounding Box’ and ‘Area’) of image region we used region props
properties in Matlab (Fig. 3).
ii) Further for extraction of text region, we have computed a new-value by multiplying height
and width of a Bounding Box and then resultant new-value is divided by Area. By
experimentation it is found that if the ratio (new-value/area) is less than 1.78 and height is
greater than 9 then the region so obtained are text region (specific condition).

Figure 2: Input color Image

Figure 3: Bounding box

3.2.3) Displayed Text region and removed non-text Region
i) We found the connected component (CC) of a binary image which returns CC using
bwconncomp Function in matlab.
ii) Obtained the size of dilated image and make its value as false to make a blank background.
iii) The connected component which satisfied the above specific condition makes that connected
component value as true using PixelIndexList.
iv)
The resultant image visualization is very poor. To increases the visualization we subtract the
resultant image with input image (Fig. 4).

Figure 4: Final Result

Figure 5: Segmentation of
text into line
227
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME

3.2.4) Segmentation of Text into lines and words
i) The document image is segmented into several text lines using horizontal projection profiles
computed by a row-wise sum of black pixels. After that we found valley of minimum and
maximum points of a text document in the histogram and then draw a line (cut point) from
minimum point to size of documents. Shown in Fig. 5.
ii) After line segmentation we have used vertical projection profile by considering the threshold
value (which maintained the inter character gap) for Bilingual text (Devanagari – English)
word segmentation. The words obtained are thin for feature extraction and also we have
provided the bounding box to the word by obtaining left, right, top and bottom first pixel. The
extracted words are inverted for feature extraction which is shown in Fig. 6.

Figure 6: Word segmented Image
3.2.5) Script Identification
The distinct features used for script Identification are:
i) Feature 1:Top_profile and Bottom_profile: The top_profile (bottom_profile) of a text line
represents a set of black pixels obtained by scanning each column of the text line from top (bottom)
until it reaches a first black pixel. Thus, a component of width N gets N such pixels.
ii) Feature 2: Top-max-row: Represents the row number of the top_profile at which the maximum
number of black pixels lies (black pixels having the value 0’s correspond to object and white pixels
having value 1’s correspond to background).
iii) Feature 3: Bottom-max-row: Represents the row number of the bottom_profile at which the
maximum number of black pixels lies (black pixels having the value 0’s correspond to object and
white pixels having value 1’s correspond to background).
iv) Feature 4: Top-horizontal-line: (i) Obtain the top-max-row from the top-profile. (ii) Find the
components whose number of black pixels is greater than threshold1 (threshold1 = half of the height
of the bounding box) and store the number of such components in the attribute horizontal-lines. (iii)
Compute the feature top-horizontal-line using the equation (1.3) below:
Top-horizontal-line = (hlines * 100) / tc

-----------------(1.3)

Where- hlines represent number of horizontal lines and tc represents total number of components of
the top-max-row.
228
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME

3.2.6) Heuristic script identification algorithm ():
(Result is shown in Fig. 7)
Input: Pre-processed text words of Devnagari and English scripts
Output: Range of feature values
1. Compute top-profile
2. Compute bottom-profile
3. Compute features F3 & F4
4. Identify the Script type as follows
If Top_max_row=Bottom_max_row OR
Top_horizontal_lines >= 60 then Script=”Hindi”
else Script=”Others”
5. Return Script

Figure 7: Hindi Identified words
IV) RESULTS AND DISCUSSION
We compared proposed algorithm with edge based and connected component using
projection algorithm. For comparison we are used precision and recall rate.

--- (1.2)

---- (1.3)
Precision rate takes into consideration the false positives, which are the non-text regions in
the image and have been detected by the algorithm as text regions. Recall rate takes into
consideration the false negatives, which are text words in the image, and have not been detected by
the algorithm. Thus, precision and recall rates are useful as measures to determine the accuracy of
each algorithm in locating correct text regions and eliminating non-text regions as shown in
table 1.1.
229
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME

Input Image

Image1
Image2
Image3
Image4
Image5

Table 1.1: Comparison of Results of Three Algorithms
Edge based Algorithm
Connected component Proposed Algorithm
Based Algorithm
Precision
Recall Rate Precision
Recall
Precision
Recall
Rate
Rate
Rate
Rate
Rate
76.19
80.00
83.33
99.00
94.11
80.00
68.42
76.47
77.27
99.00
86.66
76.47
61.53
100.00
66.66
100.00
72.72
100.00
0.00
0.00
53.84
100.00
63.63
100.00
83.33
99.90
66.66
80.00
90.90
98.00

Table 1.2: Results of Identified Devanagari (Hindi) script from Images
Sr. No
Dataset Name
Hindi Words
1
Image1
Correct Classification
100%
Misclassification
0%
Rejection
0%
2
Image2
Correct Classification
80.00%
Misclassification
20.00%
Rejection
0%
3
Image3
Correct Classification
66.66%
Misclassification
00.01%
Rejection
33.33%
4
Image4
Correct Classification
100%
Misclassification
0%
Rejection
0%
5
Image5
Correct Classification
100%
Misclassification
0%
Rejection
0%
V)

CONLUSION

In this paper, we have presented a very efficient and easy algorithm for extraction of text
from image document based on connected component. The morphological approach is applied
followed by finding a result by multiplication of height and width and then divided the result by area.
By experimentation we have fixed the value of result to remove non text region from image. The
proposed algorithm is tested on five images having same font size and obtained average accuracy of
precision rate 81.60% and recall rate 90.89%. The extracted text form image is passed for script
identification. Using Heuristic Search classifier we have obtained correct classification accuracy of
89.33%. In future we are tested the algorithm on variable font size images.
REFERENCES
1. Keechul Jung, Kwang In Kim and Anil K. Jain, “Text information extraction in images and
video: a survey”, The journal of the Pattern Recognition society, Vol. 37, Issue 5, pp. 977-997,
May 2004.
2. Chitrakala Gopalan and D. Manjula, “Contourlet Based Approach for Text Identification and
Extraction from Heterogeneous Textual Images”, International Journal of Electrical and
Electronics Engineering 2(8), pp. 491-500, 2008.
230
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME
3. M. C. Padma and P.A. Vijaya, “Script Identification form Trilingual Documents using profile
Based Features”, International Journal of Computer Science and Applications, Vol. 7 No. 4, pp. 16
- 33 , 2010.
4. Leon, M., Vilaplana, V., Gasull, A. and Marques, F., "Caption text extraction for indexing purposes
using a hierarchical region-based image model," 16th IEEE International Conference on Image
Processing (ICIP), Nov. 2009.
5. Yu. Zhong, K. Karu, A. K. Jain, “Locating text in complex color images,” 3rd International
Conference on Document Analysis and Recognition, vol. 1, pp. 146-149,1995.
6. Victor. Wu, R. Manmatha, E. M. Riseman, “Text Finder: an automatic system to detect and
recognize text in images”, IEEE Transactions on PAMI, vol. 21, pp. 1224-1228, 1999.
7. Jagath Samarabandu and Xiaoqing Liu, “An Edge-based Text Region Extraction Algorithm for
Indoor Mobile Robot Navigation”, World Academy of Science, Engineering and Technology , pp
382-389, 2007
8. Neha Gupta, V .K. Banga, “Image Segmentation for Text Extraction”, 2nd International Conference
on Electrical, Electronics and Civil Engineering (ICEECE'2012), Singapore, April 28-29, 2012
9. C. V. Jawahar, Pavan Kumar, S.S.Ravi Kiran, “A Bilingual OCR for Hindi-Telugu Documents and
its applications”, Proceedings of 7th International Conference on Document Analysis and
Recognition (ICDAR)- Aug 2003, Vol 1, pp 408-412,2003.
10. S. Basavaraj Patil and N V Subbareddy, “Neural network based system for script identification in
Indian documents”, Sadhana, Academy Proceedings in Engineering Sciences, Vol. 27, Part 1, pp.
83–97, , February 2002
11. K. Roy, U. Pal, and B. B. Chaudhuri, “Neural Network based Word wise Handwritten Script
Identification System for Indian Postal Automation”, Proceedings of ICISIP, International
Conference on IEEE, pp 240-245,2005.
12. Santanu Choudhury, Gaurav Harit, Shekar Madnani, R.B. Shet, “Identification of Scripts of Indian
Languages by Combining Trainable Classifiers”, ICVGIP, Dec.20-22, Bangalore, India, (2000).
13. M Swamy Das, D Sandhya Rani, C R K Reddy and A Govardhan, “Script identification from
Multilingual Telugu, Hindi and English Text Documents”, International Journal of Wisdom Based
Computing, Vol. 1 (3), December 2011.
14. M. M. Kodabagi and S. R. Karjol, “Script Identification from Printed Document Images using
Statistical Features”, International Journal of Computer Engineering & Technology (IJCET),
Volume 4, Issue 2, 2013, pp. 607 - 622, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375.
15. R. Edbert Rajan and Dr.K.Prasadh, “Spatial and Hierarchical Feature Extraction Based on Sift for
Medical Images”, International Journal of Computer Engineering & Technology (IJCET),
Volume 3, Issue 2, 2012, pp. 308 - 322, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375.
16. M. M. Kodabagi, S. A. Angadi and Chetana. R. Shivanagi, “Character Recognition of Kannada
Text in Scene Images using Neural Network”, International Journal of Graphics and Multimedia
(IJGM), Volume 4, Issue 1, 2013, pp. 9 - 19, ISSN Print: 0976 – 6448, ISSN Online: 0976 –6456.
17. Patange V.V and Prof. Deshmukh B.T, “Visual Acknowledgement [O.C.R.] – A Method to Identify
the Printed Characters”, International Journal of Computer Engineering & Technology (IJCET),
Volume 3, Issue 2, 2012, pp. 108 - 114, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375.
18. M. M. Kodabagi, S. A. Angadi and Anuradha. R. Pujari, “Text Region Extraction from Low
Resolution Display Board Images using Wavelet Features”, International Journal of Information
Technology and Management Information Systems (IJITMIS), Volume 4, Issue 1, 2013,
pp. 38 - 49, ISSN Print: 0976 – 6405, ISSN Online: 0976 – 6413.

231

Mais conteúdo relacionado

Mais procurados

Fuzzy rule based classification and recognition of handwritten hindi
Fuzzy rule based classification and recognition of handwritten hindiFuzzy rule based classification and recognition of handwritten hindi
Fuzzy rule based classification and recognition of handwritten hindi
IAEME Publication
 
Handwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text ConversionHandwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text Conversion
ijtsrd
 
Fusion technology for robust human skin detection 2
Fusion technology for robust human skin detection 2Fusion technology for robust human skin detection 2
Fusion technology for robust human skin detection 2
IAEME Publication
 
Skin Detection Based on Color Model and Low Level Features Combined with Expl...
Skin Detection Based on Color Model and Low Level Features Combined with Expl...Skin Detection Based on Color Model and Low Level Features Combined with Expl...
Skin Detection Based on Color Model and Low Level Features Combined with Expl...
IJERA Editor
 

Mais procurados (20)

RECOGNITION AND CONVERSION OF HANDWRITTEN MODI CHARACTERS
RECOGNITION AND CONVERSION OF HANDWRITTEN MODI CHARACTERSRECOGNITION AND CONVERSION OF HANDWRITTEN MODI CHARACTERS
RECOGNITION AND CONVERSION OF HANDWRITTEN MODI CHARACTERS
 
An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...
An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...
An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...
 
IRJET- Image based Information Retrieval
IRJET- Image based Information RetrievalIRJET- Image based Information Retrieval
IRJET- Image based Information Retrieval
 
H018124360
H018124360H018124360
H018124360
 
Low level features for image retrieval based
Low level features for image retrieval basedLow level features for image retrieval based
Low level features for image retrieval based
 
Text extraction from images
Text extraction from imagesText extraction from images
Text extraction from images
 
BrailleOCR: An Open Source Document to Braille Converter Application
BrailleOCR: An Open Source Document to Braille Converter ApplicationBrailleOCR: An Open Source Document to Braille Converter Application
BrailleOCR: An Open Source Document to Braille Converter Application
 
Fuzzy rule based classification and recognition of handwritten hindi
Fuzzy rule based classification and recognition of handwritten hindiFuzzy rule based classification and recognition of handwritten hindi
Fuzzy rule based classification and recognition of handwritten hindi
 
Ocr 1
Ocr 1Ocr 1
Ocr 1
 
Text Extraction from Image using Python
Text Extraction from Image using PythonText Extraction from Image using Python
Text Extraction from Image using Python
 
A bidirectional text transcription of braille for odia, hindi, telugu and eng...
A bidirectional text transcription of braille for odia, hindi, telugu and eng...A bidirectional text transcription of braille for odia, hindi, telugu and eng...
A bidirectional text transcription of braille for odia, hindi, telugu and eng...
 
Bi4103375380
Bi4103375380Bi4103375380
Bi4103375380
 
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
A Texture Based Methodology for Text Region Extraction from Low Resolution Na...
 
Anatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern DetectionAnatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern Detection
 
SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT
SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXTSEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT
SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT
 
Ijetcas14 399
Ijetcas14 399Ijetcas14 399
Ijetcas14 399
 
Handwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text ConversionHandwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text Conversion
 
Script Identification of Text Words from a Tri-Lingual Document Using Voting ...
Script Identification of Text Words from a Tri-Lingual Document Using Voting ...Script Identification of Text Words from a Tri-Lingual Document Using Voting ...
Script Identification of Text Words from a Tri-Lingual Document Using Voting ...
 
Fusion technology for robust human skin detection 2
Fusion technology for robust human skin detection 2Fusion technology for robust human skin detection 2
Fusion technology for robust human skin detection 2
 
Skin Detection Based on Color Model and Low Level Features Combined with Expl...
Skin Detection Based on Color Model and Low Level Features Combined with Expl...Skin Detection Based on Color Model and Low Level Features Combined with Expl...
Skin Detection Based on Color Model and Low Level Features Combined with Expl...
 

Destaque (20)

50120130405027 2-3
50120130405027 2-350120130405027 2-3
50120130405027 2-3
 
20120130406026
2012013040602620120130406026
20120130406026
 
The dela rosa family
The dela rosa familyThe dela rosa family
The dela rosa family
 
50120130405016 2
50120130405016 250120130405016 2
50120130405016 2
 
10120130405012
1012013040501210120130405012
10120130405012
 
Doba revolucija
Doba revolucijaDoba revolucija
Doba revolucija
 
50120130405009
5012013040500950120130405009
50120130405009
 
20120140503001
2012014050300120120140503001
20120140503001
 
20120140502014 2
20120140502014 220120140502014 2
20120140502014 2
 
50120140502017
5012014050201750120140502017
50120140502017
 
10120140501012
1012014050101210120140501012
10120140501012
 
30120140502004 2
30120140502004 230120140502004 2
30120140502004 2
 
40120140502010
4012014050201040120140502010
40120140502010
 
30120140502014
3012014050201430120140502014
30120140502014
 
30120140502013
3012014050201330120140502013
30120140502013
 
Não Espere
Não EspereNão Espere
Não Espere
 
Real.UI バスや電車の利用を簡単にするスマホアプリ
Real.UI バスや電車の利用を簡単にするスマホアプリReal.UI バスや電車の利用を簡単にするスマホアプリ
Real.UI バスや電車の利用を簡単にするスマホアプリ
 
Pasos para Slideshare.net / bam-s.xxi
Pasos para Slideshare.net / bam-s.xxiPasos para Slideshare.net / bam-s.xxi
Pasos para Slideshare.net / bam-s.xxi
 
Auditoria
AuditoriaAuditoria
Auditoria
 
El filtro de amigos
El filtro de amigosEl filtro de amigos
El filtro de amigos
 

Semelhante a 50120130405026

Dimension Reduction for Script Classification - Printed Indian Documents
Dimension Reduction for Script Classification - Printed Indian DocumentsDimension Reduction for Script Classification - Printed Indian Documents
Dimension Reduction for Script Classification - Printed Indian Documents
ijait
 
Feature extraction techniques on cbir a review
Feature extraction techniques on cbir a reviewFeature extraction techniques on cbir a review
Feature extraction techniques on cbir a review
IAEME Publication
 
Text region extraction from low resolution display board ima
Text region extraction from low resolution display board imaText region extraction from low resolution display board ima
Text region extraction from low resolution display board ima
IAEME Publication
 
Fuzzy rule based classification and recognition of handwritten hindi
Fuzzy rule based classification and recognition of handwritten hindiFuzzy rule based classification and recognition of handwritten hindi
Fuzzy rule based classification and recognition of handwritten hindi
IAEME Publication
 
Script identification using dct coefficients 2
Script identification using dct coefficients 2Script identification using dct coefficients 2
Script identification using dct coefficients 2
IAEME Publication
 
Script identification from printed document images using statistical
Script identification from printed document images using statisticalScript identification from printed document images using statistical
Script identification from printed document images using statistical
IAEME Publication
 
08 8879 10060-1-sm (ijict sj) edit iqbal
08 8879 10060-1-sm (ijict sj) edit iqbal08 8879 10060-1-sm (ijict sj) edit iqbal
08 8879 10060-1-sm (ijict sj) edit iqbal
IAESIJEECS
 
Recognition of basic kannada characters in scene images using euclidean dis
Recognition of basic kannada characters in scene images using euclidean disRecognition of basic kannada characters in scene images using euclidean dis
Recognition of basic kannada characters in scene images using euclidean dis
IAEME Publication
 

Semelhante a 50120130405026 (20)

Dimension Reduction for Script Classification - Printed Indian Documents
Dimension Reduction for Script Classification - Printed Indian DocumentsDimension Reduction for Script Classification - Printed Indian Documents
Dimension Reduction for Script Classification - Printed Indian Documents
 
DIMENSION REDUCTION FOR SCRIPT CLASSIFICATION- PRINTED INDIAN DOCUMENTS
DIMENSION REDUCTION FOR SCRIPT CLASSIFICATION- PRINTED INDIAN DOCUMENTSDIMENSION REDUCTION FOR SCRIPT CLASSIFICATION- PRINTED INDIAN DOCUMENTS
DIMENSION REDUCTION FOR SCRIPT CLASSIFICATION- PRINTED INDIAN DOCUMENTS
 
Feature extraction techniques on cbir a review
Feature extraction techniques on cbir a reviewFeature extraction techniques on cbir a review
Feature extraction techniques on cbir a review
 
Text region extraction from low resolution display board ima
Text region extraction from low resolution display board imaText region extraction from low resolution display board ima
Text region extraction from low resolution display board ima
 
Fuzzy rule based classification and recognition of handwritten hindi
Fuzzy rule based classification and recognition of handwritten hindiFuzzy rule based classification and recognition of handwritten hindi
Fuzzy rule based classification and recognition of handwritten hindi
 
Script identification using dct coefficients 2
Script identification using dct coefficients 2Script identification using dct coefficients 2
Script identification using dct coefficients 2
 
Hf3413291335
Hf3413291335Hf3413291335
Hf3413291335
 
IRJET - Text Detection in Natural Scene Images: A Survey
IRJET - Text Detection in Natural Scene Images: A SurveyIRJET - Text Detection in Natural Scene Images: A Survey
IRJET - Text Detection in Natural Scene Images: A Survey
 
Script identification from printed document images using statistical
Script identification from printed document images using statisticalScript identification from printed document images using statistical
Script identification from printed document images using statistical
 
08 8879 10060-1-sm (ijict sj) edit iqbal
08 8879 10060-1-sm (ijict sj) edit iqbal08 8879 10060-1-sm (ijict sj) edit iqbal
08 8879 10060-1-sm (ijict sj) edit iqbal
 
Ijetcas14 619
Ijetcas14 619Ijetcas14 619
Ijetcas14 619
 
Script Identification for printed document images at text-line level using DC...
Script Identification for printed document images at text-line level using DC...Script Identification for printed document images at text-line level using DC...
Script Identification for printed document images at text-line level using DC...
 
Hand-written Hindi Word Recognition - A Comprehensive Survey
Hand-written Hindi Word Recognition - A Comprehensive SurveyHand-written Hindi Word Recognition - A Comprehensive Survey
Hand-written Hindi Word Recognition - A Comprehensive Survey
 
Recognition of basic kannada characters in scene images using euclidean dis
Recognition of basic kannada characters in scene images using euclidean disRecognition of basic kannada characters in scene images using euclidean dis
Recognition of basic kannada characters in scene images using euclidean dis
 
Recognition of Words in Tamil Script Using Neural Network
Recognition of Words in Tamil Script Using Neural NetworkRecognition of Words in Tamil Script Using Neural Network
Recognition of Words in Tamil Script Using Neural Network
 
AN APPORACH FOR SCRIPT IDENTIFICATION IN PRINTED TRILINGUAL DOCUMENTS USING T...
AN APPORACH FOR SCRIPT IDENTIFICATION IN PRINTED TRILINGUAL DOCUMENTS USING T...AN APPORACH FOR SCRIPT IDENTIFICATION IN PRINTED TRILINGUAL DOCUMENTS USING T...
AN APPORACH FOR SCRIPT IDENTIFICATION IN PRINTED TRILINGUAL DOCUMENTS USING T...
 
Wavelet Packet Based Features for Automatic Script Identification
Wavelet Packet Based Features for Automatic Script IdentificationWavelet Packet Based Features for Automatic Script Identification
Wavelet Packet Based Features for Automatic Script Identification
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using Tesseract
 
Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text Image
 
A review on signature detection and signature based document image retrieval
A review on signature detection and signature based document image retrievalA review on signature detection and signature based document image retrieval
A review on signature detection and signature based document image retrieval
 

Mais de IAEME Publication

A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
IAEME Publication
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
IAEME Publication
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
IAEME Publication
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
IAEME Publication
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
IAEME Publication
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
IAEME Publication
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
IAEME Publication
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
IAEME Publication
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
IAEME Publication
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
IAEME Publication
 

Mais de IAEME Publication (20)

IAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdf
 
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
 
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
 
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSDETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
 
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
 
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOVOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
 
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
 
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYVISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
 
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
 
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
 
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTA MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

50120130405026

  • 1. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME TECHNOLOGY (IJCET) ISSN 0976 – 6367(Print) ISSN 0976 – 6375(Online) Volume 4, Issue 5, September – October (2013), pp. 224-231 © IAEME: www.iaeme.com/ijcet.asp Journal Impact Factor (2013): 6.1302 (Calculated by GISI) www.jifactor.com IJCET ©IAEME IDENTIFICATION OF DEVANAGARI SCRIPT FROM IMAGE DOCUMENT SATISH R. DAMADE1, K. P. ADHIYA2 and RANJANA S. ZINJORE3 1 Computer Engineering, KBS’s College of Engineering & Technology, North Maharashtra Knowledge city, Jalgaon. 2 Computer Engineering, SSBT’s College of Engineering & Technology, Bambhori, Jalgaon. 3 Computer Application, KCES’s Institute of Management and Research, Jalgaon. ABSTRACT Texts that appear in the image contain useful and important information. Optical Character Recognition technology is restricted to finding text printed against clean backgrounds, and cannot handle text printed against shaded or textured backgrounds or embedded in images. It is necessary to extract the text form image which is helpful in a society for a blind and visually impaired person when voice synthesizer is attached with the system. In this paper, we present a methodology for extracting text from printed image document and then identified Devanagari Script (Hindi language) from extracted text. Firstly we used Morphological Approach for extracting the text from image documents. The resultant text image is passed to Optical Character Recognition for Identification purpose. Projection profile is used for segmentation followed by Visual Discriminating approach for feature extraction. Finally for classification purpose Heuristic search is used. The result of proposed method for text extraction is compared with edge based and connected component with projection profile approach. After comparison using precision and recall rate it is observed that proposed algorithm work well. Keywords: Area, Bounding Box, Canny edge detector, Heuristic Search, Projection Profile, Visual Discriminating feature. I. INTRODUCTION In recent years, the escalating use of physical documents has made to progress towards the creation of electronic documents to facilitate easy communication and storage of documents. Now a day, information is becoming increasingly enriched by multimedia components containing images and video in addition to the textual information. The extraction of text in an image is a classical 224
  • 2. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME problem in the computer vision research area. Text extraction from images and video find many applications in document processing, detection of vehicle license plate, mobile robot navigation, object identification, text in www images, content based image retrieval from image database and video content analysis [1]. There are basically three kinds of images: Document image, Scene text images and Caption text images for extracting the text from these images basically two approaches are used i.e. – Region based approach and Textual based approach [2]. After extracting the text from image document, script identification play a vital role in designing of Optical character Recognition. Script Identification is a key step that arises in document image analysis especially when the environment is multi script and language identification is required to identify the different language that exists in the same script. In India, Script Identification facilitates many important applications such as sorting the images, selecting appropriate script specific text understanding system and searching online archives of document image containing a particular script [3]. In this paper we used Hindi language for identification purpose because Hindi is the third most spoken language of the world after Chinese and English, and there are approximately 500 billion people all over the world who speak and write in Hindi language. Also there are many forms and application available in combination of state official language and English language. In this paper we used printed images consist of Hindi and English text and then identifying Hindi language from such image documents. Hindi is derived from Devanagari script consisting 12 vowels and 34 consonants apart from a horizontal line at the upper part of a characters called as Shirorekha. English alphabet is a Latinbased alphabet consisting of 26 letters each of upper and lower case characters. The structure of the English alphabet contains more vertical and slant strokes. II. CHALLENGES AND RELAED WORK Text extraction from complex images is one of the most useful and difficult applications of pattern recognition and computer vision. Also identifying script form extracted image text is also very difficult task due to similar shape of the characters of the script. Authors presented a technique for detecting caption text for indexing purposes. Caption text objects are detected combining texture and geometric features and textured areas is detected using wavelet analysis. [4]. Zhong et al. [5] located text form complex images like compact disc, book cover, or traffic scenes. For finding text location authors used higher spatial variance of the image intensity of horizontal text lines. In this paper authors [6] proposed a four step system which automatically detects and extracts text in images including texture segmentation in which image is filter using bank of linear filters followed by strokes extraction, drawing rectangular box around the text and finally detecting the text. Authors [7] are used edge based approach for extracting the text based on generating a feature map using three important properties of edges: edge strength, density and variance of orientation. Neha Gupta[8] have proposed a method for image segmentation for text extraction based on 2d-Discrete Wavelet Transform which decompose the image into four sub component. After that edges of three sub-bands are fused to create a candidate text region followed by projection profile approach and based on some threshold text is extracted. C. V. Jawahar proposed [9] a technique to distinguish between Hindi and Telugu script. For Hindi, segmentation involves the removal of shirorekha. For Telugu, component extraction implies the separation of connected components. S. Basavaraj Patil [10] presents a approach for identification of Hindi, English and Kannada language script. For feature extraction, input image is dilated using 3x3 masks in Horizontal, Vertical, left and right diagonal direction followed by average pixel distribution of resultant image and neural network is used for classification. Pal and Chaudhuri [11] have proposed an automatic technique of separating the text lines from 12 Indian scripts (English, Devanagari, Bangla, Gujarati, Tamil, Kashmiri, Malayalam, Oriya, Punjabi, Telugu and Urdu) using ten triplets formed by grouping English and Devanagari with 225
  • 3. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME any one of the other scripts. Santanu Choudhuri, et al. [12] have proposed a method for identification of Indian languages by combining Gabor filter based technique and direction distance histogram classifier considering Hindi, English, Malayalam, Bengali, Telugu and Urdu. III. PROPOSED ARCHITECTURE AND METHODOLOGY 3.1 Proposed Architecture: The proposed architecture for identification of Hindi Script from image document is shown in Fig.1. 3.2. Methodology 3.2.1) Preprocessing: i) In this step we have convert the image having RGB color space (Fig. 2) into gray scale image. Gray scale image is converted into binary using Ostu’s thresholding. Color image with complex background Detection of text region Preprocessing Display the text region and removing non-text region which is a resultant image Input Improving the quality of text Subtract resultant image with input image Output Final Image containing Text Passed the extracted text for script Identification Segmentation of text into line & words Heuristic Search Feature Extraction Identified Hindi Script Identification of Hindi Script Figure 1: Proposed Architecture for Identification of Hindi script from Image Document ii) Edge detection and Morphological Dilation: An efficient canny method is used for edge extraction. The edge image is dilated using square structuring element of size. Mathematical dilation of A by B is denoted by: ----------------- (1.1) iii) Hole filling: Hole filling is determined by selection of marker and mask images. 0 -------------------(1.2) 226
  • 4. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME 3.2.2) Detection of Text Region i) The dilated image is labeled using bwlabel in matlab followed by 8 way connectivity. To obtained measurement (‘Bounding Box’ and ‘Area’) of image region we used region props properties in Matlab (Fig. 3). ii) Further for extraction of text region, we have computed a new-value by multiplying height and width of a Bounding Box and then resultant new-value is divided by Area. By experimentation it is found that if the ratio (new-value/area) is less than 1.78 and height is greater than 9 then the region so obtained are text region (specific condition). Figure 2: Input color Image Figure 3: Bounding box 3.2.3) Displayed Text region and removed non-text Region i) We found the connected component (CC) of a binary image which returns CC using bwconncomp Function in matlab. ii) Obtained the size of dilated image and make its value as false to make a blank background. iii) The connected component which satisfied the above specific condition makes that connected component value as true using PixelIndexList. iv) The resultant image visualization is very poor. To increases the visualization we subtract the resultant image with input image (Fig. 4). Figure 4: Final Result Figure 5: Segmentation of text into line 227
  • 5. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME 3.2.4) Segmentation of Text into lines and words i) The document image is segmented into several text lines using horizontal projection profiles computed by a row-wise sum of black pixels. After that we found valley of minimum and maximum points of a text document in the histogram and then draw a line (cut point) from minimum point to size of documents. Shown in Fig. 5. ii) After line segmentation we have used vertical projection profile by considering the threshold value (which maintained the inter character gap) for Bilingual text (Devanagari – English) word segmentation. The words obtained are thin for feature extraction and also we have provided the bounding box to the word by obtaining left, right, top and bottom first pixel. The extracted words are inverted for feature extraction which is shown in Fig. 6. Figure 6: Word segmented Image 3.2.5) Script Identification The distinct features used for script Identification are: i) Feature 1:Top_profile and Bottom_profile: The top_profile (bottom_profile) of a text line represents a set of black pixels obtained by scanning each column of the text line from top (bottom) until it reaches a first black pixel. Thus, a component of width N gets N such pixels. ii) Feature 2: Top-max-row: Represents the row number of the top_profile at which the maximum number of black pixels lies (black pixels having the value 0’s correspond to object and white pixels having value 1’s correspond to background). iii) Feature 3: Bottom-max-row: Represents the row number of the bottom_profile at which the maximum number of black pixels lies (black pixels having the value 0’s correspond to object and white pixels having value 1’s correspond to background). iv) Feature 4: Top-horizontal-line: (i) Obtain the top-max-row from the top-profile. (ii) Find the components whose number of black pixels is greater than threshold1 (threshold1 = half of the height of the bounding box) and store the number of such components in the attribute horizontal-lines. (iii) Compute the feature top-horizontal-line using the equation (1.3) below: Top-horizontal-line = (hlines * 100) / tc -----------------(1.3) Where- hlines represent number of horizontal lines and tc represents total number of components of the top-max-row. 228
  • 6. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME 3.2.6) Heuristic script identification algorithm (): (Result is shown in Fig. 7) Input: Pre-processed text words of Devnagari and English scripts Output: Range of feature values 1. Compute top-profile 2. Compute bottom-profile 3. Compute features F3 & F4 4. Identify the Script type as follows If Top_max_row=Bottom_max_row OR Top_horizontal_lines >= 60 then Script=”Hindi” else Script=”Others” 5. Return Script Figure 7: Hindi Identified words IV) RESULTS AND DISCUSSION We compared proposed algorithm with edge based and connected component using projection algorithm. For comparison we are used precision and recall rate. --- (1.2) ---- (1.3) Precision rate takes into consideration the false positives, which are the non-text regions in the image and have been detected by the algorithm as text regions. Recall rate takes into consideration the false negatives, which are text words in the image, and have not been detected by the algorithm. Thus, precision and recall rates are useful as measures to determine the accuracy of each algorithm in locating correct text regions and eliminating non-text regions as shown in table 1.1. 229
  • 7. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME Input Image Image1 Image2 Image3 Image4 Image5 Table 1.1: Comparison of Results of Three Algorithms Edge based Algorithm Connected component Proposed Algorithm Based Algorithm Precision Recall Rate Precision Recall Precision Recall Rate Rate Rate Rate Rate 76.19 80.00 83.33 99.00 94.11 80.00 68.42 76.47 77.27 99.00 86.66 76.47 61.53 100.00 66.66 100.00 72.72 100.00 0.00 0.00 53.84 100.00 63.63 100.00 83.33 99.90 66.66 80.00 90.90 98.00 Table 1.2: Results of Identified Devanagari (Hindi) script from Images Sr. No Dataset Name Hindi Words 1 Image1 Correct Classification 100% Misclassification 0% Rejection 0% 2 Image2 Correct Classification 80.00% Misclassification 20.00% Rejection 0% 3 Image3 Correct Classification 66.66% Misclassification 00.01% Rejection 33.33% 4 Image4 Correct Classification 100% Misclassification 0% Rejection 0% 5 Image5 Correct Classification 100% Misclassification 0% Rejection 0% V) CONLUSION In this paper, we have presented a very efficient and easy algorithm for extraction of text from image document based on connected component. The morphological approach is applied followed by finding a result by multiplication of height and width and then divided the result by area. By experimentation we have fixed the value of result to remove non text region from image. The proposed algorithm is tested on five images having same font size and obtained average accuracy of precision rate 81.60% and recall rate 90.89%. The extracted text form image is passed for script identification. Using Heuristic Search classifier we have obtained correct classification accuracy of 89.33%. In future we are tested the algorithm on variable font size images. REFERENCES 1. Keechul Jung, Kwang In Kim and Anil K. Jain, “Text information extraction in images and video: a survey”, The journal of the Pattern Recognition society, Vol. 37, Issue 5, pp. 977-997, May 2004. 2. Chitrakala Gopalan and D. Manjula, “Contourlet Based Approach for Text Identification and Extraction from Heterogeneous Textual Images”, International Journal of Electrical and Electronics Engineering 2(8), pp. 491-500, 2008. 230
  • 8. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 4, Issue 5, September - October (2013), © IAEME 3. M. C. Padma and P.A. Vijaya, “Script Identification form Trilingual Documents using profile Based Features”, International Journal of Computer Science and Applications, Vol. 7 No. 4, pp. 16 - 33 , 2010. 4. Leon, M., Vilaplana, V., Gasull, A. and Marques, F., "Caption text extraction for indexing purposes using a hierarchical region-based image model," 16th IEEE International Conference on Image Processing (ICIP), Nov. 2009. 5. Yu. Zhong, K. Karu, A. K. Jain, “Locating text in complex color images,” 3rd International Conference on Document Analysis and Recognition, vol. 1, pp. 146-149,1995. 6. Victor. Wu, R. Manmatha, E. M. Riseman, “Text Finder: an automatic system to detect and recognize text in images”, IEEE Transactions on PAMI, vol. 21, pp. 1224-1228, 1999. 7. Jagath Samarabandu and Xiaoqing Liu, “An Edge-based Text Region Extraction Algorithm for Indoor Mobile Robot Navigation”, World Academy of Science, Engineering and Technology , pp 382-389, 2007 8. Neha Gupta, V .K. Banga, “Image Segmentation for Text Extraction”, 2nd International Conference on Electrical, Electronics and Civil Engineering (ICEECE'2012), Singapore, April 28-29, 2012 9. C. V. Jawahar, Pavan Kumar, S.S.Ravi Kiran, “A Bilingual OCR for Hindi-Telugu Documents and its applications”, Proceedings of 7th International Conference on Document Analysis and Recognition (ICDAR)- Aug 2003, Vol 1, pp 408-412,2003. 10. S. Basavaraj Patil and N V Subbareddy, “Neural network based system for script identification in Indian documents”, Sadhana, Academy Proceedings in Engineering Sciences, Vol. 27, Part 1, pp. 83–97, , February 2002 11. K. Roy, U. Pal, and B. B. Chaudhuri, “Neural Network based Word wise Handwritten Script Identification System for Indian Postal Automation”, Proceedings of ICISIP, International Conference on IEEE, pp 240-245,2005. 12. Santanu Choudhury, Gaurav Harit, Shekar Madnani, R.B. Shet, “Identification of Scripts of Indian Languages by Combining Trainable Classifiers”, ICVGIP, Dec.20-22, Bangalore, India, (2000). 13. M Swamy Das, D Sandhya Rani, C R K Reddy and A Govardhan, “Script identification from Multilingual Telugu, Hindi and English Text Documents”, International Journal of Wisdom Based Computing, Vol. 1 (3), December 2011. 14. M. M. Kodabagi and S. R. Karjol, “Script Identification from Printed Document Images using Statistical Features”, International Journal of Computer Engineering & Technology (IJCET), Volume 4, Issue 2, 2013, pp. 607 - 622, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375. 15. R. Edbert Rajan and Dr.K.Prasadh, “Spatial and Hierarchical Feature Extraction Based on Sift for Medical Images”, International Journal of Computer Engineering & Technology (IJCET), Volume 3, Issue 2, 2012, pp. 308 - 322, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375. 16. M. M. Kodabagi, S. A. Angadi and Chetana. R. Shivanagi, “Character Recognition of Kannada Text in Scene Images using Neural Network”, International Journal of Graphics and Multimedia (IJGM), Volume 4, Issue 1, 2013, pp. 9 - 19, ISSN Print: 0976 – 6448, ISSN Online: 0976 –6456. 17. Patange V.V and Prof. Deshmukh B.T, “Visual Acknowledgement [O.C.R.] – A Method to Identify the Printed Characters”, International Journal of Computer Engineering & Technology (IJCET), Volume 3, Issue 2, 2012, pp. 108 - 114, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375. 18. M. M. Kodabagi, S. A. Angadi and Anuradha. R. Pujari, “Text Region Extraction from Low Resolution Display Board Images using Wavelet Features”, International Journal of Information Technology and Management Information Systems (IJITMIS), Volume 4, Issue 1, 2013, pp. 38 - 49, ISSN Print: 0976 – 6405, ISSN Online: 0976 – 6413. 231