Introduction:
• It converts scanned text into equivalent speech
• For it simulation is done in MATLAB
• The image is in .jpeg format.
• Speech is in .wav format
• The language used for it is English
Introduction to MATLAB:
• MATLAB stands for Matrix Laboratory.
• It is high performance language for technical computing.
• It’s basic data element is a matrix.
• Typical uses
1). Math and computation
2). Algorithm development
3). Modeling, Simulation
4). Data analysis, exploration, and visualization
5). Scientific and engineering graphics
6). Application development, including building graphical user
interfaces.
What is Image Processing?
• Image processing is the study of any algorithm that takes an image as input
and returns an image as output.
• An image may be defined as a two dimensional function, f(x , y).
• x and y are spatial coordinates, and the amplitude of f at any pair of
coordinates (x , y) is called intensity of image at that point.
• When the value of x, y & f are all finite, discrete quantities it’s called digital
image
• Digital image is composed of a finite no. of elements, each of which has a
particular location and value these elements are referred as picture element,
image element and pixels.
Brief Description:
The Project can be divided in three parts
(1.) Scan the text which to be converted
(2.) Optical character recognization
(3.) Conversation of recognized word to speech
Step 1:
•The text which to be converted scanned by scanner in
jpeg format
• Then all process will done in MATLAB
Step 2: ( Optical Character Recognition)
•Optical Character Recognition, usually abbreviated to OCR, is the
electronic translation of scanned images of handwritten, typewritten or
printed text into machine-encoded text.
•Widely used to convert books and documents into electronic files, to
computerize a record-keeping system in an office, or to publish the text on a
website.
Block diagram Of OCR
TEXT AREA DECTECTION
TEXT AREA PROCESSING
BINARY OUTPUT
Algorithm used in OCR
Convert RGB image to Gray Scale Image and display it
Convert image into binary image after defining its threshold
Place the ideal match in the text file
Open text file to write the text
Load Templates
Crop lines one by one from the image file
Match each letter with the templates using correlation
How is it decided that weather it is character or not?
•Black and white images are stored in MATLAB as a two dimensional array
containing 1s and 0s where 0 represents black and 1 represents white. The
values in a single row are all added together.
•A row with all zeroes is a row with only blank space and without any text and a row with
some non-zero elements is a row with some portion of the text.
How is it decided that weather it is character or not?
•Black and white images are stored in MATLAB as a two dimensional array
containing 1s and 0s where 0 represents black and 1 represents white. The
values in a single row are all added together.
•A row with all zeroes is a row with only blank space and without any text and a row with
some non-zero elements is a row with some portion of the text.
Separation of characters from line
•All the operations which were being performed on the rows while extracting
lines are now performed on the columns. The values in a single column are
all added together.
•If the sum is not equal to zero, then it means that there is at least one non-
zero value in the column. However, if the sum is equal to zero, then it
indicates that the column contains only zero elements.
• A column with all zeroes is a column with only blank space and without any
text and a column with some non-zero elements is a column with some
portion of the text.
Identifying individual characters
•Once the characters have been extracted from the image, they are passed to
the read letter block for identification.
•Two-dimensional correlation function is then used to find the best match
between the extracted character and the characters stored in the templates
file. The character from the templates file that gives the highest correlation is
the extracted character and it is written into a text file.
Step 3: (Text To Speech Conversation)
•This done by programming in MATLAB
•In it the programming of MATLAB gives pre-recorded voice as output which is
related with that particular character.
• Thus conversion is done.
APPLICATION:
•The blind use Braille script to read or write any text. This scheme uses a
pattern of pockmarks or indentations to represent each alphabet. Reading is
done by feeling these notches with the fingers. This makes reading a very slow
and laborious task for them.
•By using this a blind people know content of any letter