Anúncio
Anúncio

### Text to speech conversation in gujarati

1. TEXT TO SPEECH CONVERSATION Prepared By : Nakum Ashvin P.
2. Introduction: • It converts scanned text into equivalent speech • For it simulation is done in MATLAB • The image is in .jpeg format. • Speech is in .wav format • The language used for it is English
3. Introduction to MATLAB: • MATLAB stands for Matrix Laboratory. • It is high performance language for technical computing. • It’s basic data element is a matrix. • Typical uses 1). Math and computation 2). Algorithm development 3). Modeling, Simulation 4). Data analysis, exploration, and visualization 5). Scientific and engineering graphics 6). Application development, including building graphical user interfaces.
4. What is Image Processing? • Image processing is the study of any algorithm that takes an image as input and returns an image as output. • An image may be defined as a two dimensional function, f(x , y). • x and y are spatial coordinates, and the amplitude of f at any pair of coordinates (x , y) is called intensity of image at that point. • When the value of x, y & f are all finite, discrete quantities it’s called digital image • Digital image is composed of a finite no. of elements, each of which has a particular location and value these elements are referred as picture element, image element and pixels.
5. Brief Description: The Project can be divided in three parts (1.) Scan the text which to be converted (2.) Optical character recognization (3.) Conversation of recognized word to speech Step 1: •The text which to be converted scanned by scanner in jpeg format • Then all process will done in MATLAB
6. Step 2: ( Optical Character Recognition) •Optical Character Recognition, usually abbreviated to OCR, is the electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. •Widely used to convert books and documents into electronic files, to computerize a record-keeping system in an office, or to publish the text on a website. Block diagram Of OCR TEXT AREA DECTECTION TEXT AREA PROCESSING BINARY OUTPUT
7. Algorithm used in OCR Convert RGB image to Gray Scale Image and display it Convert image into binary image after defining its threshold Place the ideal match in the text file Open text file to write the text Load Templates Crop lines one by one from the image file Match each letter with the templates using correlation
8. Working First RGB image is converted in GRAYSCALE image RGB image GRAYSCALE image BINARY image
9. All these are stored in matrix form in MATLAB
10. How is it decided that weather it is character or not? •Black and white images are stored in MATLAB as a two dimensional array containing 1s and 0s where 0 represents black and 1 represents white. The values in a single row are all added together. •A row with all zeroes is a row with only blank space and without any text and a row with some non-zero elements is a row with some portion of the text.
11. How is it decided that weather it is character or not? •Black and white images are stored in MATLAB as a two dimensional array containing 1s and 0s where 0 represents black and 1 represents white. The values in a single row are all added together. •A row with all zeroes is a row with only blank space and without any text and a row with some non-zero elements is a row with some portion of the text. Separation of characters from line •All the operations which were being performed on the rows while extracting lines are now performed on the columns. The values in a single column are all added together. •If the sum is not equal to zero, then it means that there is at least one non- zero value in the column. However, if the sum is equal to zero, then it indicates that the column contains only zero elements. • A column with all zeroes is a column with only blank space and without any text and a column with some non-zero elements is a column with some portion of the text.
12. Identifying individual characters •Once the characters have been extracted from the image, they are passed to the read letter block for identification. •Two-dimensional correlation function is then used to find the best match between the extracted character and the characters stored in the templates file. The character from the templates file that gives the highest correlation is the extracted character and it is written into a text file.
13. Step 3: (Text To Speech Conversation) •This done by programming in MATLAB •In it the programming of MATLAB gives pre-recorded voice as output which is related with that particular character. • Thus conversion is done.
14. APPLICATION: •The blind use Braille script to read or write any text. This scheme uses a pattern of pockmarks or indentations to represent each alphabet. Reading is done by feeling these notches with the fingers. This makes reading a very slow and laborious task for them. •By using this a blind people know content of any letter
Anúncio