SlideShare uma empresa Scribd logo
1 de 16
Speech Recognition



          BY
          GIRISH JOSHI
Introduction
• What is Speech Recognition?
  also known as automatic speech recognition or
   computer speech recognition which means
   understanding voice by the computer and
   performing any required task.
• Where can it be used?

   - Dictation

   - System control/navigation

   - Commercial/Industrial applications

   - Voice dialing
Recognition

Voice Input     Analog to Digital      Acoustic Model



                                       Language Model




     Feedback      Display          Speech Engine
• Acoustic Model
   An acoustic model is created by taking audio recordings of
    speech, and their text transcriptions, and using software to
    create statistical representations of the sounds that make up
    each word. It is used by a speech recognition engine to
    recognize speech.

• Language Model
   Language modeling is used in many natural language
    processing applications such as speech recognition tries to
    capture the properties of a language, and to predict the next
    word in a speech sequence.
TYPES OF VOICE RECOGNITION

• There are two types of speech recognition. One is called
  speaker-dependent and the other isspeaker-independent
  . Speaker-dependent software is commonly used for
  dictation software, while speaker-independent software
  is more commonly found in telephone applications.
• Speaker-dependent software works by learning the
  unique characteristics of a single person’s voice, in a
  way similar to voice recognition. New users must first
  “train” the software by speaking to it, so the computer
  can analyze how the person talks. This often means
  users have to read a few pages of text to the computer
  before they can use the speech recognition software.
TYPES OF VOICE RECOGNITION
• Speaker-independent software is designed to recognize
  anyone’s voice, so no training is involved. This means it
  is the only real option for applications such as interactive
  voice response systems — where businesses can’t ask
  callers to read pages of text before using the system.
  The downside is that speaker-independent software is
  generally less accurate than speaker-dependent
  software.
• Speech recognition engines that are speaker
  independent generally deal with this fact by limiting the
  grammars they use. By using a smaller list of recognized
  words, the speech engine is more likely to correctly
  recognize what a speaker said.
How might computers do it?


Acoustic waveform   Acoustic signal




  • Digitization
  • Acoustic analysis of the          Speech recognition
    speech signal
  • Linguistic interpretation
DIFFERENT PROCESSES
               INVOLVED
• Digitization
    – Converting analogue signal into digital representation
• Signal processing
    – Separating speech from background noise
• Phonetics
    – Variability in human speech
• Phonology
    – Recognizing individual sound distinctions (similar phonemes)
    –    is the systematic use of sound to encode meaning in any
        spoken human language
• Lexicology and syntax
    – Lexicology is that part of linguistics which studies words, their nature and
      meaning, words' elements, relations between words,words groups and the whole
        lexicon.   .
DIFFERENT PROCESSES
         INVOLVED(CONTD.)
• Syntax and pragmatics
   – Semantics tells about the meaning
   – Pragmatics is concerned with bridging the explanatory gap
     between sentence meaning and speaker's meaning
Digitization
• Analogue to digital conversion
• Sampling and quantizing
   Sampling is converting a continuous signal into a discrete signal

   Quantizing is the process of approximating a continuous range of values

• Use filters to measure energy levels for various
  points on the frequency spectrum
• Knowing the relative importance of different
  frequency bands (for speech) makes this
  process more efficient
• E.g. high frequency sounds are less informative,
  so can be sampled using a broader bandwidth
  (log scale)
Separating speech from
        background noise
• Noise cancelling microphones
  – Two mics, one facing speaker, the other facing away
  – Ambient noise is roughly same for both mics
• Knowing which bits of the signal relate to speech
Speaker Recognition
• Definition
  – It is the method of recognizing a person based on
    his voice
  – It is one of the forms of biometric identification
• Depends of speaker specific
  characteristics.
ADVANTAGES
• Advantages
  –   People with disabilities
  –   Organizations - Increases productivity, reduces costs and errors.
  –   Lower operational Costs
  –   Advances in technology will allow consumers and businesses to
      implement speech recognition systems at a relatively low cost.
       • Cell-phone users can dial pre-programmed numbers by voice
         command.
       • Users can trade stocks through a voice-activated trading system.
       • Speech recognition technology can also replace touch-tone dialing
         resulting in the ability to target customers that speak different
         languages
DISADVANTAGES
• Difficult to build a perfect system.
• Conversations
  – Involves more than just words (non-verbal
    communication; stutters etc.
  – Every human being has differences such as
    their voice, mouth, and speaking style.
• Filtering background noise is a task that
  can even be difficult for humans to
  accomplish.
Future of Speech Recognition
• Accuracy will become better and better.
• Dictation speech recognition will gradually become
  accepted.
• Small hand-held writing tablets for computer speech
  recognition dictation and data entry will be developed, as
  faster processors and more memory become available.
• Greater use will be made of "intelligent systems" which
  will attempt to guess what the speaker intended to say,
  rather than what was actually said, as people often
  misspeak and make unintentional mistakes.
• Microphone and sound systems will be designed to
  adapt more quickly to changing background noise levels,
  different environments, with better recognition of
  extraneous material to be discarded.

Mais conteúdo relacionado

Mais procurados

Ece speech-recognition-report
Ece speech-recognition-reportEce speech-recognition-report
Ece speech-recognition-reportAnakali Mahesh
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionAhmed Moawad
 
Group 2 -innovation in smartphones-
Group 2 -innovation in smartphones-Group 2 -innovation in smartphones-
Group 2 -innovation in smartphones-Fuyi Pan
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognitionfathitarek
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognitionأحلام انصارى
 
Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSubmissionResearchpa
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesHarshad Karmarkar
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech RecognitionYogesh Vijay
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by IqbalIqbal
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technologySrijanKumar18
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySrijanKumar18
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognitionVinay Jaisriram
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionHugo Moreno
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologyAamir-sheriff
 
Speech synthesis technology
Speech synthesis technologySpeech synthesis technology
Speech synthesis technologyKalluri Madhuri
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speechBilgin Aksoy
 

Mais procurados (20)

Ece speech-recognition-report
Ece speech-recognition-reportEce speech-recognition-report
Ece speech-recognition-report
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Group 2 -innovation in smartphones-
Group 2 -innovation in smartphones-Group 2 -innovation in smartphones-
Group 2 -innovation in smartphones-
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
 
Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speech
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devices
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Voice recognition
Voice recognitionVoice recognition
Voice recognition
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
 
Automatic Speech Recognion
Automatic Speech RecognionAutomatic Speech Recognion
Automatic Speech Recognion
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech synthesis technology
Speech synthesis technologySpeech synthesis technology
Speech synthesis technology
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
 

Destaque

Policy communities presentation
Policy communities presentationPolicy communities presentation
Policy communities presentationRahmanTalha
 
Speech coding techniques
Speech coding techniquesSpeech coding techniques
Speech coding techniqueskailash karki
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overviewVarun Jain
 
Basics of speech coding
Basics of speech codingBasics of speech coding
Basics of speech codingsakshij91
 
Li-Fi Technology PPT
Li-Fi Technology PPT Li-Fi Technology PPT
Li-Fi Technology PPT Seminar Links
 
Final Project Report on Image processing based intelligent traffic control sy...
Final Project Report on Image processing based intelligent traffic control sy...Final Project Report on Image processing based intelligent traffic control sy...
Final Project Report on Image processing based intelligent traffic control sy...Louise Antonio
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
Introduction to Digital Image Processing Using MATLAB
Introduction to Digital Image Processing Using MATLABIntroduction to Digital Image Processing Using MATLAB
Introduction to Digital Image Processing Using MATLABRay Phan
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By MatlabAnkit Gujrati
 

Destaque (12)

Rohatash Resume1
Rohatash Resume1Rohatash Resume1
Rohatash Resume1
 
Policy communities presentation
Policy communities presentationPolicy communities presentation
Policy communities presentation
 
Speech coding techniques
Speech coding techniquesSpeech coding techniques
Speech coding techniques
 
Meteoroloji 2014
Meteoroloji 2014Meteoroloji 2014
Meteoroloji 2014
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
 
Basics of speech coding
Basics of speech codingBasics of speech coding
Basics of speech coding
 
Li-Fi Technology PPT
Li-Fi Technology PPT Li-Fi Technology PPT
Li-Fi Technology PPT
 
Final Project Report on Image processing based intelligent traffic control sy...
Final Project Report on Image processing based intelligent traffic control sy...Final Project Report on Image processing based intelligent traffic control sy...
Final Project Report on Image processing based intelligent traffic control sy...
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Bio-metrics Technology
Bio-metrics TechnologyBio-metrics Technology
Bio-metrics Technology
 
Introduction to Digital Image Processing Using MATLAB
Introduction to Digital Image Processing Using MATLABIntroduction to Digital Image Processing Using MATLAB
Introduction to Digital Image Processing Using MATLAB
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 

Semelhante a Speechrecognition 100423091251-phpapp01

Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generatorsPaul Kahoro
 
Digital speech processing lecture1
Digital speech processing lecture1Digital speech processing lecture1
Digital speech processing lecture1Samiul Parag
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition systemavinash raibole
 
Artificial Intelligence- An Introduction
Artificial Intelligence- An IntroductionArtificial Intelligence- An Introduction
Artificial Intelligence- An Introductionacemindia
 
Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction acemindia
 
Artificial intelligence - research areas
Artificial intelligence - research areasArtificial intelligence - research areas
Artificial intelligence - research areasLearnbay Datascience
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization worksMuhammad Taqi
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition Goa App
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognitionAditya Kumar Khare
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptxJhalakDashora
 

Semelhante a Speechrecognition 100423091251-phpapp01 (20)

Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
Digital speech processing lecture1
Digital speech processing lecture1Digital speech processing lecture1
Digital speech processing lecture1
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Assign
AssignAssign
Assign
 
Artificial Intelligence- An Introduction
Artificial Intelligence- An IntroductionArtificial Intelligence- An Introduction
Artificial Intelligence- An Introduction
 
Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction
 
Artificial intelligence - research areas
Artificial intelligence - research areasArtificial intelligence - research areas
Artificial intelligence - research areas
 
Seminar
SeminarSeminar
Seminar
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
30
3030
30
 
Amadou
AmadouAmadou
Amadou
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognition
 
Web AI.pptx
Web AI.pptxWeb AI.pptx
Web AI.pptx
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptx
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
FINAL report
FINAL reportFINAL report
FINAL report
 

Último

Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Último (20)

YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 

Speechrecognition 100423091251-phpapp01

  • 1. Speech Recognition BY GIRISH JOSHI
  • 2. Introduction • What is Speech Recognition? also known as automatic speech recognition or computer speech recognition which means understanding voice by the computer and performing any required task.
  • 3. • Where can it be used? - Dictation - System control/navigation - Commercial/Industrial applications - Voice dialing
  • 4. Recognition Voice Input Analog to Digital Acoustic Model Language Model Feedback Display Speech Engine
  • 5. • Acoustic Model  An acoustic model is created by taking audio recordings of speech, and their text transcriptions, and using software to create statistical representations of the sounds that make up each word. It is used by a speech recognition engine to recognize speech. • Language Model  Language modeling is used in many natural language processing applications such as speech recognition tries to capture the properties of a language, and to predict the next word in a speech sequence.
  • 6. TYPES OF VOICE RECOGNITION • There are two types of speech recognition. One is called speaker-dependent and the other isspeaker-independent . Speaker-dependent software is commonly used for dictation software, while speaker-independent software is more commonly found in telephone applications. • Speaker-dependent software works by learning the unique characteristics of a single person’s voice, in a way similar to voice recognition. New users must first “train” the software by speaking to it, so the computer can analyze how the person talks. This often means users have to read a few pages of text to the computer before they can use the speech recognition software.
  • 7. TYPES OF VOICE RECOGNITION • Speaker-independent software is designed to recognize anyone’s voice, so no training is involved. This means it is the only real option for applications such as interactive voice response systems — where businesses can’t ask callers to read pages of text before using the system. The downside is that speaker-independent software is generally less accurate than speaker-dependent software. • Speech recognition engines that are speaker independent generally deal with this fact by limiting the grammars they use. By using a smaller list of recognized words, the speech engine is more likely to correctly recognize what a speaker said.
  • 8. How might computers do it? Acoustic waveform Acoustic signal • Digitization • Acoustic analysis of the Speech recognition speech signal • Linguistic interpretation
  • 9. DIFFERENT PROCESSES INVOLVED • Digitization – Converting analogue signal into digital representation • Signal processing – Separating speech from background noise • Phonetics – Variability in human speech • Phonology – Recognizing individual sound distinctions (similar phonemes) – is the systematic use of sound to encode meaning in any spoken human language • Lexicology and syntax – Lexicology is that part of linguistics which studies words, their nature and meaning, words' elements, relations between words,words groups and the whole lexicon. .
  • 10. DIFFERENT PROCESSES INVOLVED(CONTD.) • Syntax and pragmatics – Semantics tells about the meaning – Pragmatics is concerned with bridging the explanatory gap between sentence meaning and speaker's meaning
  • 11. Digitization • Analogue to digital conversion • Sampling and quantizing  Sampling is converting a continuous signal into a discrete signal  Quantizing is the process of approximating a continuous range of values • Use filters to measure energy levels for various points on the frequency spectrum • Knowing the relative importance of different frequency bands (for speech) makes this process more efficient • E.g. high frequency sounds are less informative, so can be sampled using a broader bandwidth (log scale)
  • 12. Separating speech from background noise • Noise cancelling microphones – Two mics, one facing speaker, the other facing away – Ambient noise is roughly same for both mics • Knowing which bits of the signal relate to speech
  • 13. Speaker Recognition • Definition – It is the method of recognizing a person based on his voice – It is one of the forms of biometric identification • Depends of speaker specific characteristics.
  • 14. ADVANTAGES • Advantages – People with disabilities – Organizations - Increases productivity, reduces costs and errors. – Lower operational Costs – Advances in technology will allow consumers and businesses to implement speech recognition systems at a relatively low cost. • Cell-phone users can dial pre-programmed numbers by voice command. • Users can trade stocks through a voice-activated trading system. • Speech recognition technology can also replace touch-tone dialing resulting in the ability to target customers that speak different languages
  • 15. DISADVANTAGES • Difficult to build a perfect system. • Conversations – Involves more than just words (non-verbal communication; stutters etc. – Every human being has differences such as their voice, mouth, and speaking style. • Filtering background noise is a task that can even be difficult for humans to accomplish.
  • 16. Future of Speech Recognition • Accuracy will become better and better. • Dictation speech recognition will gradually become accepted. • Small hand-held writing tablets for computer speech recognition dictation and data entry will be developed, as faster processors and more memory become available. • Greater use will be made of "intelligent systems" which will attempt to guess what the speaker intended to say, rather than what was actually said, as people often misspeak and make unintentional mistakes. • Microphone and sound systems will be designed to adapt more quickly to changing background noise levels, different environments, with better recognition of extraneous material to be discarded.