SlideShare uma empresa Scribd logo
1 de 18
Speech Recognition




                     1
Introduction
•   What is Speech Recognition?
           - Voice Recognition?
•   Where can it be used?
    - Dictation
    - System control/navigation
    - Commercial/Industrial applications
    - Hand held digital recorders
                                   2
Contents:
•   Continuous/Discrete
•   How does it work?
•   Recent improvements
•   Current software options
•   Future of SR



                          3
Continuous or Discrete?
    • Continuous speech
       - dictation
    • Discrete speech
       - system controls




                           4
How does SR work?
  •   Recognition
  •   Training
  •   Correction
  •   Command/Control




                        5
Recognition (1)
Voice Input     Analog to Digital      Acoustic Model



                                       Language Model




     Feedback      Display          Speech Engine



                                           6
Recognition (2)
Acoustic Modeling
• Spoken words: “I think there are…..”
• Phonemes: ‘ ay th-in-nk-kd dh-eh-r aa-
  r’
• H.M.M.’s: 5 state representation
• Speech Engine


                               7
Recognition (3)
Language Modeling
• Word context
• Word frequency
• Transition possibilities




                         8
Voice Training (1)
Can be done by:
• Predetermined text segments
• Individual words
Compare new acoustic with old and combines
• More training = better recognition



                                9
Voice Training (2)
User specific Voice file
• Voice qualities
• Pronunciation
• Patterns of word use
• Preferred vocabulary



                           10
Making Corrections
•   Move cursor by voice command
•   Memorize edit commands
•   List of possible alternatives
•   Make correction manually




                            11
Command/Control
•   Desktop grid
•   Program or Link name/number
•   URL name
•   Memorized commands




                          12
Recent Improvements in SR
  •   Faster training ~10 min.
  •   Better recognition ~95%
  •   More compatible software
  •   Better system control/command




                              13
Current Software Options for PC
•   Dragon Systems – Naturally Speaking
•   Philips – FreeSpeech
•   IBM – ViaVoice
•   Lernout & Hauspie – Voice Xpress




                                  14
How well do the work?
           Training   Dictation App.        Command
                      Correct. Integrat.    - Control
Dragon     Excellent Excellent Good         Good

Philips    Fair       Fair      Good        Good

IBM        Excellent Good       Good        Excellent

L&H        Good       Good      Good        Good

                                       15
Future of SR
• SUI – Speech-based User Interface
• Improvements needed:
  - Greater accuracy
  - Greater system control/command
  - More compatible software



                                 16
Conclusion
•   SR Uses
•   How does it work?
•   Current Software
•   Problems of SR
•   More SR coming soon….



                        17
References
• 1. Alwang, Greg. “Speech Recognition,” PC Magazine, December 1
  1999
• 2. Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon
  University. “Learning to Recognize Speech by Watching Television,”
  IEEE Intelligent Systems, September/October 1999.
• 3. Miastkowski, Stan. “Latest Speech Software Gets You Up and
  Running Faster,” PC World, November 1999.




                                                     18

Mais conteúdo relacionado

Mais procurados

Computer programming
Computer programmingComputer programming
Computer programmingwesleycatcher
 
Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Crispin Reedy
 
Building Voice UI products for events
Building Voice UI products for eventsBuilding Voice UI products for events
Building Voice UI products for eventsNelli Hergenröther
 
Text to Speech for Mobile Voice
Text to Speech for Mobile Voice Text to Speech for Mobile Voice
Text to Speech for Mobile Voice June Hostetter
 
classification of computer language
classification of computer languageclassification of computer language
classification of computer languageBinamraRegmi
 
College forum software
College forum softwareCollege forum software
College forum softwareRahul E
 
Presentation on computer language
Presentation on computer languagePresentation on computer language
Presentation on computer languageSwarnima Tiwari
 
computer languages
computer languagescomputer languages
computer languagesRajendran
 
Applying Filmmaking Tools and Techniques to Interaction Design
Applying Filmmaking Tools and Techniques to Interaction DesignApplying Filmmaking Tools and Techniques to Interaction Design
Applying Filmmaking Tools and Techniques to Interaction DesignAdam Connor
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...nehachhh
 
MIS software concepts, Dr. Ashish K. Gupta
MIS software concepts, Dr. Ashish K. GuptaMIS software concepts, Dr. Ashish K. Gupta
MIS software concepts, Dr. Ashish K. GuptaAshish K Gupta
 
Software (Application and System Software)
Software (Application and System Software)Software (Application and System Software)
Software (Application and System Software)Project Student
 
Computer Languages....ppt
Computer Languages....pptComputer Languages....ppt
Computer Languages....ppthashgeneration
 
Text to speech converter in C#.NET
Text to speech converter in C#.NETText to speech converter in C#.NET
Text to speech converter in C#.NETMandeep Cheema
 

Mais procurados (20)

What is a programmer
What is a programmerWhat is a programmer
What is a programmer
 
Computer programming
Computer programmingComputer programming
Computer programming
 
Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016Voice Recognition and Natural Language - Dallas TechFest 2016
Voice Recognition and Natural Language - Dallas TechFest 2016
 
Lecture 11
Lecture 11Lecture 11
Lecture 11
 
Building Voice UI products for events
Building Voice UI products for eventsBuilding Voice UI products for events
Building Voice UI products for events
 
Text to Speech for Mobile Voice
Text to Speech for Mobile Voice Text to Speech for Mobile Voice
Text to Speech for Mobile Voice
 
classification of computer language
classification of computer languageclassification of computer language
classification of computer language
 
computer languages
computer languagescomputer languages
computer languages
 
Computer Language
Computer LanguageComputer Language
Computer Language
 
Computer languages
Computer languagesComputer languages
Computer languages
 
Army architect
Army architectArmy architect
Army architect
 
College forum software
College forum softwareCollege forum software
College forum software
 
Presentation on computer language
Presentation on computer languagePresentation on computer language
Presentation on computer language
 
computer languages
computer languagescomputer languages
computer languages
 
Applying Filmmaking Tools and Techniques to Interaction Design
Applying Filmmaking Tools and Techniques to Interaction DesignApplying Filmmaking Tools and Techniques to Interaction Design
Applying Filmmaking Tools and Techniques to Interaction Design
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
MIS software concepts, Dr. Ashish K. Gupta
MIS software concepts, Dr. Ashish K. GuptaMIS software concepts, Dr. Ashish K. Gupta
MIS software concepts, Dr. Ashish K. Gupta
 
Software (Application and System Software)
Software (Application and System Software)Software (Application and System Software)
Software (Application and System Software)
 
Computer Languages....ppt
Computer Languages....pptComputer Languages....ppt
Computer Languages....ppt
 
Text to speech converter in C#.NET
Text to speech converter in C#.NETText to speech converter in C#.NET
Text to speech converter in C#.NET
 

Destaque

Winter deliverables ii
Winter deliverables iiWinter deliverables ii
Winter deliverables iijvj002
 
Presentation1
Presentation1Presentation1
Presentation1nabil1927
 
Answer to question I - MDS presentation
Answer to question I - MDS presentationAnswer to question I - MDS presentation
Answer to question I - MDS presentationpj7291
 
An_activity_I_enjoy:_Intense_physical_challenges
An_activity_I_enjoy:_Intense_physical_challengesAn_activity_I_enjoy:_Intense_physical_challenges
An_activity_I_enjoy:_Intense_physical_challengespj7291
 
On wireless scheduling algorithms for minimizing the queue overflow probability
On wireless scheduling algorithms for minimizing the queue overflow probabilityOn wireless scheduling algorithms for minimizing the queue overflow probability
On wireless scheduling algorithms for minimizing the queue overflow probabilityPreet Kanwal
 
SILABUS MULTIMEDIA LENGKAP
SILABUS MULTIMEDIA LENGKAPSILABUS MULTIMEDIA LENGKAP
SILABUS MULTIMEDIA LENGKAPTaufik Hidayat
 
Behavior-based robotics
Behavior-based roboticsBehavior-based robotics
Behavior-based roboticsPreet Kanwal
 

Destaque (10)

Winter deliverables ii
Winter deliverables iiWinter deliverables ii
Winter deliverables ii
 
Presentation1
Presentation1Presentation1
Presentation1
 
Answer to question I - MDS presentation
Answer to question I - MDS presentationAnswer to question I - MDS presentation
Answer to question I - MDS presentation
 
Cookies
CookiesCookies
Cookies
 
An_activity_I_enjoy:_Intense_physical_challenges
An_activity_I_enjoy:_Intense_physical_challengesAn_activity_I_enjoy:_Intense_physical_challenges
An_activity_I_enjoy:_Intense_physical_challenges
 
Key
KeyKey
Key
 
On wireless scheduling algorithms for minimizing the queue overflow probability
On wireless scheduling algorithms for minimizing the queue overflow probabilityOn wireless scheduling algorithms for minimizing the queue overflow probability
On wireless scheduling algorithms for minimizing the queue overflow probability
 
Grouper
GrouperGrouper
Grouper
 
SILABUS MULTIMEDIA LENGKAP
SILABUS MULTIMEDIA LENGKAPSILABUS MULTIMEDIA LENGKAP
SILABUS MULTIMEDIA LENGKAP
 
Behavior-based robotics
Behavior-based roboticsBehavior-based robotics
Behavior-based robotics
 

Semelhante a Speech recognition1

Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologyAamir-sheriff
 
Artificial intelligence in speech recognition
Artificial intelligence in speech recognitionArtificial intelligence in speech recognition
Artificial intelligence in speech recognitionRajanivetha G
 
Fixing the program my computer learned: End-user debugging of machine-learned...
Fixing the program my computer learned: End-user debugging of machine-learned...Fixing the program my computer learned: End-user debugging of machine-learned...
Fixing the program my computer learned: End-user debugging of machine-learned...City University London
 
computer architecture and organization.ppt
computer architecture and organization.pptcomputer architecture and organization.ppt
computer architecture and organization.pptmuhammadosama0121
 
Ask me anything: A Conversational Interface to Augment Information Security w...
Ask me anything:A Conversational Interface to Augment Information Security w...Ask me anything:A Conversational Interface to Augment Information Security w...
Ask me anything: A Conversational Interface to Augment Information Security w...Matthew Park
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognitionVinay Jaisriram
 
Voice Enabled Desktop Interaction and Control System (VEDICS).
Voice Enabled Desktop Interaction and Control System (VEDICS).Voice Enabled Desktop Interaction and Control System (VEDICS).
Voice Enabled Desktop Interaction and Control System (VEDICS).AEGIS-ACCESSIBLE Projects
 
DTUI6_chap09_accessiblePPT.pptx
DTUI6_chap09_accessiblePPT.pptxDTUI6_chap09_accessiblePPT.pptx
DTUI6_chap09_accessiblePPT.pptxHetaSuto
 
Unit 1 computer concepts
Unit 1   computer conceptsUnit 1   computer concepts
Unit 1 computer conceptsMithun DSouza
 
Week 5
Week 5Week 5
Week 5A VD
 
Week 5
Week 5Week 5
Week 5A VD
 
Python-unit -I.pptx
Python-unit -I.pptxPython-unit -I.pptx
Python-unit -I.pptxcrAmth
 

Semelhante a Speech recognition1 (20)

Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Artificial intelligence in speech recognition
Artificial intelligence in speech recognitionArtificial intelligence in speech recognition
Artificial intelligence in speech recognition
 
System softare
System softareSystem softare
System softare
 
Lecture 8
Lecture 8Lecture 8
Lecture 8
 
Presentation2
Presentation2Presentation2
Presentation2
 
Fixing the program my computer learned: End-user debugging of machine-learned...
Fixing the program my computer learned: End-user debugging of machine-learned...Fixing the program my computer learned: End-user debugging of machine-learned...
Fixing the program my computer learned: End-user debugging of machine-learned...
 
Ppl 13 july2019
Ppl 13 july2019Ppl 13 july2019
Ppl 13 july2019
 
VA ppt.pdf
VA ppt.pdfVA ppt.pdf
VA ppt.pdf
 
computer architecture and organization.ppt
computer architecture and organization.pptcomputer architecture and organization.ppt
computer architecture and organization.ppt
 
Ask me anything: A Conversational Interface to Augment Information Security w...
Ask me anything:A Conversational Interface to Augment Information Security w...Ask me anything:A Conversational Interface to Augment Information Security w...
Ask me anything: A Conversational Interface to Augment Information Security w...
 
Aplikace pro rozpoznávání řeči - Jan Šedivý
Aplikace pro rozpoznávání řeči - Jan ŠedivýAplikace pro rozpoznávání řeči - Jan Šedivý
Aplikace pro rozpoznávání řeči - Jan Šedivý
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
 
Voice Enabled Desktop Interaction and Control System (VEDICS).
Voice Enabled Desktop Interaction and Control System (VEDICS).Voice Enabled Desktop Interaction and Control System (VEDICS).
Voice Enabled Desktop Interaction and Control System (VEDICS).
 
DTUI6_chap09_accessiblePPT.pptx
DTUI6_chap09_accessiblePPT.pptxDTUI6_chap09_accessiblePPT.pptx
DTUI6_chap09_accessiblePPT.pptx
 
Unit 1 computer concepts
Unit 1   computer conceptsUnit 1   computer concepts
Unit 1 computer concepts
 
Week 5
Week 5Week 5
Week 5
 
Week 5
Week 5Week 5
Week 5
 
Software
SoftwareSoftware
Software
 
Python-unit -I.pptx
Python-unit -I.pptxPython-unit -I.pptx
Python-unit -I.pptx
 

Último

ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...Nguyen Thanh Tu Collection
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptNishitharanjan Rout
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code ExamplesPeter Brusilovsky
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnershipsexpandedwebsite
 
male presentation...pdf.................
male presentation...pdf.................male presentation...pdf.................
male presentation...pdf.................MirzaAbrarBaig5
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismDabee Kamal
 
How to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxHow to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxCeline George
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...Nguyen Thanh Tu Collection
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppCeline George
 
PSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxPSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxMarlene Maheu
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjMohammed Sikander
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSean M. Fox
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....Ritu480198
 
How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17Celine George
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsSandeep D Chaudhary
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...Gary Wood
 
Trauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical PrinciplesTrauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical PrinciplesPooky Knightsmith
 

Último (20)

ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
 
male presentation...pdf.................
male presentation...pdf.................male presentation...pdf.................
male presentation...pdf.................
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in Hinduism
 
How to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxHow to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptx
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio App
 
ESSENTIAL of (CS/IT/IS) class 07 (Networks)
ESSENTIAL of (CS/IT/IS) class 07 (Networks)ESSENTIAL of (CS/IT/IS) class 07 (Networks)
ESSENTIAL of (CS/IT/IS) class 07 (Networks)
 
PSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxPSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptx
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....
 
How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
Trauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical PrinciplesTrauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical Principles
 

Speech recognition1

  • 2. Introduction • What is Speech Recognition? - Voice Recognition? • Where can it be used? - Dictation - System control/navigation - Commercial/Industrial applications - Hand held digital recorders 2
  • 3. Contents: • Continuous/Discrete • How does it work? • Recent improvements • Current software options • Future of SR 3
  • 4. Continuous or Discrete? • Continuous speech - dictation • Discrete speech - system controls 4
  • 5. How does SR work? • Recognition • Training • Correction • Command/Control 5
  • 6. Recognition (1) Voice Input Analog to Digital Acoustic Model Language Model Feedback Display Speech Engine 6
  • 7. Recognition (2) Acoustic Modeling • Spoken words: “I think there are…..” • Phonemes: ‘ ay th-in-nk-kd dh-eh-r aa- r’ • H.M.M.’s: 5 state representation • Speech Engine 7
  • 8. Recognition (3) Language Modeling • Word context • Word frequency • Transition possibilities 8
  • 9. Voice Training (1) Can be done by: • Predetermined text segments • Individual words Compare new acoustic with old and combines • More training = better recognition 9
  • 10. Voice Training (2) User specific Voice file • Voice qualities • Pronunciation • Patterns of word use • Preferred vocabulary 10
  • 11. Making Corrections • Move cursor by voice command • Memorize edit commands • List of possible alternatives • Make correction manually 11
  • 12. Command/Control • Desktop grid • Program or Link name/number • URL name • Memorized commands 12
  • 13. Recent Improvements in SR • Faster training ~10 min. • Better recognition ~95% • More compatible software • Better system control/command 13
  • 14. Current Software Options for PC • Dragon Systems – Naturally Speaking • Philips – FreeSpeech • IBM – ViaVoice • Lernout & Hauspie – Voice Xpress 14
  • 15. How well do the work? Training Dictation App. Command Correct. Integrat. - Control Dragon Excellent Excellent Good Good Philips Fair Fair Good Good IBM Excellent Good Good Excellent L&H Good Good Good Good 15
  • 16. Future of SR • SUI – Speech-based User Interface • Improvements needed: - Greater accuracy - Greater system control/command - More compatible software 16
  • 17. Conclusion • SR Uses • How does it work? • Current Software • Problems of SR • More SR coming soon…. 17
  • 18. References • 1. Alwang, Greg. “Speech Recognition,” PC Magazine, December 1 1999 • 2. Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon University. “Learning to Recognize Speech by Watching Television,” IEEE Intelligent Systems, September/October 1999. • 3. Miastkowski, Stan. “Latest Speech Software Gets You Up and Running Faster,” PC World, November 1999. 18