SlideShare uma empresa Scribd logo
1 de 19
Voice Browser
By
Praveen Kumar Mutcharla
&
Ramanuja SVL
Contents in Presentation
 Introducing Voice Browser
 Differences between Graphical and Voice Browser
 Why a voice browser?
 W3C Standards
 W3C Speech Interface Framework
 Speech Recognition Grammar Specification
 Semantic Interpretation
 Pronunciation Lexicon Analysis
 VoiceXML
 Functionality
 Applications
 Conclusion
Introducing Voice Browser
 As the name suggests Voice Browser uses
Voice to navigate a web application.
 This works similar to graphical web
browser, as a result any telephone can be
used to access appropriately designed
Web-based services.
 In essence, a voice browser is a software
application that presents an interactive
voice user interface
Differences Between
Graphical and Voice Browser
 Graphical
browsing is
passive.
 Keyboard
commands are
required for
Graphical
browser.
 Voice browsing
is an active
process.
 Voice is medium
to communicate
with an
application.
WHY A VOICE BROWSER?
 Use of the hands during browsing might
prove inconvenient or impossible. Voice
input is a natural solution for such ands-
busy situations.
 Even in standard browser applications,
using voice input is simply more fun than
the alternatives.
 Browser replaces the mouse in most
instances to enable hands-free browsing.
Use Cases:
 Easy to use - for people with no knowledge or fear
of computers.
 Many companies offer services over the phone via
IVR . With the advent of Voice Browsers ,they will
become next generation Voice Web portals to the
company's services and related websites, whether
accessed via the telephone network or via the
Internet.
W3C STANDARDS
 The World Wide Web Consortium (W3C)
develops interoperable technologies
(specifications, guidelines, software, and
tools) to lead the Web to its full potential
as a forum for information, commerce,
communication, and collective
understanding.
W3C Speech Interface
Framework
Speech
Recognition
Grammar
Specification
(SRGS)
Semantic
Interpretation for
speech
recognition
(SISR)
Pronunciation
Lexicon
Specification(PLS)
VoiceXML
Speech Recognition Grammar
Specification (SRGS)
 A document language that can be used by
developers to specify the words and
patterns of words to be listened for by a
speech recognizer or other grammar
processor.
SEMANTIC INTERPRETATION
FOR SPEECH RECOGNITION
 The recognition process matches an
utterance to a speech grammar, building a
parse tree as a byproduct.
 There are two approaches to harvesting
semantic results from the parse tree:
 Annotating grammar rules with semantic
interpretation tags.
 Representing the result in XML.
PRONUNCIATION LEXICON
SPECIFICATION
 A representation of phonetic information
for use in speech recognition and
synthesis.
 Application developers sometimes need to
ability to tune speech engines, whether
for synthesis or recognition.
VOICEXML
 VoiceXML is a dialog markup language
designed for telephony applications, where
users are restricted to voice and DTMF
(touch tone) input.
 There are other languages: VoXML.
A Sample VXML Code
<vxml version=“2.0”
xmlns=http://www.w3.org/2001/vxml>
<form>
<block>
<prompt>
Hello world!
</prompt>
</block>
</form>
</vxml>
FUNCTIONALITY
 Communication from the user to the system is made by
issuing voice commands.
 A grammar set is defined to recognise these commands.
 There are few rules that are used to navigate the
webpage.
 Below are a few administrative controls:
Speak
Where am I?
What is my home page?
APPLICATIONS
 Google’s-Google now can be considered as
one of the best delivered application as
of yet.
 Though it is not as functional as a
graphical browser, it is somewhat similar.
 Some other applications are:
 IOS’s: Siri
 Windows: Cortana
Processing of Google Now
CONCLUSION
 In order to make technology more familiar to the
user its access should be made much easier.
 It is know that visual internet access experiences
various limitations such as people who are
physically handicapped (especially blind users)
cannot use keypads or touch screens for giving
instructions.
 Above all these limitations today’s generation
demands to use internet independent of PC’s and
also hands free access to it.
 For this VOICE BROWSING is an intelligent idea.
Voice Browser
Voice Browser

Mais conteúdo relacionado

Mais procurados

bluejacking.ppt
bluejacking.pptbluejacking.ppt
bluejacking.ppt
Aeman Khan
 
Smart note taker
Smart note takerSmart note taker
Smart note taker
crisane93
 
Blue Eyes Technology Abstract
Blue Eyes Technology AbstractBlue Eyes Technology Abstract
Blue Eyes Technology Abstract
Colloquium
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
Hugo Moreno
 

Mais procurados (20)

Voicexml ppt
Voicexml pptVoicexml ppt
Voicexml ppt
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
 
SMART NOTE TAKER REPORT
SMART NOTE TAKER REPORTSMART NOTE TAKER REPORT
SMART NOTE TAKER REPORT
 
VoiceXML
VoiceXMLVoiceXML
VoiceXML
 
bluejacking.ppt
bluejacking.pptbluejacking.ppt
bluejacking.ppt
 
Information technology seminar topics
Information technology  seminar topicsInformation technology  seminar topics
Information technology seminar topics
 
Smart note taker
Smart note takerSmart note taker
Smart note taker
 
Smart note taker
Smart note takerSmart note taker
Smart note taker
 
Blue Eyes Technology Abstract
Blue Eyes Technology AbstractBlue Eyes Technology Abstract
Blue Eyes Technology Abstract
 
Z force touch screen technology
Z force touch screen technologyZ force touch screen technology
Z force touch screen technology
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Fog Screen technology
Fog Screen technologyFog Screen technology
Fog Screen technology
 
Apple talk ppt
Apple talk pptApple talk ppt
Apple talk ppt
 
Progressive Web Apps are here!
Progressive Web Apps are here!Progressive Web Apps are here!
Progressive Web Apps are here!
 
Smart note taker ppt
Smart note taker pptSmart note taker ppt
Smart note taker ppt
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Ppt presentation
Ppt presentationPpt presentation
Ppt presentation
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 

Destaque

Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
ankit_saluja
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
Sarang Afle
 
VOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEMVOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEM
Nikhil Ravi
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
Santosh Kumar
 
Best topics for seminar
Best topics for seminarBest topics for seminar
Best topics for seminar
shilpi nagpal
 
Speech recognition final
Speech recognition finalSpeech recognition final
Speech recognition final
Arpit Kumar
 
Frosty The Snowman
Frosty The SnowmanFrosty The Snowman
Frosty The Snowman
pps 33
 
Cd 315 Adaptive Tech revised
Cd 315 Adaptive Tech revisedCd 315 Adaptive Tech revised
Cd 315 Adaptive Tech revised
fry43
 

Destaque (19)

Voice based email for blinds
Voice based email for blindsVoice based email for blinds
Voice based email for blinds
 
voice browser
voice browservoice browser
voice browser
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
 
Hak voice-browser
Hak voice-browserHak voice-browser
Hak voice-browser
 
VOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEMVOICE BASED SECURITY SYSTEM
VOICE BASED SECURITY SYSTEM
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
 
Best topics for seminar
Best topics for seminarBest topics for seminar
Best topics for seminar
 
Web Search - Lecture 10 - Web Information Systems (4011474FNR)
Web Search - Lecture 10 - Web Information Systems (4011474FNR)Web Search - Lecture 10 - Web Information Systems (4011474FNR)
Web Search - Lecture 10 - Web Information Systems (4011474FNR)
 
Toward a New Algorithm for Hands Free Browsing
Toward a New Algorithm for Hands Free BrowsingToward a New Algorithm for Hands Free Browsing
Toward a New Algorithm for Hands Free Browsing
 
Voice browser1
Voice browser1Voice browser1
Voice browser1
 
Speech recognition final
Speech recognition finalSpeech recognition final
Speech recognition final
 
Silver Light By Nyros Developer
Silver Light By Nyros DeveloperSilver Light By Nyros Developer
Silver Light By Nyros Developer
 
Frosty The Snowman
Frosty The SnowmanFrosty The Snowman
Frosty The Snowman
 
Cd 315 Adaptive Tech revised
Cd 315 Adaptive Tech revisedCd 315 Adaptive Tech revised
Cd 315 Adaptive Tech revised
 
Meteor South Bay Meetup - Kubernetes & Google Container Engine
Meteor South Bay Meetup - Kubernetes & Google Container EngineMeteor South Bay Meetup - Kubernetes & Google Container Engine
Meteor South Bay Meetup - Kubernetes & Google Container Engine
 
Rain technology
Rain technologyRain technology
Rain technology
 
Java ring
Java ringJava ring
Java ring
 
Bittorrent
BittorrentBittorrent
Bittorrent
 

Semelhante a Voice Browser

Voicexml543
Voicexml543Voicexml543
Voicexml543
pavisony
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
ankit_saluja
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
Thejus Joby
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
Vinay Jaisriram
 
Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008
eComm2008
 

Semelhante a Voice Browser (20)

final doc
final docfinal doc
final doc
 
Voicexml543
Voicexml543Voicexml543
Voicexml543
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
Voice based web browser
Voice based web browserVoice based web browser
Voice based web browser
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOM
 
Accessing Scholarly Content through FOSS based Assistive Technology
Accessing Scholarly Content through FOSS based Assistive TechnologyAccessing Scholarly Content through FOSS based Assistive Technology
Accessing Scholarly Content through FOSS based Assistive Technology
 
02 state of the art speech technology using java speech api@egsp 25.08.2011
02 state of the art speech technology using java speech api@egsp 25.08.201102 state of the art speech technology using java speech api@egsp 25.08.2011
02 state of the art speech technology using java speech api@egsp 25.08.2011
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
BTP paper
BTP paperBTP paper
BTP paper
 
10.1.1.510.6198
10.1.1.510.619810.1.1.510.6198
10.1.1.510.6198
 
visH (fin).pptx
visH (fin).pptxvisH (fin).pptx
visH (fin).pptx
 
Seminar
SeminarSeminar
Seminar
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
 
Voice based autometedtransport enquiry system in #c by Rohit malav
Voice based autometedtransport enquiry system in #c by Rohit malavVoice based autometedtransport enquiry system in #c by Rohit malav
Voice based autometedtransport enquiry system in #c by Rohit malav
 
Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 

Voice Browser

  • 1. Voice Browser By Praveen Kumar Mutcharla & Ramanuja SVL
  • 2. Contents in Presentation  Introducing Voice Browser  Differences between Graphical and Voice Browser  Why a voice browser?  W3C Standards  W3C Speech Interface Framework  Speech Recognition Grammar Specification  Semantic Interpretation  Pronunciation Lexicon Analysis  VoiceXML  Functionality  Applications  Conclusion
  • 3. Introducing Voice Browser  As the name suggests Voice Browser uses Voice to navigate a web application.  This works similar to graphical web browser, as a result any telephone can be used to access appropriately designed Web-based services.  In essence, a voice browser is a software application that presents an interactive voice user interface
  • 4. Differences Between Graphical and Voice Browser  Graphical browsing is passive.  Keyboard commands are required for Graphical browser.  Voice browsing is an active process.  Voice is medium to communicate with an application.
  • 5. WHY A VOICE BROWSER?  Use of the hands during browsing might prove inconvenient or impossible. Voice input is a natural solution for such ands- busy situations.  Even in standard browser applications, using voice input is simply more fun than the alternatives.  Browser replaces the mouse in most instances to enable hands-free browsing.
  • 6. Use Cases:  Easy to use - for people with no knowledge or fear of computers.  Many companies offer services over the phone via IVR . With the advent of Voice Browsers ,they will become next generation Voice Web portals to the company's services and related websites, whether accessed via the telephone network or via the Internet.
  • 7. W3C STANDARDS  The World Wide Web Consortium (W3C) develops interoperable technologies (specifications, guidelines, software, and tools) to lead the Web to its full potential as a forum for information, commerce, communication, and collective understanding.
  • 8. W3C Speech Interface Framework Speech Recognition Grammar Specification (SRGS) Semantic Interpretation for speech recognition (SISR) Pronunciation Lexicon Specification(PLS) VoiceXML
  • 9. Speech Recognition Grammar Specification (SRGS)  A document language that can be used by developers to specify the words and patterns of words to be listened for by a speech recognizer or other grammar processor.
  • 10. SEMANTIC INTERPRETATION FOR SPEECH RECOGNITION  The recognition process matches an utterance to a speech grammar, building a parse tree as a byproduct.  There are two approaches to harvesting semantic results from the parse tree:  Annotating grammar rules with semantic interpretation tags.  Representing the result in XML.
  • 11. PRONUNCIATION LEXICON SPECIFICATION  A representation of phonetic information for use in speech recognition and synthesis.  Application developers sometimes need to ability to tune speech engines, whether for synthesis or recognition.
  • 12. VOICEXML  VoiceXML is a dialog markup language designed for telephony applications, where users are restricted to voice and DTMF (touch tone) input.  There are other languages: VoXML.
  • 13. A Sample VXML Code <vxml version=“2.0” xmlns=http://www.w3.org/2001/vxml> <form> <block> <prompt> Hello world! </prompt> </block> </form> </vxml>
  • 14. FUNCTIONALITY  Communication from the user to the system is made by issuing voice commands.  A grammar set is defined to recognise these commands.  There are few rules that are used to navigate the webpage.  Below are a few administrative controls: Speak Where am I? What is my home page?
  • 15. APPLICATIONS  Google’s-Google now can be considered as one of the best delivered application as of yet.  Though it is not as functional as a graphical browser, it is somewhat similar.  Some other applications are:  IOS’s: Siri  Windows: Cortana
  • 17. CONCLUSION  In order to make technology more familiar to the user its access should be made much easier.  It is know that visual internet access experiences various limitations such as people who are physically handicapped (especially blind users) cannot use keypads or touch screens for giving instructions.  Above all these limitations today’s generation demands to use internet independent of PC’s and also hands free access to it.  For this VOICE BROWSING is an intelligent idea.