SlideShare uma empresa Scribd logo
1 de 31
Introduction
Physiological Characteristics
Behavioral Characteristic
 Biometrics are automated methods of
recognizing a person based on a physiological
or behavioral characteristic.
 Physiological characteristics are related with
the shape of the body.
 Behavioral charcteristics are related with
behavior of a person included but not limited to
voice recognition.

IQBAL
Reg # 9952
MBA(M) – Section A
 Speech Recognition Simply is the process of
converting spoken input to text.
 It is also known as Speech-to-Text and Voice
Recognition.
 Technically Speech recognition is the process of
converting an acoustic signal, captured by a
microphone or a telephone, to a set of words.
 Dragon Naturally Speaking developed and
acquired by Dragon Systems and Nuance
Communications respectively.
 Microsoft Speech Recognition by Microsoft.
 Via Voice by IBM
 NUANCE COMMUNICATIONS:-
 This Nuance Communications is a 
multinational computer software technology
 corporation, headquartered in Burlington,
Massachusetts, USA, that provides speech and
imaging applications.
Current business products focus on server & embedded
speech recognition, telephone call steering systems,
automated telephone directory services, medical
transcription software & systems, optical character
recognition software, and desktop imaging software.
ScanSoft and Nuance merged in October 2005;
before the merger, the two companies competed in
the commercial large scale speech application
business.
 Nuance was founded in 1994 as a spinoff
of SRI International's Speech Technology
and Research (STAR) Laboratory to
commercialise the speaker-independent
speech recognition technology developed for
the US government at SRI.
 Based in Menlo Park, California, Nuance
deployed their first commercial large-scale
speech application in 1996.
1994 – Nuance spun off from SRI's
STAR Lab.
1996 – Nuance deployed its first
commercial speech application.
2000 April 13 – Nuance files initial
public offering on the Nasdaq under the
symbol NUANE
 Dragon speech recognition software is a
Naturally Speaking Language.
 This software has three primary features of
functionality.
 Dictation
 Text-To-Speech
 Command Input
 Dictation
 As user dictates the words it will converts it into
text and it displays.
 Text-To-Speech
 And as text what is present or selected can be
converted to speech.
 Command Input
 User can control the operations by means of
his voice without using keyboard by just giving
commands.
 TRANSLATION
 It cannot translate from one language to
another language here comes translation
problem.
 UNTRAINED
 It cannot work without training ,training is
required,dynamic acceptance is not present.
 PLATFORM DEPENDENT
 It cannot work on another platforms other than
windows like mac o.s,ubuntu etc.
• To develop a translation feature in near
future to spread the availabilty of
product to all type of users.
• To make the system platform
independent.
• Home Automation
There is a lot of interest in the use of SR in
domestic appliances such as ovens,
refrigerators, dishwashers and washing
machines.
• Wearable Computers
The most futuristic application is in the use
and functionality of wearable computers.
The most futuristic application is in the
use and functionality of wearable
computers. These would allow people
to go about their everyday lives, but
still store information (thoughts, notes, to-do lists)
verbally, or communicate via email, phone or videophone,
through wearable devices. Crucially, this would be done
without having to interact with the device, or even
remember that it is there; the user would just speak, the
device would know what to do with the speech, and would
carry out the appropriate task.
• People with Disabilities
Speech recognition technology helps people with
disabilities interact with computers more easily.
People with motor limitations, who cannot use a
standard keyboard and mouse, can use their voices
to navigate the computer and create documents.
• Dyslexic People
Speech Recognition Technology is helpful for people
with learning disabilities, who experience difficulty
with spelling and writing.
 Speech to text module
 Command Input module
 Input predefined execute
command commands command
define
command |
 Sound Cards
soundcard with the cleanest A/D (Analog
to Digital) conversions are recommended.
 Microphone
The best choice for microphone is the
headset style.
 Computers / Processors
The more the speed the better Speech
Recognition would work. For good Speech
Recognition you should be having 1 GHz
processor and 1 GB of RAM.
 Windows Operating System(NT,XP,7,8).
 Audio Driver Software
 As for a bussiness like online
shopping,organisations like amazon etc have
separate dept for replying to customers in that
place of replying e-mails this can be used to
minimisation of time.
 Cost required for developing the product is
more.
 Time required for developing the product is
medium.
• Speech recognition will revolutionize the way
people conduct business over the Web and will,
ultimately, differentiate world-class e-
businesses. VoiceXML ties speech recognition
and telephony together and provides the
technology with which businesses can develop
and deploy voice-enabled Web solutions
TODAY!
 These solutions can greatly expand the
accessibility of Web-based self-service
transactions to customers who would otherwise
not have access, and, at the same time,
leverage a business’ existing Web investments.
 Speech recognition and VoiceXML clearly
represent the next wave of the Web. In near
future people will be using their home and
business computers by speech not by keyboard
or mouse. Home automation will be completely
based on speech recognition system. 
Abstract of speech recognition

Mais conteúdo relacionado

Mais procurados

Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
Diptimaya Sarangi
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
ankit_saluja
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
Amrita More
 

Mais procurados (20)

Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Voice recognition
Voice recognitionVoice recognition
Voice recognition
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
OCR (Optical Character Recognition)
OCR (Optical Character Recognition) OCR (Optical Character Recognition)
OCR (Optical Character Recognition)
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Abstract Silent Sound Technology
Abstract   Silent Sound TechnologyAbstract   Silent Sound Technology
Abstract Silent Sound Technology
 
The Use of Artificial Intelligence and Machine Learning in Speech Recognition
The Use of Artificial Intelligence and Machine Learning in Speech RecognitionThe Use of Artificial Intelligence and Machine Learning in Speech Recognition
The Use of Artificial Intelligence and Machine Learning in Speech Recognition
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Optical Character Recognition (OCR) based Retrieval
Optical Character Recognition (OCR) based RetrievalOptical Character Recognition (OCR) based Retrieval
Optical Character Recognition (OCR) based Retrieval
 
Artificial intelligence for speech recognition
Artificial intelligence for speech recognitionArtificial intelligence for speech recognition
Artificial intelligence for speech recognition
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 

Semelhante a Abstract of speech recognition

Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
Thejus Joby
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
ankit_saluja
 

Semelhante a Abstract of speech recognition (20)

10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
Presentation.ai
Presentation.aiPresentation.ai
Presentation.ai
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
ICT, Importance of programming and programming languages
ICT, Importance of programming and programming languagesICT, Importance of programming and programming languages
ICT, Importance of programming and programming languages
 
Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phone
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
30
3030
30
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOM
 
Seminar
SeminarSeminar
Seminar
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
 
Computer system
Computer systemComputer system
Computer system
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptx
 
D1803041822
D1803041822D1803041822
D1803041822
 
voice browser
voice browservoice browser
voice browser
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Abstract of speech recognition

  • 1.
  • 3.  Biometrics are automated methods of recognizing a person based on a physiological or behavioral characteristic.  Physiological characteristics are related with the shape of the body.  Behavioral charcteristics are related with behavior of a person included but not limited to voice recognition. 
  • 4.
  • 5. IQBAL Reg # 9952 MBA(M) – Section A
  • 6.
  • 7.  Speech Recognition Simply is the process of converting spoken input to text.  It is also known as Speech-to-Text and Voice Recognition.  Technically Speech recognition is the process of converting an acoustic signal, captured by a microphone or a telephone, to a set of words.
  • 8.  Dragon Naturally Speaking developed and acquired by Dragon Systems and Nuance Communications respectively.
  • 9.  Microsoft Speech Recognition by Microsoft.  Via Voice by IBM
  • 10.  NUANCE COMMUNICATIONS:-  This Nuance Communications is a  multinational computer software technology  corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications.
  • 11. Current business products focus on server & embedded speech recognition, telephone call steering systems, automated telephone directory services, medical transcription software & systems, optical character recognition software, and desktop imaging software. ScanSoft and Nuance merged in October 2005; before the merger, the two companies competed in the commercial large scale speech application business.
  • 12.  Nuance was founded in 1994 as a spinoff of SRI International's Speech Technology and Research (STAR) Laboratory to commercialise the speaker-independent speech recognition technology developed for the US government at SRI.  Based in Menlo Park, California, Nuance deployed their first commercial large-scale speech application in 1996.
  • 13. 1994 – Nuance spun off from SRI's STAR Lab. 1996 – Nuance deployed its first commercial speech application. 2000 April 13 – Nuance files initial public offering on the Nasdaq under the symbol NUANE
  • 14.  Dragon speech recognition software is a Naturally Speaking Language.  This software has three primary features of functionality.  Dictation  Text-To-Speech  Command Input
  • 15.  Dictation  As user dictates the words it will converts it into text and it displays.  Text-To-Speech  And as text what is present or selected can be converted to speech.  Command Input  User can control the operations by means of his voice without using keyboard by just giving commands.
  • 16.  TRANSLATION  It cannot translate from one language to another language here comes translation problem.  UNTRAINED  It cannot work without training ,training is required,dynamic acceptance is not present.
  • 17.  PLATFORM DEPENDENT  It cannot work on another platforms other than windows like mac o.s,ubuntu etc.
  • 18. • To develop a translation feature in near future to spread the availabilty of product to all type of users. • To make the system platform independent.
  • 19. • Home Automation There is a lot of interest in the use of SR in domestic appliances such as ovens, refrigerators, dishwashers and washing machines. • Wearable Computers The most futuristic application is in the use and functionality of wearable computers.
  • 20. The most futuristic application is in the use and functionality of wearable computers. These would allow people to go about their everyday lives, but still store information (thoughts, notes, to-do lists) verbally, or communicate via email, phone or videophone, through wearable devices. Crucially, this would be done without having to interact with the device, or even remember that it is there; the user would just speak, the device would know what to do with the speech, and would carry out the appropriate task.
  • 21. • People with Disabilities Speech recognition technology helps people with disabilities interact with computers more easily. People with motor limitations, who cannot use a standard keyboard and mouse, can use their voices to navigate the computer and create documents. • Dyslexic People Speech Recognition Technology is helpful for people with learning disabilities, who experience difficulty with spelling and writing.
  • 22.  Speech to text module
  • 23.  Command Input module  Input predefined execute command commands command define command |
  • 24.  Sound Cards soundcard with the cleanest A/D (Analog to Digital) conversions are recommended.  Microphone The best choice for microphone is the headset style.
  • 25.  Computers / Processors The more the speed the better Speech Recognition would work. For good Speech Recognition you should be having 1 GHz processor and 1 GB of RAM.
  • 26.  Windows Operating System(NT,XP,7,8).  Audio Driver Software
  • 27.  As for a bussiness like online shopping,organisations like amazon etc have separate dept for replying to customers in that place of replying e-mails this can be used to minimisation of time.  Cost required for developing the product is more.  Time required for developing the product is medium.
  • 28. • Speech recognition will revolutionize the way people conduct business over the Web and will, ultimately, differentiate world-class e- businesses. VoiceXML ties speech recognition and telephony together and provides the technology with which businesses can develop and deploy voice-enabled Web solutions TODAY!
  • 29.  These solutions can greatly expand the accessibility of Web-based self-service transactions to customers who would otherwise not have access, and, at the same time, leverage a business’ existing Web investments.
  • 30.  Speech recognition and VoiceXML clearly represent the next wave of the Web. In near future people will be using their home and business computers by speech not by keyboard or mouse. Home automation will be completely based on speech recognition system.