SlideShare uma empresa Scribd logo
1 de 29
SUBMITTED TO-MR. 
ABHISHEK SRIVASTAVA 
SUBMITTED BY-VIKALP 
MAHENDRA 
(EC-11)
 Introduction 
 Block Diagram 
 Linguistic Levels Of Analysis 
 Phonetics 
 Organs Of Speech And Articulation 
 Acoustic Model 
 Circuit Diagram 
 Components Used 
 Features Of HM2007 
 Working 
 Extracting Phonemes In Frequency Domain 
 Markov Model 
 Advantages 
 Applications 
 Conclusion
 Analyses sound and converts spoken word into text 
Uses knowledge of spoken English 
 Programs are available for voice recognition. 
 Systems work best on Windows XP & Windows Vista
Computers 
Databases Algorithms 
Robotics Natural Language Processing Search 
Information 
Retrieval 
Machine 
Translation 
Language 
Analysis 
Semantics
 Speech 
 Written language 
 Phonology: sounds / letters / pronunciation 
 Morphology: the structure of words 
 Syntax: how these sequences are structured 
 Semantics: meaning of the strings
The Study of the way Humans make, Transmit, and 
receive sounds 
Phonology - the study of sound systems of languages 
 A typical word such as moon broken down into 
three phonemes: m, ue , n. 
Phoneme represents all vowels and consonants of 
spoken speech
 Most vowel sounds are modified by the shape of 
the lips (rounded / spread / neutral) 
 Sounds are made by vibrating the vocal cords 
(voicing) 
 Vowels can be :- 
 Single sounds – Monophthongs or pure vowels 
 Double sounds - Diphthongs 
 Triple sounds - Triphthongs 
 Pure vowels usually come in pairs consisting of 
long and short sounds
This is found in the word tea. The lips are spread and the sound is long. 
This is found in the word hip. The lips are slightly spread and the sound is short. 
The tongue tip is raised slightly at the front towards the alveolar. In the longer sound the 
tongue is raised higher.
 This sound is made by relaxing the mouth and 
keeping your lips in a neutral position and making 
a short sound. It is found in words like paper, 
over, about, and common in weak verbs in spoken 
English.
The long sound – you, too & blue 
The short sound –Good, would & 
wool 
The lips are rounded and the centre 
and back of the tongue is raised towards 
the soft plate. For the longer sound the 
tongue is raised higher and the lips are 
more rounded. 
This sound is made with the mouth 
spread wide open. It is found in – cat, 
man, apple & ran
 Here we have three sounds: The sounds from - 
1) for 2) tour 3) go 
 Triphthongs are combinations of three sounds- 
English has 1 triphthong (a diphthong + a 
schwa sound) 
 Diphthongs are combinations of two sounds.
Diphthongs are combinations of pure vowels. 
•a:+ I = ‘aI’ - tie, buy, height & night 
•e + I = ‘eI’ -way, paid & gate 
•o: + I = ‘oI’ – boy, coin & coy 
•e + = e - where, hair & care 
• I + = I - here, hear & beer 
e e 
e e
 The audio recording of speech to create a 
statistical representation of sound. 
 To create a speech recognition engine, a large 
database of models is created to match each 
phoneme 
 These database models have stored 
phonemes 
 The language model has the grammar of the 
sentence to decode our spoken word to text.
 HM 2007 IC 
 SRAM 8K*8 
 LATCH 74LS373 
 INPUT BUFFER 7448 
 XTAL 3.57MHz 
 PCB 
 KEYPAD 
 PC MOUNTED SWITCHES 
 7 SEGMENT DISPLAY 
 MICROPHONE 
 22K RESISTOR 
 100K RESISTOR 
 .0047F CAPACITOR
 A single chip voice recognition system 
having 48 pin . 
 Manufactured by Hualon 
 Maximum 40 word and word length 1.92 sec 
 Microphone support 
 5V power supply
 How a computer convert spoken speech into data ?? 
 When we speak, a microphone converts the analog signal of our voice into 
digital chunks of data that the computer analyzes. 
 It is from this data that the computer extracts enough information that 
confidently guess the word being spoken
 To extract phonemes 
 Phonemes are linguistic units 
 The sounds that group together form words 
 Phoneme converts into sound & depends on many factors
 aa - father 
 ae - cat 
 ah - cut 
 ao - dog 
 aw - foul 
 ng - sing 
 t - talk 
 th - thin 
 uh - book 
waveform shows 
phonemes freq 
characteristics
 Phonemes are extracted by running waveform through Fourier 
transform 
 Easily visible in frequency domain 
 This can be make out by seeing spectrograph 
 Spectrograph is a 3-D plot of waveform freq and amplitude 
versus time and amplitude is shown in grey colour
 Computer generates list of phoneme 
 These phoneme have to be converted into words and to 
sentence so Markov model is used 
 It compares the observed phoneme with the stored phoneme
 In this, word tomato is written both in English and American 
English format 
 This idea is used upto the level of sentences and improved 
recognition
 It is used to translate different form of language 
 It Is used in telephones 
 The std land line telephone has a bandwidth of 64kb/s. 
 Sampling rate of 8khz 
 In Std desktop P.C ,the limiting factor is sound card.It can 
record sampling rate between 16 kHz to 48 kHz
 MILITARY 
 HELICOPTERS 
 IN MOBILE SMARTPHONES` 
 SPEECH CONTROLLED 
APPLIANCES 
 VOICE RECOGNITION SECURITY
 Speech recognition system is one of the latest technology . 
 Ir reduces costs like that of training 
 Steps : 
 Fourier transform of signal 
 Extraction of Phonemes 
 Formation of word on the basis of Markov Models 
 Charm of Simplicity 
 With the advent of this technology, we will hopefully see a 
new era of human computer interaction .
 From: Chapter 1 of An Introduction to Natural Language 
Processing, Computational Linguistics, and Speech 
Recognition, by Daniel Jurafsky and James H. Martin 
 http://en.wikipedia.org/wiki/acoustic model 
 http://en.wikipedia.org/wiki/speech recognition 
 www.wikpedia.org 
 www.slideshare.net 
 Natural Language Processing by Rada Mihalcea 
 www.youtube.com
Speech and Language Processing

Mais conteúdo relacionado

Mais procurados

Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by IqbalIqbal
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Unit 1 speech processing
Unit 1 speech processingUnit 1 speech processing
Unit 1 speech processingazhagujaisudhan
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition Goa App
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speechBilgin Aksoy
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyMarina Santini
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionAhmed Moawad
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingHemantha Kulathilake
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySrijanKumar18
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognitionananth
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechNgwe Tun
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basicssivakumar m
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Mustafa Jarrar
 
Vectorization In NLP.pptx
Vectorization In NLP.pptxVectorization In NLP.pptx
Vectorization In NLP.pptxChode Amarnath
 
Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...iwan_rg
 

Mais procurados (20)

Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Unit 1 speech processing
Unit 1 speech processingUnit 1 speech processing
Unit 1 speech processing
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language Technology
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological Parsing
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basics
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing
 
Vectorization In NLP.pptx
Vectorization In NLP.pptxVectorization In NLP.pptx
Vectorization In NLP.pptx
 
Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...
 

Destaque

Guo.Rundi.WritingSample
Guo.Rundi.WritingSampleGuo.Rundi.WritingSample
Guo.Rundi.WritingSampleWendy Guo
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognitionأحلام انصارى
 
Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaningfawzia
 
Reading Models and Schema Theory
Reading Models and Schema TheoryReading Models and Schema Theory
Reading Models and Schema TheoryAbbie Laudato
 
Three Reading Models
Three Reading ModelsThree Reading Models
Three Reading ModelsAntonette A.
 

Destaque (8)

Wonderstruck
WonderstruckWonderstruck
Wonderstruck
 
Idioms
IdiomsIdioms
Idioms
 
Guo.Rundi.WritingSample
Guo.Rundi.WritingSampleGuo.Rundi.WritingSample
Guo.Rundi.WritingSample
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
 
Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaning
 
FINAL report
FINAL reportFINAL report
FINAL report
 
Reading Models and Schema Theory
Reading Models and Schema TheoryReading Models and Schema Theory
Reading Models and Schema Theory
 
Three Reading Models
Three Reading ModelsThree Reading Models
Three Reading Models
 

Semelhante a Speech and Language Processing

How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization worksMuhammad Taqi
 
Silent sound interface
Silent sound interfaceSilent sound interface
Silent sound interfaceJeevitha Reddy
 
SAP (SPEECH AND AUDIO PROCESSING)
SAP (SPEECH AND AUDIO PROCESSING)SAP (SPEECH AND AUDIO PROCESSING)
SAP (SPEECH AND AUDIO PROCESSING)dineshkatta4
 
Coms30123 Synthesis 3 Projector
Coms30123 Synthesis 3 ProjectorComs30123 Synthesis 3 Projector
Coms30123 Synthesis 3 ProjectorDr. Cupid Lucid
 
American Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech RecognitionAmerican Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech Recognitionpaperpublications3
 
speech recognition and removal of disfluencies
speech recognition and removal of disfluenciesspeech recognition and removal of disfluencies
speech recognition and removal of disfluenciesAnkit Sharma
 
Principal characteristics of speech
Principal characteristics of speechPrincipal characteristics of speech
Principal characteristics of speechNikolay Karpov
 
Voice Recognition System using Template Matching
Voice Recognition System using Template MatchingVoice Recognition System using Template Matching
Voice Recognition System using Template MatchingIJORCS
 
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS cscpconf
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
B047006011
B047006011B047006011
B047006011inventy
 
B047006011
B047006011B047006011
B047006011inventy
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentationhimanshubhatti
 
SMATalk: Standard Malay Text to Speech Talk System
SMATalk: Standard Malay Text to Speech Talk SystemSMATalk: Standard Malay Text to Speech Talk System
SMATalk: Standard Malay Text to Speech Talk SystemCSCJournals
 

Semelhante a Speech and Language Processing (20)

An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
Assign
AssignAssign
Assign
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
Silent sound interface
Silent sound interfaceSilent sound interface
Silent sound interface
 
SAP (SPEECH AND AUDIO PROCESSING)
SAP (SPEECH AND AUDIO PROCESSING)SAP (SPEECH AND AUDIO PROCESSING)
SAP (SPEECH AND AUDIO PROCESSING)
 
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Isolated English Word Recognition System: Appropriate for Bengali-accented En...Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
 
Speech Synthesis.pptx
Speech Synthesis.pptxSpeech Synthesis.pptx
Speech Synthesis.pptx
 
Coms30123 Synthesis 3 Projector
Coms30123 Synthesis 3 ProjectorComs30123 Synthesis 3 Projector
Coms30123 Synthesis 3 Projector
 
Automatic Speech Recognion
Automatic Speech RecognionAutomatic Speech Recognion
Automatic Speech Recognion
 
American Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech RecognitionAmerican Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech Recognition
 
speech recognition and removal of disfluencies
speech recognition and removal of disfluenciesspeech recognition and removal of disfluencies
speech recognition and removal of disfluencies
 
Principal characteristics of speech
Principal characteristics of speechPrincipal characteristics of speech
Principal characteristics of speech
 
Voice Recognition System using Template Matching
Voice Recognition System using Template MatchingVoice Recognition System using Template Matching
Voice Recognition System using Template Matching
 
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
B047006011
B047006011B047006011
B047006011
 
B047006011
B047006011B047006011
B047006011
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
A Case Study on DSP (Speech Processing)
A Case Study on DSP (Speech Processing)A Case Study on DSP (Speech Processing)
A Case Study on DSP (Speech Processing)
 
SMATalk: Standard Malay Text to Speech Talk System
SMATalk: Standard Malay Text to Speech Talk SystemSMATalk: Standard Malay Text to Speech Talk System
SMATalk: Standard Malay Text to Speech Talk System
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Speech and Language Processing

  • 1. SUBMITTED TO-MR. ABHISHEK SRIVASTAVA SUBMITTED BY-VIKALP MAHENDRA (EC-11)
  • 2.  Introduction  Block Diagram  Linguistic Levels Of Analysis  Phonetics  Organs Of Speech And Articulation  Acoustic Model  Circuit Diagram  Components Used  Features Of HM2007  Working  Extracting Phonemes In Frequency Domain  Markov Model  Advantages  Applications  Conclusion
  • 3.  Analyses sound and converts spoken word into text Uses knowledge of spoken English  Programs are available for voice recognition.  Systems work best on Windows XP & Windows Vista
  • 4. Computers Databases Algorithms Robotics Natural Language Processing Search Information Retrieval Machine Translation Language Analysis Semantics
  • 5.  Speech  Written language  Phonology: sounds / letters / pronunciation  Morphology: the structure of words  Syntax: how these sequences are structured  Semantics: meaning of the strings
  • 6. The Study of the way Humans make, Transmit, and receive sounds Phonology - the study of sound systems of languages  A typical word such as moon broken down into three phonemes: m, ue , n. Phoneme represents all vowels and consonants of spoken speech
  • 7.
  • 8.  Most vowel sounds are modified by the shape of the lips (rounded / spread / neutral)  Sounds are made by vibrating the vocal cords (voicing)  Vowels can be :-  Single sounds – Monophthongs or pure vowels  Double sounds - Diphthongs  Triple sounds - Triphthongs  Pure vowels usually come in pairs consisting of long and short sounds
  • 9. This is found in the word tea. The lips are spread and the sound is long. This is found in the word hip. The lips are slightly spread and the sound is short. The tongue tip is raised slightly at the front towards the alveolar. In the longer sound the tongue is raised higher.
  • 10.  This sound is made by relaxing the mouth and keeping your lips in a neutral position and making a short sound. It is found in words like paper, over, about, and common in weak verbs in spoken English.
  • 11. The long sound – you, too & blue The short sound –Good, would & wool The lips are rounded and the centre and back of the tongue is raised towards the soft plate. For the longer sound the tongue is raised higher and the lips are more rounded. This sound is made with the mouth spread wide open. It is found in – cat, man, apple & ran
  • 12.  Here we have three sounds: The sounds from - 1) for 2) tour 3) go  Triphthongs are combinations of three sounds- English has 1 triphthong (a diphthong + a schwa sound)  Diphthongs are combinations of two sounds.
  • 13. Diphthongs are combinations of pure vowels. •a:+ I = ‘aI’ - tie, buy, height & night •e + I = ‘eI’ -way, paid & gate •o: + I = ‘oI’ – boy, coin & coy •e + = e - where, hair & care • I + = I - here, hear & beer e e e e
  • 14.  The audio recording of speech to create a statistical representation of sound.  To create a speech recognition engine, a large database of models is created to match each phoneme  These database models have stored phonemes  The language model has the grammar of the sentence to decode our spoken word to text.
  • 15.
  • 16.  HM 2007 IC  SRAM 8K*8  LATCH 74LS373  INPUT BUFFER 7448  XTAL 3.57MHz  PCB  KEYPAD  PC MOUNTED SWITCHES  7 SEGMENT DISPLAY  MICROPHONE  22K RESISTOR  100K RESISTOR  .0047F CAPACITOR
  • 17.  A single chip voice recognition system having 48 pin .  Manufactured by Hualon  Maximum 40 word and word length 1.92 sec  Microphone support  5V power supply
  • 18.
  • 19.  How a computer convert spoken speech into data ??  When we speak, a microphone converts the analog signal of our voice into digital chunks of data that the computer analyzes.  It is from this data that the computer extracts enough information that confidently guess the word being spoken
  • 20.  To extract phonemes  Phonemes are linguistic units  The sounds that group together form words  Phoneme converts into sound & depends on many factors
  • 21.  aa - father  ae - cat  ah - cut  ao - dog  aw - foul  ng - sing  t - talk  th - thin  uh - book waveform shows phonemes freq characteristics
  • 22.  Phonemes are extracted by running waveform through Fourier transform  Easily visible in frequency domain  This can be make out by seeing spectrograph  Spectrograph is a 3-D plot of waveform freq and amplitude versus time and amplitude is shown in grey colour
  • 23.  Computer generates list of phoneme  These phoneme have to be converted into words and to sentence so Markov model is used  It compares the observed phoneme with the stored phoneme
  • 24.  In this, word tomato is written both in English and American English format  This idea is used upto the level of sentences and improved recognition
  • 25.  It is used to translate different form of language  It Is used in telephones  The std land line telephone has a bandwidth of 64kb/s.  Sampling rate of 8khz  In Std desktop P.C ,the limiting factor is sound card.It can record sampling rate between 16 kHz to 48 kHz
  • 26.  MILITARY  HELICOPTERS  IN MOBILE SMARTPHONES`  SPEECH CONTROLLED APPLIANCES  VOICE RECOGNITION SECURITY
  • 27.  Speech recognition system is one of the latest technology .  Ir reduces costs like that of training  Steps :  Fourier transform of signal  Extraction of Phonemes  Formation of word on the basis of Markov Models  Charm of Simplicity  With the advent of this technology, we will hopefully see a new era of human computer interaction .
  • 28.  From: Chapter 1 of An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, by Daniel Jurafsky and James H. Martin  http://en.wikipedia.org/wiki/acoustic model  http://en.wikipedia.org/wiki/speech recognition  www.wikpedia.org  www.slideshare.net  Natural Language Processing by Rada Mihalcea  www.youtube.com