SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
Gestures and Lip Shape Integration
                                     for
               Cued Speech Recognition

Seminar By:             Seminar Coordinator:
Mohammed Musfir         Mr. Rino P. C.
ECE-B, 08104131         Assistant Professor, ECE

                        Seminar Guide:
                        Mr. Edet Bijoy K.
                        Assistant Professor, ECE
02/12/2011   2
02/12/2011   3
02/12/2011   4
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Overview of Presentation

                                                                            Objective
                                                                            Introduction
                                                                            ASR Techniques
                                                                            Lip Reading – AVSR
                                                                            Cued Speech
                                                                            Integrated Recognition
                                                                            Conclusion


                                                                02/12/2011                            5
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Objective

                                                                       Developments in ASR technique
                                                                       AVSR Accessibility solution
                                                                              Lip Detection
                                                                              Cued Speech detection
                                                                              Integration of both




                                                                02/12/2011                              6
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




02/12/2011
                                  INTRODUCTION



7
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Briefing ASR

                                                                       First successful system in 1970
                                                                       Consist of two systems
                                                                              ASR – Transcribe
                                                                              SU- Understand transcription
                                                                       Knowledge Intensive




                                                                02/12/2011                                    8
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




02/12/2011
                                  ASR TECHNIQUES



9
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                ASR Industry

                                                                       Industry pioneers – NUANCE, NTT Labs, AT
                                                                        & T labs
                                                                       MIT and GPL – Vox Forge, Gvoice
                                                                       Desktop Dictation -1990
                                                                       Types of ASR
                                                                              DVI – Word or phrase spotting
                                                                              LVCSR- Several thousands words



                                                                02/12/2011                                      10
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Techniques




                                                                       Sequence of sounds
                                                                       ASR involves
                                                                              Acquisition - Recording
                                                                              Feature Extraction – Spectral analysis
                                                                              Pattern matching and decoding


                                                                02/12/2011                                              11
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




02/12/2011
                                                         Techniques




12
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Approaches

                                                                       Template Based
                                                                       Knowledge Based
                                                                       Statistical
                                                                       Learning based
                                                                       Artificial Intelligence




                                                                02/12/2011                        13
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




02/12/2011
                                  LIP READING



14
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




                                                




02/12/2011
                                              Front end Lips detection
                                                                         Lip Reading - AVSR




15
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Localisation and Tracking

                                                                            ROI determination – Sobel Edge Filtering
                                                                                Kalman Filter – Tracking
                                                                            Principal Component Analysis – Feature
                                                                             Coefficients
                                                                            Audio feature - MFCC




                                                                02/12/2011                                              16
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




02/12/2011
                                CUED SPEECH



17
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




02/12/2011
                                                         Overview of Cued Speech




18
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




02/12/2011
                                INTEGRATION



19
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Steps

                                                                       Lip feature extraction
                                                                       Audio Synchronization with the Image
                                                                       Multistream HMM Fusion – State Synchronous
                                                                        Decision
                                                                       Automatic Image Processing to record the CUEs
                                                                       Lip Width, Aperture, Area, Upper pinch and
                                                                        Lower Pinch
                                                                       Modeling - 8 lip parameters and 10 hand
                                                                        parameters
                                                                02/12/2011                                         20
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Fusion

                                                                       Feature Fusion – Concatenation

                                                                                          ������ ������    ������ ������  ������ ������ ������
                                                                                       ������������ = [������������ , ������������ ] ∈               ������������
                                                                                 ������������ ������ - Lip hand feature vector
                                                                                   ������

                                                                                        ������ ������
                                                                                 ������������           - Lip shape feature vector
                                                                                        ������ ������
                                                                                 ������������ - Hand feature vector
                                                                             D - Dimensionality

                                                                02/12/2011                                                          21
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Conclusion

                                                                       Cued Speech Recognition – 80% accuracy
                                                                       Outstands ASR in normal environment
                                                                       Visual mode – Education of the hearing impaired
                                                                       Phoneme recognition successful
                                                                       Another product over SIRI




                                                                02/12/2011                                           22
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION


                                                                Reference
                                                                 1.     Baum L.E., Petrie T., “Statistical Inference for Probabilistic functions of Finite-State Markov
                                                                        Chains”, Annotated Mathematical Statistics, Volume 37, Number 6, pp.1554-1563, 1966
                                                                 2.     XiaoZheng Zhang, Charles C. Broun, Russell M. Mersereau, Mark A. Clements, “Automatic
                                                                        speech reading with applications to human computer interfaces”, Eurasip Journal on Applied
                                                                        Signal Processing, Volume 2002, Issue 11, pp. 1228-1247.
                                                                 3.     Jian-Ming Zhang, Liang-Min Wang, De-Jiao Niu,Yong-Zhao Zhan, “Research and
                                                                        implementation of a real time approach to lip detection in video sequence”, International
                                                                        Conference on Machine Learning and Cybernetics, IEEE, 2003.
                                                                 4.     Md. Rashidul Hasan, Mustafa Jamil, Md. Golam Rabbani Md Saifur Rahman, “Speaker
                                                                        identification using Mel frequency cepstral coefficients”, 3rd International Conference on
                                                                        Electrical And Computer Engineering, ICECE 2004.
                                                                 5.     P. Dreuw, D. Rybach, T. Deselaers, M. Zahedi, and H. Ney, “Speech recognition techniques
                                                                        for a sign language recognition system,” In Proceedings of Interspeech, pp. 2513–2516, 2007.
                                                                 6.     A. A. Montgomery and P. L. Jackson, “Physical characteristics of the lips underlying vowel lip
                                                                        reading performance,” Journal of the Acoustical Society of America, Volume 73, Number 6,
                                                                        pp. 2134–2144, 1983.
                                                                 7.     J. Leybaert, “Phonology acquired through the eyes and spelling in deaf children,” Journal of
                                                                        Experimental Child Psychology, Volume 75, pp. 291–318, 2000.



                                                                02/12/2011                                                                                           23
GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION




02/12/2011
                                THANK YOU



24

Mais conteúdo relacionado

Destaque

Electronic Toll Tax collection system in india
Electronic Toll Tax collection system in india Electronic Toll Tax collection system in india
Electronic Toll Tax collection system in india Deepak Chouhan
 
Electronic Toll Collection System
Electronic Toll Collection SystemElectronic Toll Collection System
Electronic Toll Collection SystemArshad Shareef
 
Smart card technology
Smart card technologySmart card technology
Smart card technologyLav Pratap
 
Embedded system in automobile
Embedded system in automobileEmbedded system in automobile
Embedded system in automobileAali Aalim
 
The Top Skills That Can Get You Hired in 2017
The Top Skills That Can Get You Hired in 2017The Top Skills That Can Get You Hired in 2017
The Top Skills That Can Get You Hired in 2017LinkedIn
 

Destaque (7)

Electronic Toll Tax collection system in india
Electronic Toll Tax collection system in india Electronic Toll Tax collection system in india
Electronic Toll Tax collection system in india
 
Smart quill
Smart quillSmart quill
Smart quill
 
Electronic Toll Collection System
Electronic Toll Collection SystemElectronic Toll Collection System
Electronic Toll Collection System
 
Smart card technology
Smart card technologySmart card technology
Smart card technology
 
Embedded system in automobile
Embedded system in automobileEmbedded system in automobile
Embedded system in automobile
 
Toll plaza ppt
Toll plaza pptToll plaza ppt
Toll plaza ppt
 
The Top Skills That Can Get You Hired in 2017
The Top Skills That Can Get You Hired in 2017The Top Skills That Can Get You Hired in 2017
The Top Skills That Can Get You Hired in 2017
 

Último

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 

Último (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

Gestures and Lip Shape Integration for Cued Speech Recognition

  • 1. Gestures and Lip Shape Integration for Cued Speech Recognition Seminar By: Seminar Coordinator: Mohammed Musfir Mr. Rino P. C. ECE-B, 08104131 Assistant Professor, ECE Seminar Guide: Mr. Edet Bijoy K. Assistant Professor, ECE
  • 5. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Overview of Presentation  Objective  Introduction  ASR Techniques  Lip Reading – AVSR  Cued Speech  Integrated Recognition  Conclusion 02/12/2011 5
  • 6. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Objective  Developments in ASR technique  AVSR Accessibility solution  Lip Detection  Cued Speech detection  Integration of both 02/12/2011 6
  • 7. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION 02/12/2011 INTRODUCTION 7
  • 8. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Briefing ASR  First successful system in 1970  Consist of two systems  ASR – Transcribe  SU- Understand transcription  Knowledge Intensive 02/12/2011 8
  • 9. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION 02/12/2011 ASR TECHNIQUES 9
  • 10. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION ASR Industry  Industry pioneers – NUANCE, NTT Labs, AT & T labs  MIT and GPL – Vox Forge, Gvoice  Desktop Dictation -1990  Types of ASR  DVI – Word or phrase spotting  LVCSR- Several thousands words 02/12/2011 10
  • 11. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Techniques  Sequence of sounds  ASR involves  Acquisition - Recording  Feature Extraction – Spectral analysis  Pattern matching and decoding 02/12/2011 11
  • 12. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION 02/12/2011 Techniques 12
  • 13. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Approaches  Template Based  Knowledge Based  Statistical  Learning based  Artificial Intelligence 02/12/2011 13
  • 14. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION 02/12/2011 LIP READING 14
  • 15. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION  02/12/2011 Front end Lips detection Lip Reading - AVSR 15
  • 16. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Localisation and Tracking  ROI determination – Sobel Edge Filtering  Kalman Filter – Tracking  Principal Component Analysis – Feature Coefficients  Audio feature - MFCC 02/12/2011 16
  • 17. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION 02/12/2011 CUED SPEECH 17
  • 18. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION 02/12/2011 Overview of Cued Speech 18
  • 19. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION 02/12/2011 INTEGRATION 19
  • 20. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Steps  Lip feature extraction  Audio Synchronization with the Image  Multistream HMM Fusion – State Synchronous Decision  Automatic Image Processing to record the CUEs  Lip Width, Aperture, Area, Upper pinch and Lower Pinch  Modeling - 8 lip parameters and 10 hand parameters 02/12/2011 20
  • 21. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Fusion  Feature Fusion – Concatenation ������ ������ ������ ������ ������ ������ ������ ������������ = [������������ , ������������ ] ∈ ������������ ������������ ������ - Lip hand feature vector ������ ������ ������ ������������ - Lip shape feature vector ������ ������ ������������ - Hand feature vector D - Dimensionality 02/12/2011 21
  • 22. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Conclusion  Cued Speech Recognition – 80% accuracy  Outstands ASR in normal environment  Visual mode – Education of the hearing impaired  Phoneme recognition successful  Another product over SIRI 02/12/2011 22
  • 23. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION Reference 1. Baum L.E., Petrie T., “Statistical Inference for Probabilistic functions of Finite-State Markov Chains”, Annotated Mathematical Statistics, Volume 37, Number 6, pp.1554-1563, 1966 2. XiaoZheng Zhang, Charles C. Broun, Russell M. Mersereau, Mark A. Clements, “Automatic speech reading with applications to human computer interfaces”, Eurasip Journal on Applied Signal Processing, Volume 2002, Issue 11, pp. 1228-1247. 3. Jian-Ming Zhang, Liang-Min Wang, De-Jiao Niu,Yong-Zhao Zhan, “Research and implementation of a real time approach to lip detection in video sequence”, International Conference on Machine Learning and Cybernetics, IEEE, 2003. 4. Md. Rashidul Hasan, Mustafa Jamil, Md. Golam Rabbani Md Saifur Rahman, “Speaker identification using Mel frequency cepstral coefficients”, 3rd International Conference on Electrical And Computer Engineering, ICECE 2004. 5. P. Dreuw, D. Rybach, T. Deselaers, M. Zahedi, and H. Ney, “Speech recognition techniques for a sign language recognition system,” In Proceedings of Interspeech, pp. 2513–2516, 2007. 6. A. A. Montgomery and P. L. Jackson, “Physical characteristics of the lips underlying vowel lip reading performance,” Journal of the Acoustical Society of America, Volume 73, Number 6, pp. 2134–2144, 1983. 7. J. Leybaert, “Phonology acquired through the eyes and spelling in deaf children,” Journal of Experimental Child Psychology, Volume 75, pp. 291–318, 2000. 02/12/2011 23
  • 24. GESTURE AND LIP SHAPE INTEGRATION FOR CUED SPEECH RECOGNITION 02/12/2011 THANK YOU 24