SlideShare a Scribd company logo
1 of 15
Open Text:Speech recognition in Opencast MatterhornStephen MarquardCentre for Educational TechnologyUniversity of Cape TownJune 2011
Project goals Integrate CMU Sphinx speech recognition engine into Opencast Matterhorn Provide easy mechanism for speaker training Generate automatic transcripts of recorded lectures Allow users to correct and improve the transcripts Use feedback to improve recognition accuracy (of the same, similar or subsequent recordings)
Why is it important? Video and audio is more useful if you can: Navigate it easily Locate relevant recordings from a large set Use by students: Catch up on missed lectures (continuous play or read the transcript) Revision: jump to a particular point or find the lectures which cover topic X On the public web: Discoverability (search indexing) Similar advantages to OCR recognition of slides (but harder)
Why is it difficult? Audio quality can dramatically affect speech recognition accuracy Echo and reverberation Background noise Microphone location Speaker-independent large-vocabulary continuous speech recognition is the hardest type of ASR Best case: good acoustics, single speaker (limited dialogue), accent match with the acoustic model, limited vocabulary.
Prior work in ASR for lectures MIT Lecture Browser (SUMMIT recognizer) U. Toronto / ePresence PhD prototype by CosminMunteanu(SONIC recognizer) ETH Zurich Integration of CMU Sphinx with REPLAY by SamirAtitallah
Speech recognition software ecosystem Licensing and patents Closed Proprietary FOSS Open
Accounting for context:Language model adaptation Adapt a language model to more closely resemble the target speech Using related text for Topic modelling (vocabulary, concepts) Style-of-speech modelling 	“ok and um it's quite useful to have a very good diagnostic test of of acute hepatitis um you know to prevent kind of unnecessary um surgery um so hepatitis is really one um example of a cause of acute abdominal pain that doesn't need surgery”
Using Wikipedia for LM adaptation Goal is to adapt a “standard” LM to be specific to the topic of the audio Start somewhere: title, keywords, text from slides Select a set of documents, adapt the LM Using wikipedia, select by similarity: identify the set of documents most closely related to the starting point or keywords
Baseline performance with Sphinx4 (HUB4 acoustic and language models) Lecture audio and transcripts from Open Yale Courses http://oyc.yale.edu/ Used under CC-BY-NC-SA license.
Best-case comparison (30% WER)Transcript, HUB4 LM, Wikipedia Similarity LM Before launching into Pynchon today, I thought I would just take a few moments to look back over the books that we've read and talk about the visions of language that they have offered us, and also just to reflect for a moment on the relationship imagined between those visions of language and what is happening outside of fiction in what we might call the real world. We started this course talking about Black Boy and the way that a whole world of pressure -- political pressure, racial tension -- pushed on the borders of that work and actually changed its very material form. before launching into not pynchontoday route just take a few moments to look back cover the books that we've brad and talk about the visions of language that they have offered up and also just to reflect for mounted on the relationship imagine between those visions of language and what is happening outside of fiction in in what we might call the real world we started this course talking about black boy and a weighing bat a whole world of pressure political pressure racial tension pushed on the borders and that work and actually changed its very nature eel for before launching into not mentioned today really does take a few moments to look back over the books that we've read and talk about the visions of language that they have offered up and also just to reflect for movement on the relationship imagine between those visions of language and what is happening outside of fiction in in what we might call the reel well we started this course talking about black boy and a weighing of that a whole world of pressure political pressure of racial tension pushed on the borders of bad work and actually changed its very nature eel for
Worst-case comparison (61% WER)Transcript, HUB4 LM, Wikipedia Similarity LM i'd talk with the french revolution this party do in all the myself will forty-five minutes after throughout beginning i'm in seoul on on i wanted it to do two things unless the revolution through the eyes of maps that ulmus piano member of a treaty of public safety arguably without fascists i'd solicit were not member ah is jacobo out into an away he incarnated death jacobinchapel back he imparted the french revolution i've talked with the french are loose in this part to do in all the myself low forty five minutes after score of beginning i'm in seoul on bob and i wanted to do two things i want the revolution through the eyes of maps that elvis piano a member of the treaty of public safety are giveaway with that fascists i thought it were not member ah gee i go back into a a way he imparted that chappel been the chapel back he imparted the first revolution I'm going to talk about the French Revolution. It's hard to do. I'll leave myself about forty-five minutes after I screw around at the beginning. I want to do two things. I want to see the Revolution through the eyes of Maximilien de Robespierre, a member of the Committee of Public Safety --arguably, with Saint-Just, its most important member. In a way, Jacobin -- he incarnated the French Revolution.
Work in progress Identify requirements for recording recognition-quality audio (equipment, acoustics) Implement dynamic language model adaptation Integrate into Opencast Matterhorn workflow Show transcript to users in UI, enable search Allow users to edit / improve transcript Use edits to improve recognition
Other integration possibilities External transcription services (automate the workflow, choice between manual or automatic transcript) External speech recognition services (e.g. nexiwave.com)
Find out more Email me:stephen.marquard@uct.ac.za 	Follow me on Twitter: http://twitter.com/stephenmarquard 	Read my blog on open source language modelling and speech recognition: http://trulymadlywordly.blogspot.com 	CMU Sphinxhttp://cmusphinx.sourceforge.net/

More Related Content

Similar to Open Text: Speech recognition in Opencast Matterhorn

High School Goes High Tech May 2007
High School Goes High Tech May 2007High School Goes High Tech May 2007
High School Goes High Tech May 2007tchiasson
 
Autobiographical Essay Sample For University Entrance
Autobiographical Essay Sample For University EntranceAutobiographical Essay Sample For University Entrance
Autobiographical Essay Sample For University EntranceErica Turner
 
Example Of A Narrative Essay About Yourself.pdf
Example Of A Narrative Essay About Yourself.pdfExample Of A Narrative Essay About Yourself.pdf
Example Of A Narrative Essay About Yourself.pdfLory Holets
 
Sample Of A Term Paper How To Write A Research P
Sample Of A Term Paper How To Write A Research PSample Of A Term Paper How To Write A Research P
Sample Of A Term Paper How To Write A Research PSabrina Baloi
 
Error correction 3 mon+wed c1.2
Error correction 3 mon+wed c1.2Error correction 3 mon+wed c1.2
Error correction 3 mon+wed c1.2jeanphilippeguy
 
Masterclass on digital anthropology and our virtual lives
Masterclass on digital anthropology and our virtual livesMasterclass on digital anthropology and our virtual lives
Masterclass on digital anthropology and our virtual livesDoug Thompson
 
Can being part machine make us more human
Can being part machine make us more humanCan being part machine make us more human
Can being part machine make us more humanDoug Thompson
 
Model of the text generator
Model of the text generatorModel of the text generator
Model of the text generatoreyetech
 
033009 Vw Methods Research Panel Metanomics Transcript
033009 Vw Methods Research Panel Metanomics Transcript033009 Vw Methods Research Panel Metanomics Transcript
033009 Vw Methods Research Panel Metanomics TranscriptRemedy Communications
 
Save Our Environment Essay
Save Our Environment EssaySave Our Environment Essay
Save Our Environment EssayDeborah Reyes
 
Narrative Essay Topics For High School.pdf
Narrative Essay Topics For High School.pdfNarrative Essay Topics For High School.pdf
Narrative Essay Topics For High School.pdfHeidi Prado
 
Essay On Environment Day 2014. Online assignment writing service.
Essay On Environment Day 2014. Online assignment writing service.Essay On Environment Day 2014. Online assignment writing service.
Essay On Environment Day 2014. Online assignment writing service.Lisa Davis
 
speech production in psycholinguistics
speech production in psycholinguistics speech production in psycholinguistics
speech production in psycholinguistics Aseel K. Mahmood
 
How Do You Reference A Web Page In An Essay
How Do You Reference A Web Page In An EssayHow Do You Reference A Web Page In An Essay
How Do You Reference A Web Page In An EssayMelanie Mendoza
 
Simon Prentis and the Origin of the Brain
Simon Prentis and the Origin of the BrainSimon Prentis and the Origin of the Brain
Simon Prentis and the Origin of the BrainEditions La Dondaine
 
An Article About Ielts And 9 Sample Papers For The Engli
An Article About Ielts And 9 Sample Papers For The EngliAn Article About Ielts And 9 Sample Papers For The Engli
An Article About Ielts And 9 Sample Papers For The EngliJulie Smith
 
Exploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentExploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentMartin Wynne
 

Similar to Open Text: Speech recognition in Opencast Matterhorn (20)

High School Goes High Tech May 2007
High School Goes High Tech May 2007High School Goes High Tech May 2007
High School Goes High Tech May 2007
 
Autobiographical Essay Sample For University Entrance
Autobiographical Essay Sample For University EntranceAutobiographical Essay Sample For University Entrance
Autobiographical Essay Sample For University Entrance
 
Example Of A Narrative Essay About Yourself.pdf
Example Of A Narrative Essay About Yourself.pdfExample Of A Narrative Essay About Yourself.pdf
Example Of A Narrative Essay About Yourself.pdf
 
Sample Of A Term Paper How To Write A Research P
Sample Of A Term Paper How To Write A Research PSample Of A Term Paper How To Write A Research P
Sample Of A Term Paper How To Write A Research P
 
Lec12
Lec12Lec12
Lec12
 
Error correction 3 mon+wed c1.2
Error correction 3 mon+wed c1.2Error correction 3 mon+wed c1.2
Error correction 3 mon+wed c1.2
 
Masterclass on digital anthropology and our virtual lives
Masterclass on digital anthropology and our virtual livesMasterclass on digital anthropology and our virtual lives
Masterclass on digital anthropology and our virtual lives
 
Can being part machine make us more human
Can being part machine make us more humanCan being part machine make us more human
Can being part machine make us more human
 
Lec13
Lec13Lec13
Lec13
 
Model of the text generator
Model of the text generatorModel of the text generator
Model of the text generator
 
033009 Vw Methods Research Panel Metanomics Transcript
033009 Vw Methods Research Panel Metanomics Transcript033009 Vw Methods Research Panel Metanomics Transcript
033009 Vw Methods Research Panel Metanomics Transcript
 
Save Our Environment Essay
Save Our Environment EssaySave Our Environment Essay
Save Our Environment Essay
 
Narrative Essay Topics For High School.pdf
Narrative Essay Topics For High School.pdfNarrative Essay Topics For High School.pdf
Narrative Essay Topics For High School.pdf
 
Essay On Environment Day 2014. Online assignment writing service.
Essay On Environment Day 2014. Online assignment writing service.Essay On Environment Day 2014. Online assignment writing service.
Essay On Environment Day 2014. Online assignment writing service.
 
Amity NLP Notes
Amity NLP NotesAmity NLP Notes
Amity NLP Notes
 
speech production in psycholinguistics
speech production in psycholinguistics speech production in psycholinguistics
speech production in psycholinguistics
 
How Do You Reference A Web Page In An Essay
How Do You Reference A Web Page In An EssayHow Do You Reference A Web Page In An Essay
How Do You Reference A Web Page In An Essay
 
Simon Prentis and the Origin of the Brain
Simon Prentis and the Origin of the BrainSimon Prentis and the Origin of the Brain
Simon Prentis and the Origin of the Brain
 
An Article About Ielts And 9 Sample Papers For The Engli
An Article About Ielts And 9 Sample Papers For The EngliAn Article About Ielts And 9 Sample Papers For The Engli
An Article About Ielts And 9 Sample Papers For The Engli
 
Exploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentExploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic Enlightenment
 

More from Stephen Marquard

The implementation of an Opt-Out Lecture Recording Policy at the University o...
The implementation of an Opt-Out Lecture Recording Policy at the University o...The implementation of an Opt-Out Lecture Recording Policy at the University o...
The implementation of an Opt-Out Lecture Recording Policy at the University o...Stephen Marquard
 
Orchestrating Self-Service Video Workflows with Opencast
Orchestrating Self-Service Video Workflows with OpencastOrchestrating Self-Service Video Workflows with Opencast
Orchestrating Self-Service Video Workflows with OpencastStephen Marquard
 
Smart workflows for Opencast
Smart workflows for OpencastSmart workflows for Opencast
Smart workflows for OpencastStephen Marquard
 
LectureSight is awesome and getting better! 
LectureSight is awesome and getting better! LectureSight is awesome and getting better! 
LectureSight is awesome and getting better! Stephen Marquard
 
Track4K in production at the University of Cape Town
Track4K in production at the University of Cape TownTrack4K in production at the University of Cape Town
Track4K in production at the University of Cape TownStephen Marquard
 
Opencast Valencia 2017: Users, groups, roles, ACLs and providers
Opencast Valencia 2017: Users, groups, roles, ACLs and providersOpencast Valencia 2017: Users, groups, roles, ACLs and providers
Opencast Valencia 2017: Users, groups, roles, ACLs and providersStephen Marquard
 
Opencast and Sakai at UCT, LectureSight and Track4K
Opencast and Sakai at UCT, LectureSight and Track4KOpencast and Sakai at UCT, LectureSight and Track4K
Opencast and Sakai at UCT, LectureSight and Track4KStephen Marquard
 
LectureSight in Action (Opencast Community Summit 2016)
LectureSight in Action (Opencast Community Summit 2016)LectureSight in Action (Opencast Community Summit 2016)
LectureSight in Action (Opencast Community Summit 2016)Stephen Marquard
 
Opencast Project Update at Open Apereo 2015
Opencast Project Update at Open Apereo 2015Opencast Project Update at Open Apereo 2015
Opencast Project Update at Open Apereo 2015Stephen Marquard
 
Why do students use lecture recordings?
Why do students use lecture recordings?Why do students use lecture recordings?
Why do students use lecture recordings?Stephen Marquard
 
Introduction to Opencast Matterhorn: Apereo 2014
Introduction to Opencast Matterhorn: Apereo 2014Introduction to Opencast Matterhorn: Apereo 2014
Introduction to Opencast Matterhorn: Apereo 2014Stephen Marquard
 
Introduction to Opencast Matterhorn, Apereo Mexico Conference, May 2014
Introduction to Opencast Matterhorn, Apereo Mexico Conference, May 2014Introduction to Opencast Matterhorn, Apereo Mexico Conference, May 2014
Introduction to Opencast Matterhorn, Apereo Mexico Conference, May 2014Stephen Marquard
 
Matterhorn 2014 Unconference: Ideas for automated post-recording video handling
Matterhorn 2014 Unconference: Ideas for automated post-recording video handlingMatterhorn 2014 Unconference: Ideas for automated post-recording video handling
Matterhorn 2014 Unconference: Ideas for automated post-recording video handlingStephen Marquard
 
Opencast Matterhorn at UCT
Opencast Matterhorn at UCTOpencast Matterhorn at UCT
Opencast Matterhorn at UCTStephen Marquard
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionStephen Marquard
 
Advancing Online Assessment in Medical Education
Advancing Online Assessment in Medical EducationAdvancing Online Assessment in Medical Education
Advancing Online Assessment in Medical EducationStephen Marquard
 
SMS, Q&A and Course Evaluations in Sakai
SMS, Q&A and Course Evaluations in SakaiSMS, Q&A and Course Evaluations in Sakai
SMS, Q&A and Course Evaluations in SakaiStephen Marquard
 
SMS, Q&A, Course Evaluation tools in Sakai
SMS, Q&A, Course Evaluation tools in SakaiSMS, Q&A, Course Evaluation tools in Sakai
SMS, Q&A, Course Evaluation tools in SakaiStephen Marquard
 
Sakai E Learning Update Sep09
Sakai E Learning Update Sep09Sakai E Learning Update Sep09
Sakai E Learning Update Sep09Stephen Marquard
 

More from Stephen Marquard (20)

The implementation of an Opt-Out Lecture Recording Policy at the University o...
The implementation of an Opt-Out Lecture Recording Policy at the University o...The implementation of an Opt-Out Lecture Recording Policy at the University o...
The implementation of an Opt-Out Lecture Recording Policy at the University o...
 
Orchestrating Self-Service Video Workflows with Opencast
Orchestrating Self-Service Video Workflows with OpencastOrchestrating Self-Service Video Workflows with Opencast
Orchestrating Self-Service Video Workflows with Opencast
 
Smart workflows for Opencast
Smart workflows for OpencastSmart workflows for Opencast
Smart workflows for Opencast
 
LectureSight is awesome and getting better! 
LectureSight is awesome and getting better! LectureSight is awesome and getting better! 
LectureSight is awesome and getting better! 
 
Track4K in production at the University of Cape Town
Track4K in production at the University of Cape TownTrack4K in production at the University of Cape Town
Track4K in production at the University of Cape Town
 
Opencast Valencia 2017: Users, groups, roles, ACLs and providers
Opencast Valencia 2017: Users, groups, roles, ACLs and providersOpencast Valencia 2017: Users, groups, roles, ACLs and providers
Opencast Valencia 2017: Users, groups, roles, ACLs and providers
 
Opencast and Sakai at UCT, LectureSight and Track4K
Opencast and Sakai at UCT, LectureSight and Track4KOpencast and Sakai at UCT, LectureSight and Track4K
Opencast and Sakai at UCT, LectureSight and Track4K
 
LectureSight in Action (Opencast Community Summit 2016)
LectureSight in Action (Opencast Community Summit 2016)LectureSight in Action (Opencast Community Summit 2016)
LectureSight in Action (Opencast Community Summit 2016)
 
Opencast Project Update at Open Apereo 2015
Opencast Project Update at Open Apereo 2015Opencast Project Update at Open Apereo 2015
Opencast Project Update at Open Apereo 2015
 
Why do students use lecture recordings?
Why do students use lecture recordings?Why do students use lecture recordings?
Why do students use lecture recordings?
 
Introduction to Opencast Matterhorn: Apereo 2014
Introduction to Opencast Matterhorn: Apereo 2014Introduction to Opencast Matterhorn: Apereo 2014
Introduction to Opencast Matterhorn: Apereo 2014
 
Introduction to Opencast Matterhorn, Apereo Mexico Conference, May 2014
Introduction to Opencast Matterhorn, Apereo Mexico Conference, May 2014Introduction to Opencast Matterhorn, Apereo Mexico Conference, May 2014
Introduction to Opencast Matterhorn, Apereo Mexico Conference, May 2014
 
Matterhorn 2014 Unconference: Ideas for automated post-recording video handling
Matterhorn 2014 Unconference: Ideas for automated post-recording video handlingMatterhorn 2014 Unconference: Ideas for automated post-recording video handling
Matterhorn 2014 Unconference: Ideas for automated post-recording video handling
 
Opencast Matterhorn at UCT
Opencast Matterhorn at UCTOpencast Matterhorn at UCT
Opencast Matterhorn at UCT
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognition
 
Advancing Online Assessment in Medical Education
Advancing Online Assessment in Medical EducationAdvancing Online Assessment in Medical Education
Advancing Online Assessment in Medical Education
 
SMS, Q&A and Course Evaluations in Sakai
SMS, Q&A and Course Evaluations in SakaiSMS, Q&A and Course Evaluations in Sakai
SMS, Q&A and Course Evaluations in Sakai
 
SMS, Q&A, Course Evaluation tools in Sakai
SMS, Q&A, Course Evaluation tools in SakaiSMS, Q&A, Course Evaluation tools in Sakai
SMS, Q&A, Course Evaluation tools in Sakai
 
Sakai E Learning Update Sep09
Sakai E Learning Update Sep09Sakai E Learning Update Sep09
Sakai E Learning Update Sep09
 
Vula is my survival kit
Vula is my survival kitVula is my survival kit
Vula is my survival kit
 

Recently uploaded

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 

Recently uploaded (20)

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 

Open Text: Speech recognition in Opencast Matterhorn

  • 1. Open Text:Speech recognition in Opencast MatterhornStephen MarquardCentre for Educational TechnologyUniversity of Cape TownJune 2011
  • 2. Project goals Integrate CMU Sphinx speech recognition engine into Opencast Matterhorn Provide easy mechanism for speaker training Generate automatic transcripts of recorded lectures Allow users to correct and improve the transcripts Use feedback to improve recognition accuracy (of the same, similar or subsequent recordings)
  • 3. Why is it important? Video and audio is more useful if you can: Navigate it easily Locate relevant recordings from a large set Use by students: Catch up on missed lectures (continuous play or read the transcript) Revision: jump to a particular point or find the lectures which cover topic X On the public web: Discoverability (search indexing) Similar advantages to OCR recognition of slides (but harder)
  • 4. Why is it difficult? Audio quality can dramatically affect speech recognition accuracy Echo and reverberation Background noise Microphone location Speaker-independent large-vocabulary continuous speech recognition is the hardest type of ASR Best case: good acoustics, single speaker (limited dialogue), accent match with the acoustic model, limited vocabulary.
  • 5. Prior work in ASR for lectures MIT Lecture Browser (SUMMIT recognizer) U. Toronto / ePresence PhD prototype by CosminMunteanu(SONIC recognizer) ETH Zurich Integration of CMU Sphinx with REPLAY by SamirAtitallah
  • 6. Speech recognition software ecosystem Licensing and patents Closed Proprietary FOSS Open
  • 7.
  • 8. Accounting for context:Language model adaptation Adapt a language model to more closely resemble the target speech Using related text for Topic modelling (vocabulary, concepts) Style-of-speech modelling “ok and um it's quite useful to have a very good diagnostic test of of acute hepatitis um you know to prevent kind of unnecessary um surgery um so hepatitis is really one um example of a cause of acute abdominal pain that doesn't need surgery”
  • 9. Using Wikipedia for LM adaptation Goal is to adapt a “standard” LM to be specific to the topic of the audio Start somewhere: title, keywords, text from slides Select a set of documents, adapt the LM Using wikipedia, select by similarity: identify the set of documents most closely related to the starting point or keywords
  • 10. Baseline performance with Sphinx4 (HUB4 acoustic and language models) Lecture audio and transcripts from Open Yale Courses http://oyc.yale.edu/ Used under CC-BY-NC-SA license.
  • 11. Best-case comparison (30% WER)Transcript, HUB4 LM, Wikipedia Similarity LM Before launching into Pynchon today, I thought I would just take a few moments to look back over the books that we've read and talk about the visions of language that they have offered us, and also just to reflect for a moment on the relationship imagined between those visions of language and what is happening outside of fiction in what we might call the real world. We started this course talking about Black Boy and the way that a whole world of pressure -- political pressure, racial tension -- pushed on the borders of that work and actually changed its very material form. before launching into not pynchontoday route just take a few moments to look back cover the books that we've brad and talk about the visions of language that they have offered up and also just to reflect for mounted on the relationship imagine between those visions of language and what is happening outside of fiction in in what we might call the real world we started this course talking about black boy and a weighing bat a whole world of pressure political pressure racial tension pushed on the borders and that work and actually changed its very nature eel for before launching into not mentioned today really does take a few moments to look back over the books that we've read and talk about the visions of language that they have offered up and also just to reflect for movement on the relationship imagine between those visions of language and what is happening outside of fiction in in what we might call the reel well we started this course talking about black boy and a weighing of that a whole world of pressure political pressure of racial tension pushed on the borders of bad work and actually changed its very nature eel for
  • 12. Worst-case comparison (61% WER)Transcript, HUB4 LM, Wikipedia Similarity LM i'd talk with the french revolution this party do in all the myself will forty-five minutes after throughout beginning i'm in seoul on on i wanted it to do two things unless the revolution through the eyes of maps that ulmus piano member of a treaty of public safety arguably without fascists i'd solicit were not member ah is jacobo out into an away he incarnated death jacobinchapel back he imparted the french revolution i've talked with the french are loose in this part to do in all the myself low forty five minutes after score of beginning i'm in seoul on bob and i wanted to do two things i want the revolution through the eyes of maps that elvis piano a member of the treaty of public safety are giveaway with that fascists i thought it were not member ah gee i go back into a a way he imparted that chappel been the chapel back he imparted the first revolution I'm going to talk about the French Revolution. It's hard to do. I'll leave myself about forty-five minutes after I screw around at the beginning. I want to do two things. I want to see the Revolution through the eyes of Maximilien de Robespierre, a member of the Committee of Public Safety --arguably, with Saint-Just, its most important member. In a way, Jacobin -- he incarnated the French Revolution.
  • 13. Work in progress Identify requirements for recording recognition-quality audio (equipment, acoustics) Implement dynamic language model adaptation Integrate into Opencast Matterhorn workflow Show transcript to users in UI, enable search Allow users to edit / improve transcript Use edits to improve recognition
  • 14. Other integration possibilities External transcription services (automate the workflow, choice between manual or automatic transcript) External speech recognition services (e.g. nexiwave.com)
  • 15. Find out more Email me:stephen.marquard@uct.ac.za Follow me on Twitter: http://twitter.com/stephenmarquard Read my blog on open source language modelling and speech recognition: http://trulymadlywordly.blogspot.com CMU Sphinxhttp://cmusphinx.sourceforge.net/