IIHS Open Framework-SpokenMedia

•Download as PPT, PDF•

1 like•1,693 views

SpokenMedia automatically transcribes IIIHS video, and enables a process to edit and translate transcripts. Presented by Brandon Muramatsu at the IIHS Curriculum Conference, Bangalore, India, January 5, 2010.

Education Technology Business

Enabling the IIHS Vision, Part 1 Brandon Muramatsu Andrew McKinney Peter Wilkins—Our colleague at MIT at 0° C January 2010 Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License ( creativecommons.org/licenses/by-nc-sa/3.0/us/ )

2 Demos For January 2010 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

“ The IIHS Website is our commitment to a different way of looking at things.” – Aromar Revi 5 January 2010

“ The Institution will fail or scale based on language.” – Aromar Revi 5 January 2010

Our Goals with this Demo ,[object Object],[object Object],[object Object]

What did we do? Auto Transcribe Edit Translate Present SpokenMedia

How did we do it? Auto Transcribe Edit Translate Present SpokenMedia

How do we do it? Lecture Transcription ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],James Glass [email_address]

Spoken Lecture Project ,[object Object],[object Object],[object Object],[object Object],[object Object],James Glass [email_address]

How Does it Work? Lecture Transcription Workflow

SpokenMedia Process We used a portion of the SpokenMedia process for the demo

Edit & Translate: Accuracy Automatic Transcription Hand Transcription Time Adjusted Translated Hindi I I I मेरे खयाल से think think think once one one नयोजन की एक मुख्य चुनौती है and central so challenge central the of challenger planning challenge of planning is planning nice legitimacy is legitimacy of legitimacy of of government government सरकार की एक ऐसी मुख्य संस्थान के रूप में वैधता government as as

SpokenMedia Accuracy Potential ,[object Object],[object Object],[object Object],[object Object],[object Object]

The Player ,[object Object],[object Object],[object Object],[object Object]

Check it out for yourself ,[object Object]

Thank You! Brandon Muramatsu, [email_address] Andrew McKinney, [email_address] ° Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License ( creativecommons.org/licenses/by-nc-sa/3.0/us/ )

Similar to IIHS Open Framework-SpokenMedia

Video Captioning for Accessibility: University of Florida and Regis Universit...3Play Media

Getting Started with Camtasia-A Seflin Round Table discussionAlyse Ergood McKeal

Evaluation question 6RedhorseMar11980

SpokenMedia: Content, Content Everywhere...What video? Where? at OpenEd 2009Brandon Muramatsu

Advanced Workflows for Closed Captioning3Play Media

Creating podcastsDCPS

Watson API Use Case Demos for the Nittany Watson ChallengePenn State EdTech Network

Using AI to transcribe qualitative data: Personal reflections of an experienc...John Wren

Video Accessibility at the University of Washington3Play Media

Free Web Sites 2009Cheryl Wissick

Lunar.pptxstowlson

Evaluation q4 - Amz Ash Baby Fachehaverstockmedia

June 1 wiki evidenceBarbara Reid

Accessibility online courses-trends tips toolsFelicia Cruz

My presentation 1Visho Ravindran

Free Educational Technology ToolsCheryl Wissick

University of Wisconsin: Captioning and Transcription Policies, Uses and Work...3Play Media

Digital Storytelling - Windows and AppleJennifer Dorman

Kaltura Digital Media Hub Launch - Graham McElearney et. al.telshef

VloggingAiden Yeh

Similar to IIHS Open Framework-SpokenMedia (20)

Video Captioning for Accessibility: University of Florida and Regis Universit...

Getting Started with Camtasia-A Seflin Round Table discussion

Evaluation question 6

SpokenMedia: Content, Content Everywhere...What video? Where? at OpenEd 2009

Advanced Workflows for Closed Captioning

Creating podcasts

Watson API Use Case Demos for the Nittany Watson Challenge

Using AI to transcribe qualitative data: Personal reflections of an experienc...

Video Accessibility at the University of Washington

Free Web Sites 2009

Lunar.pptx

Evaluation q4 - Amz Ash Baby Fache

June 1 wiki evidence

Accessibility online courses-trends tips tools

My presentation 1

Free Educational Technology Tools

University of Wisconsin: Captioning and Transcription Policies, Uses and Work...

Digital Storytelling - Windows and Apple

Kaltura Digital Media Hub Launch - Graham McElearney et. al.

Vlogging

Recently uploaded

Accessible design: Minimum effort, maximum impactdawncurless

The Most Excellent Way | 1 Corinthians 13Steve Thomason

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy

Student login on Anyboli platform.helpinRaunakKeshri1

Advanced Views - Calendar View in Odoo 17Celine George

9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt

Grant Readiness 101 TechSoup and Remy ConsultingTechSoup

Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB

Advance Mobile Application Development class 07Dr. Mazin Mohamed alkathiri

Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxRAM LAL ANAND COLLEGE, DELHI UNIVERSITY.

Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha

Nutritional Needs Presentation - HLTH 104misteraugie

social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3

Sports & Fitness Value Added Course FY..Disha Kariya

Interactive Powerpoint_How to Master effective communicationnomboosow

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics

Introduction to Nonprofit Accounting: The BasicsTechSoup

Holdier Curriculum Vitae (April 2024).pdfagholdier

Recently uploaded (20)

Accessible design: Minimum effort, maximum impact

The Most Excellent Way | 1 Corinthians 13

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf

Student login on Anyboli platform.helpin

Advanced Views - Calendar View in Odoo 17

9548086042 for call girls in Indira Nagar with room service

Grant Readiness 101 TechSoup and Remy Consulting

Beyond the EU: DORA and NIS 2 Directive's Global Impact

Advance Mobile Application Development class 07

Measures of Dispersion and Variability: Range, QD, AD and SD

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx

Call Girls in Dwarka Mor Delhi Contact Us 9654467111

Nutritional Needs Presentation - HLTH 104

social pharmacy d-pharm 1st year by Pragati K. Mahajan

Sports & Fitness Value Added Course FY..

Interactive Powerpoint_How to Master effective communication

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...

Introduction to Nonprofit Accounting: The Basics

Holdier Curriculum Vitae (April 2024).pdf

IIHS Open Framework-SpokenMedia

1. Enabling the IIHS Vision, Part 1 Brandon Muramatsu Andrew McKinney Peter Wilkins—Our colleague at MIT at 0° C January 2010 Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License ( creativecommons.org/licenses/by-nc-sa/3.0/us/ )

3. “ The IIHS Website is our commitment to a different way of looking at things.” – Aromar Revi 5 January 2010

4. “ The Institution will fail or scale based on language.” – Aromar Revi 5 January 2010

6. What did we do? Auto Transcribe Edit Translate Present SpokenMedia

7. The Demo

8. How did we do it? Auto Transcribe Edit Translate Present SpokenMedia

10.

11. How Does it Work? Lecture Transcription Workflow

12. SpokenMedia Process We used a portion of the SpokenMedia process for the demo

13. How did we do it? Auto Transcribe Edit Translate Present SpokenMedia

14. Edit & Translate: Accuracy Automatic Transcription Hand Transcription Time Adjusted Translated Hindi I I I मेरे खयाल से think think think once one one नयोजन की एक मुख्य चुनौती है and central so challenge central the of challenger planning challenge of planning is planning nice legitimacy is legitimacy of legitimacy of of government government सरकार की एक ऐसी मुख्य संस्थान के रूप में वैधता government as as

15.

16. How did we do it? Auto Transcribe Edit Translate Present SpokenMedia

17.

18.

19. Thank You! Brandon Muramatsu, [email_address] Andrew McKinney, [email_address] ° Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License ( creativecommons.org/licenses/by-nc-sa/3.0/us/ )

Editor's Notes

Four step process (simple) First we used the SpokenMedia to do automatic transcription. Next we did hand edit and translation steps. Then we created a player for the presentation of the video and transcripts…
Demo
First we used the SpokenMedia to do automatic transcription.
Lecture Transcription Jim Glass and his group have years of research experience for spoken languages Lectures are a different type of spoken language Much of the speech recognition research has focused on real time transcription of news broadcasts, or interactive voice response systems (telephone) Broadcast news has something like 300 unique words in an hour long broadcast Broadcast news is well structured, prepared copy (in studio via teleprompters), clear transitions between speakers, etc. Lectures are conversational and spontaneous Can use highly specialized vocabularies, engineering, physical sciences, mathematics
Spoken Lecture Project Supported by iCampus Includes the browser (which was just demo’d) the processor (back end lecture transcription) and a hand workflow to do the processing Approximately 400 hours of video indexed
How does it work? Audio System only needs audio (waveform), extracts from video Domain Model (base is generic domain model) System needs to know what words it can expect to find in the audio Syllabus, lecture notes, index from text book, research papers Build library of domains Separate sub-process for text for domain model Speaker model (base is generic speaker model) If multiple lectures by the same author, best to create a speaker model Separate sub-process for speaker model Process—With audio, domain and speaker models Output Time coded transcript (standard formats) Links media and transcript Applications Search/retrieval Player
We only used part of the process due to time constraints. Audio separation, speech processing, time-coded transcript, and then presentation through a SpokenMedia player.
Next we did hand edit and translation steps.
For this demo, we did computer-based automatic transcription, sent a file to IIHS for “editing” that consisted of performing a hand transcription (due to the format we sent, and the low accuracy of the automatic transcription in this case), a time alignment (though I actually feel that it’s “off” or “slow”) and then a hand translation by IIHS.
Recognizer Accuracy Base accuracy is approximately 50% (generic domain and speaker models) Increase accuracy with speaker model up to 80-85%, and specific domain model This approach is good for courses with multiple lectures by the same speaker Domain models get more useful as more relevant text documents are indexed (keyword/noun phrase extraction) Initial results indicate that doing one 99% accurate (by hand/manual) transcript can help immensely for additional lectures by the same speaker Better use of limited resources Search accuracy is closer to 90%, searches tend to be for unique words which the processor is better at recognizing
Then we created a player for the presentation of the video and transcripts…

IIHS Open Framework-SpokenMedia

Recommended

Recommended

More Related Content

Similar to IIHS Open Framework-SpokenMedia

Similar to IIHS Open Framework-SpokenMedia (20)

More from Brandon Muramatsu

More from Brandon Muramatsu (20)

Recently uploaded

Recently uploaded (20)

IIHS Open Framework-SpokenMedia

Editor's Notes