SlideShare uma empresa Scribd logo
1 de 3
Baixar para ler offline
eTranscriber Transcriptions | Online Transcriber




                            Google Voice Transcription vs. Humans




In the first week of May, 2010 Google announced the worldwide release of its YouTube video transcription
services. Although released in mid 2009, the beta version of YouTube video transcription was available to a
select few Universities, News Broadcasters and Government agencies.

The history of speech recognition technology dates back to the late 1930’s, when AT&T Bell Laboratories
developed a primitive device that could recognize speech. Researchers knew that the widespread use of
speech recognition would depend on the ability to accurately and consistently perceive subtle and complex
verbal input. But because the computing technology was not good enough, the development of speech
recognition was snail paced.

50 years down the line, the capabilities of many digital electronic devices had surpassed even the best and the
costliest technologies of the 1930’s. This was made possible due to the breakthroughs made in chip and
semiconductor fabrication. The largest barriers to the speed and accuracy of speech recognition - computer
speed and power - were no longer an issue.

With more computing power (measured in units of FLOPS) than our 1930’s computer scientists could imagine,
programmers could now develop algorithms to code and decode a multitude of voice patterns. Practically they
could now build a database of thousands of different voice patterns, convert them into digital sine waves and
analyze words based on the mathematics of voice pattern signals. Over a period of time, as the speech to text
technologies became usable; many companies started offering voice recognition to its consumers – Dragon
Dictation, Microsoft (XP, Vista), Google Voice and other niche companies.



So now the question arises – How reliable are these technologies, particularly Google YouTube
transcription and will they ever compete if not surpass human transcription accuracy?

Those who like to view YouTube videos with captions turned on, you may see that the accuracy of the captions
has increased several folds over the past few months. The accuracy is going up day by day and is only going
to improve as more people use the service. As Eric Schmidt, CEO of Google Inc. says –‘ Our Google voice will
improve over a period of time as more and more users use it, it’s a self learning technology “



But there are still a few major flaws that could be foreseen despite it being a self learning technology -


     eTranscriber Transcription Services | www.etranscriber.net | Academic Transcriber Services
eTranscriber Transcriptions | Online Transcriber


   1. Accurate captioning is possible only in the case when the speaker is speaking very clearly and
      distinctly.
   2. The environment has to be free from any sort of disturbance
   3. Errors creep in because of similar sounding words such – sky and high –when spoken quickly, the
      system is not able to differentiate between the two.
   4. Interjections – People often pause or make some thinking sounds during speeches – these include
      uh’s, Hmmms, ahh etc. The recognition software makes an effort to transcribe these as well, at times
      giving hilarious results. (Search YouTube for Hilarious Google voice transcription)

And finally comes the major downside of them all



   5. Psychological Satisfaction – After the captioning has been done by the Google robots, can
      uploader be sure of the accuracy? It is quite obvious that the transcribed captions would need to be
      thoroughly checked for errors and proofread several times. This means going through the whole video
      several times, manually correcting the words, correcting the grammar portion including commas,
      hyphens, quotes etc and them uploading them. A very time consuming process.



So what is the ultimate solution to transcribing files if not voice to text recognition technology?

The answer is simple, the way digital and analog files have been transcribed for the past 50 years - Humans.



Can speech recognition technologies ever surpass human transcribing abilities?




      eTranscriber Transcription Services | www.etranscriber.net | Academic Transcriber Services
eTranscriber Transcriptions | Online Transcriber


Links




  •    Online Transcriber
  •    Interview Transcriber
  •    Academic Transcriber
  •    Verbatim Transcriber
  •    Audio Transcriber
  •    Video Transcriber
  •    Media Transcriber
  •    Verbatim Transcriber
  •    DSS Olympus Transcriber
  •    Podcast Transcriber
  •    Mp3 to Text Transcriber




      eTranscriber Transcription Services | www.etranscriber.net | Academic Transcriber Services

Mais conteúdo relacionado

Último

call Now 9811711561 Cash Payment乂 Call Girls in Dwarka Mor
call Now 9811711561 Cash Payment乂 Call Girls in Dwarka Morcall Now 9811711561 Cash Payment乂 Call Girls in Dwarka Mor
call Now 9811711561 Cash Payment乂 Call Girls in Dwarka Mor
vikas rana
 

Último (15)

$ Love Spells^ 💎 (310) 882-6330 in West Virginia, WV | Psychic Reading Best B...
$ Love Spells^ 💎 (310) 882-6330 in West Virginia, WV | Psychic Reading Best B...$ Love Spells^ 💎 (310) 882-6330 in West Virginia, WV | Psychic Reading Best B...
$ Love Spells^ 💎 (310) 882-6330 in West Virginia, WV | Psychic Reading Best B...
 
(Aarini) Russian Call Girls Surat Call Now 8250077686 Surat Escorts 24x7
(Aarini) Russian Call Girls Surat Call Now 8250077686 Surat Escorts 24x7(Aarini) Russian Call Girls Surat Call Now 8250077686 Surat Escorts 24x7
(Aarini) Russian Call Girls Surat Call Now 8250077686 Surat Escorts 24x7
 
2k Shots ≽ 9205541914 ≼ Call Girls In Palam (Delhi)
2k Shots ≽ 9205541914 ≼ Call Girls In Palam (Delhi)2k Shots ≽ 9205541914 ≼ Call Girls In Palam (Delhi)
2k Shots ≽ 9205541914 ≼ Call Girls In Palam (Delhi)
 
8377087607 Full Enjoy @24/7-CLEAN-Call Girls In Chhatarpur,
8377087607 Full Enjoy @24/7-CLEAN-Call Girls In Chhatarpur,8377087607 Full Enjoy @24/7-CLEAN-Call Girls In Chhatarpur,
8377087607 Full Enjoy @24/7-CLEAN-Call Girls In Chhatarpur,
 
WOMEN EMPOWERMENT women empowerment.pptx
WOMEN EMPOWERMENT women empowerment.pptxWOMEN EMPOWERMENT women empowerment.pptx
WOMEN EMPOWERMENT women empowerment.pptx
 
LC_YouSaidYes_NewBelieverBookletDone.pdf
LC_YouSaidYes_NewBelieverBookletDone.pdfLC_YouSaidYes_NewBelieverBookletDone.pdf
LC_YouSaidYes_NewBelieverBookletDone.pdf
 
(Anamika) VIP Call Girls Navi Mumbai Call Now 8250077686 Navi Mumbai Escorts ...
(Anamika) VIP Call Girls Navi Mumbai Call Now 8250077686 Navi Mumbai Escorts ...(Anamika) VIP Call Girls Navi Mumbai Call Now 8250077686 Navi Mumbai Escorts ...
(Anamika) VIP Call Girls Navi Mumbai Call Now 8250077686 Navi Mumbai Escorts ...
 
2k Shots ≽ 9205541914 ≼ Call Girls In Jasola (Delhi)
2k Shots ≽ 9205541914 ≼ Call Girls In Jasola (Delhi)2k Shots ≽ 9205541914 ≼ Call Girls In Jasola (Delhi)
2k Shots ≽ 9205541914 ≼ Call Girls In Jasola (Delhi)
 
9892124323, Call Girls in mumbai, Vashi Call Girls , Kurla Call girls
9892124323, Call Girls in mumbai, Vashi Call Girls , Kurla Call girls9892124323, Call Girls in mumbai, Vashi Call Girls , Kurla Call girls
9892124323, Call Girls in mumbai, Vashi Call Girls , Kurla Call girls
 
call Now 9811711561 Cash Payment乂 Call Girls in Dwarka Mor
call Now 9811711561 Cash Payment乂 Call Girls in Dwarka Morcall Now 9811711561 Cash Payment乂 Call Girls in Dwarka Mor
call Now 9811711561 Cash Payment乂 Call Girls in Dwarka Mor
 
2k Shots ≽ 9205541914 ≼ Call Girls In Dashrath Puri (Delhi)
2k Shots ≽ 9205541914 ≼ Call Girls In Dashrath Puri (Delhi)2k Shots ≽ 9205541914 ≼ Call Girls In Dashrath Puri (Delhi)
2k Shots ≽ 9205541914 ≼ Call Girls In Dashrath Puri (Delhi)
 
The Selfspace Journal Preview by Mindbrush
The Selfspace Journal Preview by MindbrushThe Selfspace Journal Preview by Mindbrush
The Selfspace Journal Preview by Mindbrush
 
2k Shots ≽ 9205541914 ≼ Call Girls In Mukherjee Nagar (Delhi)
2k Shots ≽ 9205541914 ≼ Call Girls In Mukherjee Nagar (Delhi)2k Shots ≽ 9205541914 ≼ Call Girls In Mukherjee Nagar (Delhi)
2k Shots ≽ 9205541914 ≼ Call Girls In Mukherjee Nagar (Delhi)
 
Pokemon Go... Unraveling the Conspiracy Theory
Pokemon Go... Unraveling the Conspiracy TheoryPokemon Go... Unraveling the Conspiracy Theory
Pokemon Go... Unraveling the Conspiracy Theory
 
Top Rated Pune Call Girls Tingre Nagar ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Tingre Nagar ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Tingre Nagar ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Tingre Nagar ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 

Destaque

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Destaque (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Google voice vs human transcribing

  • 1. eTranscriber Transcriptions | Online Transcriber Google Voice Transcription vs. Humans In the first week of May, 2010 Google announced the worldwide release of its YouTube video transcription services. Although released in mid 2009, the beta version of YouTube video transcription was available to a select few Universities, News Broadcasters and Government agencies. The history of speech recognition technology dates back to the late 1930’s, when AT&T Bell Laboratories developed a primitive device that could recognize speech. Researchers knew that the widespread use of speech recognition would depend on the ability to accurately and consistently perceive subtle and complex verbal input. But because the computing technology was not good enough, the development of speech recognition was snail paced. 50 years down the line, the capabilities of many digital electronic devices had surpassed even the best and the costliest technologies of the 1930’s. This was made possible due to the breakthroughs made in chip and semiconductor fabrication. The largest barriers to the speed and accuracy of speech recognition - computer speed and power - were no longer an issue. With more computing power (measured in units of FLOPS) than our 1930’s computer scientists could imagine, programmers could now develop algorithms to code and decode a multitude of voice patterns. Practically they could now build a database of thousands of different voice patterns, convert them into digital sine waves and analyze words based on the mathematics of voice pattern signals. Over a period of time, as the speech to text technologies became usable; many companies started offering voice recognition to its consumers – Dragon Dictation, Microsoft (XP, Vista), Google Voice and other niche companies. So now the question arises – How reliable are these technologies, particularly Google YouTube transcription and will they ever compete if not surpass human transcription accuracy? Those who like to view YouTube videos with captions turned on, you may see that the accuracy of the captions has increased several folds over the past few months. The accuracy is going up day by day and is only going to improve as more people use the service. As Eric Schmidt, CEO of Google Inc. says –‘ Our Google voice will improve over a period of time as more and more users use it, it’s a self learning technology “ But there are still a few major flaws that could be foreseen despite it being a self learning technology - eTranscriber Transcription Services | www.etranscriber.net | Academic Transcriber Services
  • 2. eTranscriber Transcriptions | Online Transcriber 1. Accurate captioning is possible only in the case when the speaker is speaking very clearly and distinctly. 2. The environment has to be free from any sort of disturbance 3. Errors creep in because of similar sounding words such – sky and high –when spoken quickly, the system is not able to differentiate between the two. 4. Interjections – People often pause or make some thinking sounds during speeches – these include uh’s, Hmmms, ahh etc. The recognition software makes an effort to transcribe these as well, at times giving hilarious results. (Search YouTube for Hilarious Google voice transcription) And finally comes the major downside of them all 5. Psychological Satisfaction – After the captioning has been done by the Google robots, can uploader be sure of the accuracy? It is quite obvious that the transcribed captions would need to be thoroughly checked for errors and proofread several times. This means going through the whole video several times, manually correcting the words, correcting the grammar portion including commas, hyphens, quotes etc and them uploading them. A very time consuming process. So what is the ultimate solution to transcribing files if not voice to text recognition technology? The answer is simple, the way digital and analog files have been transcribed for the past 50 years - Humans. Can speech recognition technologies ever surpass human transcribing abilities? eTranscriber Transcription Services | www.etranscriber.net | Academic Transcriber Services
  • 3. eTranscriber Transcriptions | Online Transcriber Links • Online Transcriber • Interview Transcriber • Academic Transcriber • Verbatim Transcriber • Audio Transcriber • Video Transcriber • Media Transcriber • Verbatim Transcriber • DSS Olympus Transcriber • Podcast Transcriber • Mp3 to Text Transcriber eTranscriber Transcription Services | www.etranscriber.net | Academic Transcriber Services