SlideShare uma empresa Scribd logo
1 de 18
Research & Development
Analysing media in the
cloud
An experiment and a marketplace
Tristan Ferne
Executive Producer
BBC Research & Development
Research & Development
A experiment in using the cloud to process
a radio archive
A prototype for the World Service archive
A marketplace for analysing media in the
cloud
Research & Development
ABC-IP
Automatic Broadcast Content Interlinking Project
Unlocking media archives by making better use of
metadata
TSB competition for “Metadata: increasing the value of
digital content”
BBC R&D and Metabroadcast
May 2011 - May 2013
Research & Development
The BBC World Service archive
A 3-year digitisation project
50,000 radio programmes from the past 45 years
3 years of continuous audio
500TB of high quality audio
Research & Development
The missing metadata
Missing fields
Incorrect data
Spelling mistakes
Research & Development
Listening machines
Research & Development
Noisy transcripts
to be raised in a crisp and easy gait collar tradition and mystique
and net bottle westphal mia ballroom with a fifth will one of your
very well that p. c. set a caustic wet plate is sprint says it twice to
purposes again who's addicted across stick is a podium which
stopped at a slow start to the masses of setting up a world and
on top was a big nineteen ninety three after a renewed spirit of
the big dig ,comma off trillo .period when you are unable to
compose and see what it's stole to working for a while at the
guys when i started the eighth that we teach eighteen hamper
and a timeless dave they'd each code for my list tinged yellow
and io i had no east p. n. c. and i was a big epic tina afoot
o'mara i. q. from kodiak and there was so they become kosher
shopko misfit and i was a david to compose his team's end and
at haas tied to districts in the indian head of i. a. moved to beijing
Research & Development
Extracting topics
Extract keywords from noisy
transcripts
Match to Linked Data topics from
DBpedia
Disambiguate using distance within
the “semantic” space
Research & Development
Processing in the cloud
26,280 hours of audio processed
36,729 compute hours on “small” cloud machines
Processed whole archive in 2 weeks at a cost of ~$3,000
Built an API for managing the process
Research & Development
Machines + People
Archive Machines People
Archive
+
Metadata
Experiences
Web TV
+
Radio Mobile
IMPROVES
PROVIDESPEOPLE
Research & Development
http://worldservice.prototyping.bbc.co.uk
Research & Development
http://worldservice.prototyping.bbc.co.uk
Research & Development
comma – Cloud marketplace for media analysis
TSB competition for “Innovating in the Cloud”
BBC R&D, Somethin’Else and Kite
May 2013 - May 2015
Research & Development
Media analysis
Topic generation from text
Summarising text
Sentiment analysis
Speaker identification and diarisation
Music identification
Mood classification of audio and video
Face recognition
Segmentation of audio and video
Object and place recognition
Scene detection in video
Subtitle creation
Research & Development
Problems with media analysis
Computationally intensive
Hard to integrate with other systems
Hard to evaluate and compare
Hard to know what's possible and what’s available
Research & Development
Making media analysis easy
Algorithm providers upload algorithms
Media owners upload content and choose what they want
to analyse
The platform manages:
Computation and scaling
Storing the data
Monitoring
Billing
Research & Development
The comma marketplace
Algorithm developers; e.g. research departments at
universities and SMEs
Media owners; e.g. broadcasters, museums, archives, even
individuals
Research & Development
Analysing media in the cloud
Tristan Ferne, BBC R&D
tristan.ferne@bbc.co.uk
@tristanf
http://www.bbc.co.uk/rd
http://worldservice.prototyping.bbc.co.uk

Mais conteúdo relacionado

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Destaque

Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Destaque (20)

Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 

Analysing media in the cloud

  • 1. Research & Development Analysing media in the cloud An experiment and a marketplace Tristan Ferne Executive Producer BBC Research & Development
  • 2. Research & Development A experiment in using the cloud to process a radio archive A prototype for the World Service archive A marketplace for analysing media in the cloud
  • 3. Research & Development ABC-IP Automatic Broadcast Content Interlinking Project Unlocking media archives by making better use of metadata TSB competition for “Metadata: increasing the value of digital content” BBC R&D and Metabroadcast May 2011 - May 2013
  • 4. Research & Development The BBC World Service archive A 3-year digitisation project 50,000 radio programmes from the past 45 years 3 years of continuous audio 500TB of high quality audio
  • 5. Research & Development The missing metadata Missing fields Incorrect data Spelling mistakes
  • 7. Research & Development Noisy transcripts to be raised in a crisp and easy gait collar tradition and mystique and net bottle westphal mia ballroom with a fifth will one of your very well that p. c. set a caustic wet plate is sprint says it twice to purposes again who's addicted across stick is a podium which stopped at a slow start to the masses of setting up a world and on top was a big nineteen ninety three after a renewed spirit of the big dig ,comma off trillo .period when you are unable to compose and see what it's stole to working for a while at the guys when i started the eighth that we teach eighteen hamper and a timeless dave they'd each code for my list tinged yellow and io i had no east p. n. c. and i was a big epic tina afoot o'mara i. q. from kodiak and there was so they become kosher shopko misfit and i was a david to compose his team's end and at haas tied to districts in the indian head of i. a. moved to beijing
  • 8. Research & Development Extracting topics Extract keywords from noisy transcripts Match to Linked Data topics from DBpedia Disambiguate using distance within the “semantic” space
  • 9. Research & Development Processing in the cloud 26,280 hours of audio processed 36,729 compute hours on “small” cloud machines Processed whole archive in 2 weeks at a cost of ~$3,000 Built an API for managing the process
  • 10. Research & Development Machines + People Archive Machines People Archive + Metadata Experiences Web TV + Radio Mobile IMPROVES PROVIDESPEOPLE
  • 13. Research & Development comma – Cloud marketplace for media analysis TSB competition for “Innovating in the Cloud” BBC R&D, Somethin’Else and Kite May 2013 - May 2015
  • 14. Research & Development Media analysis Topic generation from text Summarising text Sentiment analysis Speaker identification and diarisation Music identification Mood classification of audio and video Face recognition Segmentation of audio and video Object and place recognition Scene detection in video Subtitle creation
  • 15. Research & Development Problems with media analysis Computationally intensive Hard to integrate with other systems Hard to evaluate and compare Hard to know what's possible and what’s available
  • 16. Research & Development Making media analysis easy Algorithm providers upload algorithms Media owners upload content and choose what they want to analyse The platform manages: Computation and scaling Storing the data Monitoring Billing
  • 17. Research & Development The comma marketplace Algorithm developers; e.g. research departments at universities and SMEs Media owners; e.g. broadcasters, museums, archives, even individuals
  • 18. Research & Development Analysing media in the cloud Tristan Ferne, BBC R&D tristan.ferne@bbc.co.uk @tristanf http://www.bbc.co.uk/rd http://worldservice.prototyping.bbc.co.uk