A talk through two projects that BBC R&D is involved in that use cloud computing for processing media. The first is a case study showing how we used cloud computing to efficiently process a very large archive of media and generate metadata, and the second part is about how this led to us to think about abstracting a service out of it, leading to a general purpose cloud service for analysing media.
Full talk notes at http://www.cookinrelaxin.com/2013/06/analysing-media-in-cloud.html
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Analysing media in the cloud
1. Research & Development
Analysing media in the
cloud
An experiment and a marketplace
Tristan Ferne
Executive Producer
BBC Research & Development
2. Research & Development
A experiment in using the cloud to process
a radio archive
A prototype for the World Service archive
A marketplace for analysing media in the
cloud
3. Research & Development
ABC-IP
Automatic Broadcast Content Interlinking Project
Unlocking media archives by making better use of
metadata
TSB competition for “Metadata: increasing the value of
digital content”
BBC R&D and Metabroadcast
May 2011 - May 2013
4. Research & Development
The BBC World Service archive
A 3-year digitisation project
50,000 radio programmes from the past 45 years
3 years of continuous audio
500TB of high quality audio
7. Research & Development
Noisy transcripts
to be raised in a crisp and easy gait collar tradition and mystique
and net bottle westphal mia ballroom with a fifth will one of your
very well that p. c. set a caustic wet plate is sprint says it twice to
purposes again who's addicted across stick is a podium which
stopped at a slow start to the masses of setting up a world and
on top was a big nineteen ninety three after a renewed spirit of
the big dig ,comma off trillo .period when you are unable to
compose and see what it's stole to working for a while at the
guys when i started the eighth that we teach eighteen hamper
and a timeless dave they'd each code for my list tinged yellow
and io i had no east p. n. c. and i was a big epic tina afoot
o'mara i. q. from kodiak and there was so they become kosher
shopko misfit and i was a david to compose his team's end and
at haas tied to districts in the indian head of i. a. moved to beijing
8. Research & Development
Extracting topics
Extract keywords from noisy
transcripts
Match to Linked Data topics from
DBpedia
Disambiguate using distance within
the “semantic” space
9. Research & Development
Processing in the cloud
26,280 hours of audio processed
36,729 compute hours on “small” cloud machines
Processed whole archive in 2 weeks at a cost of ~$3,000
Built an API for managing the process
10. Research & Development
Machines + People
Archive Machines People
Archive
+
Metadata
Experiences
Web TV
+
Radio Mobile
IMPROVES
PROVIDESPEOPLE
13. Research & Development
comma – Cloud marketplace for media analysis
TSB competition for “Innovating in the Cloud”
BBC R&D, Somethin’Else and Kite
May 2013 - May 2015
14. Research & Development
Media analysis
Topic generation from text
Summarising text
Sentiment analysis
Speaker identification and diarisation
Music identification
Mood classification of audio and video
Face recognition
Segmentation of audio and video
Object and place recognition
Scene detection in video
Subtitle creation
15. Research & Development
Problems with media analysis
Computationally intensive
Hard to integrate with other systems
Hard to evaluate and compare
Hard to know what's possible and what’s available
16. Research & Development
Making media analysis easy
Algorithm providers upload algorithms
Media owners upload content and choose what they want
to analyse
The platform manages:
Computation and scaling
Storing the data
Monitoring
Billing
17. Research & Development
The comma marketplace
Algorithm developers; e.g. research departments at
universities and SMEs
Media owners; e.g. broadcasters, museums, archives, even
individuals
18. Research & Development
Analysing media in the cloud
Tristan Ferne, BBC R&D
tristan.ferne@bbc.co.uk
@tristanf
http://www.bbc.co.uk/rd
http://worldservice.prototyping.bbc.co.uk