SlideShare a Scribd company logo
1 of 33
Big Data at WB
Brian Kursar – VP Data Strategy and Architecture
Big Data Day LA 2016 – West Los Angeles College
2
COMPANY OVERVIEW
Warner Bros. Entertainment Inc., a Time Warner
Company, is a fully integrated, broad-based
entertainment company.
Warner Bros. is the global leader in the creation,
production, distribution, licensing and marketing of all
forms of entertainment and their related businesses.
We stand at the forefront of every aspect of the
entertainment industry, from feature films to television,
home entertainment, animation, comic books,
interactive entertainment and games, product and
brand licensing, and broadcasting.
3
THE TIME WARNER FAMILY OF COMPANIES
Warner Bros.
Pictures
Group
• Crossed the
billion-dollar
mark for a 15th
year running
• Over 7000
Feature Films
DC
Entertainment
• Largest
English-
language
publisher of
comics in the
world, with
more than
1,200 titles
each year
Warner Bros.
Consumer
Products
• More than
3,700 active
licensees
Worldwide
Warner Home
Entertainment
• Industry-
leading 21
percent market
share
Warner Bros.
Television
Group
• #1 the last 10
out of 11 years,
and 23 out of
26 years!
• Produced 79
series (69
primetime) for
broadcast, first-
run syndication
and cable in the
2014-15
season
Warner Bros.
Studio Tours
• Harry Potter UK
Leavesden
Tour
• Hollywood
Studio Tour
WARNER BROS.
Sales
Supply Chain
Production
Marketing
Physical
Pre-Production
Ratings
Consumer
Exhibitor
Social
Digital
Linear
Operations
Behavior
Inventory
Spend
Competitors
Content Protection
Over the Top
Theatrical
Television
Home Entertainment
Consumer Products
DC Entertainment
Theme Parks
Studio Tours
DATA
STORAGE DATA DISCOVERY AND
SELF SERVICE BI
PROCESS REAL-TIME VISUALIZATION
OUR STACK
CODE
Connecting the Dots Isn’t Easy
Harry Potter Franchise
• Theatrical Movies
• Home Entertainment (Blu-ray, Digital Downloads)
• Streaming (Netflix, Hulu, etc.)
• Television
• Video Games
• Consumer Products
• Theme Parks
“I want a 360 Franchise View”
Sales
Supply Chain
Production
Marketing
Physical
Pre-Production
Ratings
Consumer
Exhibitor
Social
Digital
Linear
Operations
Behavior
Inventory
Spend
Competitors
Content Protection
Over the Top
Theme Parks
Theatrical
Television
Home Entertainment
Consumer Products
DC Entertainment
Studio Tours
Connecting Social
Questions that need answers
What is the impact of a trailer drop or a social media comment on…
• Theatrical Box Office Sales
• Franchise Back Catalog Sales
• Consumer Products Sales
• Studio Tour Ticket Sales
Connecting Social
1 • FILTER
KEYWORDS
Define keyword
terms
associated with
Titles
CREED
#CREEDMOVIE
PAN
#PANMOVIE
…
2 • CLASSIFY
TITLES
Tag Filters
associated with
Titles
CREED OR
#CREEDMOVIE
=
CREED
3 • INDEX
DATA
Write Data to a
real-time
analysis engine
4 • ANALYZE
RESULTS
Aggregated data is
pushed to Tableau
to visualize
Results
Scott Stapp
Assassin’s Creed
Video Games
Music
Movies
Bread Pudding Recipes
Cooking
Bands
Michael B. Jordan
!= !=
!=
Unique Title
Very
Ambiguous Title
Ambiguous Title
Title Ambiguity Scoring
Video ID
Facebook Page, Amazon Product ID
Apollo
Rocky
Concept Based
Disambiguation
!=
Exclude terms like “band”, “assassin's”
“movie”,
“film”
Unique Social
IDs
“Boxing”
Characters
“Creed”
“Michael B. Jordan”
“Sylvester Stallone”
Top
Talent
Establishing
Context
of Consumer engagement on the Creed
Facebook page does
NOT contain the keyword “Creed”
Content Context
No keyword “Creed” in this post!
If Content contains
“Uv554B7YHk4” –
Classify as “Creed”
Social Pipeline
Social API Apply L1 Title Filter
L1 Base Filter
tag.movie.title “Creed”
{
fb.all.content CONTAINS_ANY “Creedmovie,Creedfilm,Creed”
}
Social Pipeline
Social API Apply L1 Title Filter Pull into Spark RDD
Apply L2 Concept
Title Filter Rules
Pre-Process
Title Ambiguity Score
and Rules
L2 Concept
Filter
tag.movie.title “Creed”
{
(
(fb.all.content CONTAINS_ANY “#Creedmovie,#Creedfilm”
OR
links.url CONTAINS_ANY “Uv554B7YHk4 ,-HPam119fhM”
OR
fb.topics.username == “creedmovie”)
Unique Values
Hash Tags, Video IDs, or
Anything on CreedMovie
FB Page
Title + Disambiguation concepts
OR
fb.all.content CONTAINS “Creed” NEAR_ANY
(Sylvester, Stallone, Michael B. Jordan, Michael Jordan,Tessa Thompson,
#SylvesterStallone, #MichaelBJordan, #TessaThompson,
Rocky, Apollo, Boxing
film, movie, theater, movies, warner bros, wb, warner brothers):10
)
Excluded Terms
AND NOT
fb.all.content CONTAINS_ANY
(assassin, assassins, band, piratebay, torrent, isohunt, megaupload)
)
}
Social Pipeline
Social API Apply L1 Title Filter Pull into Spark RDD
Apply L2 Concept
Title Filter Rules
Write to ElasticSearch
Write to Parquet
Visualizations and
Ad-Hoc Queries
Process in Spark for
Affinity Models
Pre-Process
Title Ambiguity Score
and Rules
Michael B. Jordan
Rocky Sylvester StalloneMovies
Boxing
Apollo Creed Carl Weathers
Creed
WB Data
Sales
Supply Chain
Production
Marketing
Physical
Pre-Production
Ratings
Consumer
Exhibitor
Social
Digital
Linear
Operations
Behavior
Inventory
Spend
Competitors
Content Protection
Over the Top
Theatrical
Home Entertainment
Consumer Products
DC Entertainment
Television
Theme Parks
Anti-Piracy Data Pipeline
Acquire P2P
Attempts
Pull into Spark RDD Apply Matching Logic Write to ElasticSearch Visualize
Demo
Thank you
Brian Kursar – VP Data Strategy and Architecture
WB Data and Analytics
@briankursar
For more information on exciting opportunities in Big Data
happening at Warner Bros. please come by the Warner Bros.
Career Booth here at Big Data Day LA 2016 or check out our
careers site at WarnerBrosCareers.com.

More Related Content

What's hot

Marketing and promoting superhero films
Marketing and promoting superhero filmsMarketing and promoting superhero films
Marketing and promoting superhero filmsHeworthMedia1
 
Media ownership skyfall ill manors
 Media ownership skyfall ill manors Media ownership skyfall ill manors
Media ownership skyfall ill manorssandraoddy2
 
Level 4 response section b - media ownership
Level 4 response   section b - media ownershipLevel 4 response   section b - media ownership
Level 4 response section b - media ownershipCoombeMedia1
 
Impact of Media Ownership on Products Available
Impact of Media Ownership on Products AvailableImpact of Media Ownership on Products Available
Impact of Media Ownership on Products AvailableHeworthMedia1
 
Similarities and differences skyfall ill manors
Similarities and differences  skyfall ill manorsSimilarities and differences  skyfall ill manors
Similarities and differences skyfall ill manorssandraoddy2
 
Film4 revision
Film4 revisionFilm4 revision
Film4 revisionksomel
 
The Inbetweeners
The InbetweenersThe Inbetweeners
The Inbetweenersjwright61
 
Warner Bros. Synergy and Convergence
Warner Bros. Synergy and ConvergenceWarner Bros. Synergy and Convergence
Warner Bros. Synergy and ConvergenceGDBrew
 
Marketing and promotion
Marketing and promotionMarketing and promotion
Marketing and promotionHeworthMedia1
 
What kind of media institution might distribute your media product and why?
What kind of media institution might distribute your media product and why?What kind of media institution might distribute your media product and why?
What kind of media institution might distribute your media product and why?Stephanie
 
Submarine: British Film Case Study
Submarine: British Film Case StudySubmarine: British Film Case Study
Submarine: British Film Case StudyShauna_97
 
Independent cinema research
Independent cinema researchIndependent cinema research
Independent cinema researchpaluh001
 
Media convergence and synergy
Media convergence and synergyMedia convergence and synergy
Media convergence and synergyhasnmedia
 
Small production companies
Small production companiesSmall production companies
Small production companiesToby1232
 

What's hot (20)

Marketing and promoting superhero films
Marketing and promoting superhero filmsMarketing and promoting superhero films
Marketing and promoting superhero films
 
Media ownership skyfall ill manors
 Media ownership skyfall ill manors Media ownership skyfall ill manors
Media ownership skyfall ill manors
 
Level 4 response section b - media ownership
Level 4 response   section b - media ownershipLevel 4 response   section b - media ownership
Level 4 response section b - media ownership
 
Impact of Media Ownership on Products Available
Impact of Media Ownership on Products AvailableImpact of Media Ownership on Products Available
Impact of Media Ownership on Products Available
 
Similarities and differences skyfall ill manors
Similarities and differences  skyfall ill manorsSimilarities and differences  skyfall ill manors
Similarities and differences skyfall ill manors
 
Vertigo films
Vertigo filmsVertigo films
Vertigo films
 
Film4 revision
Film4 revisionFilm4 revision
Film4 revision
 
Section b synergy
Section b   synergySection b   synergy
Section b synergy
 
Media insti
Media instiMedia insti
Media insti
 
Warner Bros
Warner BrosWarner Bros
Warner Bros
 
The Inbetweeners
The InbetweenersThe Inbetweeners
The Inbetweeners
 
Warner Bros. Synergy and Convergence
Warner Bros. Synergy and ConvergenceWarner Bros. Synergy and Convergence
Warner Bros. Synergy and Convergence
 
G322 case study working title Case Stu
G322 case study   working title Case StuG322 case study   working title Case Stu
G322 case study working title Case Stu
 
Film
FilmFilm
Film
 
Marketing and promotion
Marketing and promotionMarketing and promotion
Marketing and promotion
 
What kind of media institution might distribute your media product and why?
What kind of media institution might distribute your media product and why?What kind of media institution might distribute your media product and why?
What kind of media institution might distribute your media product and why?
 
Submarine: British Film Case Study
Submarine: British Film Case StudySubmarine: British Film Case Study
Submarine: British Film Case Study
 
Independent cinema research
Independent cinema researchIndependent cinema research
Independent cinema research
 
Media convergence and synergy
Media convergence and synergyMedia convergence and synergy
Media convergence and synergy
 
Small production companies
Small production companiesSmall production companies
Small production companies
 

Viewers also liked

Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...
Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...
Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...Data Con LA
 
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...Data Con LA
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...Data Con LA
 
Big Data Day LA 2016/ Use Case Driven track - BI is broken, Dave Fryer, Produ...
Big Data Day LA 2016/ Use Case Driven track - BI is broken, Dave Fryer, Produ...Big Data Day LA 2016/ Use Case Driven track - BI is broken, Dave Fryer, Produ...
Big Data Day LA 2016/ Use Case Driven track - BI is broken, Dave Fryer, Produ...Data Con LA
 
Big Data Day LA 2016/ Data Science Track - Data Science + Hollywood, Todd Ho...
Big Data Day LA 2016/ Data Science Track -  Data Science + Hollywood, Todd Ho...Big Data Day LA 2016/ Data Science Track -  Data Science + Hollywood, Todd Ho...
Big Data Day LA 2016/ Data Science Track - Data Science + Hollywood, Todd Ho...Data Con LA
 
Big Data Day LA 2016/ Big Data Track - Rapid Analytics @ Netflix LA (Updated ...
Big Data Day LA 2016/ Big Data Track - Rapid Analytics @ Netflix LA (Updated ...Big Data Day LA 2016/ Big Data Track - Rapid Analytics @ Netflix LA (Updated ...
Big Data Day LA 2016/ Big Data Track - Rapid Analytics @ Netflix LA (Updated ...Data Con LA
 
Disney consumer products
Disney consumer productsDisney consumer products
Disney consumer productsARAVIND SAMALA
 
Tajolabigdatacamp2014 140618135810-phpapp01 hyunsik-choi
Tajolabigdatacamp2014 140618135810-phpapp01 hyunsik-choiTajolabigdatacamp2014 140618135810-phpapp01 hyunsik-choi
Tajolabigdatacamp2014 140618135810-phpapp01 hyunsik-choiData Con LA
 
Big Data Day LA 2015 - Data mining, forecasting, and BI at the RRCC by Benjam...
Big Data Day LA 2015 - Data mining, forecasting, and BI at the RRCC by Benjam...Big Data Day LA 2015 - Data mining, forecasting, and BI at the RRCC by Benjam...
Big Data Day LA 2015 - Data mining, forecasting, and BI at the RRCC by Benjam...Data Con LA
 
Big Data Day LA 2015 - Tips for Building Self Service Data Science Platform b...
Big Data Day LA 2015 - Tips for Building Self Service Data Science Platform b...Big Data Day LA 2015 - Tips for Building Self Service Data Science Platform b...
Big Data Day LA 2015 - Tips for Building Self Service Data Science Platform b...Data Con LA
 
Big Data Day LA 2015 - What's new and next in Apache Tez by Bikas Saha of Hor...
Big Data Day LA 2015 - What's new and next in Apache Tez by Bikas Saha of Hor...Big Data Day LA 2015 - What's new and next in Apache Tez by Bikas Saha of Hor...
Big Data Day LA 2015 - What's new and next in Apache Tez by Bikas Saha of Hor...Data Con LA
 
Getting started with Spark & Cassandra by Jon Haddad of Datastax
Getting started with Spark & Cassandra by Jon Haddad of DatastaxGetting started with Spark & Cassandra by Jon Haddad of Datastax
Getting started with Spark & Cassandra by Jon Haddad of DatastaxData Con LA
 
Big Data Day LA 2015 - Machine Learning on Largish Data by Szilard Pafka of E...
Big Data Day LA 2015 - Machine Learning on Largish Data by Szilard Pafka of E...Big Data Day LA 2015 - Machine Learning on Largish Data by Szilard Pafka of E...
Big Data Day LA 2015 - Machine Learning on Largish Data by Szilard Pafka of E...Data Con LA
 
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of Amazon
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of AmazonBig Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of Amazon
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of AmazonData Con LA
 
A noETL Parallel Streaming Transformation Loader using Spark, Kafka­ & Ver­tica
A noETL Parallel Streaming Transformation Loader using Spark, Kafka­ & Ver­ticaA noETL Parallel Streaming Transformation Loader using Spark, Kafka­ & Ver­tica
A noETL Parallel Streaming Transformation Loader using Spark, Kafka­ & Ver­ticaData Con LA
 
Data science and good questions eric kostello
Data science and good questions eric kostelloData science and good questions eric kostello
Data science and good questions eric kostelloData Con LA
 
Big Data Day LA 2016 Keynote - Jeanne Holm/ City of LA
Big Data Day LA 2016 Keynote - Jeanne Holm/ City of LABig Data Day LA 2016 Keynote - Jeanne Holm/ City of LA
Big Data Day LA 2016 Keynote - Jeanne Holm/ City of LAData Con LA
 

Viewers also liked (20)

Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...
Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...
Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...
 
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Deep Learning at Scale - A...
 
Big Data Day LA 2016/ Use Case Driven track - BI is broken, Dave Fryer, Produ...
Big Data Day LA 2016/ Use Case Driven track - BI is broken, Dave Fryer, Produ...Big Data Day LA 2016/ Use Case Driven track - BI is broken, Dave Fryer, Produ...
Big Data Day LA 2016/ Use Case Driven track - BI is broken, Dave Fryer, Produ...
 
Warner Bros
Warner BrosWarner Bros
Warner Bros
 
Warner Bros
Warner BrosWarner Bros
Warner Bros
 
Warner bros presentation
Warner bros presentationWarner bros presentation
Warner bros presentation
 
Big Data Day LA 2016/ Data Science Track - Data Science + Hollywood, Todd Ho...
Big Data Day LA 2016/ Data Science Track -  Data Science + Hollywood, Todd Ho...Big Data Day LA 2016/ Data Science Track -  Data Science + Hollywood, Todd Ho...
Big Data Day LA 2016/ Data Science Track - Data Science + Hollywood, Todd Ho...
 
Big Data Day LA 2016/ Big Data Track - Rapid Analytics @ Netflix LA (Updated ...
Big Data Day LA 2016/ Big Data Track - Rapid Analytics @ Netflix LA (Updated ...Big Data Day LA 2016/ Big Data Track - Rapid Analytics @ Netflix LA (Updated ...
Big Data Day LA 2016/ Big Data Track - Rapid Analytics @ Netflix LA (Updated ...
 
Disney consumer products
Disney consumer productsDisney consumer products
Disney consumer products
 
Tajolabigdatacamp2014 140618135810-phpapp01 hyunsik-choi
Tajolabigdatacamp2014 140618135810-phpapp01 hyunsik-choiTajolabigdatacamp2014 140618135810-phpapp01 hyunsik-choi
Tajolabigdatacamp2014 140618135810-phpapp01 hyunsik-choi
 
Big Data Day LA 2015 - Data mining, forecasting, and BI at the RRCC by Benjam...
Big Data Day LA 2015 - Data mining, forecasting, and BI at the RRCC by Benjam...Big Data Day LA 2015 - Data mining, forecasting, and BI at the RRCC by Benjam...
Big Data Day LA 2015 - Data mining, forecasting, and BI at the RRCC by Benjam...
 
Big Data Day LA 2015 - Tips for Building Self Service Data Science Platform b...
Big Data Day LA 2015 - Tips for Building Self Service Data Science Platform b...Big Data Day LA 2015 - Tips for Building Self Service Data Science Platform b...
Big Data Day LA 2015 - Tips for Building Self Service Data Science Platform b...
 
Big Data Day LA 2015 - What's new and next in Apache Tez by Bikas Saha of Hor...
Big Data Day LA 2015 - What's new and next in Apache Tez by Bikas Saha of Hor...Big Data Day LA 2015 - What's new and next in Apache Tez by Bikas Saha of Hor...
Big Data Day LA 2015 - What's new and next in Apache Tez by Bikas Saha of Hor...
 
Getting started with Spark & Cassandra by Jon Haddad of Datastax
Getting started with Spark & Cassandra by Jon Haddad of DatastaxGetting started with Spark & Cassandra by Jon Haddad of Datastax
Getting started with Spark & Cassandra by Jon Haddad of Datastax
 
Big Data Day LA 2015 - Machine Learning on Largish Data by Szilard Pafka of E...
Big Data Day LA 2015 - Machine Learning on Largish Data by Szilard Pafka of E...Big Data Day LA 2015 - Machine Learning on Largish Data by Szilard Pafka of E...
Big Data Day LA 2015 - Machine Learning on Largish Data by Szilard Pafka of E...
 
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of Amazon
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of AmazonBig Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of Amazon
Big Data Day LA 2015 - The AWS Big Data Platform by Michael Limcaco of Amazon
 
A noETL Parallel Streaming Transformation Loader using Spark, Kafka­ & Ver­tica
A noETL Parallel Streaming Transformation Loader using Spark, Kafka­ & Ver­ticaA noETL Parallel Streaming Transformation Loader using Spark, Kafka­ & Ver­tica
A noETL Parallel Streaming Transformation Loader using Spark, Kafka­ & Ver­tica
 
Data science and good questions eric kostello
Data science and good questions eric kostelloData science and good questions eric kostello
Data science and good questions eric kostello
 
Big Data Day LA 2016 Keynote - Jeanne Holm/ City of LA
Big Data Day LA 2016 Keynote - Jeanne Holm/ City of LABig Data Day LA 2016 Keynote - Jeanne Holm/ City of LA
Big Data Day LA 2016 Keynote - Jeanne Holm/ City of LA
 

Similar to Big Data Day LA 2016/ Big Data Track - Warner Bros. Digital Consumer Intelligence at Scale, Brian Kursar, VP Data Strategy & Architecture, Warner Bros

Warner brother institution research
Warner brother institution researchWarner brother institution research
Warner brother institution researchShahzeb Butt
 
Institution research asad mehdi
Institution research asad mehdiInstitution research asad mehdi
Institution research asad mehdiAsad Mehdi
 
Institution research-Warner brothers by shahrain shah
Institution research-Warner brothers by shahrain shahInstitution research-Warner brothers by shahrain shah
Institution research-Warner brothers by shahrain shahSyed Shah
 
Movie Release Deck - Captain America
Movie Release Deck - Captain AmericaMovie Release Deck - Captain America
Movie Release Deck - Captain AmericaAlyssa Pacheco
 
Successful media products... rewritten
Successful media products... rewrittenSuccessful media products... rewritten
Successful media products... rewrittenJosh Pamfilo
 
Amcfinal2bc D
Amcfinal2bc DAmcfinal2bc D
Amcfinal2bc Danimarts
 
What kind of media institution might distribute your media products and why?
What kind of media institution might distribute your media products and why?What kind of media institution might distribute your media products and why?
What kind of media institution might distribute your media products and why?Emily Erskine
 
Institution research a2
Institution research a2Institution research a2
Institution research a2efrahvistro
 
What kind of media institution might distribute your media product and why?
What kind of media institution might distribute your media product and why?What kind of media institution might distribute your media product and why?
What kind of media institution might distribute your media product and why?Emily Erskine
 
Drive your business success with dynamic public relations.
Drive your business success with dynamic public relations.Drive your business success with dynamic public relations.
Drive your business success with dynamic public relations.The Darnell Works Agency
 
Marty Stouffer Show Cove
Marty Stouffer Show CoveMarty Stouffer Show Cove
Marty Stouffer Show Covecmcdonne
 
Power Rangers - A Business Case Study
Power Rangers - A Business Case StudyPower Rangers - A Business Case Study
Power Rangers - A Business Case StudyRavi Teja Chittipotu
 
Evaluation question 3
Evaluation question 3Evaluation question 3
Evaluation question 3jameshmedia
 
Findability & Attracting Audiences
Findability & Attracting AudiencesFindability & Attracting Audiences
Findability & Attracting AudiencesChristy Dena
 
Successful media products
Successful media products Successful media products
Successful media products Josh Pamfilo
 
SplashCast | Pioneers of Social TV
SplashCast | Pioneers of Social TVSplashCast | Pioneers of Social TV
SplashCast | Pioneers of Social TVMike Berkley
 

Similar to Big Data Day LA 2016/ Big Data Track - Warner Bros. Digital Consumer Intelligence at Scale, Brian Kursar, VP Data Strategy & Architecture, Warner Bros (20)

Warner brother institution research
Warner brother institution researchWarner brother institution research
Warner brother institution research
 
Institution research asad mehdi
Institution research asad mehdiInstitution research asad mehdi
Institution research asad mehdi
 
Institution research-Warner brothers by shahrain shah
Institution research-Warner brothers by shahrain shahInstitution research-Warner brothers by shahrain shah
Institution research-Warner brothers by shahrain shah
 
Movie Release Deck - Captain America
Movie Release Deck - Captain AmericaMovie Release Deck - Captain America
Movie Release Deck - Captain America
 
Successful media products... rewritten
Successful media products... rewrittenSuccessful media products... rewritten
Successful media products... rewritten
 
Amcfinal2bc D
Amcfinal2bc DAmcfinal2bc D
Amcfinal2bc D
 
What kind of media institution might distribute your media products and why?
What kind of media institution might distribute your media products and why?What kind of media institution might distribute your media products and why?
What kind of media institution might distribute your media products and why?
 
Institution research a2
Institution research a2Institution research a2
Institution research a2
 
What kind of media institution might distribute your media product and why?
What kind of media institution might distribute your media product and why?What kind of media institution might distribute your media product and why?
What kind of media institution might distribute your media product and why?
 
Lo3
Lo3Lo3
Lo3
 
Drive your business success with dynamic public relations.
Drive your business success with dynamic public relations.Drive your business success with dynamic public relations.
Drive your business success with dynamic public relations.
 
Marty Stouffer Show Cove
Marty Stouffer Show CoveMarty Stouffer Show Cove
Marty Stouffer Show Cove
 
Evaluation Question 3
Evaluation Question 3Evaluation Question 3
Evaluation Question 3
 
Power Rangers - A Business Case Study
Power Rangers - A Business Case StudyPower Rangers - A Business Case Study
Power Rangers - A Business Case Study
 
Evaluation question 3
Evaluation question 3Evaluation question 3
Evaluation question 3
 
Warner brothers
Warner brothersWarner brothers
Warner brothers
 
Findability & Attracting Audiences
Findability & Attracting AudiencesFindability & Attracting Audiences
Findability & Attracting Audiences
 
Successful media products
Successful media products Successful media products
Successful media products
 
SplashCast | Pioneers of Social TV
SplashCast | Pioneers of Social TVSplashCast | Pioneers of Social TV
SplashCast | Pioneers of Social TV
 
Unit 2
Unit 2Unit 2
Unit 2
 

More from Data Con LA

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA
 

More from Data Con LA (20)

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup Showcase
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendations
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI Ethics
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learning
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWS
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with Kafka
 

Recently uploaded

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Big Data Day LA 2016/ Big Data Track - Warner Bros. Digital Consumer Intelligence at Scale, Brian Kursar, VP Data Strategy & Architecture, Warner Bros

Editor's Notes

  1. Warner Bros. Entertainment Inc., a Time Warner Company, is a fully integrated, broad-based entertainment company. Warner Bros. is the global leader in the creation, production, distribution, licensing and marketing of all forms of entertainment and their related businesses. We stand at the forefront of every aspect of the entertainment industry, from feature films to television, home video/DVD, animation, comic books, interactive entertainment and games, product and brand licensing, and broadcasting.
  2. Warner Bros. falls under the Time Warner Family of Companies.
  3. Today, the WB Data and Analytics Team support six of the largest groups in Warner Bros. Warner Bros Pictures, our Theatrical group. DC Entertainment, Consumer Products, Home Entertainment, Warner Bros Television. as well as our two Studio Tour facilities both in Leavesden and here in Hollywood. And across all of these Business units, we have data.
  4. And lots of it. Everything you can possibly imagine. Web Logs, Ratings, Social, Supply Chain, Point of Sale data from our retailers and exhibitors, sales data from our digital retailers data on Content Piracy, Production, and we even have our own Netflix-like OTT offerings which we are actively working to integrate into our platform. But the real trick lands on our Data Engineering Team that is tasked with the job of connecting this data to ensure that across our lines of Business that we are able to make some sense of it all. This is where the real magic happens.
  5. A year ago we started building our platform. For storage we utilize Amazon S3 and Parquet. We also store a lot of our data in Teradata and Amazon Redshift. Recently we have been introducing a number of new components such as Kafka and Spark as well as Elastic Search. We like Spark for its ability to crunch our massive log files while Elastic Search has proven a great tool for a number of use cases. It acts as our NoSQL layer an enables use to do massive refinements across Billions of Records. It is also great for Social Media Analytics due to its search capability. We use Python and Scala to wrangle our data. Larger jobs we tend to use Scala on Spark as it is much easier to debug in Scala versus Python. For Self Serve and Data Discovery we utilize a few tools. Tableau is used primarily for visualizations, Microstrategy is heavily used with our Home Entertainment group. While Kibana is a tool we have recently brought in for log analysis. Finally, we recently started using D3 and Angular on Node.js to create big screen visualizations.
  6. Even with all of these great tools, the fact is that connecting the dots isn’t easy. It requires a lot of domain expertise and some serious data engineering. For instance, if someone wanted to understand how to ask analytical questions across a Franchise, this is not an easy thing to do. Some Franchises have many different facets across many industries. Traditionally this data was residing in several disparate databases. There are multiple theatrical releases, Home Entertainment releases, Streaming & Television deals, Video Games, Consumer Products, Studio Tours and Theme parks.
  7. And what about Social. Social has a tremendous impact on Sales, but how can connect this so we can quantify questions like this:
  8. What is the impact of a trailer drop or a social media comment on… Theatrical Box Office Sales Franchise Back Catalog Sales Consumer Products Sales Studio Tour Ticket Sales
  9. About a year ago, we began working on connecting Social Media Activity on our Theatrical and Home Entertainment Titles.
  10. We took a stab at creating some filters using the title names and hashtags, wrote some general normalization tags, indexed the data to a search index and then pushed out the aggregate results to a tableau workbook for visualization.
  11. Focusing on Pan and Creed, we created a tag cloud doing some basic noun phrase extraction so we could visualize what topics were being discussed around these two WB Titles. And here were the results.
  12. Bread Pudding… Cooking…. Video Games… Not what exactly what we were expecting. You see
  13. Creed, the movie, is not the same as Creed the Band, or Assassin's Creed the Video Game
  14. Nor is Pan the movie in an way associated with Pan, the Spanish word for Bread. But once we understood the connection, it made sense why we were seeing Bread Pudding as a topic for our Pan. We quickly had to change our query and classification approach. This started with having to focus on how best to
  15. Disambiguate, or essentially remove all noise not related to the content we were trying to track from our queries. But we had another problem.
  16. We started grouping titles into three categories. Unique Titles such as Batman V Superman, Ambiguous Titles such as Creed and Very Ambiguous titles such as the film “Her”
  17. In a pre-process step, we used Wikipedia to help us score the titles. Granted we had a few other tricks up our sleeve, essentially, any title coming up only once has a high probability of being unique whereas a title coming up in a Wikipedia search more than once has a high probability of being ambiguous. Then
  18. We created a concept based disambiguation model. In this example, we started collecting additional terms to further disambiguate the title “Creed”. We start with names of Top talent associated with the title. Stallone, Michael B Jordan, then similarly characters… Rocky, Apollo, words associated with the type of content. Was it a movie, or a film? Did they see it in a theater? Additional related themes like Boxing. Then we added terms that if present with the term “Creed” were highly likely to be interactions that were not relevant. Terms like “band” or the word “assasin’s”. We also included other unique IDs from social content that we could tie back to Creed. We also added logic to pull in contextual data where the title would not be
  19. This post could be talking about any movie in the world as it fails to mention any particular movie.
  20. But if it is found on the Creed Movie Facebook page, then we can attribute the comment to the movie Creed. Quick question. What percentage of the comments on the Creed movie page do not contain the word “Creed’?
  21. 88% of Consumer engagement on the Creed FB page does NOT contain the keyword “Creed”
  22. Similarly, in this post, the YouTube video is associated to the Creed YouTube Trailer. > By extracting the unique Video ID, we can safely assume that every time this ID is featured in any post, we should classify it against the movie Creed.
  23. Our Social pipeline for this is fairly straight forward. We call a Social API, passing a general filter against the API.
  24. In a separate job we have generated the Disambiguation rules. The data is pulled into a Spark RDD and we apply the pre-processed Ambiguity Rules
  25. And this looked quite a bit better than our previous results. So here we were able to
  26. Here we pull these massive files from our P2P data provider and leverage spark to match across the title. Once we have made our matches we then write out to ElasticSearch and stream the data to our near real-time visualization.