SlideShare uma empresa Scribd logo
1 de 2
Baixar para ler offline
Hulu is an online video service that offers a selection of hit
TV shows, clips, movies and more on the free, ad-supported
Hulu.com service, and the subscription service Hulu Plus. One
of the top video streaming sites in the U.S., today the service
has over four million subscribers and approximately 30 million
unique viewers per month.
CHALLENGE
Inability to Scale MySQL and Memcached
In 2012, Hulu’s subscriber base passed the two million mark,
and the back-end systems that tracked viewer history started
to breakdown. When a video is played, the system records
information from the player to keep track of both the video and
the viewing position or timeframe. When the video application
is closed, the stored information allows the user to resume
the video where they left off. The system also provides
recommendations for what videos to watch next based on
user history.
Originally designed as a Python application, Hulu’s viewed history
tracking system relied on Memcached for reads on top of a
sharded MySQL database for writes. When the Hulu engineering
team started to see that MySQL couldn’t handle the volume
of writes, the only way to scale was to add more shards. Reads
were done in Memcached to preserve I/O on the database, but
Memcached could not be replicated. So, user history was served
out of one shard in one datacenter.
With the occurrence of peak time failures and an understanding
of root causes, the core Hulu engineering team began to design
a solution with four overarching requirements:
1.	 Faster reads and writes
2.	 The capacity to scale to 10,000 queries/second with low
latency
3.	 Replication of data across datacenters
4.	 High availability of cached data with no single point of failure
AT-A-GLANCE
Challenges
•	 MySQL overwhelmed by the volume of writes
•	 Memcached could not be replicated across
datacenters to distribute load
•	 Latency on queries with degrading performance
•	 No high-availability strategy
Solution
•	 Redis
Key Benefits
•	 Accelerated writing and retrieval of information
with 800% performance improvement for queries
•	 Replication across datacenters
•	 Capacity to handle at least 10,000 queries per second
with low latency
•	 Open management APIs allow for high availability
•	 Ability to use data structures for flexible and efficient
queries
CASE STUDY
Hulu
LEADING VIDEO COMPANY SCALES TO SERVE 4 BILLION VIDEOS
WITH 800% QUERY PERFORMANCE IMPROVEMENT
OVERVIEW
“	We chose Redis because it was simple to
set up, had great documentation, offered
replication, and allowed us to use data structures.
Data structures are extremely powerful and
allow us to architect solutions to many use
cases very efficiently.”
—	Andres Rangel, Senior Software Engineer, Hulu
pivotal.io
Pivotal is a registered trademark or trademark of Pivotal Software, Inc. in the United States and other countries. All other trademarks used herein are the property of their respective owners. © Copyright 2014 Pivotal Software,
Inc. All rights reserved.. Published in the USA. PVTL-CS-343-12/13
At Pivotal our mission is to enable customers to build a new class of applications, leveraging big and fast data, and do all of this with the power of cloud independence.
Uniting selected technology, people and programs from EMC and VMware, the following products and services are now part of Pivotal: Greenplum, Cloud Foundry, Spring,
GemFire and other products from the VMware vFabric Suite, Cetas and Pivotal Labs.
CASE STUDY HULU
Pivotal 3495 Deer Creek Road Palo Alto, CA 94304 pivotal.io
SOLUTION
The Path to 3 Million Subscribers
After looking at a variety of NoSQL alternatives like MongoDB,
Riak, and LevelDB, Hulu selected Redis. Describing the process,
Andres Rangel, Senior Software Engineer, stated, “We chose
Redis for several key reasons – it was simple to set up, had great
documentation, offered replication, and allowed us to use data
structures. Data structures are extremely powerful and allow
us to architect solutions to many use cases very efficiently. For
example, depending on the operation, we have the need to query
either a specific video a user watched, or all of them. With Redis,
this was easy using hashes.”
To meet all requirements, there were some minor areas that
needed additional development. First, the Hulu team took a look
at how the data was sharded. They were able to easily shard on
user_id. “We scale Redis by sharding the data, and the intelligence
about shards is in the application logic,” noted Rangel. Second,
Redis didn’t have the Sentinel implementation of monitoring and
automatic failover at that time. Since Redis has a open API, the
Hulu team was able to create their own Sentinel mechanism to
support high availability.
BUSINESS BENEFITS
Redis provided the following benefits to Hulu:
Open Ended Scaling for Reads and Writes
“Since Redis supports replication, it became possible to
reorganize the data map so writes and reads could be easily
separated, load-balanced and scaled across datacenters,” said
Rangel. Reads are routinely balanced across Redis shards. Each
shard is replicated to a set of slaves in each datacenter. A user
only exists on a single shard, which ensures that newly-added
users distribute evenly across the shards. The architecture is
highly repeatable and provides Hulu with a linear scalability path.
800% Performance Improvement for Queries
With queries running on dedicated, load-balanced slaves in
regional datacenters – instead of all out of the west coast –
speed and performance improvements were expected for
this new architecture. According to Rangel, “For performance
considerations, we decided to pre-shard the system into 64
instances. We replicate the master shard to a slave in the same
datacenter and to a slave in the second datacenter. This way,
applications in the other datacenter read locally from the Redis
slave and achieve greater performance. The result was that 75% of
the latency in reads from the east coast was reduced from 120 ms
to less than 15 ms, and 90% went from 300ms to around 25ms.”
Greater Performance with Data Durability
To build data durability into their system, Hulu decided to use
Apache Cassandra as the persistent data store where all writes
are made. As data is ingested, it is written from Cassandra to
Redis. As Rangel describes the solution, “The first time a request
comes for a user, the system will create a job to load all the
videos for this user into Redis. Once this is done, the system will
update a flag. The next time a request comes in for this user, the
flag is set. Then, the system returns whatever it has from Redis
without hitting Cassandra. This way, access to the database is
greatly reduced, and we aren’t required to have every record in
Redis. When Redis queries are faster than Cassandra by a huge
margin, we achieve the low latency reads for active users by
having their data in Redis. This means we can leave Cassandra for
batch reports where the latency is not important.”
CONCLUSION
As Hulu pursues a superior experience for users, content owners,
and advertisers in the future, they are confident in the long-term
scalability of their back-end systems. The features in Redis will
continue to provide a high-performance data tier as Hulu’s user
base grows.
LEARN MORE
To learn more about our products, services and solutions, visit us
at pivotal.io.

Mais conteúdo relacionado

Mais procurados

Pitch Deck For Pre Seed Funding Powerpoint Presentation Slides
Pitch Deck For Pre Seed Funding Powerpoint Presentation SlidesPitch Deck For Pre Seed Funding Powerpoint Presentation Slides
Pitch Deck For Pre Seed Funding Powerpoint Presentation Slides
SlideTeam
 
SplunkLive! Splunk for Security
SplunkLive! Splunk for SecuritySplunkLive! Splunk for Security
SplunkLive! Splunk for Security
Splunk
 
Industry Analysis
Industry AnalysisIndustry Analysis
Industry Analysis
kmarvin1
 
Malware analysis using volatility
Malware analysis using volatilityMalware analysis using volatility
Malware analysis using volatility
Yashashree Gund
 
Splunk Overview
Splunk OverviewSplunk Overview
Splunk Overview
Splunk
 

Mais procurados (20)

SOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security WebinarSOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security Webinar
 
Pitch Deck For Pre Seed Funding Powerpoint Presentation Slides
Pitch Deck For Pre Seed Funding Powerpoint Presentation SlidesPitch Deck For Pre Seed Funding Powerpoint Presentation Slides
Pitch Deck For Pre Seed Funding Powerpoint Presentation Slides
 
SplunkLive! Splunk for Security
SplunkLive! Splunk for SecuritySplunkLive! Splunk for Security
SplunkLive! Splunk for Security
 
How to Design, Build and Map IT and Business Services in Splunk
How to Design, Build and Map IT and Business Services in SplunkHow to Design, Build and Map IT and Business Services in Splunk
How to Design, Build and Map IT and Business Services in Splunk
 
Update 4 Cramond Campus July 2023.pdf
Update 4 Cramond Campus July   2023.pdfUpdate 4 Cramond Campus July   2023.pdf
Update 4 Cramond Campus July 2023.pdf
 
IBM Qradar
IBM QradarIBM Qradar
IBM Qradar
 
Alan's deck
Alan's deck Alan's deck
Alan's deck
 
Industry Analysis
Industry AnalysisIndustry Analysis
Industry Analysis
 
Malware analysis using volatility
Malware analysis using volatilityMalware analysis using volatility
Malware analysis using volatility
 
235629204 snapchat-business-deck
235629204 snapchat-business-deck235629204 snapchat-business-deck
235629204 snapchat-business-deck
 
Standard Treasury Series A Pitch Deck
Standard Treasury Series A Pitch DeckStandard Treasury Series A Pitch Deck
Standard Treasury Series A Pitch Deck
 
Brex Pitch Deck
Brex Pitch DeckBrex Pitch Deck
Brex Pitch Deck
 
MySQL fundraising pitch deck ($16 million Series B round - 2003)
MySQL fundraising pitch deck ($16 million Series B round - 2003)MySQL fundraising pitch deck ($16 million Series B round - 2003)
MySQL fundraising pitch deck ($16 million Series B round - 2003)
 
maliPay Pitch Deck.pdf
maliPay Pitch Deck.pdfmaliPay Pitch Deck.pdf
maliPay Pitch Deck.pdf
 
Monzo: £19.3M VC investment turned into $2B. Monzo's Series C pitch deck
Monzo: £19.3M VC investment turned into $2B. Monzo's Series C pitch deckMonzo: £19.3M VC investment turned into $2B. Monzo's Series C pitch deck
Monzo: £19.3M VC investment turned into $2B. Monzo's Series C pitch deck
 
Splunk Overview
Splunk OverviewSplunk Overview
Splunk Overview
 
Swissborg Pitch Deck
Swissborg Pitch DeckSwissborg Pitch Deck
Swissborg Pitch Deck
 
GBI Strategy & Operations Overview
GBI Strategy & Operations OverviewGBI Strategy & Operations Overview
GBI Strategy & Operations Overview
 
Bolt pitch deck: $393M Series D, $6B+ valuation
Bolt pitch deck: $393M Series D, $6B+ valuationBolt pitch deck: $393M Series D, $6B+ valuation
Bolt pitch deck: $393M Series D, $6B+ valuation
 
WeWork Pitch Deck 2014
WeWork Pitch Deck 2014WeWork Pitch Deck 2014
WeWork Pitch Deck 2014
 

Destaque (11)

HBR Hulu Case Study Analysis
HBR Hulu Case Study Analysis HBR Hulu Case Study Analysis
HBR Hulu Case Study Analysis
 
Project report on HT Media Ltd. (MINT)
Project report on HT Media Ltd. (MINT)Project report on HT Media Ltd. (MINT)
Project report on HT Media Ltd. (MINT)
 
Cw
CwCw
Cw
 
Cw
CwCw
Cw
 
Peru 1a
Peru 1aPeru 1a
Peru 1a
 
CW television network
CW television networkCW television network
CW television network
 
198470 g pedagogica_la_ruta_de_chile_07
198470 g pedagogica_la_ruta_de_chile_07198470 g pedagogica_la_ruta_de_chile_07
198470 g pedagogica_la_ruta_de_chile_07
 
Edelman Social Entertainment &Trust in the Entertainment Industry
Edelman Social Entertainment &Trust in the Entertainment Industry Edelman Social Entertainment &Trust in the Entertainment Industry
Edelman Social Entertainment &Trust in the Entertainment Industry
 
Cw Powerpoint
Cw PowerpointCw Powerpoint
Cw Powerpoint
 
Working memory model, case study cw & evaluate msm
Working memory model, case study cw & evaluate msmWorking memory model, case study cw & evaluate msm
Working memory model, case study cw & evaluate msm
 
Tyler's Presentation Resume
Tyler's Presentation ResumeTyler's Presentation Resume
Tyler's Presentation Resume
 

Semelhante a Hulu Case Study

Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformPivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
EMC
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 

Semelhante a Hulu Case Study (20)

Move your oracle apps to oci
Move your oracle apps to ociMove your oracle apps to oci
Move your oracle apps to oci
 
Hadoop in a Nutshell
Hadoop in a NutshellHadoop in a Nutshell
Hadoop in a Nutshell
 
Hadoop jon
Hadoop jonHadoop jon
Hadoop jon
 
2014.07.11 biginsights data2014
2014.07.11 biginsights data20142014.07.11 biginsights data2014
2014.07.11 biginsights data2014
 
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformPivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
 
Vmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanVmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps Ironfan
 
Lucid logistics case study
Lucid logistics case studyLucid logistics case study
Lucid logistics case study
 
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
 
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
 
4 hp converged_cloud
4 hp converged_cloud4 hp converged_cloud
4 hp converged_cloud
 
Actian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL EditionActian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL Edition
 
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
 
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
 How to use Hadoop for operational and transactional purposes by RODRIGO MERI... How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
 
Many Clouds, Many Choices (Oracle)
Many Clouds, Many Choices (Oracle) Many Clouds, Many Choices (Oracle)
Many Clouds, Many Choices (Oracle)
 
Cloud computing What Why How
Cloud computing What Why HowCloud computing What Why How
Cloud computing What Why How
 
02 오라클
02 오라클02 오라클
02 오라클
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
 
Datacenter 2014: HP - Brian Andersen
Datacenter 2014: HP - Brian AndersenDatacenter 2014: HP - Brian Andersen
Datacenter 2014: HP - Brian Andersen
 

Mais de VMware Tanzu

Mais de VMware Tanzu (20)

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About It
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at Scale
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a Product
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And Beyond
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptx
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - French
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - English
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - French
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software Engineer
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs Practice
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Hulu Case Study

  • 1. Hulu is an online video service that offers a selection of hit TV shows, clips, movies and more on the free, ad-supported Hulu.com service, and the subscription service Hulu Plus. One of the top video streaming sites in the U.S., today the service has over four million subscribers and approximately 30 million unique viewers per month. CHALLENGE Inability to Scale MySQL and Memcached In 2012, Hulu’s subscriber base passed the two million mark, and the back-end systems that tracked viewer history started to breakdown. When a video is played, the system records information from the player to keep track of both the video and the viewing position or timeframe. When the video application is closed, the stored information allows the user to resume the video where they left off. The system also provides recommendations for what videos to watch next based on user history. Originally designed as a Python application, Hulu’s viewed history tracking system relied on Memcached for reads on top of a sharded MySQL database for writes. When the Hulu engineering team started to see that MySQL couldn’t handle the volume of writes, the only way to scale was to add more shards. Reads were done in Memcached to preserve I/O on the database, but Memcached could not be replicated. So, user history was served out of one shard in one datacenter. With the occurrence of peak time failures and an understanding of root causes, the core Hulu engineering team began to design a solution with four overarching requirements: 1. Faster reads and writes 2. The capacity to scale to 10,000 queries/second with low latency 3. Replication of data across datacenters 4. High availability of cached data with no single point of failure AT-A-GLANCE Challenges • MySQL overwhelmed by the volume of writes • Memcached could not be replicated across datacenters to distribute load • Latency on queries with degrading performance • No high-availability strategy Solution • Redis Key Benefits • Accelerated writing and retrieval of information with 800% performance improvement for queries • Replication across datacenters • Capacity to handle at least 10,000 queries per second with low latency • Open management APIs allow for high availability • Ability to use data structures for flexible and efficient queries CASE STUDY Hulu LEADING VIDEO COMPANY SCALES TO SERVE 4 BILLION VIDEOS WITH 800% QUERY PERFORMANCE IMPROVEMENT OVERVIEW “ We chose Redis because it was simple to set up, had great documentation, offered replication, and allowed us to use data structures. Data structures are extremely powerful and allow us to architect solutions to many use cases very efficiently.” — Andres Rangel, Senior Software Engineer, Hulu pivotal.io
  • 2. Pivotal is a registered trademark or trademark of Pivotal Software, Inc. in the United States and other countries. All other trademarks used herein are the property of their respective owners. © Copyright 2014 Pivotal Software, Inc. All rights reserved.. Published in the USA. PVTL-CS-343-12/13 At Pivotal our mission is to enable customers to build a new class of applications, leveraging big and fast data, and do all of this with the power of cloud independence. Uniting selected technology, people and programs from EMC and VMware, the following products and services are now part of Pivotal: Greenplum, Cloud Foundry, Spring, GemFire and other products from the VMware vFabric Suite, Cetas and Pivotal Labs. CASE STUDY HULU Pivotal 3495 Deer Creek Road Palo Alto, CA 94304 pivotal.io SOLUTION The Path to 3 Million Subscribers After looking at a variety of NoSQL alternatives like MongoDB, Riak, and LevelDB, Hulu selected Redis. Describing the process, Andres Rangel, Senior Software Engineer, stated, “We chose Redis for several key reasons – it was simple to set up, had great documentation, offered replication, and allowed us to use data structures. Data structures are extremely powerful and allow us to architect solutions to many use cases very efficiently. For example, depending on the operation, we have the need to query either a specific video a user watched, or all of them. With Redis, this was easy using hashes.” To meet all requirements, there were some minor areas that needed additional development. First, the Hulu team took a look at how the data was sharded. They were able to easily shard on user_id. “We scale Redis by sharding the data, and the intelligence about shards is in the application logic,” noted Rangel. Second, Redis didn’t have the Sentinel implementation of monitoring and automatic failover at that time. Since Redis has a open API, the Hulu team was able to create their own Sentinel mechanism to support high availability. BUSINESS BENEFITS Redis provided the following benefits to Hulu: Open Ended Scaling for Reads and Writes “Since Redis supports replication, it became possible to reorganize the data map so writes and reads could be easily separated, load-balanced and scaled across datacenters,” said Rangel. Reads are routinely balanced across Redis shards. Each shard is replicated to a set of slaves in each datacenter. A user only exists on a single shard, which ensures that newly-added users distribute evenly across the shards. The architecture is highly repeatable and provides Hulu with a linear scalability path. 800% Performance Improvement for Queries With queries running on dedicated, load-balanced slaves in regional datacenters – instead of all out of the west coast – speed and performance improvements were expected for this new architecture. According to Rangel, “For performance considerations, we decided to pre-shard the system into 64 instances. We replicate the master shard to a slave in the same datacenter and to a slave in the second datacenter. This way, applications in the other datacenter read locally from the Redis slave and achieve greater performance. The result was that 75% of the latency in reads from the east coast was reduced from 120 ms to less than 15 ms, and 90% went from 300ms to around 25ms.” Greater Performance with Data Durability To build data durability into their system, Hulu decided to use Apache Cassandra as the persistent data store where all writes are made. As data is ingested, it is written from Cassandra to Redis. As Rangel describes the solution, “The first time a request comes for a user, the system will create a job to load all the videos for this user into Redis. Once this is done, the system will update a flag. The next time a request comes in for this user, the flag is set. Then, the system returns whatever it has from Redis without hitting Cassandra. This way, access to the database is greatly reduced, and we aren’t required to have every record in Redis. When Redis queries are faster than Cassandra by a huge margin, we achieve the low latency reads for active users by having their data in Redis. This means we can leave Cassandra for batch reports where the latency is not important.” CONCLUSION As Hulu pursues a superior experience for users, content owners, and advertisers in the future, they are confident in the long-term scalability of their back-end systems. The features in Redis will continue to provide a high-performance data tier as Hulu’s user base grows. LEARN MORE To learn more about our products, services and solutions, visit us at pivotal.io.