SlideShare uma empresa Scribd logo
1 de 13
BloomReach and AWS Elastic
MapReduce
Prateek Gupta – Lead Engineer
10/24/2014
The BloomReach
Personalized
Discovery
Platform
http://bloomreach.com/what-we-
do/
About BloomReach’s applications
Organic
Search
Contentunderstanding
What it does
Content optimization,
management and measureme
nt
Benefit
Enhanced discoverability and
customer acquisition in organic searc
h
What it does
Personalized onsite search an
d
navigation across devices
Benefit
Relevant and consistent onsite
experiences for new and known users
What it does
Merchandising tool that understa
nds products and identifies oppo
rtunities
Benefit
Prioritize and optimize
online merchandising
SNAP
Compas
s
BloomReach Organic Search -
Merchant Integration
Merchant domain Bloomreach domain
(Amazon Web
Services)
Cloudfront
domain: brcdn.combr-trk.js
pix.gif Elastic
Compute Cloud
domain: brsrvr.com
REST API
request
domain: brsrvr.com Elastic
Compute Cloud
Javascript
API response
BloomReach Organic Search
Architecture
API response
REST API request
Domain Name
Server (DNS)
AWS Load
balancer
Instance Instance Instance Instance
Alternate Cloud
Provider
Multiple Availability Zones
Domain request
Domain response
Example Workflow -
Personalization
Compute
User
Features
Compute
Recommen
dations
Compute
User Profile
User/ Product Database
Pixel
Logs
(S3)
Extract
Related
Users
Extract
User
Session
Elastic MapReduce (EMR)
Usage
• We serve 150+ customer websites
 100+ million pages processed/ day
 Users we see per day > 400M
 Multiple hadoop steps (clusters)
Usage Metric BloomReach Volume
Clusters per day 1500-2000
Hadoop jobs per day 5000-6000
Instance hours per day 25,000 – 30,000
Elastic MapReduce Usage
Growth
0
100000
200000
300000
400000
500000
600000
700000
800000
Q4
2009
Q1
2010
Q2
2010
Q3
2010
Q4
2010
Q1
2011
Q2
2011
Q3
2011
Q4
2011
Q1
2012
Q2
2012
Q3
2012
Q4
2012
Q1
2013
Q2
2013
Q3
2013
Q4
2013
Q1
2014
Q2
2014
Q3
2014
Spot
Instance
SNAP
Mobile
SNAP
Desktop
Compass
Instance
hours/
month
Organic
Challenges
• Cost containment
 On demand vs spot usage
• Cost tracking
 EMR tags
• Cluster setup delay
 Sharing clusters
• Cluster lifecycle management
 Terminate long-running clusters
Resource Selection
• Dynamic resource (instance type)
selection based on CPU, memory
maxCpuPerUnitPrice = 0
optimalInstanceType = null
For each instance_type in (Availability Zone, Region)
{
cpuPerUnitPrice =
instance.cpuCores/instance.spotPrice
if (maxCpuPerUnitPrice < cpuPerUnitPrice) {
optimalInstanceType = instance_type;
}
}
Workflow Management
• Makefile
• A framework for flow control using python
meta programming
A
C B
D
Valid Flows:
A->B->C->D
A->B->D->C
EMR Best Practices
• Use spot instances for cost optimization
• Use EMR tags for cost tracking
• Share EMR clusters for small jobs
• Keep track of long-running clusters
• Use optimal resource type based on
resource usage (e.g. CPU, memory)
• Workflow management
Thank You!
Prateek Gupta, Lead Engineer
prateek@bloomreach.com
www.bloomreach.com

Mais conteúdo relacionado

Mais de BloomReach

Neiman Marcus Case Study
Neiman Marcus Case StudyNeiman Marcus Case Study
Neiman Marcus Case Study
BloomReach
 

Mais de BloomReach (19)

Learning from Complex Online Behavior with Andy Edmonds - Big Brains
Learning from Complex Online Behavior with Andy Edmonds - Big BrainsLearning from Complex Online Behavior with Andy Edmonds - Big Brains
Learning from Complex Online Behavior with Andy Edmonds - Big Brains
 
Solr Payloads for Ranking Data
Solr Payloads for Ranking Data Solr Payloads for Ranking Data
Solr Payloads for Ranking Data
 
Solr Schema in Multi Tenant Platform
Solr Schema in Multi Tenant PlatformSolr Schema in Multi Tenant Platform
Solr Schema in Multi Tenant Platform
 
Internet Retailer Mobile Webinar
Internet Retailer Mobile WebinarInternet Retailer Mobile Webinar
Internet Retailer Mobile Webinar
 
Making Mobile Exceed Expectations
Making Mobile Exceed ExpectationsMaking Mobile Exceed Expectations
Making Mobile Exceed Expectations
 
Marketing Trends: Search, Social, Shopping & Mobile
Marketing Trends: Search, Social, Shopping & MobileMarketing Trends: Search, Social, Shopping & Mobile
Marketing Trends: Search, Social, Shopping & Mobile
 
The Secret to Great Search Campaigns
The Secret to Great Search CampaignsThe Secret to Great Search Campaigns
The Secret to Great Search Campaigns
 
Maximize ROI by Unlocking the Full Value of Undiscovered Content
Maximize ROI by Unlocking the Full Value of Undiscovered ContentMaximize ROI by Unlocking the Full Value of Undiscovered Content
Maximize ROI by Unlocking the Full Value of Undiscovered Content
 
Creating Great Landing Pages
Creating Great Landing PagesCreating Great Landing Pages
Creating Great Landing Pages
 
Big Data Marketing
Big Data MarketingBig Data Marketing
Big Data Marketing
 
Guess Case Study
Guess Case StudyGuess Case Study
Guess Case Study
 
Neiman Marcus Case Study
Neiman Marcus Case StudyNeiman Marcus Case Study
Neiman Marcus Case Study
 
Anatomy of a Big Data Application (BDA)
Anatomy of a Big Data Application (BDA)Anatomy of a Big Data Application (BDA)
Anatomy of a Big Data Application (BDA)
 
Big Data Marketing - 5th Click Conference
Big Data Marketing - 5th Click ConferenceBig Data Marketing - 5th Click Conference
Big Data Marketing - 5th Click Conference
 
BloomReach Customer Success Stories
BloomReach Customer Success StoriesBloomReach Customer Success Stories
BloomReach Customer Success Stories
 
How to grab eCommerce by the Long Tail
How to grab eCommerce by the Long TailHow to grab eCommerce by the Long Tail
How to grab eCommerce by the Long Tail
 
Ampush Case Study
Ampush Case StudyAmpush Case Study
Ampush Case Study
 
Wayfair Case Study
Wayfair Case StudyWayfair Case Study
Wayfair Case Study
 
Getting found a zillion times
Getting found a zillion timesGetting found a zillion times
Getting found a zillion times
 

Último

FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 

Último (20)

FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 

BloomReach and AWS Elastic MapReduce

Notas do Editor

  1. deep content understanding applied to organic search, site search and content marketing to match users with what they want, everywhere