3. The Project
(c) Dataiku 2013 - Confidential
Hal Alowne
BI Manager
Dim’s Private Showroom
Dim Sum
CEO & Founder
Dim’s Private Showroom
Medium size e-commerce
• 100M$ revenue
• 1 Data Analyst
Big Guys
$10B + revenue
100+ Data Scientists
Hey Hal ! We need a big data
platform, like the big guys!
Let’s just do as they do!
4. Hal Wish #1
Global Customer Value Funnel
SEO
NewsLetter
Display
Retargeting
Display
AdWords Marketplace
Direct Sales
Delivery
View Basket
Support
Returns
$
$
$ $
Orders
5. Hal Wish #2
Why people drop basket ?
9/30/13 5
Basket
Payment refused
Credit Refused
Cheaper elsewhere ?
Delivery costs ?
Wait Xmas?
ACTION
6. Hal Wish #3
What product to put on top ?
9/30/13 6
Original
Most Popular on top
Better
Machine Learning Score
(age/discount/margin…)
Advanced
Machine Learning Score
+ Personalization
13. Dataiku
Open Source Web Tracker
(WT1)
} Apache License
} Javascript & IO
} Write directly to Google
Cloud Storage
} Full Java, Easy To Deploy
Step 1
Get your own data
9/30/13 13
Silent in night
Autoscale during Sales
summer and winter
14. Step 2
Mix All Your Data
9/30/13 14
4 VMs on GCE
Tracking Data
Internal Data
Partner Data
Data Science Studio
Pig
Hive
HADOOP
auto-sync
to BigQuery
15. Step 3
Mine your Data
9/30/13 15
Builtin Predictive Models
Advanced Adhoc Models
(R or Python)
Shared Web Based
Data Mining
Platform
16. } January
◦ Choose Partner / Setup the architecture
} February
◦ Initial Deployment : 4TB
◦ Replace BI
} May
◦ New Applications (SEO, …)
} September
◦ Scale Deployment to 15TB
◦ Integrate all channels
Typical Project Calendar
9/30/13 16
17. } Enhance Daily Report Availability
◦ Previous architecture
– Between H+17 and H+26 (!)
◦ Hadoop on GCE
– Between H+3 AND H+7
} +21% Email Channel Optimization
} SEO plan optimization
} and a dozen BI Style “apps”
Some Success For the Project
9/30/13 17
18. Thank you !
9/30/13 18
Follow us on twitter
@dataiku
Ask any big data question
florian.douetteau@dataiku.com