SlideShare a Scribd company logo
1 of 50
Download to read offline
at
Dorota Kuleszo
Francesco Mucio MicroStrategy Symposium, London, 22 april 2016
What is ?
● Social Network for meeting people near you
● Over 300M users in 190 countries
● 200 employees based in London and Moscow
● Over 100M downloads on Android
● Also on iOS, Windows Phone, Web and Mobile Web
● Very Agile working environment
● We needed a proper BI tool - among 14 candidates
● Our data volumes - user level data
● Environment - Linux, Database, SSO
● Technical users with high expectations
Why ?
● Hard to set up in our environment
● No real dimensional model
● Data/ETL Team had to prepare data for us
● Time to onboard users and earn their trust
vs
What does with
now?
● Fancy Dashboards around the office
● Data Discovery tools
● Analysis delivered by email
● Self Service Reports
● Weekly releases
90+ Users in Finance, Billing, Marketing, Developers, User
Ops, Founders
‘s BI Architecture
‘s BI Architecture
Badoo’s Database
EXASOL is an Massive Parallel Processing (MPP) database.
It is an in memory columnar database.
● 8 Nodes (plus 1 spare) with 5.6 TB of RAM
● ~100 TB of Raw Data - ~30 TB of Data on Disk
● Each node has 8 TB of Disk, in RAID 2
and redundancy factor = 2
and
Query Generation Time: 0:00:00.13
Total Elapsed Time in Query Engine: 0:18:36.68
Sum of Query Execution Time: 0:16:08.46
Sum of Data Fetching and Processing Time: 0:01:03.73
Sum of Data Transfer from Datasource(s) Time: 0:00:57.93
Sum of Analytical Processing Time: 0:00:00.00
Sum of Other Processing Time: 0:01:24.49
Sum of Cube Publish Time 0:19:06.37
Number of Rows Returned: 5759450
Number of Columns Returned: 38
Number of Temp Tables: 0
Total Number of Passes: 15
Number of Datasource Query Passes: 15
Number of Analytical Query Passes: 0
Query Improvements
● Use real tables
● Use parallelization
● Use the Pre/Post Processing statements
and
and
Query Generation Time: 0:00:00.13
Total Elapsed Time in Query Engine*: 0:11:18.82
Sum of Query Execution Time: 0:18:30.86
Sum of Data Fetching and Processing Time: 0:01:04.82
Sum of Data Transfer from Datasource(s) Time: 0:00:59.37
Sum of Analytical Processing Time: 0:00:00.00
Sum of Other Processing Time: 0:01:55.12
* This report has some passes that have been executed in parallel.
Individual time components may not add up to Total Elapsed Time in Query
Engine.
Sum of Cube Publish Time 0:11:28.63
Number of Rows Returned: 5759450
Number of Columns Returned: 38
Number of Temp Tables: 17
Total Number of Passes: 49
Number of Datasource Query Passes: 49
and
Query Generation Time: 0:00:00.09
Total Elapsed Time in Query Engine: 0:02:43.29
Sum of Query Execution Time: 0:00:53.08
Sum of Data Fetching and Processing Time: 0:00:52.09
Sum of Data Transfer from Datasource(s) Time: 0:00:47.83
Sum of Analytical Processing Time: 0:00:00.00
Sum of Other Processing Time: 0:00:58.10
Sum of Template Calculate Time 0:00:00.00
Sum of AE Data Persisting Time 0:00:00.46
Sum of Cube Publish Time 0:02:56.36
Number of Rows Returned: 4702678
Number of Columns Returned: 38
Number of Temp Tables: 12
Enable Your Users
with
Visual Insight
● High Level Dashboards
● Analysis Dashboards
● OLAP Reports
Enable Your Users with Visual Insight
● High Level Dashboards
● Analysis Dashboards
● OLAP Reports
Enable Your Users with Visual Insight
This is still not enough for our users!
Let Your Users
Do the Legwork
with
Transaction Services
Enable Your Users with Transaction Services
● Agile environment
● New analysis have an assessment period
● People just like to play with data
This was just bad
Time consuming
Self esteem problems
We would end up hating our users
Enable Your Users with Transaction Services
“We have a Coefficient that we would like to use in our
calculation, this can be different for Campaign Media Source,
Country, and Platform...”
200+ Media Sources
254 Countries
12 Platforms
200+ x 254 x 12 = 609600!
Enable Your Users with Transaction Services
We had to convince them to have a go with
Transaction Services!
Enable Your Users with Transaction Services
Enable Your Users with Transaction Services
Enable Your Users with Transaction Services
Don’t Reinvent
the Wheel
Just Use MicroStrategy
Don’t reinvent the wheel: use MicroStrategy
Problem: Deliver a csv file to an external location.
Proposed Solution:
❏ Generate the data
❏ Put the data on a local drive
❏ Create a tool to copy it remotely
Don’t reinvent the wheel: Use MicroStrategy
Beat The Commute
Learn to use
Command Manager
Save Time with Command Manager
Few things we do with Command Manager
● Cube Refresh
● Start Schedules
● Manage Our Users
● Configure Database Connections
Save Time with Command Manager
My two cents about Command Manager:
● Get familiar with it
● Try to script repetitive tasks
● Integrate it with other tools
Save Time with Command Manager
MicroStrategy Web Deployment
Made Easy
MicroStrategy Web Deployment Made Easy
What we started with:
● MicroStrategy WAR File
● SDK Customizations
● Deployment scripts
● Settings changes
- Deployed Manually
- Deployed Manually
- Executed Manually
- Undocumented
MicroStrategy Web Deployment Made Easy
1. GIT
2. Maven
3. Jenkins
MicroStrategy Web Deployment Made Easy
that’s all folks… maybe
q & a

More Related Content

What's hot

Data mining concepts
Data mining conceptsData mining concepts
Data mining conceptsBasit Rafiq
 
Online retail a look at data consulting approach
Online retail   a look at data consulting approachOnline retail   a look at data consulting approach
Online retail a look at data consulting approachShesha R
 
BigData Analytics_1.7
BigData Analytics_1.7BigData Analytics_1.7
BigData Analytics_1.7Rohit Mittal
 
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricUsing Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricCambridge Semantics
 
ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML Meetup
ML Infra @ Spotify: Lessons Learned - Romain Yon -  NYC ML MeetupML Infra @ Spotify: Lessons Learned - Romain Yon -  NYC ML Meetup
ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML MeetupRomain Yon
 
Stratebi_Emilio_Arias_PCM14
Stratebi_Emilio_Arias_PCM14Stratebi_Emilio_Arias_PCM14
Stratebi_Emilio_Arias_PCM14Stratebi
 
Online SAP BO 4.2 Training
Online SAP BO 4.2 TrainingOnline SAP BO 4.2 Training
Online SAP BO 4.2 Trainingashok training
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningCambridge Semantics
 
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Cambridge Semantics
 
USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem
USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem
USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem ChemAxon
 
Real-time Big Data at FPT (for TechCamp University)
Real-time Big Data at FPT (for TechCamp University)Real-time Big Data at FPT (for TechCamp University)
Real-time Big Data at FPT (for TechCamp University)Trieu Nguyen
 
My recent resume
My recent resumeMy recent resume
My recent resumeGen Li
 
Modern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingModern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingCambridge Semantics
 
Building a Predictive Model
Building a Predictive ModelBuilding a Predictive Model
Building a Predictive ModelDKALab
 
Introduction to einstein analytics
Introduction to einstein analyticsIntroduction to einstein analytics
Introduction to einstein analyticsSteven Hugo
 

What's hot (20)

Data mining concepts
Data mining conceptsData mining concepts
Data mining concepts
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Online retail a look at data consulting approach
Online retail   a look at data consulting approachOnline retail   a look at data consulting approach
Online retail a look at data consulting approach
 
Data Analytics Life Cycle
Data Analytics Life CycleData Analytics Life Cycle
Data Analytics Life Cycle
 
BigData Analytics_1.7
BigData Analytics_1.7BigData Analytics_1.7
BigData Analytics_1.7
 
Case Study mypetstop
Case Study mypetstopCase Study mypetstop
Case Study mypetstop
 
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricUsing Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML Meetup
ML Infra @ Spotify: Lessons Learned - Romain Yon -  NYC ML MeetupML Infra @ Spotify: Lessons Learned - Romain Yon -  NYC ML Meetup
ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML Meetup
 
Stratebi_Emilio_Arias_PCM14
Stratebi_Emilio_Arias_PCM14Stratebi_Emilio_Arias_PCM14
Stratebi_Emilio_Arias_PCM14
 
Online SAP BO 4.2 Training
Online SAP BO 4.2 TrainingOnline SAP BO 4.2 Training
Online SAP BO 4.2 Training
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
 
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
Knowledge Graph Discussion: Foundational Capability for Data Fabric, Data Int...
 
USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem
USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem
USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem
 
Real-time Big Data at FPT (for TechCamp University)
Real-time Big Data at FPT (for TechCamp University)Real-time Big Data at FPT (for TechCamp University)
Real-time Big Data at FPT (for TechCamp University)
 
My recent resume
My recent resumeMy recent resume
My recent resume
 
sheethal_kamath
sheethal_kamathsheethal_kamath
sheethal_kamath
 
Modern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingModern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail Banking
 
Building a Predictive Model
Building a Predictive ModelBuilding a Predictive Model
Building a Predictive Model
 
Introduction to einstein analytics
Introduction to einstein analyticsIntroduction to einstein analytics
Introduction to einstein analytics
 

Viewers also liked

Microstrategy for Data Engineers
Microstrategy for Data EngineersMicrostrategy for Data Engineers
Microstrategy for Data EngineersFrancesco Mucio
 
La guitarra
La guitarraLa guitarra
La guitarradrako816
 
Kinder campus montessori
Kinder campus montessoriKinder campus montessori
Kinder campus montessoriCreativeworx
 
So Your Tenant Died
So Your Tenant DiedSo Your Tenant Died
So Your Tenant DiedBrian Cox
 
El boom de la industria creativa iberoamericana eje temático de ixel moda 2013
El boom de la industria creativa iberoamericana eje temático de ixel moda 2013El boom de la industria creativa iberoamericana eje temático de ixel moda 2013
El boom de la industria creativa iberoamericana eje temático de ixel moda 2013Ixel Moda
 
Etnografía de la Data: Conectando información con oportunidades de innovación...
Etnografía de la Data: Conectando información con oportunidades de innovación...Etnografía de la Data: Conectando información con oportunidades de innovación...
Etnografía de la Data: Conectando información con oportunidades de innovación...Club de Innovación
 
Proyecto de vida
Proyecto de vidaProyecto de vida
Proyecto de vidaKry Manguay
 
Jabón Protex
Jabón ProtexJabón Protex
Jabón ProtexPatito29
 
mission2beach Katalog 2011
mission2beach Katalog 2011mission2beach Katalog 2011
mission2beach Katalog 2011mission2beach
 
Published patent and design registration information january 13th, 2012
Published patent and design registration information   january 13th, 2012Published patent and design registration information   january 13th, 2012
Published patent and design registration information january 13th, 2012InvnTree IP Services Pvt. Ltd.
 
Sisdam S.A.
Sisdam S.A.Sisdam S.A.
Sisdam S.A.Loyal5
 
E portafolio OLGA LUCIA PENAGOS GRUPO 201512_97
E portafolio OLGA LUCIA PENAGOS GRUPO 201512_97E portafolio OLGA LUCIA PENAGOS GRUPO 201512_97
E portafolio OLGA LUCIA PENAGOS GRUPO 201512_97OLGALUCIAPENAGOS5
 
Quimiometria leccion-6-validacion-de-metodos-notas-de-clase
Quimiometria leccion-6-validacion-de-metodos-notas-de-claseQuimiometria leccion-6-validacion-de-metodos-notas-de-clase
Quimiometria leccion-6-validacion-de-metodos-notas-de-claseJanuusz Ruiz
 
secuencia didactica PEGUI el sistema solar desde la enseñanza tic
secuencia didactica PEGUI el sistema solar desde la enseñanza ticsecuencia didactica PEGUI el sistema solar desde la enseñanza tic
secuencia didactica PEGUI el sistema solar desde la enseñanza ticeldacastrosierra
 

Viewers also liked (20)

Microstrategy for Data Engineers
Microstrategy for Data EngineersMicrostrategy for Data Engineers
Microstrategy for Data Engineers
 
Microstrategy Overview
Microstrategy OverviewMicrostrategy Overview
Microstrategy Overview
 
Almeria ciudad de congresos
Almeria ciudad de congresosAlmeria ciudad de congresos
Almeria ciudad de congresos
 
La guitarra
La guitarraLa guitarra
La guitarra
 
Kinder campus montessori
Kinder campus montessoriKinder campus montessori
Kinder campus montessori
 
So Your Tenant Died
So Your Tenant DiedSo Your Tenant Died
So Your Tenant Died
 
Erfolg Ausgabe 07/08
Erfolg Ausgabe 07/08 Erfolg Ausgabe 07/08
Erfolg Ausgabe 07/08
 
URBACO - BOLLARDS
URBACO - BOLLARDS URBACO - BOLLARDS
URBACO - BOLLARDS
 
El boom de la industria creativa iberoamericana eje temático de ixel moda 2013
El boom de la industria creativa iberoamericana eje temático de ixel moda 2013El boom de la industria creativa iberoamericana eje temático de ixel moda 2013
El boom de la industria creativa iberoamericana eje temático de ixel moda 2013
 
Etnografía de la Data: Conectando información con oportunidades de innovación...
Etnografía de la Data: Conectando información con oportunidades de innovación...Etnografía de la Data: Conectando información con oportunidades de innovación...
Etnografía de la Data: Conectando información con oportunidades de innovación...
 
Proyecto de vida
Proyecto de vidaProyecto de vida
Proyecto de vida
 
Tot
TotTot
Tot
 
Jabón Protex
Jabón ProtexJabón Protex
Jabón Protex
 
mission2beach Katalog 2011
mission2beach Katalog 2011mission2beach Katalog 2011
mission2beach Katalog 2011
 
Published patent and design registration information january 13th, 2012
Published patent and design registration information   january 13th, 2012Published patent and design registration information   january 13th, 2012
Published patent and design registration information january 13th, 2012
 
Economía colaborativa
Economía colaborativaEconomía colaborativa
Economía colaborativa
 
Sisdam S.A.
Sisdam S.A.Sisdam S.A.
Sisdam S.A.
 
E portafolio OLGA LUCIA PENAGOS GRUPO 201512_97
E portafolio OLGA LUCIA PENAGOS GRUPO 201512_97E portafolio OLGA LUCIA PENAGOS GRUPO 201512_97
E portafolio OLGA LUCIA PENAGOS GRUPO 201512_97
 
Quimiometria leccion-6-validacion-de-metodos-notas-de-clase
Quimiometria leccion-6-validacion-de-metodos-notas-de-claseQuimiometria leccion-6-validacion-de-metodos-notas-de-clase
Quimiometria leccion-6-validacion-de-metodos-notas-de-clase
 
secuencia didactica PEGUI el sistema solar desde la enseñanza tic
secuencia didactica PEGUI el sistema solar desde la enseñanza ticsecuencia didactica PEGUI el sistema solar desde la enseñanza tic
secuencia didactica PEGUI el sistema solar desde la enseñanza tic
 

Similar to MicroStrategy at Badoo

The Lyft data platform: Now and in the future
The Lyft data platform: Now and in the futureThe Lyft data platform: Now and in the future
The Lyft data platform: Now and in the futuremarkgrover
 
Lyft data Platform - 2019 slides
Lyft data Platform - 2019 slidesLyft data Platform - 2019 slides
Lyft data Platform - 2019 slidesKarthik Murugesan
 
When Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkWhen Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkJim Kaplan CIA CFE
 
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data Hub
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data HubSFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data Hub
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data HubSouth Tyrol Free Software Conference
 
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Jaroslav Gergic
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsDenodo
 
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriDemi Ben-Ari
 
Analytics in Your Enterprise
Analytics in Your EnterpriseAnalytics in Your Enterprise
Analytics in Your EnterpriseWSO2
 
Data Architecture at Vente-Exclusive.com - TOTM Exellys
Data Architecture at Vente-Exclusive.com - TOTM ExellysData Architecture at Vente-Exclusive.com - TOTM Exellys
Data Architecture at Vente-Exclusive.com - TOTM ExellysWout Scheepers
 
Simply Business' Data Platform
Simply Business' Data PlatformSimply Business' Data Platform
Simply Business' Data PlatformDani Solà Lagares
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixC4Media
 
Levelling up your data infrastructure
Levelling up your data infrastructureLevelling up your data infrastructure
Levelling up your data infrastructureSimon Belak
 
Using ClickHouse for Experimentation
Using ClickHouse for ExperimentationUsing ClickHouse for Experimentation
Using ClickHouse for ExperimentationGleb Kanterov
 
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauWebinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauMongoDB
 
Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"Demi Ben-Ari
 
CCI2018 - Real-time dashboard whatif analysis
CCI2018 - Real-time dashboard whatif analysisCCI2018 - Real-time dashboard whatif analysis
CCI2018 - Real-time dashboard whatif analysiswalk2talk srl
 
Data_and_Analytics_Industry_IESE_v3.pdf
Data_and_Analytics_Industry_IESE_v3.pdfData_and_Analytics_Industry_IESE_v3.pdf
Data_and_Analytics_Industry_IESE_v3.pdfprevota
 

Similar to MicroStrategy at Badoo (20)

The Lyft data platform: Now and in the future
The Lyft data platform: Now and in the futureThe Lyft data platform: Now and in the future
The Lyft data platform: Now and in the future
 
Lyft data Platform - 2019 slides
Lyft data Platform - 2019 slidesLyft data Platform - 2019 slides
Lyft data Platform - 2019 slides
 
When Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t WorkWhen Data Visualizations and Data Imports Just Don’t Work
When Data Visualizations and Data Imports Just Don’t Work
 
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data Hub
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data HubSFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data Hub
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data Hub
 
The Evolution of Big Data Pipelines at Intuit
The Evolution of Big Data Pipelines at Intuit The Evolution of Big Data Pipelines at Intuit
The Evolution of Big Data Pipelines at Intuit
 
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
 
Analytics in Your Enterprise
Analytics in Your EnterpriseAnalytics in Your Enterprise
Analytics in Your Enterprise
 
Data Architecture at Vente-Exclusive.com - TOTM Exellys
Data Architecture at Vente-Exclusive.com - TOTM ExellysData Architecture at Vente-Exclusive.com - TOTM Exellys
Data Architecture at Vente-Exclusive.com - TOTM Exellys
 
Simply Business' Data Platform
Simply Business' Data PlatformSimply Business' Data Platform
Simply Business' Data Platform
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFix
 
Levelling up your data infrastructure
Levelling up your data infrastructureLevelling up your data infrastructure
Levelling up your data infrastructure
 
Using ClickHouse for Experimentation
Using ClickHouse for ExperimentationUsing ClickHouse for Experimentation
Using ClickHouse for Experimentation
 
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauWebinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
 
Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"
 
Shaik Niyas Ahamed M Resume
Shaik Niyas Ahamed M ResumeShaik Niyas Ahamed M Resume
Shaik Niyas Ahamed M Resume
 
CCI2018 - Real-time dashboard whatif analysis
CCI2018 - Real-time dashboard whatif analysisCCI2018 - Real-time dashboard whatif analysis
CCI2018 - Real-time dashboard whatif analysis
 
Data_and_Analytics_Industry_IESE_v3.pdf
Data_and_Analytics_Industry_IESE_v3.pdfData_and_Analytics_Industry_IESE_v3.pdf
Data_and_Analytics_Industry_IESE_v3.pdf
 
Advanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITIAdvanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITI
 

MicroStrategy at Badoo

  • 1. at Dorota Kuleszo Francesco Mucio MicroStrategy Symposium, London, 22 april 2016
  • 2. What is ? ● Social Network for meeting people near you ● Over 300M users in 190 countries ● 200 employees based in London and Moscow ● Over 100M downloads on Android ● Also on iOS, Windows Phone, Web and Mobile Web ● Very Agile working environment
  • 3. ● We needed a proper BI tool - among 14 candidates ● Our data volumes - user level data ● Environment - Linux, Database, SSO ● Technical users with high expectations Why ?
  • 4. ● Hard to set up in our environment ● No real dimensional model ● Data/ETL Team had to prepare data for us ● Time to onboard users and earn their trust vs
  • 5. What does with now? ● Fancy Dashboards around the office ● Data Discovery tools ● Analysis delivered by email ● Self Service Reports ● Weekly releases 90+ Users in Finance, Billing, Marketing, Developers, User Ops, Founders
  • 8. Badoo’s Database EXASOL is an Massive Parallel Processing (MPP) database. It is an in memory columnar database. ● 8 Nodes (plus 1 spare) with 5.6 TB of RAM ● ~100 TB of Raw Data - ~30 TB of Data on Disk ● Each node has 8 TB of Disk, in RAID 2 and redundancy factor = 2
  • 9. and Query Generation Time: 0:00:00.13 Total Elapsed Time in Query Engine: 0:18:36.68 Sum of Query Execution Time: 0:16:08.46 Sum of Data Fetching and Processing Time: 0:01:03.73 Sum of Data Transfer from Datasource(s) Time: 0:00:57.93 Sum of Analytical Processing Time: 0:00:00.00 Sum of Other Processing Time: 0:01:24.49 Sum of Cube Publish Time 0:19:06.37 Number of Rows Returned: 5759450 Number of Columns Returned: 38 Number of Temp Tables: 0 Total Number of Passes: 15 Number of Datasource Query Passes: 15 Number of Analytical Query Passes: 0
  • 10. Query Improvements ● Use real tables ● Use parallelization ● Use the Pre/Post Processing statements and
  • 11. and Query Generation Time: 0:00:00.13 Total Elapsed Time in Query Engine*: 0:11:18.82 Sum of Query Execution Time: 0:18:30.86 Sum of Data Fetching and Processing Time: 0:01:04.82 Sum of Data Transfer from Datasource(s) Time: 0:00:59.37 Sum of Analytical Processing Time: 0:00:00.00 Sum of Other Processing Time: 0:01:55.12 * This report has some passes that have been executed in parallel. Individual time components may not add up to Total Elapsed Time in Query Engine. Sum of Cube Publish Time 0:11:28.63 Number of Rows Returned: 5759450 Number of Columns Returned: 38 Number of Temp Tables: 17 Total Number of Passes: 49 Number of Datasource Query Passes: 49
  • 12. and Query Generation Time: 0:00:00.09 Total Elapsed Time in Query Engine: 0:02:43.29 Sum of Query Execution Time: 0:00:53.08 Sum of Data Fetching and Processing Time: 0:00:52.09 Sum of Data Transfer from Datasource(s) Time: 0:00:47.83 Sum of Analytical Processing Time: 0:00:00.00 Sum of Other Processing Time: 0:00:58.10 Sum of Template Calculate Time 0:00:00.00 Sum of AE Data Persisting Time 0:00:00.46 Sum of Cube Publish Time 0:02:56.36 Number of Rows Returned: 4702678 Number of Columns Returned: 38 Number of Temp Tables: 12
  • 14. ● High Level Dashboards ● Analysis Dashboards ● OLAP Reports Enable Your Users with Visual Insight
  • 15.
  • 16.
  • 17.
  • 18. ● High Level Dashboards ● Analysis Dashboards ● OLAP Reports Enable Your Users with Visual Insight This is still not enough for our users!
  • 19.
  • 20. Let Your Users Do the Legwork with Transaction Services
  • 21. Enable Your Users with Transaction Services ● Agile environment ● New analysis have an assessment period ● People just like to play with data This was just bad Time consuming Self esteem problems We would end up hating our users
  • 22. Enable Your Users with Transaction Services “We have a Coefficient that we would like to use in our calculation, this can be different for Campaign Media Source, Country, and Platform...” 200+ Media Sources 254 Countries 12 Platforms 200+ x 254 x 12 = 609600!
  • 23. Enable Your Users with Transaction Services We had to convince them to have a go with Transaction Services!
  • 24. Enable Your Users with Transaction Services
  • 25. Enable Your Users with Transaction Services
  • 26. Enable Your Users with Transaction Services
  • 27. Don’t Reinvent the Wheel Just Use MicroStrategy
  • 28. Don’t reinvent the wheel: use MicroStrategy Problem: Deliver a csv file to an external location. Proposed Solution: ❏ Generate the data ❏ Put the data on a local drive ❏ Create a tool to copy it remotely
  • 29. Don’t reinvent the wheel: Use MicroStrategy
  • 30. Beat The Commute Learn to use Command Manager
  • 31. Save Time with Command Manager
  • 32. Few things we do with Command Manager ● Cube Refresh ● Start Schedules ● Manage Our Users ● Configure Database Connections Save Time with Command Manager
  • 33.
  • 34. My two cents about Command Manager: ● Get familiar with it ● Try to script repetitive tasks ● Integrate it with other tools Save Time with Command Manager
  • 36. MicroStrategy Web Deployment Made Easy What we started with: ● MicroStrategy WAR File ● SDK Customizations ● Deployment scripts ● Settings changes - Deployed Manually - Deployed Manually - Executed Manually - Undocumented
  • 38. 1. GIT 2. Maven 3. Jenkins MicroStrategy Web Deployment Made Easy
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 50. q & a