SlideShare a Scribd company logo
1 of 15
Download to read offline
Big Data Challenge 
Real example in industry 
Tom Martens 
Bussum, 25th November 2014 
Business 
Analytics
Vorname Nachname 
Content 
The Challenge to define your Big Data vision 
Growth of data volume & unstructured data sources 
Do I need to invest for Big Data & how can I use it? 
Do I have the right solution for it? 
Predictive Analytics an operational area for Big Data 
Main arguments for predictive Analytics 
Fields of operation 
Conclusion 
Real example based on Cubeware solution and EXASOL analytical DB
Vorname Nachname 
The data truth! 
But please keep in mind: „The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from a given 
body of data.“ 
Data is the new oil 
John Tukey
Vorname Nachname 
Growth of data volume & unstructured data sources 
Sensor Data 
Social Media 
Server Logs
Vorname Nachname 
The Challenge to define your Big Data vision 
Do I need to invest for Big Data? 
The answer is YES, if 
You have big volume of data from different source systems 
You need to analyze all these data in high speed mode 
You believe that you can make additional profit/reduce costs and increase efficiency based on analysis of these data for specific purposes 
How can I make use out of my historical data? 
Automatic transformation of unstructured* data to structured shape 
Making qualitative and quantitative analysis of the structured data 
The result of analysis extend the basics for decision support systems 
*: e.g. Machines sensor data, social media data, mobile communication data
Vorname Nachname 
The Enterprise Information Hub 
New Data sources 
Available Data sources 
Reporting 
Dash- boarding 
Analysis OLAP 
Data & Text Mining 
Business Analytics 
Operational Intelligence 
Data Marts OLAP Databases Virtual Cubes (e.g. EXASOL / SAP HANA) 
Business Applications ERP, CRM, … 
Cloud Data 
OLTP DMBS 
Hadoop, NoSQL, Log-Data Machine-Data 
Streaming Data Real time 
Statistical Data 
ETL 
Business Intelligence offering 
Structured and unstructured Data 
Data Warehouse(s) 
Complex Event Processing 
Analogue to: Bitkom 2012 - Combination of traditional BI landscape with Big Data solution
Vorname Nachname 
Predictive Analytics 
What can be targeted with Predictive Analytics (examples): 
Increase delivery capacity and adherence to delivery date 
More concrete planning of resources 
Improve product quality and increase productivity 
Efficient forecast and planning of product maintenance 
In accordance to this saying: You can not change your past, but you can change your future!
Vorname Nachname 
Predictive Analytics 
Predictive Analytics can provide positive result if it is implemented in right domains. The most recommended operational area are: 
Early detection of churn through analysis of customers behavior in specific situations or time frames 
Recognition of relationships and pattern to clarify insurance fraud 
Forecast about product sales for planning of capacity and resources 
Having reasonable mass of stock to keep capital tied as low as possible 
Optimized marketing campaign to address customers w. right offering 
Avoid machine outages by implementing in time repair & maintenance 
And more …
Vorname Nachname 
The Data (Source Data) 
Different, but similar sources 
Time series of Events (Occurrence) 
Treatment of Occurrences 
Categorization of Occurrences Critical Category: affecting net-income 
Additional Source Data (attributes)
Vorname Nachname 
The Approach 
Identify sequences 
Clustering of features of the occurrence in a sequence (Prediction Patterns) 
„Prediction“ of the next critical occurrence 
Algorithm (SPADE) = Sequential PAttern Discovery using Equivalence classes
Vorname Nachname 
The Approach – Overview 
Events 
A 
B 
D 
C 
B 
F 
B 
E 
G 
C 
Sequences 
B 
D 
C 
B 
F 
C 
B 
E 
G 
C 
Cluster of Sequences 
B 
D 
C 
B 
E 
G 
C 
A 
A 
M 
L 
C 
Prediction 
B 
D 
C
Vorname Nachname 
The Approach – Challenge 
Search Space (Number of frequent sequences) 
Objects (O) = Sources 
Attribute (A) = Occurrence, a source report 
Length of frequent sequences (k) ~ average number of events in sequence 
Theoretical „Search Space“ = O(A^k) 
10*(1000^5) = 1E+16 possible frequent sequences 
Sensor Data Source 1 … 10 
EXASolution CPU Memory 
Storage 
C8 Server 
C8 Cockpit 
Data Visualization 
Data Distribution
Vorname Nachname 
The Solution-Architecture 
Events 
A 
B 
D 
C 
B 
F 
B 
E 
G 
C 
A 
A 
M 
L 
C 
C8 Solutions Platform 
C8 Server 
C8 Cockpit 
EXASOLUTION 
Virtual Cube 
Compute Node CPU Memory 
Storage 
C8 Importer 
Cubeware Analyzer 
New Data sources 
Available Data sources 
Business Applications ERP, CRM, … 
Cloud Data 
Hadoop, NoSQL, Log-Data Machine-Data 
OLTP DMBS 
ETL Processing (C8 Importer)
Vorname Nachname 
Achievements 
Reduction of critical events by ~ 19%  improve the model 
Reduction of costs for maintenance ~ 10%  expected decline by an improved model 
Found unexpected relations 
Detection of a construction issue in a machine type
Thank you. 
Any questions? 
www.cubeware.com 
© 2014 by Cubeware GmbH

More Related Content

What's hot

Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...TigerGraph
 
Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_Tina Zhang
 
Fraud prevention is better with TigerGraph inside
Fraud prevention is better with  TigerGraph insideFraud prevention is better with  TigerGraph inside
Fraud prevention is better with TigerGraph insideTigerGraph
 
How to program your way into data science?
How to program your way into data science?How to program your way into data science?
How to program your way into data science?DeZyre
 
How to add machine learning to your applications today
How to add machine learning to your applications todayHow to add machine learning to your applications today
How to add machine learning to your applications todayMichal Hodinka
 
eXplainable Predictive Decisioning: combine ML and Decision Management to pro...
eXplainable Predictive Decisioning: combine ML and Decision Management to pro...eXplainable Predictive Decisioning: combine ML and Decision Management to pro...
eXplainable Predictive Decisioning: combine ML and Decision Management to pro...All Things Open
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricCambridge Semantics
 
Resume_Analytics_Nidhi
Resume_Analytics_NidhiResume_Analytics_Nidhi
Resume_Analytics_NidhiNidhi Gupta
 
Software Engineering for Data Scientists
Software Engineering for Data ScientistsSoftware Engineering for Data Scientists
Software Engineering for Data ScientistsDomino Data Lab
 
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...Dataconomy Media
 
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...Dataiku
 
H2O World - Machine Learning at Comcast - Andrew Leamon & Chushi Ren
H2O World - Machine Learning at Comcast - Andrew Leamon & Chushi RenH2O World - Machine Learning at Comcast - Andrew Leamon & Chushi Ren
H2O World - Machine Learning at Comcast - Andrew Leamon & Chushi RenSri Ambati
 
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)Denny Lee
 
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...TigerGraph
 
Things you need to know about big data
Things you need to know about big dataThings you need to know about big data
Things you need to know about big dataLantern Institute
 
Synthetic data generation for machine learning
Synthetic data generation for machine learningSynthetic data generation for machine learning
Synthetic data generation for machine learningQuantUniversity
 

What's hot (19)

Predictive analysis
Predictive analysisPredictive analysis
Predictive analysis
 
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
 
Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_
 
Fraud prevention is better with TigerGraph inside
Fraud prevention is better with  TigerGraph insideFraud prevention is better with  TigerGraph inside
Fraud prevention is better with TigerGraph inside
 
How to program your way into data science?
How to program your way into data science?How to program your way into data science?
How to program your way into data science?
 
How to add machine learning to your applications today
How to add machine learning to your applications todayHow to add machine learning to your applications today
How to add machine learning to your applications today
 
eXplainable Predictive Decisioning: combine ML and Decision Management to pro...
eXplainable Predictive Decisioning: combine ML and Decision Management to pro...eXplainable Predictive Decisioning: combine ML and Decision Management to pro...
eXplainable Predictive Decisioning: combine ML and Decision Management to pro...
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
 
Resume_Analytics_Nidhi
Resume_Analytics_NidhiResume_Analytics_Nidhi
Resume_Analytics_Nidhi
 
Software Engineering for Data Scientists
Software Engineering for Data ScientistsSoftware Engineering for Data Scientists
Software Engineering for Data Scientists
 
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...
 
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
 
H2O World - Machine Learning at Comcast - Andrew Leamon & Chushi Ren
H2O World - Machine Learning at Comcast - Andrew Leamon & Chushi RenH2O World - Machine Learning at Comcast - Andrew Leamon & Chushi Ren
H2O World - Machine Learning at Comcast - Andrew Leamon & Chushi Ren
 
Value Drivers for Your Data – Big, Fast, or Smart
Value Drivers for Your Data – Big, Fast, or SmartValue Drivers for Your Data – Big, Fast, or Smart
Value Drivers for Your Data – Big, Fast, or Smart
 
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
 
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
 
Things you need to know about big data
Things you need to know about big dataThings you need to know about big data
Things you need to know about big data
 
Synthetic data generation for machine learning
Synthetic data generation for machine learningSynthetic data generation for machine learning
Synthetic data generation for machine learning
 
Helping Business Leaders Get Over Their Learning Curve in Advanced Analytics
Helping Business Leaders Get Over Their Learning Curve in Advanced AnalyticsHelping Business Leaders Get Over Their Learning Curve in Advanced Analytics
Helping Business Leaders Get Over Their Learning Curve in Advanced Analytics
 

Similar to Tom Martens - Cube Ware - The big data challenge - bo

Data Mining with SQL Server 2008
Data Mining with SQL Server 2008Data Mining with SQL Server 2008
Data Mining with SQL Server 2008Peter Gfader
 
Data Mining with SQL Server 2005
Data Mining with SQL Server 2005Data Mining with SQL Server 2005
Data Mining with SQL Server 2005Dean Willson
 
Cssu dw dm
Cssu dw dmCssu dw dm
Cssu dw dmsumit621
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...Lucas Jellema
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefitsRicky Barron
 
Data science technology overview
Data science technology overviewData science technology overview
Data science technology overviewSoojung Hong
 
Splunk Business Analytics
Splunk Business AnalyticsSplunk Business Analytics
Splunk Business AnalyticsCleverDATA
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Sri Ambati
 
Introduction to HANA in-memory from SAP
Introduction to HANA in-memory from SAPIntroduction to HANA in-memory from SAP
Introduction to HANA in-memory from SAPugur candan
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisYuanyuan Tian
 
V like Velocity, Predicting in Real-Time with Azure ML
V like Velocity, Predicting in Real-Time with Azure MLV like Velocity, Predicting in Real-Time with Azure ML
V like Velocity, Predicting in Real-Time with Azure MLBarbara Fusinska
 
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Denodo
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptxamitparashar42
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptxamitparashar42
 

Similar to Tom Martens - Cube Ware - The big data challenge - bo (20)

big-data-anallytics.pptx
big-data-anallytics.pptxbig-data-anallytics.pptx
big-data-anallytics.pptx
 
Data Mining with SQL Server 2008
Data Mining with SQL Server 2008Data Mining with SQL Server 2008
Data Mining with SQL Server 2008
 
Data Mining with SQL Server 2005
Data Mining with SQL Server 2005Data Mining with SQL Server 2005
Data Mining with SQL Server 2005
 
Cssu dw dm
Cssu dw dmCssu dw dm
Cssu dw dm
 
ITReady DW Day2
ITReady DW Day2ITReady DW Day2
ITReady DW Day2
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Spring 2017 Sage 300 (Accpac) Users Group
Spring 2017 Sage 300 (Accpac) Users GroupSpring 2017 Sage 300 (Accpac) Users Group
Spring 2017 Sage 300 (Accpac) Users Group
 
KNIME Meetup 2016-04-16
KNIME Meetup 2016-04-16KNIME Meetup 2016-04-16
KNIME Meetup 2016-04-16
 
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefits
 
Data science technology overview
Data science technology overviewData science technology overview
Data science technology overview
 
Splunk Business Analytics
Splunk Business AnalyticsSplunk Business Analytics
Splunk Business Analytics
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
 
Introduction to HANA in-memory from SAP
Introduction to HANA in-memory from SAPIntroduction to HANA in-memory from SAP
Introduction to HANA in-memory from SAP
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
 
V like Velocity, Predicting in Real-Time with Azure ML
V like Velocity, Predicting in Real-Time with Azure MLV like Velocity, Predicting in Real-Time with Azure ML
V like Velocity, Predicting in Real-Time with Azure ML
 
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 

More from Sogeti Nederland B.V.

Sogeti User eXperience (UX) Lab - Usability Testing
Sogeti User eXperience (UX) Lab - Usability TestingSogeti User eXperience (UX) Lab - Usability Testing
Sogeti User eXperience (UX) Lab - Usability TestingSogeti Nederland B.V.
 
Usability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
Usability Lab Live Roleplay - Charlotte Janus & Randy SemeleerUsability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
Usability Lab Live Roleplay - Charlotte Janus & Randy SemeleerSogeti Nederland B.V.
 
OPS als enabler voor DevOps bij KvK - Robbert van der Houwen (KvK) & Conrad H...
OPS als enabler voor DevOps bij KvK - Robbert van der Houwen (KvK) & Conrad H...OPS als enabler voor DevOps bij KvK - Robbert van der Houwen (KvK) & Conrad H...
OPS als enabler voor DevOps bij KvK - Robbert van der Houwen (KvK) & Conrad H...Sogeti Nederland B.V.
 
Imagine Digital Safety Assured - Arno van de Velde (Micro Focus)
Imagine Digital Safety Assured - Arno van de Velde (Micro Focus)Imagine Digital Safety Assured - Arno van de Velde (Micro Focus)
Imagine Digital Safety Assured - Arno van de Velde (Micro Focus)Sogeti Nederland B.V.
 
Good practices in pentesting - Bas de Heer
Good practices in pentesting - Bas de HeerGood practices in pentesting - Bas de Heer
Good practices in pentesting - Bas de HeerSogeti Nederland B.V.
 
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...Sogeti Nederland B.V.
 
Quality Assurance in fast paced DevOps projects using automation - Patrick Ve...
Quality Assurance in fast paced DevOps projects using automation - Patrick Ve...Quality Assurance in fast paced DevOps projects using automation - Patrick Ve...
Quality Assurance in fast paced DevOps projects using automation - Patrick Ve...Sogeti Nederland B.V.
 
Neste: How to Build Business Case, Ramp-up and Run SAP Test Automation Factory
Neste: How to Build Business Case, Ramp-up and Run SAP Test Automation FactoryNeste: How to Build Business Case, Ramp-up and Run SAP Test Automation Factory
Neste: How to Build Business Case, Ramp-up and Run SAP Test Automation FactorySogeti Nederland B.V.
 
HEAT KILLS: Drink plenty of water - Michael de Boer (DATPROF)
HEAT KILLS: Drink plenty of water - Michael de Boer (DATPROF)HEAT KILLS: Drink plenty of water - Michael de Boer (DATPROF)
HEAT KILLS: Drink plenty of water - Michael de Boer (DATPROF)Sogeti Nederland B.V.
 
Hands-off performance testing - Twan Koot & Addy Zwiebel
Hands-off performance testing - Twan Koot & Addy ZwiebelHands-off performance testing - Twan Koot & Addy Zwiebel
Hands-off performance testing - Twan Koot & Addy ZwiebelSogeti Nederland B.V.
 
Agile Quality Improvement 7: common challenges and good practices - Wouter Ru...
Agile Quality Improvement 7: common challenges and good practices - Wouter Ru...Agile Quality Improvement 7: common challenges and good practices - Wouter Ru...
Agile Quality Improvement 7: common challenges and good practices - Wouter Ru...Sogeti Nederland B.V.
 
Analyze your application portfolio to know where the quality and risk issues ...
Analyze your application portfolio to know where the quality and risk issues ...Analyze your application portfolio to know where the quality and risk issues ...
Analyze your application portfolio to know where the quality and risk issues ...Sogeti Nederland B.V.
 
Worldwide Testing & Cutover at Cargill - Daniele Hernandez-Aita & Leo Janssen
Worldwide Testing & Cutover at Cargill - Daniele Hernandez-Aita & Leo JanssenWorldwide Testing & Cutover at Cargill - Daniele Hernandez-Aita & Leo Janssen
Worldwide Testing & Cutover at Cargill - Daniele Hernandez-Aita & Leo JanssenSogeti Nederland B.V.
 
Workshop: Usability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
Workshop: Usability Lab Live Roleplay - Charlotte Janus & Randy SemeleerWorkshop: Usability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
Workshop: Usability Lab Live Roleplay - Charlotte Janus & Randy SemeleerSogeti Nederland B.V.
 
The state of Continuous Testing in 2019 - Antoine Aymer
The state of Continuous Testing in 2019 - Antoine AymerThe state of Continuous Testing in 2019 - Antoine Aymer
The state of Continuous Testing in 2019 - Antoine AymerSogeti Nederland B.V.
 
Reduceer je False Negatives - Peter Wanders van KLM
Reduceer je False Negatives - Peter Wanders van KLMReduceer je False Negatives - Peter Wanders van KLM
Reduceer je False Negatives - Peter Wanders van KLMSogeti Nederland B.V.
 
Effective Insights in an Hour - Daan Oosters
Effective Insights in an Hour - Daan OostersEffective Insights in an Hour - Daan Oosters
Effective Insights in an Hour - Daan OostersSogeti Nederland B.V.
 
Data Driven Ecosystemen - Marcel Schmidt & Dennis Van Gelder
Data Driven Ecosystemen - Marcel Schmidt & Dennis Van GelderData Driven Ecosystemen - Marcel Schmidt & Dennis Van Gelder
Data Driven Ecosystemen - Marcel Schmidt & Dennis Van GelderSogeti Nederland B.V.
 
(Niet) effectief exploratory testen - Rik Marselis
(Niet) effectief exploratory testen - Rik Marselis(Niet) effectief exploratory testen - Rik Marselis
(Niet) effectief exploratory testen - Rik MarselisSogeti Nederland B.V.
 
Charlotte van 't Wout - Slim ondernemen met Instagram
Charlotte van 't Wout - Slim ondernemen met InstagramCharlotte van 't Wout - Slim ondernemen met Instagram
Charlotte van 't Wout - Slim ondernemen met InstagramSogeti Nederland B.V.
 

More from Sogeti Nederland B.V. (20)

Sogeti User eXperience (UX) Lab - Usability Testing
Sogeti User eXperience (UX) Lab - Usability TestingSogeti User eXperience (UX) Lab - Usability Testing
Sogeti User eXperience (UX) Lab - Usability Testing
 
Usability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
Usability Lab Live Roleplay - Charlotte Janus & Randy SemeleerUsability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
Usability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
 
OPS als enabler voor DevOps bij KvK - Robbert van der Houwen (KvK) & Conrad H...
OPS als enabler voor DevOps bij KvK - Robbert van der Houwen (KvK) & Conrad H...OPS als enabler voor DevOps bij KvK - Robbert van der Houwen (KvK) & Conrad H...
OPS als enabler voor DevOps bij KvK - Robbert van der Houwen (KvK) & Conrad H...
 
Imagine Digital Safety Assured - Arno van de Velde (Micro Focus)
Imagine Digital Safety Assured - Arno van de Velde (Micro Focus)Imagine Digital Safety Assured - Arno van de Velde (Micro Focus)
Imagine Digital Safety Assured - Arno van de Velde (Micro Focus)
 
Good practices in pentesting - Bas de Heer
Good practices in pentesting - Bas de HeerGood practices in pentesting - Bas de Heer
Good practices in pentesting - Bas de Heer
 
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
 
Quality Assurance in fast paced DevOps projects using automation - Patrick Ve...
Quality Assurance in fast paced DevOps projects using automation - Patrick Ve...Quality Assurance in fast paced DevOps projects using automation - Patrick Ve...
Quality Assurance in fast paced DevOps projects using automation - Patrick Ve...
 
Neste: How to Build Business Case, Ramp-up and Run SAP Test Automation Factory
Neste: How to Build Business Case, Ramp-up and Run SAP Test Automation FactoryNeste: How to Build Business Case, Ramp-up and Run SAP Test Automation Factory
Neste: How to Build Business Case, Ramp-up and Run SAP Test Automation Factory
 
HEAT KILLS: Drink plenty of water - Michael de Boer (DATPROF)
HEAT KILLS: Drink plenty of water - Michael de Boer (DATPROF)HEAT KILLS: Drink plenty of water - Michael de Boer (DATPROF)
HEAT KILLS: Drink plenty of water - Michael de Boer (DATPROF)
 
Hands-off performance testing - Twan Koot & Addy Zwiebel
Hands-off performance testing - Twan Koot & Addy ZwiebelHands-off performance testing - Twan Koot & Addy Zwiebel
Hands-off performance testing - Twan Koot & Addy Zwiebel
 
Agile Quality Improvement 7: common challenges and good practices - Wouter Ru...
Agile Quality Improvement 7: common challenges and good practices - Wouter Ru...Agile Quality Improvement 7: common challenges and good practices - Wouter Ru...
Agile Quality Improvement 7: common challenges and good practices - Wouter Ru...
 
Analyze your application portfolio to know where the quality and risk issues ...
Analyze your application portfolio to know where the quality and risk issues ...Analyze your application portfolio to know where the quality and risk issues ...
Analyze your application portfolio to know where the quality and risk issues ...
 
Worldwide Testing & Cutover at Cargill - Daniele Hernandez-Aita & Leo Janssen
Worldwide Testing & Cutover at Cargill - Daniele Hernandez-Aita & Leo JanssenWorldwide Testing & Cutover at Cargill - Daniele Hernandez-Aita & Leo Janssen
Worldwide Testing & Cutover at Cargill - Daniele Hernandez-Aita & Leo Janssen
 
Workshop: Usability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
Workshop: Usability Lab Live Roleplay - Charlotte Janus & Randy SemeleerWorkshop: Usability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
Workshop: Usability Lab Live Roleplay - Charlotte Janus & Randy Semeleer
 
The state of Continuous Testing in 2019 - Antoine Aymer
The state of Continuous Testing in 2019 - Antoine AymerThe state of Continuous Testing in 2019 - Antoine Aymer
The state of Continuous Testing in 2019 - Antoine Aymer
 
Reduceer je False Negatives - Peter Wanders van KLM
Reduceer je False Negatives - Peter Wanders van KLMReduceer je False Negatives - Peter Wanders van KLM
Reduceer je False Negatives - Peter Wanders van KLM
 
Effective Insights in an Hour - Daan Oosters
Effective Insights in an Hour - Daan OostersEffective Insights in an Hour - Daan Oosters
Effective Insights in an Hour - Daan Oosters
 
Data Driven Ecosystemen - Marcel Schmidt & Dennis Van Gelder
Data Driven Ecosystemen - Marcel Schmidt & Dennis Van GelderData Driven Ecosystemen - Marcel Schmidt & Dennis Van Gelder
Data Driven Ecosystemen - Marcel Schmidt & Dennis Van Gelder
 
(Niet) effectief exploratory testen - Rik Marselis
(Niet) effectief exploratory testen - Rik Marselis(Niet) effectief exploratory testen - Rik Marselis
(Niet) effectief exploratory testen - Rik Marselis
 
Charlotte van 't Wout - Slim ondernemen met Instagram
Charlotte van 't Wout - Slim ondernemen met InstagramCharlotte van 't Wout - Slim ondernemen met Instagram
Charlotte van 't Wout - Slim ondernemen met Instagram
 

Tom Martens - Cube Ware - The big data challenge - bo

  • 1. Big Data Challenge Real example in industry Tom Martens Bussum, 25th November 2014 Business Analytics
  • 2. Vorname Nachname Content The Challenge to define your Big Data vision Growth of data volume & unstructured data sources Do I need to invest for Big Data & how can I use it? Do I have the right solution for it? Predictive Analytics an operational area for Big Data Main arguments for predictive Analytics Fields of operation Conclusion Real example based on Cubeware solution and EXASOL analytical DB
  • 3. Vorname Nachname The data truth! But please keep in mind: „The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from a given body of data.“ Data is the new oil John Tukey
  • 4. Vorname Nachname Growth of data volume & unstructured data sources Sensor Data Social Media Server Logs
  • 5. Vorname Nachname The Challenge to define your Big Data vision Do I need to invest for Big Data? The answer is YES, if You have big volume of data from different source systems You need to analyze all these data in high speed mode You believe that you can make additional profit/reduce costs and increase efficiency based on analysis of these data for specific purposes How can I make use out of my historical data? Automatic transformation of unstructured* data to structured shape Making qualitative and quantitative analysis of the structured data The result of analysis extend the basics for decision support systems *: e.g. Machines sensor data, social media data, mobile communication data
  • 6. Vorname Nachname The Enterprise Information Hub New Data sources Available Data sources Reporting Dash- boarding Analysis OLAP Data & Text Mining Business Analytics Operational Intelligence Data Marts OLAP Databases Virtual Cubes (e.g. EXASOL / SAP HANA) Business Applications ERP, CRM, … Cloud Data OLTP DMBS Hadoop, NoSQL, Log-Data Machine-Data Streaming Data Real time Statistical Data ETL Business Intelligence offering Structured and unstructured Data Data Warehouse(s) Complex Event Processing Analogue to: Bitkom 2012 - Combination of traditional BI landscape with Big Data solution
  • 7. Vorname Nachname Predictive Analytics What can be targeted with Predictive Analytics (examples): Increase delivery capacity and adherence to delivery date More concrete planning of resources Improve product quality and increase productivity Efficient forecast and planning of product maintenance In accordance to this saying: You can not change your past, but you can change your future!
  • 8. Vorname Nachname Predictive Analytics Predictive Analytics can provide positive result if it is implemented in right domains. The most recommended operational area are: Early detection of churn through analysis of customers behavior in specific situations or time frames Recognition of relationships and pattern to clarify insurance fraud Forecast about product sales for planning of capacity and resources Having reasonable mass of stock to keep capital tied as low as possible Optimized marketing campaign to address customers w. right offering Avoid machine outages by implementing in time repair & maintenance And more …
  • 9. Vorname Nachname The Data (Source Data) Different, but similar sources Time series of Events (Occurrence) Treatment of Occurrences Categorization of Occurrences Critical Category: affecting net-income Additional Source Data (attributes)
  • 10. Vorname Nachname The Approach Identify sequences Clustering of features of the occurrence in a sequence (Prediction Patterns) „Prediction“ of the next critical occurrence Algorithm (SPADE) = Sequential PAttern Discovery using Equivalence classes
  • 11. Vorname Nachname The Approach – Overview Events A B D C B F B E G C Sequences B D C B F C B E G C Cluster of Sequences B D C B E G C A A M L C Prediction B D C
  • 12. Vorname Nachname The Approach – Challenge Search Space (Number of frequent sequences) Objects (O) = Sources Attribute (A) = Occurrence, a source report Length of frequent sequences (k) ~ average number of events in sequence Theoretical „Search Space“ = O(A^k) 10*(1000^5) = 1E+16 possible frequent sequences Sensor Data Source 1 … 10 EXASolution CPU Memory Storage C8 Server C8 Cockpit Data Visualization Data Distribution
  • 13. Vorname Nachname The Solution-Architecture Events A B D C B F B E G C A A M L C C8 Solutions Platform C8 Server C8 Cockpit EXASOLUTION Virtual Cube Compute Node CPU Memory Storage C8 Importer Cubeware Analyzer New Data sources Available Data sources Business Applications ERP, CRM, … Cloud Data Hadoop, NoSQL, Log-Data Machine-Data OLTP DMBS ETL Processing (C8 Importer)
  • 14. Vorname Nachname Achievements Reduction of critical events by ~ 19%  improve the model Reduction of costs for maintenance ~ 10%  expected decline by an improved model Found unexpected relations Detection of a construction issue in a machine type
  • 15. Thank you. Any questions? www.cubeware.com © 2014 by Cubeware GmbH