SlideShare uma empresa Scribd logo
1 de 25
How to Lie with Data and Statistics?
Iveta Lohovska
Principal Data Scientist @lohovska
Who am I ?
2
3
Home Cities TransportationIndustrial
“Moving from a good intuition to a Data Driven Decision”
Who am I ?
Principal Data Scientist
Open Data, Computer Vision,
IoT Data, HPC
Who are you ?
4
Mathematician
Who are you ?
5
Data
Engineer
Data Scientists
Computer
Scientist
Business Analyst
Machine vs. human generated data?
6
Machine vs. human generated data
CRISP-MD CRoss-Industry Standard Process for Data Mining
Plan
Understand the business
Acquire
Data Understanding
Transform
Data Preparation
Model
Build &Testing
Visualise
Communicate and evaluate
Score
Productionize the model
• What are your
business goals?
• What outcome are
you trying to change
• Are the answers you
are looking for:
• Descriptive
• Predictive
• Prescriptive
• Are there
constraints to the
use of your data
• What is the
meaning and
relevance of your
data
• What sampling
methods were used
• Cleanse the data
• Analyse/reduce
variables
• Plot the data
• Discover first
insights into the
data
• Select and build a
model
• Train the model
• Validate the model
• Does the model
teach us anything
• Communicate and
visualise the results
• What did we learn
• Do the results make
sense
• Can we deploy the
model
• Publish/deploy the
model
• Implement real-
time data
transformation for
real-time scoring
• Schedule data
transformation for
batch scoring
• Make informed
decisions
Why Machine Data(IoT)?
8
How do you collect and process this analog information, to
transform into useful business insights?
9
Information Insight
Internet of Things
How do you collect and process this analog information, to
transform into useful business insights?
10
Information Insight
Internet of Things
Daily shipments in tons
throughout the year
“Winter schedule”
“Summer schedule”
“Winter schedule”
JAN APR JUL OCT DEC
Daily shipments in tons
throughout the year
JAN APR JUL OCT DEC
“Transmetrics schedule”
Traditional network capacity planning: without prediction Predictive network capacity planning: with prediction models
Analog Data Digital Data
What has been holding people back?
11
Cities TransportationIndustrial
An Architecture for Machine Data(IoT)
12
Cities TransportationIndustrial
Data Understanding?
13
14
Collect data Clean data Identify patterns Make prediction
Data Understanding
hindsight insight foresight
The type of the information you have for a device:
15
Colect Data Clean Data Identify patterns Make prediction
Devices-
Gateway
Connectivity
Eventpipeline
AnalyticsApplications
The type of the information you have for a device:
16
Assets/Beacons Access Points Wi-Fi Router
Inventory Interface
Sensors sometimes lie
17
Sensors sometimes lie
18
Even when they aren’t lying, sensors don’t always tell the whole truth
19
This might be a problem… … or loose device connection.
Even when they aren’t lying, sensors don’t always tell the whole truth
20
What the sensor reads… … what the control unit stores and forwards
Extracting useful signal from time-series sensor data requires ‘multi-genre’
Predictive Analytics – and additional data
21
Analytics
Capture full-fidelity
data to enable use-
case specific event
detection
Interpolation of
missing values,
corrections,
recalibration, etc.
Identification of state
change; matching
Comparison and
correlation with other
systems(CRM,
Marketing, etc.)
Raw sensor data
Raw sensor data from
adjacent sensors;
Master data
Alert data; historical
data; environmental
data
-
Interpolation;
Neural networks;
Smoothing
Time-series;
Pattern recognition;
Event mapping
Graph; Clustering;
Predictions; Decisions
trees
Whole device
historical data
Raw IoT data Cleansed IoT data Event Detection Path to association
A
A
A
Comparison and
correlation with
human observations
Text and network
analytics
Maintenance and
operational data
Labelled IoT data
Data
Process
Sensors typically don’t measure the quantity of interest directly
22
By themselves, sensor data are of only limited value
23
Key Takeaways?
24
• Understand how data process affect
our decision making.
• Engage with the numbers all around
us.
Thank you!

Mais conteúdo relacionado

Mais procurados

Data Science
Data ScienceData Science
Data ScienceRabin BK
 
Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data ScienceActonRoy
 
Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...
Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...
Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...Dataconomy Media
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientistVijayMohan Vasu
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data sciencebhavesh lande
 
1. Data Analytics-introduction
1. Data Analytics-introduction1. Data Analytics-introduction
1. Data Analytics-introductionkrishna singh
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryDomino Data Lab
 
Data science and visualization lab presentation
Data science and visualization lab presentationData science and visualization lab presentation
Data science and visualization lab presentationiHub Research
 
Privacy preserving machine learning
Privacy preserving machine learningPrivacy preserving machine learning
Privacy preserving machine learningMichał Kuźba
 
International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...ijscai
 
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...Dataconomy Media
 
Credit Card Fraud Detection Using ML In Databricks
Credit Card Fraud Detection Using ML In DatabricksCredit Card Fraud Detection Using ML In Databricks
Credit Card Fraud Detection Using ML In DatabricksDatabricks
 
Big data business analytics | Introduction to Business Analytics
Big data business analytics | Introduction to Business AnalyticsBig data business analytics | Introduction to Business Analytics
Big data business analytics | Introduction to Business AnalyticsShilpaKrishna6
 
Data Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralData Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralFrank Kienle
 
From Science to Data: Following a principled path to Data Science
From Science to Data: Following a principled path to Data ScienceFrom Science to Data: Following a principled path to Data Science
From Science to Data: Following a principled path to Data ScienceInstitute of Contemporary Sciences
 

Mais procurados (20)

Data Science
Data ScienceData Science
Data Science
 
Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data Science
 
Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...
Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...
Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
Data science Big Data
Data science Big DataData science Big Data
Data science Big Data
 
Building up a Data Science Team from Scratch
Building up a Data Science Team from ScratchBuilding up a Data Science Team from Scratch
Building up a Data Science Team from Scratch
 
The 10th ACC Audience survey report
The 10th ACC Audience survey reportThe 10th ACC Audience survey report
The 10th ACC Audience survey report
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
1. Data Analytics-introduction
1. Data Analytics-introduction1. Data Analytics-introduction
1. Data Analytics-introduction
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive Industry
 
Data science and visualization lab presentation
Data science and visualization lab presentationData science and visualization lab presentation
Data science and visualization lab presentation
 
Privacy preserving machine learning
Privacy preserving machine learningPrivacy preserving machine learning
Privacy preserving machine learning
 
International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...International Journal on Soft Computing, Artificial Intelligence and Applicat...
International Journal on Soft Computing, Artificial Intelligence and Applicat...
 
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
 
Credit Card Fraud Detection Using ML In Databricks
Credit Card Fraud Detection Using ML In DatabricksCredit Card Fraud Detection Using ML In Databricks
Credit Card Fraud Detection Using ML In Databricks
 
Big data business analytics | Introduction to Business Analytics
Big data business analytics | Introduction to Business AnalyticsBig data business analytics | Introduction to Business Analytics
Big data business analytics | Introduction to Business Analytics
 
Big Data
Big DataBig Data
Big Data
 
Data Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralData Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information Collateral
 
From Science to Data: Following a principled path to Data Science
From Science to Data: Following a principled path to Data ScienceFrom Science to Data: Following a principled path to Data Science
From Science to Data: Following a principled path to Data Science
 
Chatbots: Automated Conversational Model using Machine Learning
Chatbots: Automated Conversational Model using Machine LearningChatbots: Automated Conversational Model using Machine Learning
Chatbots: Automated Conversational Model using Machine Learning
 

Semelhante a How to Lie with Data and Statistics? | Iveta Lohovska, Principal Data Scientist | DN18

Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and OpportunitiesKenny Huang Ph.D.
 
WWV2015: Jibes Paul van der Hulst big data
WWV2015: Jibes Paul van der Hulst big dataWWV2015: Jibes Paul van der Hulst big data
WWV2015: Jibes Paul van der Hulst big datawebwinkelvakdag
 
Startds9.19.17sd
Startds9.19.17sdStartds9.19.17sd
Startds9.19.17sdThinkful
 
Data sci sd-11.6.17
Data sci sd-11.6.17Data sci sd-11.6.17
Data sci sd-11.6.17Thinkful
 
D92-198gstindspdx
D92-198gstindspdxD92-198gstindspdx
D92-198gstindspdxThinkful
 
Gary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you ThinkGary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you ThinkSaratoga
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j
 
Getstarteddssd12717sd
Getstarteddssd12717sdGetstarteddssd12717sd
Getstarteddssd12717sdThinkful
 
Analytics as-a-service-io t-asia-arpanpal
Analytics as-a-service-io t-asia-arpanpalAnalytics as-a-service-io t-asia-arpanpal
Analytics as-a-service-io t-asia-arpanpalArpan Pal
 
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptxarpit206900
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
Don't build a data science team
Don't build a data science teamDon't build a data science team
Don't build a data science teamLars Albertsson
 
Infopulse AI, Data Science & RPA Managed Services
Infopulse AI, Data Science & RPA Managed ServicesInfopulse AI, Data Science & RPA Managed Services
Infopulse AI, Data Science & RPA Managed ServicesInfopulse
 
Introduction to DS, ML and IBM Tools
Introduction to DS, ML and IBM ToolsIntroduction to DS, ML and IBM Tools
Introduction to DS, ML and IBM ToolsQamar un Nisa
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Thinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCThinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCTJ Stalcup
 
Predictive Analytics World Chicago 2015
Predictive Analytics World Chicago 2015Predictive Analytics World Chicago 2015
Predictive Analytics World Chicago 2015Dan Potter
 
Advanced Analytics for Any Data at Real-Time Speed
Advanced Analytics for Any Data at Real-Time SpeedAdvanced Analytics for Any Data at Real-Time Speed
Advanced Analytics for Any Data at Real-Time Speeddanpotterdwch
 

Semelhante a How to Lie with Data and Statistics? | Iveta Lohovska, Principal Data Scientist | DN18 (20)

Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and Opportunities
 
WWV2015: Jibes Paul van der Hulst big data
WWV2015: Jibes Paul van der Hulst big dataWWV2015: Jibes Paul van der Hulst big data
WWV2015: Jibes Paul van der Hulst big data
 
Startds9.19.17sd
Startds9.19.17sdStartds9.19.17sd
Startds9.19.17sd
 
Data sci sd-11.6.17
Data sci sd-11.6.17Data sci sd-11.6.17
Data sci sd-11.6.17
 
D92-198gstindspdx
D92-198gstindspdxD92-198gstindspdx
D92-198gstindspdx
 
Gary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you ThinkGary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you Think
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - Webinar
 
Getstarteddssd12717sd
Getstarteddssd12717sdGetstarteddssd12717sd
Getstarteddssd12717sd
 
Data Analytics for IoT
Data Analytics for IoT Data Analytics for IoT
Data Analytics for IoT
 
Analytics as-a-service-io t-asia-arpanpal
Analytics as-a-service-io t-asia-arpanpalAnalytics as-a-service-io t-asia-arpanpal
Analytics as-a-service-io t-asia-arpanpal
 
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Don't build a data science team
Don't build a data science teamDon't build a data science team
Don't build a data science team
 
Infopulse AI, Data Science & RPA Managed Services
Infopulse AI, Data Science & RPA Managed ServicesInfopulse AI, Data Science & RPA Managed Services
Infopulse AI, Data Science & RPA Managed Services
 
Introduction to DS, ML and IBM Tools
Introduction to DS, ML and IBM ToolsIntroduction to DS, ML and IBM Tools
Introduction to DS, ML and IBM Tools
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Thinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCThinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DC
 
Just ask Watson Seminar
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
 
Predictive Analytics World Chicago 2015
Predictive Analytics World Chicago 2015Predictive Analytics World Chicago 2015
Predictive Analytics World Chicago 2015
 
Advanced Analytics for Any Data at Real-Time Speed
Advanced Analytics for Any Data at Real-Time SpeedAdvanced Analytics for Any Data at Real-Time Speed
Advanced Analytics for Any Data at Real-Time Speed
 

Mais de DataconomyGmbH

The Data Janitor Returns | Daniel Molnar | DN18
The Data Janitor Returns | Daniel Molnar | DN18The Data Janitor Returns | Daniel Molnar | DN18
The Data Janitor Returns | Daniel Molnar | DN18DataconomyGmbH
 
Technical debt in ML | Jaroslaw Szymczak | DN18
Technical debt in ML | Jaroslaw Szymczak | DN18Technical debt in ML | Jaroslaw Szymczak | DN18
Technical debt in ML | Jaroslaw Szymczak | DN18DataconomyGmbH
 
Accessing Online Text-based conversation | Jay Krall | DN2018
Accessing Online Text-based conversation | Jay Krall | DN2018Accessing Online Text-based conversation | Jay Krall | DN2018
Accessing Online Text-based conversation | Jay Krall | DN2018DataconomyGmbH
 
Journey from Structured to Unstructured Data | Nischal HP | VP, Engineering a...
Journey from Structured to Unstructured Data | Nischal HP | VP, Engineering a...Journey from Structured to Unstructured Data | Nischal HP | VP, Engineering a...
Journey from Structured to Unstructured Data | Nischal HP | VP, Engineering a...DataconomyGmbH
 
Causal inference-for-profit | Dan McKinley | DN18
Causal inference-for-profit | Dan McKinley | DN18Causal inference-for-profit | Dan McKinley | DN18
Causal inference-for-profit | Dan McKinley | DN18DataconomyGmbH
 
Data Science in Clinical Care | Johannes Starlinger, Charité | DN18
Data Science in Clinical Care | Johannes Starlinger, Charité | DN18Data Science in Clinical Care | Johannes Starlinger, Charité | DN18
Data Science in Clinical Care | Johannes Starlinger, Charité | DN18DataconomyGmbH
 
Building a Data Science Consultancy | Bart Smeets | DN18
Building a Data Science Consultancy | Bart Smeets | DN18Building a Data Science Consultancy | Bart Smeets | DN18
Building a Data Science Consultancy | Bart Smeets | DN18DataconomyGmbH
 
Building Sustainable Machine Learning Products for Communities, by Communit...
Building Sustainable Machine Learning Products  for Communities,  by Communit...Building Sustainable Machine Learning Products  for Communities,  by Communit...
Building Sustainable Machine Learning Products for Communities, by Communit...DataconomyGmbH
 
BIG DATA is DEAD | Marc Weimer-Hablitzel, Etventure | DN18
BIG DATA  is DEAD | Marc Weimer-Hablitzel, Etventure | DN18BIG DATA  is DEAD | Marc Weimer-Hablitzel, Etventure | DN18
BIG DATA is DEAD | Marc Weimer-Hablitzel, Etventure | DN18DataconomyGmbH
 
Undermining democracy | Alisa Kolesnikova | DN18
Undermining  democracy | Alisa Kolesnikova | DN18Undermining  democracy | Alisa Kolesnikova | DN18
Undermining democracy | Alisa Kolesnikova | DN18DataconomyGmbH
 
Support automation with chatbots | Erik Pfannmöller | Founder, Solvemate | DN18
Support automation with chatbots | Erik Pfannmöller | Founder, Solvemate | DN18Support automation with chatbots | Erik Pfannmöller | Founder, Solvemate | DN18
Support automation with chatbots | Erik Pfannmöller | Founder, Solvemate | DN18DataconomyGmbH
 
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18Rent, Rain, and Regulations | Du Phan, Dataiku | DN18
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18DataconomyGmbH
 
Linked data in an era of data surrealism Hans Constandt | CEO & FOUNDER
Linked data in an era of data surrealism Hans Constandt | CEO & FOUNDERLinked data in an era of data surrealism Hans Constandt | CEO & FOUNDER
Linked data in an era of data surrealism Hans Constandt | CEO & FOUNDERDataconomyGmbH
 
Living in an Era of Data Surrealism | Hans Constandt | CEO & FOUNDER
Living in an Era of Data Surrealism | Hans Constandt | CEO & FOUNDERLiving in an Era of Data Surrealism | Hans Constandt | CEO & FOUNDER
Living in an Era of Data Surrealism | Hans Constandt | CEO & FOUNDERDataconomyGmbH
 
Are You Ready for the Quickening!
Are You Ready for the Quickening!Are You Ready for the Quickening!
Are You Ready for the Quickening!DataconomyGmbH
 

Mais de DataconomyGmbH (15)

The Data Janitor Returns | Daniel Molnar | DN18
The Data Janitor Returns | Daniel Molnar | DN18The Data Janitor Returns | Daniel Molnar | DN18
The Data Janitor Returns | Daniel Molnar | DN18
 
Technical debt in ML | Jaroslaw Szymczak | DN18
Technical debt in ML | Jaroslaw Szymczak | DN18Technical debt in ML | Jaroslaw Szymczak | DN18
Technical debt in ML | Jaroslaw Szymczak | DN18
 
Accessing Online Text-based conversation | Jay Krall | DN2018
Accessing Online Text-based conversation | Jay Krall | DN2018Accessing Online Text-based conversation | Jay Krall | DN2018
Accessing Online Text-based conversation | Jay Krall | DN2018
 
Journey from Structured to Unstructured Data | Nischal HP | VP, Engineering a...
Journey from Structured to Unstructured Data | Nischal HP | VP, Engineering a...Journey from Structured to Unstructured Data | Nischal HP | VP, Engineering a...
Journey from Structured to Unstructured Data | Nischal HP | VP, Engineering a...
 
Causal inference-for-profit | Dan McKinley | DN18
Causal inference-for-profit | Dan McKinley | DN18Causal inference-for-profit | Dan McKinley | DN18
Causal inference-for-profit | Dan McKinley | DN18
 
Data Science in Clinical Care | Johannes Starlinger, Charité | DN18
Data Science in Clinical Care | Johannes Starlinger, Charité | DN18Data Science in Clinical Care | Johannes Starlinger, Charité | DN18
Data Science in Clinical Care | Johannes Starlinger, Charité | DN18
 
Building a Data Science Consultancy | Bart Smeets | DN18
Building a Data Science Consultancy | Bart Smeets | DN18Building a Data Science Consultancy | Bart Smeets | DN18
Building a Data Science Consultancy | Bart Smeets | DN18
 
Building Sustainable Machine Learning Products for Communities, by Communit...
Building Sustainable Machine Learning Products  for Communities,  by Communit...Building Sustainable Machine Learning Products  for Communities,  by Communit...
Building Sustainable Machine Learning Products for Communities, by Communit...
 
BIG DATA is DEAD | Marc Weimer-Hablitzel, Etventure | DN18
BIG DATA  is DEAD | Marc Weimer-Hablitzel, Etventure | DN18BIG DATA  is DEAD | Marc Weimer-Hablitzel, Etventure | DN18
BIG DATA is DEAD | Marc Weimer-Hablitzel, Etventure | DN18
 
Undermining democracy | Alisa Kolesnikova | DN18
Undermining  democracy | Alisa Kolesnikova | DN18Undermining  democracy | Alisa Kolesnikova | DN18
Undermining democracy | Alisa Kolesnikova | DN18
 
Support automation with chatbots | Erik Pfannmöller | Founder, Solvemate | DN18
Support automation with chatbots | Erik Pfannmöller | Founder, Solvemate | DN18Support automation with chatbots | Erik Pfannmöller | Founder, Solvemate | DN18
Support automation with chatbots | Erik Pfannmöller | Founder, Solvemate | DN18
 
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18Rent, Rain, and Regulations | Du Phan, Dataiku | DN18
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18
 
Linked data in an era of data surrealism Hans Constandt | CEO & FOUNDER
Linked data in an era of data surrealism Hans Constandt | CEO & FOUNDERLinked data in an era of data surrealism Hans Constandt | CEO & FOUNDER
Linked data in an era of data surrealism Hans Constandt | CEO & FOUNDER
 
Living in an Era of Data Surrealism | Hans Constandt | CEO & FOUNDER
Living in an Era of Data Surrealism | Hans Constandt | CEO & FOUNDERLiving in an Era of Data Surrealism | Hans Constandt | CEO & FOUNDER
Living in an Era of Data Surrealism | Hans Constandt | CEO & FOUNDER
 
Are You Ready for the Quickening!
Are You Ready for the Quickening!Are You Ready for the Quickening!
Are You Ready for the Quickening!
 

Último

04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 

Último (20)

04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 

How to Lie with Data and Statistics? | Iveta Lohovska, Principal Data Scientist | DN18

  • 1. How to Lie with Data and Statistics? Iveta Lohovska Principal Data Scientist @lohovska
  • 2. Who am I ? 2
  • 3. 3 Home Cities TransportationIndustrial “Moving from a good intuition to a Data Driven Decision” Who am I ? Principal Data Scientist Open Data, Computer Vision, IoT Data, HPC
  • 5. Mathematician Who are you ? 5 Data Engineer Data Scientists Computer Scientist Business Analyst
  • 6. Machine vs. human generated data? 6
  • 7. Machine vs. human generated data CRISP-MD CRoss-Industry Standard Process for Data Mining Plan Understand the business Acquire Data Understanding Transform Data Preparation Model Build &Testing Visualise Communicate and evaluate Score Productionize the model • What are your business goals? • What outcome are you trying to change • Are the answers you are looking for: • Descriptive • Predictive • Prescriptive • Are there constraints to the use of your data • What is the meaning and relevance of your data • What sampling methods were used • Cleanse the data • Analyse/reduce variables • Plot the data • Discover first insights into the data • Select and build a model • Train the model • Validate the model • Does the model teach us anything • Communicate and visualise the results • What did we learn • Do the results make sense • Can we deploy the model • Publish/deploy the model • Implement real- time data transformation for real-time scoring • Schedule data transformation for batch scoring • Make informed decisions
  • 9. How do you collect and process this analog information, to transform into useful business insights? 9 Information Insight Internet of Things
  • 10. How do you collect and process this analog information, to transform into useful business insights? 10 Information Insight Internet of Things Daily shipments in tons throughout the year “Winter schedule” “Summer schedule” “Winter schedule” JAN APR JUL OCT DEC Daily shipments in tons throughout the year JAN APR JUL OCT DEC “Transmetrics schedule” Traditional network capacity planning: without prediction Predictive network capacity planning: with prediction models Analog Data Digital Data
  • 11. What has been holding people back? 11 Cities TransportationIndustrial
  • 12. An Architecture for Machine Data(IoT) 12 Cities TransportationIndustrial
  • 14. 14 Collect data Clean data Identify patterns Make prediction Data Understanding hindsight insight foresight
  • 15. The type of the information you have for a device: 15 Colect Data Clean Data Identify patterns Make prediction Devices- Gateway Connectivity Eventpipeline AnalyticsApplications
  • 16. The type of the information you have for a device: 16 Assets/Beacons Access Points Wi-Fi Router Inventory Interface
  • 19. Even when they aren’t lying, sensors don’t always tell the whole truth 19 This might be a problem… … or loose device connection.
  • 20. Even when they aren’t lying, sensors don’t always tell the whole truth 20 What the sensor reads… … what the control unit stores and forwards
  • 21. Extracting useful signal from time-series sensor data requires ‘multi-genre’ Predictive Analytics – and additional data 21 Analytics Capture full-fidelity data to enable use- case specific event detection Interpolation of missing values, corrections, recalibration, etc. Identification of state change; matching Comparison and correlation with other systems(CRM, Marketing, etc.) Raw sensor data Raw sensor data from adjacent sensors; Master data Alert data; historical data; environmental data - Interpolation; Neural networks; Smoothing Time-series; Pattern recognition; Event mapping Graph; Clustering; Predictions; Decisions trees Whole device historical data Raw IoT data Cleansed IoT data Event Detection Path to association A A A Comparison and correlation with human observations Text and network analytics Maintenance and operational data Labelled IoT data Data Process
  • 22. Sensors typically don’t measure the quantity of interest directly 22
  • 23. By themselves, sensor data are of only limited value 23
  • 24. Key Takeaways? 24 • Understand how data process affect our decision making. • Engage with the numbers all around us.