SlideShare uma empresa Scribd logo
1 de 43
Big Data &
Data Science
W's
Emanuele Della Valle
@manudellavalle
Prof. @polimi & Founder @fluxedo_
W's
18/06/2018 @manudellavalle - http://emanueledellavalle.org 2
Why?
• In many organizations decisions are made by
"questionable" methodologies such as
– Highest Paid Person Opinion (HiPPO)
– Flipism (all decisions are made by flipping a coin)
18/06/2018 @manudellavalle - http://emanueledellavalle.org 3
Why?
Highest Paid Person Opinion (HiPPO)
18/06/2018 @manudellavalle - http://emanueledellavalle.org 4
Why?
Flipism (all decisions are made by flipping a coin)
18/06/2018 @manudellavalle - http://emanueledellavalle.org 5
Why?
• In many organizations decisions are made by the
"questionable" methodologies such as
– Highest Paid Person Opinion (HiPPO)
– Flipism (all decisions are made by flipping a coin)
• This could have been the right approach in the '70s …
– See the "Theory of Bounded Rationality" by Herbert Simons
18/06/2018 @manudellavalle - http://emanueledellavalle.org 6
Why?
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source http://www.azquotes.com/quote/139996 ]
7
Why?
• In many organizations decisions are made by the
"questionable" methodologies such as
– Highest Paid Person Opinion (HiPPO)
– Flipism (all decisions are made by flipping a coin)
• This could have been the right approach in the '70s …
– See the "Theory of Bounded Rationality" by Herbert Simons
• … but in the Big Data era one can dream of
data-driven organization
18/06/2018 @manudellavalle - http://emanueledellavalle.org 8
Why?
• Data-Driven Organization
18/06/2018 @manudellavalle - http://emanueledellavalle.org 9
Why?
Decisions no longer have to be made in the dark
or based on gut instinct; they can be based on
evidence, experiments and more accurate
forecasts.
-- McKinsey
18/06/2018 @manudellavalle - http://emanueledellavalle.org 10
Why?
• Data-driven organizations
– perform better
• The data shows where they can streamline their processes
– are operationally more predictable
• Data insights fuel current and future decision making
– are more profitable
• Constant improvements and better predictions help to
outsmart the competition and improve innovation.
18/06/2018 @manudellavalle - http://emanueledellavalle.org 11
Why?
• Moneyball: data + analysis to win games
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source: https://www.imdb.com/title/tt1210166/ ]
12
What's Big Data?
[source: IBM, 2012]
18/06/2018 @manudellavalle - http://emanueledellavalle.org 13
What's Big Data?
[source: IBM, 2012]
18/06/2018 @manudellavalle - http://emanueledellavalle.org 14
What's Big Data?
[source: IBM, 2012]
18/06/2018 @manudellavalle - http://emanueledellavalle.org 15
What's Big Data?
[source: IBM, 2012]
18/06/2018 @manudellavalle - http://emanueledellavalle.org 16
What's Big Data?
[source: IBM, 2012]
18/06/2018 @manudellavalle - http://emanueledellavalle.org 17
What's Big Data?
• Big Data is "crude oil" … that we have to
– Extract
– Transport in mega-tankers
– Ship through pipelines
– Store in massive silos
– …
18/06/2018 @manudellavalle - http://emanueledellavalle.org 18
What's Data Science?
• Data Science is "refining crude oil"
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source:http://allabtinstru.blogspot.com/2016/09/ProcessofRefiningCrudeOil.html]
19
What's Data Science?
• The Science [and Art] of…
– Discovering what we don’t know from data
– Obtaining predictive, actionable insight from data
– Creating Data Products that have business impact
now
– Communicating relevant business stories from data
– Building confidence in decisions that drive business
value
18/06/2018 @manudellavalle - http://emanueledellavalle.org 20
Who's a Data Scientist?
• Drew Conway, 2010
18/06/2018 @manudellavalle - http://emanueledellavalle.org 21
How?
• Statistics starts with data
• Two goals of analyzing data
– Descriptions: how nature associates responses to inputs
– Predictions: response for future input variables
[source: Statistical Modeling: The Two Cultures. Leo Breiman, 2001]
18/06/2018 @manudellavalle - http://emanueledellavalle.org
nature xy
independent
variable
response
variable
22
How?
[source: Marc Andrews, 2014]
Leverage more of the data being captured
18/06/2018 @manudellavalle - http://emanueledellavalle.org 23
How?
[source: Marc Andrews, 2014]
Leverage more of the data being captured
18/06/2018 @manudellavalle - http://emanueledellavalle.org 24
How?
[source: Marc Andrews, 2014]
Leverage more of the data being captured
18/06/2018 @manudellavalle - http://emanueledellavalle.org 25
How?
18/06/2018 @manudellavalle - http://emanueledellavalle.org
Reduce effort required to leverage data
[source: Marc Andrews, 2014]
26
How?
18/06/2018 @manudellavalle - http://emanueledellavalle.org
Reduce effort required to leverage data
[source: Marc Andrews, 2014]
27
What?
18/06/2018 @manudellavalle - http://emanueledellavalle.org
Reduce effort required to leverage data
[source: Marc Andrews, 2014]
28
How?
Data-driven exploration looking for correlation
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source: Marc Andrews, 2014]
29
How?
Data-driven exploration looking for correlation
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source: Marc Andrews, 2014]
30
Your butcher …
18/06/2018 @manudellavalle - http://emanueledellavalle.org 31
… at scale!
18/06/2018 @manudellavalle - http://emanueledellavalle.org 32
How?
Leverage data as it is captured
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source: Marc Andrews, 2014]
33
How?
Leverage data as it is captured
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source: Marc Andrews, 2014]
34
How?
Leverage data as it is captured
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source: Marc Andrews, 2014]
35
How?
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[sourcehttps://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/]
36
How?
Overall picture by Gartner
18/06/2018 @manudellavalle - http://emanueledellavalle.org 37
Where?
18/06/2018 @manudellavalle - http://emanueledellavalle.org
[source https://www.ted.com/talks/anne_milgram_why_smart_statistics_are_the_key_to_fighting_crime ]
Improve public safety and
reduce violent crime
through data analytics
-41% murders | -27% crimes
38
Where?
18/06/2018 @manudellavalle - http://emanueledellavalle.org 39
Where?
18/06/2018 @manudellavalle - http://emanueledellavalle.org 40
What about cybersec?
18/06/2018 @manudellavalle - http://emanueledellavalle.org 41
Credits
• Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Carlos Somohano, 2013
– https://www.slideshare.net/datasciencelondon/big-data-sorry-data-
science-what-does-a-data-scientist-do-world
• Becoming a data-driven organization The what, why and how.
SAS, 2018
– https://www.sas.com/en_us/whitepapers/becoming-data-driven-
organization-109150.html
• Never trust summary statistics alone; always visualize your data.
Alberto Cairo, 2016
– http://www.thefunctionalart.com/2016/08/download-datasaurus-
never-trust-summary.html
• 2017 Planning Guide for Data and Analytics. John Hagerty
(Gartner), 2016
– https://www.gartner.com/binaries/content/assets/events/keywords/
catalyst/catus8/2017_planning_guide_for_data_analytics.pdf
18/06/2018 @manudellavalle - http://emanueledellavalle.org 42
Thank you!
Any Question?
Emanuele Della Valle
@manudellavalle
Prof. @polimi & Founder @fluxedo_

Mais conteúdo relacionado

Semelhante a Big Data and Data Science W's

Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Simplilearn
 

Semelhante a Big Data and Data Science W's (20)

Bit120 m02 l02 - valuing information
Bit120   m02 l02 - valuing informationBit120   m02 l02 - valuing information
Bit120 m02 l02 - valuing information
 
“Who Moved My Cheese?” – Sniff the changes and stay relevant as an analytics ...
“Who Moved My Cheese?” – Sniff the changes and stay relevant as an analytics ...“Who Moved My Cheese?” – Sniff the changes and stay relevant as an analytics ...
“Who Moved My Cheese?” – Sniff the changes and stay relevant as an analytics ...
 
“Who Moved My Cheese?” – Sniff the changes and stay relevant as an analytics ...
“Who Moved My Cheese?” – Sniff the changes and stay relevant as an analytics ...“Who Moved My Cheese?” – Sniff the changes and stay relevant as an analytics ...
“Who Moved My Cheese?” – Sniff the changes and stay relevant as an analytics ...
 
Data_Mining.ppt
Data_Mining.pptData_Mining.ppt
Data_Mining.ppt
 
Art of Science : choosing the best graphical representation to make decision
Art of Science : choosing the best graphical representation to make decisionArt of Science : choosing the best graphical representation to make decision
Art of Science : choosing the best graphical representation to make decision
 
Big data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeBig data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscape
 
Engage 2017 Watson Analytics - Socialytics, accelerating IBM Connections ado...
Engage 2017  Watson Analytics - Socialytics, accelerating IBM Connections ado...Engage 2017  Watson Analytics - Socialytics, accelerating IBM Connections ado...
Engage 2017 Watson Analytics - Socialytics, accelerating IBM Connections ado...
 
Impact of big data on analytics
Impact of big data on analyticsImpact of big data on analytics
Impact of big data on analytics
 
How To Activate Employee Engagement Through Digital Transformation
How To Activate Employee Engagement Through Digital TransformationHow To Activate Employee Engagement Through Digital Transformation
How To Activate Employee Engagement Through Digital Transformation
 
Data Driven Growth - Amplitude London Product Analytics Summit
Data Driven Growth - Amplitude London Product Analytics SummitData Driven Growth - Amplitude London Product Analytics Summit
Data Driven Growth - Amplitude London Product Analytics Summit
 
Social Connections 14 - Watson Analytics: accelerate your Connections adoption
Social Connections 14 - Watson Analytics: accelerate your Connections adoptionSocial Connections 14 - Watson Analytics: accelerate your Connections adoption
Social Connections 14 - Watson Analytics: accelerate your Connections adoption
 
Applying Data Quality Best Practices at Big Data Scale
Applying Data Quality Best Practices at Big Data ScaleApplying Data Quality Best Practices at Big Data Scale
Applying Data Quality Best Practices at Big Data Scale
 
Using Data to Inform Information Architecture and User Experience
Using Data to Inform Information Architecture and User ExperienceUsing Data to Inform Information Architecture and User Experience
Using Data to Inform Information Architecture and User Experience
 
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
pixelcamp
pixelcamppixelcamp
pixelcamp
 
big data analytics pgpmx2015
big data analytics pgpmx2015big data analytics pgpmx2015
big data analytics pgpmx2015
 
Modern Metadata Strategies
Modern Metadata StrategiesModern Metadata Strategies
Modern Metadata Strategies
 
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...
 
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
Andy Young — Data-Driven Growth: Analytics Tools and Tips for Marketers in 20...
 

Mais de Emanuele Della Valle

Mais de Emanuele Della Valle (20)

Taming velocity - a tale of four streams
Taming velocity - a tale of four streamsTaming velocity - a tale of four streams
Taming velocity - a tale of four streams
 
Stream reasoning
Stream reasoningStream reasoning
Stream reasoning
 
Work in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningWork in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream Reasoning
 
Knowledge graphs in search engines
Knowledge graphs in search enginesKnowledge graphs in search engines
Knowledge graphs in search engines
 
La città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoLa città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - Fluxedo
 
Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...
 
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...
 
Stream reasoning: an approach to tame the velocity and variety dimensions of ...
Stream reasoning: an approach to tame the velocity and variety dimensions of ...Stream reasoning: an approach to tame the velocity and variety dimensions of ...
Stream reasoning: an approach to tame the velocity and variety dimensions of ...
 
Big Data: how to use it to create value
Big Data: how to use it to create valueBig Data: how to use it to create value
Big Data: how to use it to create value
 
Listening to the pulse of our cities with Stream Reasoning (and few more tech...
Listening to the pulse of our cities with Stream Reasoning (and few more tech...Listening to the pulse of our cities with Stream Reasoning (and few more tech...
Listening to the pulse of our cities with Stream Reasoning (and few more tech...
 
Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF
 
Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web
 
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
 
IST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesIST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic Technologies
 
Stream reasoning: mastering the velocity and the variety dimensions of Big Da...
Stream reasoning: mastering the velocity and the variety dimensions of Big Da...Stream reasoning: mastering the velocity and the variety dimensions of Big Da...
Stream reasoning: mastering the velocity and the variety dimensions of Big Da...
 
On Stream Reasoning
On Stream ReasoningOn Stream Reasoning
On Stream Reasoning
 
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
 
Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03
 
City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)
 
Semantic technologies and Interoperability
Semantic technologies and InteroperabilitySemantic technologies and Interoperability
Semantic technologies and Interoperability
 

Último

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
shivangimorya083
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
JohnnyPlasten
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
shambhavirathore45
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
shivangimorya083
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 

Último (20)

Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 

Big Data and Data Science W's

  • 1. Big Data & Data Science W's Emanuele Della Valle @manudellavalle Prof. @polimi & Founder @fluxedo_
  • 2. W's 18/06/2018 @manudellavalle - http://emanueledellavalle.org 2
  • 3. Why? • In many organizations decisions are made by "questionable" methodologies such as – Highest Paid Person Opinion (HiPPO) – Flipism (all decisions are made by flipping a coin) 18/06/2018 @manudellavalle - http://emanueledellavalle.org 3
  • 4. Why? Highest Paid Person Opinion (HiPPO) 18/06/2018 @manudellavalle - http://emanueledellavalle.org 4
  • 5. Why? Flipism (all decisions are made by flipping a coin) 18/06/2018 @manudellavalle - http://emanueledellavalle.org 5
  • 6. Why? • In many organizations decisions are made by the "questionable" methodologies such as – Highest Paid Person Opinion (HiPPO) – Flipism (all decisions are made by flipping a coin) • This could have been the right approach in the '70s … – See the "Theory of Bounded Rationality" by Herbert Simons 18/06/2018 @manudellavalle - http://emanueledellavalle.org 6
  • 7. Why? 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source http://www.azquotes.com/quote/139996 ] 7
  • 8. Why? • In many organizations decisions are made by the "questionable" methodologies such as – Highest Paid Person Opinion (HiPPO) – Flipism (all decisions are made by flipping a coin) • This could have been the right approach in the '70s … – See the "Theory of Bounded Rationality" by Herbert Simons • … but in the Big Data era one can dream of data-driven organization 18/06/2018 @manudellavalle - http://emanueledellavalle.org 8
  • 9. Why? • Data-Driven Organization 18/06/2018 @manudellavalle - http://emanueledellavalle.org 9
  • 10. Why? Decisions no longer have to be made in the dark or based on gut instinct; they can be based on evidence, experiments and more accurate forecasts. -- McKinsey 18/06/2018 @manudellavalle - http://emanueledellavalle.org 10
  • 11. Why? • Data-driven organizations – perform better • The data shows where they can streamline their processes – are operationally more predictable • Data insights fuel current and future decision making – are more profitable • Constant improvements and better predictions help to outsmart the competition and improve innovation. 18/06/2018 @manudellavalle - http://emanueledellavalle.org 11
  • 12. Why? • Moneyball: data + analysis to win games 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source: https://www.imdb.com/title/tt1210166/ ] 12
  • 13. What's Big Data? [source: IBM, 2012] 18/06/2018 @manudellavalle - http://emanueledellavalle.org 13
  • 14. What's Big Data? [source: IBM, 2012] 18/06/2018 @manudellavalle - http://emanueledellavalle.org 14
  • 15. What's Big Data? [source: IBM, 2012] 18/06/2018 @manudellavalle - http://emanueledellavalle.org 15
  • 16. What's Big Data? [source: IBM, 2012] 18/06/2018 @manudellavalle - http://emanueledellavalle.org 16
  • 17. What's Big Data? [source: IBM, 2012] 18/06/2018 @manudellavalle - http://emanueledellavalle.org 17
  • 18. What's Big Data? • Big Data is "crude oil" … that we have to – Extract – Transport in mega-tankers – Ship through pipelines – Store in massive silos – … 18/06/2018 @manudellavalle - http://emanueledellavalle.org 18
  • 19. What's Data Science? • Data Science is "refining crude oil" 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source:http://allabtinstru.blogspot.com/2016/09/ProcessofRefiningCrudeOil.html] 19
  • 20. What's Data Science? • The Science [and Art] of… – Discovering what we don’t know from data – Obtaining predictive, actionable insight from data – Creating Data Products that have business impact now – Communicating relevant business stories from data – Building confidence in decisions that drive business value 18/06/2018 @manudellavalle - http://emanueledellavalle.org 20
  • 21. Who's a Data Scientist? • Drew Conway, 2010 18/06/2018 @manudellavalle - http://emanueledellavalle.org 21
  • 22. How? • Statistics starts with data • Two goals of analyzing data – Descriptions: how nature associates responses to inputs – Predictions: response for future input variables [source: Statistical Modeling: The Two Cultures. Leo Breiman, 2001] 18/06/2018 @manudellavalle - http://emanueledellavalle.org nature xy independent variable response variable 22
  • 23. How? [source: Marc Andrews, 2014] Leverage more of the data being captured 18/06/2018 @manudellavalle - http://emanueledellavalle.org 23
  • 24. How? [source: Marc Andrews, 2014] Leverage more of the data being captured 18/06/2018 @manudellavalle - http://emanueledellavalle.org 24
  • 25. How? [source: Marc Andrews, 2014] Leverage more of the data being captured 18/06/2018 @manudellavalle - http://emanueledellavalle.org 25
  • 26. How? 18/06/2018 @manudellavalle - http://emanueledellavalle.org Reduce effort required to leverage data [source: Marc Andrews, 2014] 26
  • 27. How? 18/06/2018 @manudellavalle - http://emanueledellavalle.org Reduce effort required to leverage data [source: Marc Andrews, 2014] 27
  • 28. What? 18/06/2018 @manudellavalle - http://emanueledellavalle.org Reduce effort required to leverage data [source: Marc Andrews, 2014] 28
  • 29. How? Data-driven exploration looking for correlation 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source: Marc Andrews, 2014] 29
  • 30. How? Data-driven exploration looking for correlation 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source: Marc Andrews, 2014] 30
  • 31. Your butcher … 18/06/2018 @manudellavalle - http://emanueledellavalle.org 31
  • 32. … at scale! 18/06/2018 @manudellavalle - http://emanueledellavalle.org 32
  • 33. How? Leverage data as it is captured 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source: Marc Andrews, 2014] 33
  • 34. How? Leverage data as it is captured 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source: Marc Andrews, 2014] 34
  • 35. How? Leverage data as it is captured 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source: Marc Andrews, 2014] 35
  • 36. How? 18/06/2018 @manudellavalle - http://emanueledellavalle.org [sourcehttps://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/] 36
  • 37. How? Overall picture by Gartner 18/06/2018 @manudellavalle - http://emanueledellavalle.org 37
  • 38. Where? 18/06/2018 @manudellavalle - http://emanueledellavalle.org [source https://www.ted.com/talks/anne_milgram_why_smart_statistics_are_the_key_to_fighting_crime ] Improve public safety and reduce violent crime through data analytics -41% murders | -27% crimes 38
  • 39. Where? 18/06/2018 @manudellavalle - http://emanueledellavalle.org 39
  • 40. Where? 18/06/2018 @manudellavalle - http://emanueledellavalle.org 40
  • 41. What about cybersec? 18/06/2018 @manudellavalle - http://emanueledellavalle.org 41
  • 42. Credits • Big Data [sorry] & Data Science: What Does a Data Scientist Do? Carlos Somohano, 2013 – https://www.slideshare.net/datasciencelondon/big-data-sorry-data- science-what-does-a-data-scientist-do-world • Becoming a data-driven organization The what, why and how. SAS, 2018 – https://www.sas.com/en_us/whitepapers/becoming-data-driven- organization-109150.html • Never trust summary statistics alone; always visualize your data. Alberto Cairo, 2016 – http://www.thefunctionalart.com/2016/08/download-datasaurus- never-trust-summary.html • 2017 Planning Guide for Data and Analytics. John Hagerty (Gartner), 2016 – https://www.gartner.com/binaries/content/assets/events/keywords/ catalyst/catus8/2017_planning_guide_for_data_analytics.pdf 18/06/2018 @manudellavalle - http://emanueledellavalle.org 42
  • 43. Thank you! Any Question? Emanuele Della Valle @manudellavalle Prof. @polimi & Founder @fluxedo_