SlideShare uma empresa Scribd logo
1 de 38
BIG DATA:
WHAT’S IT REALLY ABOUT?
Rich Brueckner
President, InsideBIGDATA
© TCC 2014, Confidential and Proprietary
AGENDA

•
•
•
•
•
•
•

Proper Intro
It’s Not the Data. It’s What You Use it for.
How Big Data Got Me Here

Case Studies
HPC and Big Data
Trends- What’s Next?
Call to Action

© TCC 2014, Confidential and Proprietary
THE PROPER INTRODUCTION

insideBIGDATA.com

© TCC 2014, Confidential and Proprietary
WHAT BIG DATA IS NOT

•
•
•

Size matters not.
It’s not about the Data, it’s what you do with
it.
Deriving insight from Data for purposes for
© TCC 2014, Confidential and Proprietary
FOR TODAY’S DISCUSSION:
BIG DATA = HIGH PERFORMANCE DATA ANALYSIS

© TCC 2014, Confidential and Proprietary
BIG DATA IS ABOUT TWO THINGS

© TCC 2014, Confidential and Proprietary
BIG DATA IS ABOUT TWO THINGS

First it’s about Money, Lots of
Money
© TCC 2014, Confidential and Proprietary
© TCC 2014, Confidential and Proprietary
SO WHAT’S THE SECOND THING?

© TCC 2014, Confidential and Proprietary
SO WHAT’S THE SECOND THING?

© TCC 2014, Confidential and Proprietary
SO WHAT’S THE SECOND THING?

© TCC 2014, Confidential and Proprietary
SO WHAT’S THE SECOND THING?
“ Bayesian probability provides a rational method for updating beliefs.”

© TCC 2014, Confidential and Proprietary
Big Data is about Degrees of Belief.
…even about how we feel about data itself.

© TCC 2014, Confidential and Proprietary
Can I prove that Big Data is about Degrees of Belief?

These 12 words just cost Facebook $18 billion of value:
"We did see a decrease in daily users specifically among
younger teens."

© TCC 2014, Confidential and Proprietary
BIG DATA- WHAT’S CHANGED?
• The nature of the Data: From
Sampling to Full Datasets
• Rise of unstructured data
• Acceptance of messiness in the
data
• N=All

http://bit.ly/1gZ7ZbD

© TCC 2014, Confidential and Proprietary
THE BIG DATA FRONTIER

There is no Eminent Domain

© TCC 2014, Confidential and Proprietary
SO WHERE ARE HEADED IN THIS TALK?

© TCC 2014, Confidential and Proprietary
BEWARE THE BIG DATA NAYSAYERS
WHAT IS THEIR AGENDA?

© TCC 2014, Confidential and Proprietary
IF BIG DATA IS SO POWERFUL,
WHY CAN’T IT PREDICT THE ECONOMY?
The Stock Market is all about Degrees of Belief!

• The economic system is non-linear.
• Therefore, even a small stimulus can create a
completely unexpected result

• It’s a complex dynamical system where you don’t
have clear knowledge of the initial conditions or the
conditions of the stimulus.

• Big Data can can help you see averages, find the
needle in the haystack, and help identify the accurate
models for predicting what the market is really going
to be doing.

© TCC 2014, Confidential and Proprietary
HOW BIG DATA GOT ME HERE

© TCC 2014, Confidential and Proprietary
HOW BIG DATA GOT ME HERE

•
•
•

REACH – Measure of total audience size.
RESONANCE – How much activity someone
creates when he/she publishes.
RELEVANCE – How relevant someone is to a
© TCC 2014, Confidential and Proprietary
CASE STUDY: SUMO FRAUD

•
•
•
•

Match-Fixing Scandal in 2011
Discovered through Big Data analysis.
Proven by Text Messages.
$200 tickets and $1 Million Dollar Champions.

© TCC 2014, Confidential and Proprietary
CASE STUDY: PREDICTION MACHINE

•
•

Tool simulates each and every game 50,000
times before making a pick.
32 million Americans average yearly spend
is $467 or $15 billion in total playing.

© TCC 2014, Confidential and Proprietary
CASE STUDY: MORTGAGE FRAUD
LexisNexis Risk Analysis

• Developed HPCC technology - an HPC alternative to Hadoop
• Database has 270 Million Individuals in the US Alone
• They know you’re that John Smith
• Graph Analysis spots Relationships
• Ability to spot mortgage fraud rings that were previously
undetectable
Annual Fraud Estimates:
California, at $864 million
New York at $278 million
Florida at $273 million

•
•
•

© TCC 2014, Confidential and Proprietary
© TCC 2014, Confidential and Proprietary
CASE STUDY: BIG BROTHER WATCHES BIG BROTHER

© TCC 2014, Confidential and Proprietary
WORLD’S CONVERGING:
HPC AND BIG DATA

Venus
(HPC)

Mars
(Big Data)

© TCC 2014, Confidential and Proprietary
PAYPAL CASE STUDY: HPC IN THE ENTERPRISE
“Examples of large
organizations using HPC
include PayPal, which IDC
estimates has saved over
$700 million by adopting HPC
for real-time detection of
online consumer fraud.”
- Steve Conway, IDC

© TCC 2014, Confidential and Proprietary
PAYPAL CASE STUDY: HPC IN THE ENTERPRISE

© TCC 2014, Confidential and Proprietary
HPC - WHAT’S CHANGED?

1986
CRAY X-MP/4
4 Vector Processors
800 Mflops (106)

2013
Tianhe 2
3,120,000 cores
33.86 Petaflops (1015)
© TCC 2014, Confidential and Proprietary
THE HPC FRONTIER

© TCC 2014, Confidential and Proprietary
TRENDS
• Rise of Real-Time Analytic
through in-memory
technologies
• Enterprises adopt HPC
technologies into workflow
• Internet of Things Feeds Big
Data Phenomenon and
ends up swallowing Big
Data as a meme.

© TCC 2014, Confidential and Proprietary
© TCC 2014, Confidential and Proprietary
SUMMARY
Big Data is about two things:

• Money! Making More and
Keeping it

• Degrees of Belief

© TCC 2014, Confidential and Proprietary
CALL TO ACTION

• Check out insideBIGDATA.com
• Buy this book:
• Big Data: A Revolution That Will
Transform How We
Live, Work, and Think by Viktor
Mayer-Schonberger and Kenneth
Cukier

© TCC 2014, Confidential and Proprietary
CALL TO ACTION
Read my SCI-FI Original:
The Observer Effect

http://bit.ly/theobservereffe
ct

© TCC 2014, Confidential and Proprietary
POLL: SO HOW DID I DO TONIGHT?

© TCC 2014, Confidential and Proprietary
PLEASE LET ME KNOW HOW I DID!
“Big Data: What’s It Really About?”

1) WITH YOUR MOBILE DEVICE:
In the TCCLive mobile app,
“Agenda” section,
then tap “Surveys”
- OR 2) FILL OUT THE PAPER VERSION
given to you at registration

© TCC 2014, Confidential and Proprietary

Mais conteúdo relacionado

Mais de inside-BigData.com

Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networksinside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoringinside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Updateinside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Accelerationinside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficientlyinside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Erainside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computinginside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...inside-BigData.com
 
Adaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and EigensolversAdaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and Eigensolversinside-BigData.com
 
Scientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous ArchitecturesScientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous Architecturesinside-BigData.com
 

Mais de inside-BigData.com (20)

Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
 
Data Parallel Deep Learning
Data Parallel Deep LearningData Parallel Deep Learning
Data Parallel Deep Learning
 
Making Supernovae with Jets
Making Supernovae with JetsMaking Supernovae with Jets
Making Supernovae with Jets
 
Adaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and EigensolversAdaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and Eigensolvers
 
Scientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous ArchitecturesScientific Applications and Heterogeneous Architectures
Scientific Applications and Heterogeneous Architectures
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 

Último (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

Big Data - What is it Really About?

  • 1. BIG DATA: WHAT’S IT REALLY ABOUT? Rich Brueckner President, InsideBIGDATA © TCC 2014, Confidential and Proprietary
  • 2. AGENDA • • • • • • • Proper Intro It’s Not the Data. It’s What You Use it for. How Big Data Got Me Here Case Studies HPC and Big Data Trends- What’s Next? Call to Action © TCC 2014, Confidential and Proprietary
  • 3. THE PROPER INTRODUCTION insideBIGDATA.com © TCC 2014, Confidential and Proprietary
  • 4. WHAT BIG DATA IS NOT • • • Size matters not. It’s not about the Data, it’s what you do with it. Deriving insight from Data for purposes for © TCC 2014, Confidential and Proprietary
  • 5. FOR TODAY’S DISCUSSION: BIG DATA = HIGH PERFORMANCE DATA ANALYSIS © TCC 2014, Confidential and Proprietary
  • 6. BIG DATA IS ABOUT TWO THINGS © TCC 2014, Confidential and Proprietary
  • 7. BIG DATA IS ABOUT TWO THINGS First it’s about Money, Lots of Money © TCC 2014, Confidential and Proprietary
  • 8. © TCC 2014, Confidential and Proprietary
  • 9. SO WHAT’S THE SECOND THING? © TCC 2014, Confidential and Proprietary
  • 10. SO WHAT’S THE SECOND THING? © TCC 2014, Confidential and Proprietary
  • 11. SO WHAT’S THE SECOND THING? © TCC 2014, Confidential and Proprietary
  • 12. SO WHAT’S THE SECOND THING? “ Bayesian probability provides a rational method for updating beliefs.” © TCC 2014, Confidential and Proprietary
  • 13. Big Data is about Degrees of Belief. …even about how we feel about data itself. © TCC 2014, Confidential and Proprietary
  • 14. Can I prove that Big Data is about Degrees of Belief? These 12 words just cost Facebook $18 billion of value: "We did see a decrease in daily users specifically among younger teens." © TCC 2014, Confidential and Proprietary
  • 15. BIG DATA- WHAT’S CHANGED? • The nature of the Data: From Sampling to Full Datasets • Rise of unstructured data • Acceptance of messiness in the data • N=All http://bit.ly/1gZ7ZbD © TCC 2014, Confidential and Proprietary
  • 16. THE BIG DATA FRONTIER There is no Eminent Domain © TCC 2014, Confidential and Proprietary
  • 17. SO WHERE ARE HEADED IN THIS TALK? © TCC 2014, Confidential and Proprietary
  • 18. BEWARE THE BIG DATA NAYSAYERS WHAT IS THEIR AGENDA? © TCC 2014, Confidential and Proprietary
  • 19. IF BIG DATA IS SO POWERFUL, WHY CAN’T IT PREDICT THE ECONOMY? The Stock Market is all about Degrees of Belief! • The economic system is non-linear. • Therefore, even a small stimulus can create a completely unexpected result • It’s a complex dynamical system where you don’t have clear knowledge of the initial conditions or the conditions of the stimulus. • Big Data can can help you see averages, find the needle in the haystack, and help identify the accurate models for predicting what the market is really going to be doing. © TCC 2014, Confidential and Proprietary
  • 20. HOW BIG DATA GOT ME HERE © TCC 2014, Confidential and Proprietary
  • 21. HOW BIG DATA GOT ME HERE • • • REACH – Measure of total audience size. RESONANCE – How much activity someone creates when he/she publishes. RELEVANCE – How relevant someone is to a © TCC 2014, Confidential and Proprietary
  • 22. CASE STUDY: SUMO FRAUD • • • • Match-Fixing Scandal in 2011 Discovered through Big Data analysis. Proven by Text Messages. $200 tickets and $1 Million Dollar Champions. © TCC 2014, Confidential and Proprietary
  • 23. CASE STUDY: PREDICTION MACHINE • • Tool simulates each and every game 50,000 times before making a pick. 32 million Americans average yearly spend is $467 or $15 billion in total playing. © TCC 2014, Confidential and Proprietary
  • 24. CASE STUDY: MORTGAGE FRAUD LexisNexis Risk Analysis • Developed HPCC technology - an HPC alternative to Hadoop • Database has 270 Million Individuals in the US Alone • They know you’re that John Smith • Graph Analysis spots Relationships • Ability to spot mortgage fraud rings that were previously undetectable Annual Fraud Estimates: California, at $864 million New York at $278 million Florida at $273 million • • • © TCC 2014, Confidential and Proprietary
  • 25. © TCC 2014, Confidential and Proprietary
  • 26. CASE STUDY: BIG BROTHER WATCHES BIG BROTHER © TCC 2014, Confidential and Proprietary
  • 27. WORLD’S CONVERGING: HPC AND BIG DATA Venus (HPC) Mars (Big Data) © TCC 2014, Confidential and Proprietary
  • 28. PAYPAL CASE STUDY: HPC IN THE ENTERPRISE “Examples of large organizations using HPC include PayPal, which IDC estimates has saved over $700 million by adopting HPC for real-time detection of online consumer fraud.” - Steve Conway, IDC © TCC 2014, Confidential and Proprietary
  • 29. PAYPAL CASE STUDY: HPC IN THE ENTERPRISE © TCC 2014, Confidential and Proprietary
  • 30. HPC - WHAT’S CHANGED? 1986 CRAY X-MP/4 4 Vector Processors 800 Mflops (106) 2013 Tianhe 2 3,120,000 cores 33.86 Petaflops (1015) © TCC 2014, Confidential and Proprietary
  • 31. THE HPC FRONTIER © TCC 2014, Confidential and Proprietary
  • 32. TRENDS • Rise of Real-Time Analytic through in-memory technologies • Enterprises adopt HPC technologies into workflow • Internet of Things Feeds Big Data Phenomenon and ends up swallowing Big Data as a meme. © TCC 2014, Confidential and Proprietary
  • 33. © TCC 2014, Confidential and Proprietary
  • 34. SUMMARY Big Data is about two things: • Money! Making More and Keeping it • Degrees of Belief © TCC 2014, Confidential and Proprietary
  • 35. CALL TO ACTION • Check out insideBIGDATA.com • Buy this book: • Big Data: A Revolution That Will Transform How We Live, Work, and Think by Viktor Mayer-Schonberger and Kenneth Cukier © TCC 2014, Confidential and Proprietary
  • 36. CALL TO ACTION Read my SCI-FI Original: The Observer Effect http://bit.ly/theobservereffe ct © TCC 2014, Confidential and Proprietary
  • 37. POLL: SO HOW DID I DO TONIGHT? © TCC 2014, Confidential and Proprietary
  • 38. PLEASE LET ME KNOW HOW I DID! “Big Data: What’s It Really About?” 1) WITH YOUR MOBILE DEVICE: In the TCCLive mobile app, “Agenda” section, then tap “Surveys” - OR 2) FILL OUT THE PAPER VERSION given to you at registration © TCC 2014, Confidential and Proprietary