SlideShare uma empresa Scribd logo
1 de 44
Photo Credit: http://www.crosseyedlife.com/teaching-resources/
Menu
Who am I?
Early adopters of Hadoop
Next generation use cases
Changing big data architectures
Art of the possible
My request
Questions
Appetiz
er
Main
Dessert
Who am I?
Google, Software Engineer
Personalized Search
Personalized Recommendations
WibiData, CTO
Real-time Personalization Platform
Customer Use Cases
EARLY ADOPTERS OF HADOOP
Early AdopterEarly Majority
Collect Everything
Keep Everything
Ask Anything
Collect Everything
Collect Everything
Collect Everything
Collect Everything
Collect Everything
Maybe I
should, too?
Keep Everything
Blind Spots
1. New, high-value use cases
1. Architectural changes to
support broader use cases
1. The ultimate strategic
goals of early adopters
NEXT GENERATION USE CASES
Blind Spot Number 1
Recommendations
Recommendations
Search
Prediction and Prevention
Targeted Offers
Customer Experience Optimization
Clearly, early adopters have
moved beyond ETL.
Life After ETL
Understanding
360-degree customer views
Visualization
Graphs
Exploration
Trends
Customer segmentation
ROI
Prediction
Action
Recommendations
Prevention
Mobile
Offers
Recommendations
Localization
Search
Personalization
Evolution of Enterprise Data
Collect Organize Understand ActUnderstandUnderstand
CHANGING ARCHITECTURE
Blind Spot Number 2
Sometimes, supporting a new use case
requires a different architecture.
Evolution of Enterprise Data
Collect Organize Understand Act
Collect Organize Understand
Key Ingredients
Data
Consolidation
Organization
Experimentation
Try something!
Rapid iteration
Tuning
Deployment
Evaluation
Real time
Required to Understand
Required to Act
Web Web Web
HDFS
Logs
Txns
POS
Third
Party
Data
1. Collect
MapReduce
Web Web Web
HDFS
Logs
Txns
POS
Third
Party
Data
1. Collect
2. Organize
Data Warehouse
Web Web Web
HDFS
POS
Third
Party
1. Collect
2. Organize
3. Understand
MapReduce
Data Warehouse
Web Web Web
HDFS
POS
Third
Party
1. Collect
2. Organize
3. Understand
4. Act
MapReduceHBase
Data Warehouse
Key Ingredients
Data
Consolidation
Organization
Experimentation
Try something!
Rapid iteration
Tuning
Deployment
Evaluation
Real time
Required to Understand
Required to Act
Did we get any
of these?
Early Adopter Migration Strategies
Add serving capability
Key-value store
Indexing
Add stream processing
Storm
Samza
Lambda architecture
Add both
Web Web Web
HDFS
POS
Third
Party
1. Collect
2. Organize
3. Understand
4. Act
MapReduceHBase
Data Warehouse
HBaseStorm
Query
BatchServingSpeed
Key Ingredients
Data
Consolidation
Organization
Experimentation
Try something!
Rapid iteration
Tuning
Deployment
Evaluation
Real time
Required to Understand
Required to Act
Did we get any
of these?
Web Web Web
HDFS
POS
Third
Party
1. Collect
2. Organize
3. Understand
4. Act
MapReduce
Data Warehouse
HBaseStorm
Query
BatchServingSpeed
ART OF THE POSSIBLE
Blind Spot Number 3
Photo credit: http://mediahub.olive.co.uk/blog/the-art-of-the-possible
You can’t build a data platform to solve
a problem you haven’t identified yet.
What’s Next?
Collect Organize Understand Act ?
What’s Next?
Collect
OrganizeUnderstand
Act
Where is the Value?
Collect Organize Understand Act
0%
20%
40%
60%
80%
100%
Collect Organize Understand Act
“As the amount of data goes up,
the importance of human judgment
should go down”
- Andrew McAfee
HBR Blog
Question
Hypothesis
PredictionTesting
Analysis
Hire smarter people
Faster EDW
Hire smarter peopleFaster Deployment
Faster EDW
Testing
What does this all mean?
The real value is in next generation “action”
use cases
The architecture for “action” is different
Design for your problem, since you don’t know
the art of the possible.
Requirements first, then technology
My Request
Stop building faster data warehouses.
You already understand your data.
Turn your understanding into action.
Questions?
Garrett Wu
http://www.wibidata.com
gwu@wibidata.com

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Activating Governance for End Users in Office 365 and SharePoint - SPS Vanc...
Activating Governance for End Users in Office 365 and SharePoint -   SPS Vanc...Activating Governance for End Users in Office 365 and SharePoint -   SPS Vanc...
Activating Governance for End Users in Office 365 and SharePoint - SPS Vanc...
 
Google Survivor Tips at 2011 SMX Advanced
Google Survivor Tips at 2011 SMX AdvancedGoogle Survivor Tips at 2011 SMX Advanced
Google Survivor Tips at 2011 SMX Advanced
 
Belvilla
BelvillaBelvilla
Belvilla
 
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...
SPS Boston 2016 - Drive on the FastTrack to SharePoint and Office 365 End Use...
 
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
How to Build a Successful Data Team - Florian Douetteau @ PAPIs ConnectHow to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
 
The Road to Awesome SharePoint Adoption - SPTechCon June 2016
The Road to Awesome SharePoint Adoption - SPTechCon June 2016The Road to Awesome SharePoint Adoption - SPTechCon June 2016
The Road to Awesome SharePoint Adoption - SPTechCon June 2016
 
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
 
Staffing your analytics team: 6 skill sets
Staffing your analytics team:  6 skill setsStaffing your analytics team:  6 skill sets
Staffing your analytics team: 6 skill sets
 
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...SPTechCon San Francsico 2016   When SharePoint Needs a Spark - Building an En...
SPTechCon San Francsico 2016 When SharePoint Needs a Spark - Building an En...
 
Introduction to Mahout with HDInsight
Introduction to Mahout with HDInsightIntroduction to Mahout with HDInsight
Introduction to Mahout with HDInsight
 
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...
Drive on the FastTrack to SharePoint End User Adoption in Your Organization -...
 
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...
San Francisco SharePoint User Group - September 2016 - Drive on the FastTrack...
 
The Limitations of Web Scraping Tools
The Limitations of Web Scraping ToolsThe Limitations of Web Scraping Tools
The Limitations of Web Scraping Tools
 
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016
Drive On the Fast Track to SharePoint End User Adoption - SPS Toronto 2016
 
BigData Meets the Federal Data Center
BigData Meets the Federal Data CenterBigData Meets the Federal Data Center
BigData Meets the Federal Data Center
 
Christoph Luetke Schelhowe - Data for Everyone
Christoph Luetke Schelhowe  - Data for EveryoneChristoph Luetke Schelhowe  - Data for Everyone
Christoph Luetke Schelhowe - Data for Everyone
 
O'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data ExhaustO'Reilly Strata: Distilling Data Exhaust
O'Reilly Strata: Distilling Data Exhaust
 
RightScale Webinar: Introducing Cloud Analytics
RightScale Webinar: Introducing Cloud AnalyticsRightScale Webinar: Introducing Cloud Analytics
RightScale Webinar: Introducing Cloud Analytics
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science Summit
 
Publishers presentation ucl
Publishers presentation uclPublishers presentation ucl
Publishers presentation ucl
 

Destaque

Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Innovative Management Services
 
Тематическое планирование 8 класс
Тематическое планирование 8 классТематическое планирование 8 класс
Тематическое планирование 8 класс
koneqq
 
Doctor Faustus by Micaela & Rocío
Doctor Faustus by Micaela & RocíoDoctor Faustus by Micaela & Rocío
Doctor Faustus by Micaela & Rocío
Andrea Izzo
 
Aggietarium slideshow final
Aggietarium slideshow finalAggietarium slideshow final
Aggietarium slideshow final
Ana Monzon
 
План самообразования
План самообразованияПлан самообразования
План самообразования
koneqq
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
DataWorks Summit
 
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin BuzzwordsDataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku
 
Кроссворд "Грибы"
Кроссворд "Грибы"Кроссворд "Грибы"
Кроссворд "Грибы"
koneqq
 

Destaque (20)

Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...
 
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
Open-BDA Hadoop Summit 2014 - Mr. Krish Krishnan (Driving Business Value – Bi...
 
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...Getting Ahead of the Game: How a gaming studio gets customer insights from bi...
Getting Ahead of the Game: How a gaming studio gets customer insights from bi...
 
Practical Placement
Practical PlacementPractical Placement
Practical Placement
 
Oxleas_Review_2012_website_version_1
Oxleas_Review_2012_website_version_1Oxleas_Review_2012_website_version_1
Oxleas_Review_2012_website_version_1
 
Тематическое планирование 8 класс
Тематическое планирование 8 классТематическое планирование 8 класс
Тематическое планирование 8 класс
 
Doctor Faustus by Micaela & Rocío
Doctor Faustus by Micaela & RocíoDoctor Faustus by Micaela & Rocío
Doctor Faustus by Micaela & Rocío
 
Aggietarium slideshow final
Aggietarium slideshow finalAggietarium slideshow final
Aggietarium slideshow final
 
Pay periods use show
Pay periods use showPay periods use show
Pay periods use show
 
Pure Storage Customer Business and IT Transformation
Pure Storage Customer Business and IT TransformationPure Storage Customer Business and IT Transformation
Pure Storage Customer Business and IT Transformation
 
План самообразования
План самообразованияПлан самообразования
План самообразования
 
Active and passive voice
Active and passive voice Active and passive voice
Active and passive voice
 
Step by step essay
Step by step essayStep by step essay
Step by step essay
 
Zachatie Bulgaria
Zachatie BulgariaZachatie Bulgaria
Zachatie Bulgaria
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble Storage
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
 
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin BuzzwordsDataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin Buzzwords
 
Auto loans
Auto loansAuto loans
Auto loans
 
Кроссворд "Грибы"
Кроссворд "Грибы"Кроссворд "Грибы"
Кроссворд "Грибы"
 
Earth science pptx
Earth science pptxEarth science pptx
Earth science pptx
 

Semelhante a Move Beyond ETL: Tapping the True Business Value of Hadoop

Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Jonathan Seidman
 
Gartner peer forum sept 2011 orbitz
Gartner peer forum sept 2011   orbitzGartner peer forum sept 2011   orbitz
Gartner peer forum sept 2011 orbitz
Raghu Kashyap
 
Web analyticsandbigdata techweek2011
Web analyticsandbigdata techweek2011Web analyticsandbigdata techweek2011
Web analyticsandbigdata techweek2011
Raghu Kashyap
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
Raul Chong
 
Hortonworks.HadoopPatternsOfUse.201304
Hortonworks.HadoopPatternsOfUse.201304Hortonworks.HadoopPatternsOfUse.201304
Hortonworks.HadoopPatternsOfUse.201304
James Kenney
 
Big data and Hadoop Training Brochure
Big data and Hadoop Training Brochure Big data and Hadoop Training Brochure
Big data and Hadoop Training Brochure
MCAL Management Consulting and Advanced Learning
 

Semelhante a Move Beyond ETL: Tapping the True Business Value of Hadoop (20)

Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
 
Gartner peer forum sept 2011 orbitz
Gartner peer forum sept 2011   orbitzGartner peer forum sept 2011   orbitz
Gartner peer forum sept 2011 orbitz
 
Search Analytics at Enterprise Search Summit Fall 2011
Search Analytics at Enterprise Search Summit Fall 2011Search Analytics at Enterprise Search Summit Fall 2011
Search Analytics at Enterprise Search Summit Fall 2011
 
Web analyticsandbigdata techweek2011
Web analyticsandbigdata techweek2011Web analyticsandbigdata techweek2011
Web analyticsandbigdata techweek2011
 
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
 
Hortonworks.HadoopPatternsOfUse.201304
Hortonworks.HadoopPatternsOfUse.201304Hortonworks.HadoopPatternsOfUse.201304
Hortonworks.HadoopPatternsOfUse.201304
 
Embracing Hadoop with a musical touch!
Embracing Hadoop with a musical touch!Embracing Hadoop with a musical touch!
Embracing Hadoop with a musical touch!
 
5 Tips to Bulletproof Your Analytics Implementation
5 Tips to Bulletproof Your Analytics Implementation5 Tips to Bulletproof Your Analytics Implementation
5 Tips to Bulletproof Your Analytics Implementation
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Big data and Hadoop Training Brochure
Big data and Hadoop Training Brochure Big data and Hadoop Training Brochure
Big data and Hadoop Training Brochure
 
Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010
 
Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
 
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...
Hadoop World 2011: Extending Enterprise Data Warehouse with Hadoop - Jonathan...
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
 
The Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterThe Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value Thereafter
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big Data
 

Mais de DataWorks Summit

HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 

Mais de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

Move Beyond ETL: Tapping the True Business Value of Hadoop

Notas do Editor

  1. This talk is really about blind spots. I believe there are three that are ultimately keeping many of you from “tapping the true value of Hadoop.”
  2. How are we going to store all the information on the internet? Google File System (GFS) How are we going to analyze is? MapReduce (MR) How are we going to do something with it? BigTable (BT)
  3. “I, too, need to store large amounts of data!” These are technology companies The followers on this wave are in other businesses, but need to use technology to move forward They waited to see if these technologies would really work
  4. Three things the follower does not see: New use cases from a few early adopters What changes about the architectures to support new use cases Where the early adopters are ultimately going
  5. I don’t actually mean the use cases that are way out there. I mean the very next ones that you early adopters are doing now, and you should be doing next (this year or next year)
  6. We all know product recommendations
  7. Recommendations are not just for products. Recommend content Recommend people Recommend actions
  8. Auto-complete Recommendations within search Personalized search results Search within the enterprise
  9. Predict energy usage (Opower) Predict weather (Climate Corp) Predict device returns (Motorola)
  10. Deals to tablet and mobile devices
  11. Optimizing experiences on each channel The key ingredients here are: Data consolidation (get everything in one place so it is accessible) Experimentation (try different things on live traffic) Rapid iteration (optimize by making changes quickly)
  12. You should, too. At the very least, you should start doing “traditional BI” on big data.
  13. Next generation use cases are in two categories: Analysis: Now that we have data, and it is consolidated, let’s ask more questions. Action: Now that we have data, and it is consolidated, let’s put it to work.
  14. Followers (early majority) are at the Understand phase. Early adopters are going deep into Understand, or moving on to Act. I really want to talk about the last phase. What are the key ingredients?
  15. Early adopters are changing their system architectures: They are adding new-age tools They are removing and replacing outdated systems They are restructuring and shuffling components
  16. Review the difference between building upon understanding versus moving into action.
  17. You got data delivered back into the application, but did you include any of the key ingredients?
  18. Let’s focus on the early adopters who migrated into action. What have they done? We have already added the KVStore, HBase, to connect data back to the frontends. We can add a stream processing engine to get real-time. We can use the Lambda architecture to get all sorts of nice properties like immutable data sources, and make only incremental additions.
  19. What does it look like to go through this process of “going deep” into action? Add room for a stream processing system (Storm, Samza) Add a query layer on top to join the results from the batch layer from the speed layer
  20. You got data delivered back into the application, but did you include any of the key ingredients?
  21. To make a change to something you need to edit the batch layer, the speed layer, and potentially the query that joins the two.
  22. You don’t have enough data to see the future of where people are going.
  23. What’s next?
  24. What’s next?
  25. I don’t know how to quantify the business value. I’ll leave that to Gartner. But I hope that I can convince you that: The intrinsic value of each phase is greater than the previous. What good is collecting data if you don’t do anything with it? What good is it if you don’t understand it? The realized value to the business at each phase is even more extreme that what I’ve shown here. What good is understanding unless you do something with it? You can do something with it as a human being, but many more decisions now are made by machines, not humans.
  26. How long does this take? The testing, aka experiment design, development, and deployment is the bottleneck. Why are you spending so much money working on increasing the speed of these other phases?
  27. What you would design to solve the first three phases (up to understanding) is different from what you would build to solve “action.” We don’t know what’s coming next. Design for your problem. And do so without just blindly following the early adopters. Instead, start with your requirements, and design with purpose.