SlideShare uma empresa Scribd logo
1 de 23
The State of Big Data Adoption in the Enterprise Tony Baer [email_address] November 2011
[object Object],[object Object],[object Object],[object Object],Agenda
Survey sample by region Europe North America APAC 27% 33% 40% Sample size = 150 organizations Minimum > 1+ TByte data in enterprise DWs/analytic data stores
Company size & IT budget <$10m Don't Know $50m+ $10m - $50m 9% 7% 7% 77% Annual IT budget $50m - $250m <$10m >$1bn 22% 18% 17% 21% 22% Company size Big data is not limited to big companies! $250m -$1b $10m - $50m
Survey sample by vertical industry Other Media & Entertainment Mfg Public sector (government)  ICT Healthcare Financial services Retail 5% 16% 13% 5% 21% 20% 5% 9% Transportation 6%
[object Object],[object Object],[object Object],[object Object],Agenda
Mean analytic data store size Number of respondents 1 150 1000 3TB = mean 75 3000 5000 7000 Terabytes 2000 4000 6000 8000
Analytic data store size by data type 0% 10% 20% 30% No unstructured data 1 - 5 TBytes 5 - 10 TBytes 10 - 20 TBytes 20 - 50 TBytes 50 - 100 TBytes 100 - 500 TBytes 500 - 1000 Tbtyes Over 1000 TBytes Structured data Unstructured/ Variably structured data
Popular analytic data types 0% 30% 60% 90% B2B   transactional Supply Chain Mgmt Call Detail Records (CDRs) Internet search indices Legal/regulatory documentation Legacy Apps CRM ERP Email/Messaging Currently Planned
Variably structured data – current & future demand Weighted score Social Media Web Logs Sensory Graph Rich media  Text Time series All sectors Financial Svcs Healthcare Retail Public Sector Healthcare Healthcare Healthcare Healthcare FS FS Retail Retail Healthcare Public Sector FS FS FS Healthcare
[object Object],[object Object],[object Object],[object Object],Agenda
Business objectives Big Data analytic projects change the analytics, but not the objectives 0% 30% 60% Other Advanced analytics Competitive positioning Business agility Regulatory compliance ID hidden business trends Predictive analytic insights Business forecasting Customer service Strategic decision making Operational decision making
Business sponsors The players are currently the same 0% 30% 60% Supply   chain Other Customer management Internal operations Sales and marketing Finance
Advanced SQL analytic database use A 5-year head start on NoSQL 0% 2% 4% 6% 8% 10% 12% Oracle Exadata Sybase IQ Aster Data IBM Netezza Sand Greenplum Kognitio Teradata ParAccel Vertica Infobright
NoSQL platform use Considering Testing/ Evaluating In Production  0% 2% 4% 6% 8% 10% 12% Amazon SimpleDB Hadoop MongoDB Membase Cassandra CouchDB
Non-SQL languages/frameworks for analytic queries 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% Java None PHP C or C++ Other language Perl Other framework/technology? Python Ruby MapReduce Now Future
Big Data in the cloud? No Yes Considering in next 12 months Considering in next 12 - 36 months  24% 9% 9% 58%
Technical concerns Analytic/NoSQL databases technology immaturity Current DW investments not easily extended In-house analytic skills Increased capital investment Visualizing large datasets Technology infrastructure costs Analysis requires additional IT resources Data integration complexity Systems & network performance impact High data volatility/refresh cycles Information filtering technology Data storage issues & costs Performance issues & query response times Data quality & governance Not Important Important
[object Object],[object Object],[object Object],[object Object],Agenda
Who will deliver Big Data technologies & solutions? Plenty of room for new blood 0% 20% 40% 60% Existing data warehousing/ BI analytics supplier In-house Systems integrator Specialist provider Other technology provider
Big Data IT budget plans Based on 25% of respondents provided actual numbers Current IT budget Next year's IT budget Next 2 - 5 yrs Under $100,000 $100,000 - $1 million 1 - 5 million Over $5 million
Big Data budget plans 0% 10% 20% 30% 40% 27% 33% 44% Current IT budget Next year's IT budget In next 2 - 5 years Proportion of respondents
Thank you – any questions? Tony Baer Email:  [email_address] Twitter: @TonyBaer All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form by any means, electronic, mechanical, photocopying, recording or otherwise, without the prior permission of the publisher, Ovum (an Informa business). The facts of this report are believed to be correct at the time of publication but cannot be guaranteed. Please note that the findings, conclusions and recommendations that Ovum delivers will be based on information gathered in good faith from both primary and secondary sources, whose accuracy we are not always in a position to guarantee. As such Ovum can accept no liability whatever for actions taken based on any information that may subsequently prove to be incorrect.

Mais conteúdo relacionado

Mais procurados

Data and Analytics: A New Toolkit for Asset Managers
Data and Analytics: A New Toolkit for Asset ManagersData and Analytics: A New Toolkit for Asset Managers
Data and Analytics: A New Toolkit for Asset ManagersState Street
 
Chasing Alpha: Data and Analytics for Alternative Asset Managers
Chasing Alpha: Data and Analytics for Alternative Asset ManagersChasing Alpha: Data and Analytics for Alternative Asset Managers
Chasing Alpha: Data and Analytics for Alternative Asset ManagersState Street
 
INFOGRAPHIC: Making #BigData Work
INFOGRAPHIC: Making #BigData WorkINFOGRAPHIC: Making #BigData Work
INFOGRAPHIC: Making #BigData WorkCapgemini
 
The Innovator’s Journey: Asset Manager Insights
The Innovator’s Journey: Asset Manager InsightsThe Innovator’s Journey: Asset Manager Insights
The Innovator’s Journey: Asset Manager InsightsState Street
 
CRITICAL SUCCESS FACTORS IN IMPLEMENTING INFORMATION GOVERNANCE (IG)
CRITICAL SUCCESS FACTORS IN IMPLEMENTING INFORMATION GOVERNANCE (IG)CRITICAL SUCCESS FACTORS IN IMPLEMENTING INFORMATION GOVERNANCE (IG)
CRITICAL SUCCESS FACTORS IN IMPLEMENTING INFORMATION GOVERNANCE (IG)Irmbulldog
 
CIO Tech Poll: IT Economic Outlook 2018
CIO Tech Poll: IT Economic Outlook 2018CIO Tech Poll: IT Economic Outlook 2018
CIO Tech Poll: IT Economic Outlook 2018IDG
 
The Data-Driven Investor: How Technology Changes the Game for Today’s Asset O...
The Data-Driven Investor: How Technology Changes the Game for Today’s Asset O...The Data-Driven Investor: How Technology Changes the Game for Today’s Asset O...
The Data-Driven Investor: How Technology Changes the Game for Today’s Asset O...State Street
 
Data Mining 11-18-10
Data Mining 11-18-10Data Mining 11-18-10
Data Mining 11-18-10Ed Tobias
 
Preqin solutions and HKVCA: technology improvements in private equity
Preqin solutions and HKVCA: technology improvements in private equityPreqin solutions and HKVCA: technology improvements in private equity
Preqin solutions and HKVCA: technology improvements in private equityPreqin Solutions
 
The Evolution of Data and New Opportunities for Analytics
The Evolution of Data and New Opportunities for AnalyticsThe Evolution of Data and New Opportunities for Analytics
The Evolution of Data and New Opportunities for AnalyticsSAS Canada
 
CIO IT Budget & Staffing Survey
CIO IT Budget & Staffing SurveyCIO IT Budget & Staffing Survey
CIO IT Budget & Staffing SurveyAbbie Lundberg
 
Best Practices In Predictive Analytics
Best Practices In Predictive AnalyticsBest Practices In Predictive Analytics
Best Practices In Predictive AnalyticsCapgemini
 
Data Analytics for Finance
Data Analytics for FinanceData Analytics for Finance
Data Analytics for Financeellenica
 
Credit Card Analytics on a Connected Data Platform
Credit Card Analytics on a Connected Data PlatformCredit Card Analytics on a Connected Data Platform
Credit Card Analytics on a Connected Data PlatformHortonworks
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPMBob Samuels
 
Big data &amp; analytics for banking new york lars hamberg
Big data &amp; analytics for banking new york   lars hambergBig data &amp; analytics for banking new york   lars hamberg
Big data &amp; analytics for banking new york lars hambergLars Hamberg
 
Ibm ofa ottawa_analytics_in_gov _campbell_robertson
Ibm  ofa ottawa_analytics_in_gov _campbell_robertsonIbm  ofa ottawa_analytics_in_gov _campbell_robertson
Ibm ofa ottawa_analytics_in_gov _campbell_robertsondawnrk
 
FulcrumWay GRC Solutions
FulcrumWay GRC SolutionsFulcrumWay GRC Solutions
FulcrumWay GRC SolutionsMantala
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorMichael Haddad
 

Mais procurados (20)

Data and Analytics: A New Toolkit for Asset Managers
Data and Analytics: A New Toolkit for Asset ManagersData and Analytics: A New Toolkit for Asset Managers
Data and Analytics: A New Toolkit for Asset Managers
 
Chasing Alpha: Data and Analytics for Alternative Asset Managers
Chasing Alpha: Data and Analytics for Alternative Asset ManagersChasing Alpha: Data and Analytics for Alternative Asset Managers
Chasing Alpha: Data and Analytics for Alternative Asset Managers
 
INFOGRAPHIC: Making #BigData Work
INFOGRAPHIC: Making #BigData WorkINFOGRAPHIC: Making #BigData Work
INFOGRAPHIC: Making #BigData Work
 
The Innovator’s Journey: Asset Manager Insights
The Innovator’s Journey: Asset Manager InsightsThe Innovator’s Journey: Asset Manager Insights
The Innovator’s Journey: Asset Manager Insights
 
CRITICAL SUCCESS FACTORS IN IMPLEMENTING INFORMATION GOVERNANCE (IG)
CRITICAL SUCCESS FACTORS IN IMPLEMENTING INFORMATION GOVERNANCE (IG)CRITICAL SUCCESS FACTORS IN IMPLEMENTING INFORMATION GOVERNANCE (IG)
CRITICAL SUCCESS FACTORS IN IMPLEMENTING INFORMATION GOVERNANCE (IG)
 
CIO Tech Poll: IT Economic Outlook 2018
CIO Tech Poll: IT Economic Outlook 2018CIO Tech Poll: IT Economic Outlook 2018
CIO Tech Poll: IT Economic Outlook 2018
 
The Data-Driven Investor: How Technology Changes the Game for Today’s Asset O...
The Data-Driven Investor: How Technology Changes the Game for Today’s Asset O...The Data-Driven Investor: How Technology Changes the Game for Today’s Asset O...
The Data-Driven Investor: How Technology Changes the Game for Today’s Asset O...
 
Data Mining 11-18-10
Data Mining 11-18-10Data Mining 11-18-10
Data Mining 11-18-10
 
Preqin solutions and HKVCA: technology improvements in private equity
Preqin solutions and HKVCA: technology improvements in private equityPreqin solutions and HKVCA: technology improvements in private equity
Preqin solutions and HKVCA: technology improvements in private equity
 
Changing Tides
Changing TidesChanging Tides
Changing Tides
 
The Evolution of Data and New Opportunities for Analytics
The Evolution of Data and New Opportunities for AnalyticsThe Evolution of Data and New Opportunities for Analytics
The Evolution of Data and New Opportunities for Analytics
 
CIO IT Budget & Staffing Survey
CIO IT Budget & Staffing SurveyCIO IT Budget & Staffing Survey
CIO IT Budget & Staffing Survey
 
Best Practices In Predictive Analytics
Best Practices In Predictive AnalyticsBest Practices In Predictive Analytics
Best Practices In Predictive Analytics
 
Data Analytics for Finance
Data Analytics for FinanceData Analytics for Finance
Data Analytics for Finance
 
Credit Card Analytics on a Connected Data Platform
Credit Card Analytics on a Connected Data PlatformCredit Card Analytics on a Connected Data Platform
Credit Card Analytics on a Connected Data Platform
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPM
 
Big data &amp; analytics for banking new york lars hamberg
Big data &amp; analytics for banking new york   lars hambergBig data &amp; analytics for banking new york   lars hamberg
Big data &amp; analytics for banking new york lars hamberg
 
Ibm ofa ottawa_analytics_in_gov _campbell_robertson
Ibm  ofa ottawa_analytics_in_gov _campbell_robertsonIbm  ofa ottawa_analytics_in_gov _campbell_robertson
Ibm ofa ottawa_analytics_in_gov _campbell_robertson
 
FulcrumWay GRC Solutions
FulcrumWay GRC SolutionsFulcrumWay GRC Solutions
FulcrumWay GRC Solutions
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sector
 

Semelhante a Hadoop World 2011: The State of Big Data Adoption in the Enterprise - Tony Baer - Ovum

Accelerating transformation hmg - pa
Accelerating transformation   hmg - paAccelerating transformation   hmg - pa
Accelerating transformation hmg - paMatt Mandich
 
Accelerating IT Transformation with Data & Analytic s
Accelerating IT Transformation with Data & Analytic s  Accelerating IT Transformation with Data & Analytic s
Accelerating IT Transformation with Data & Analytic s Matt Mandich
 
MHR Analytics Summit 2018 | The Data Journey - Laura Timms
MHR Analytics Summit 2018 | The Data Journey - Laura TimmsMHR Analytics Summit 2018 | The Data Journey - Laura Timms
MHR Analytics Summit 2018 | The Data Journey - Laura TimmsMHR Analytics
 
Data Trends for 2019: Extracting Value from Data
Data Trends for 2019: Extracting Value from DataData Trends for 2019: Extracting Value from Data
Data Trends for 2019: Extracting Value from DataPrecisely
 
Stateofthe cio 2022 new sample slides
Stateofthe cio 2022 new sample slidesStateofthe cio 2022 new sample slides
Stateofthe cio 2022 new sample slidesIDG
 
Stateofthe cio 2022 sample slides
Stateofthe cio 2022 sample slides Stateofthe cio 2022 sample slides
Stateofthe cio 2022 sample slides IDG
 
Stateofthe cio 2022 sample slides
Stateofthe cio 2022 sample slides Stateofthe cio 2022 sample slides
Stateofthe cio 2022 sample slides IDG
 
DATA Inc. 2009 / 2010 Survey Results
DATA Inc. 2009 / 2010 Survey ResultsDATA Inc. 2009 / 2010 Survey Results
DATA Inc. 2009 / 2010 Survey ResultsDATA Inc.
 
Capturing the Attention of SMBs
Capturing the Attention of SMBsCapturing the Attention of SMBs
Capturing the Attention of SMBsBredin, Inc.
 
CIO Tech Priorities 2023_Sample Slides.pdf
CIO Tech Priorities 2023_Sample Slides.pdfCIO Tech Priorities 2023_Sample Slides.pdf
CIO Tech Priorities 2023_Sample Slides.pdfIDG
 
Data and Findings - Accelerating IT Transformation with DATA
Data and Findings - Accelerating IT Transformation with DATAData and Findings - Accelerating IT Transformation with DATA
Data and Findings - Accelerating IT Transformation with DATAMatt Mandich
 
2011 summit office of the cio presentation galit
2011 summit office of the cio presentation galit2011 summit office of the cio presentation galit
2011 summit office of the cio presentation galitGalit Fein
 
Go-To-Market with Capstone v3
Go-To-Market with Capstone v3Go-To-Market with Capstone v3
Go-To-Market with Capstone v3Tracy Hawkey
 
The Value of Pervasive Analytics
The Value of Pervasive AnalyticsThe Value of Pervasive Analytics
The Value of Pervasive AnalyticsCloudera, Inc.
 
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...Datameer
 
2015 BigInsights Big Data Study
2015 BigInsights Big Data Study   2015 BigInsights Big Data Study
2015 BigInsights Big Data Study BigInsights
 
Bad Data is Polluting Big Data
Bad Data is Polluting Big DataBad Data is Polluting Big Data
Bad Data is Polluting Big DataStreamsets Inc.
 
Is Bimodal IT Dead? July 2017
Is Bimodal IT Dead?  July 2017Is Bimodal IT Dead?  July 2017
Is Bimodal IT Dead? July 2017Pulse Q&A
 

Semelhante a Hadoop World 2011: The State of Big Data Adoption in the Enterprise - Tony Baer - Ovum (20)

Accelerating transformation hmg - pa
Accelerating transformation   hmg - paAccelerating transformation   hmg - pa
Accelerating transformation hmg - pa
 
Accelerating IT Transformation with Data & Analytic s
Accelerating IT Transformation with Data & Analytic s  Accelerating IT Transformation with Data & Analytic s
Accelerating IT Transformation with Data & Analytic s
 
MHR Analytics Summit 2018 | The Data Journey - Laura Timms
MHR Analytics Summit 2018 | The Data Journey - Laura TimmsMHR Analytics Summit 2018 | The Data Journey - Laura Timms
MHR Analytics Summit 2018 | The Data Journey - Laura Timms
 
Data Trends for 2019: Extracting Value from Data
Data Trends for 2019: Extracting Value from DataData Trends for 2019: Extracting Value from Data
Data Trends for 2019: Extracting Value from Data
 
Stateofthe cio 2022 new sample slides
Stateofthe cio 2022 new sample slidesStateofthe cio 2022 new sample slides
Stateofthe cio 2022 new sample slides
 
Stateofthe cio 2022 sample slides
Stateofthe cio 2022 sample slides Stateofthe cio 2022 sample slides
Stateofthe cio 2022 sample slides
 
Stateofthe cio 2022 sample slides
Stateofthe cio 2022 sample slides Stateofthe cio 2022 sample slides
Stateofthe cio 2022 sample slides
 
DATA Inc. 2009 / 2010 Survey Results
DATA Inc. 2009 / 2010 Survey ResultsDATA Inc. 2009 / 2010 Survey Results
DATA Inc. 2009 / 2010 Survey Results
 
Capturing the Attention of SMBs
Capturing the Attention of SMBsCapturing the Attention of SMBs
Capturing the Attention of SMBs
 
CIO Tech Priorities 2023_Sample Slides.pdf
CIO Tech Priorities 2023_Sample Slides.pdfCIO Tech Priorities 2023_Sample Slides.pdf
CIO Tech Priorities 2023_Sample Slides.pdf
 
Rob Bearden Keynote Hadoop Summit San Jose
Rob Bearden Keynote Hadoop Summit San JoseRob Bearden Keynote Hadoop Summit San Jose
Rob Bearden Keynote Hadoop Summit San Jose
 
Data and Findings - Accelerating IT Transformation with DATA
Data and Findings - Accelerating IT Transformation with DATAData and Findings - Accelerating IT Transformation with DATA
Data and Findings - Accelerating IT Transformation with DATA
 
2011 summit office of the cio presentation galit
2011 summit office of the cio presentation galit2011 summit office of the cio presentation galit
2011 summit office of the cio presentation galit
 
Data-Driven IT Automation: A Vision for the Modern CIO
Data-Driven IT Automation: A Vision for the Modern CIOData-Driven IT Automation: A Vision for the Modern CIO
Data-Driven IT Automation: A Vision for the Modern CIO
 
Go-To-Market with Capstone v3
Go-To-Market with Capstone v3Go-To-Market with Capstone v3
Go-To-Market with Capstone v3
 
The Value of Pervasive Analytics
The Value of Pervasive AnalyticsThe Value of Pervasive Analytics
The Value of Pervasive Analytics
 
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
 
2015 BigInsights Big Data Study
2015 BigInsights Big Data Study   2015 BigInsights Big Data Study
2015 BigInsights Big Data Study
 
Bad Data is Polluting Big Data
Bad Data is Polluting Big DataBad Data is Polluting Big Data
Bad Data is Polluting Big Data
 
Is Bimodal IT Dead? July 2017
Is Bimodal IT Dead?  July 2017Is Bimodal IT Dead?  July 2017
Is Bimodal IT Dead? July 2017
 

Mais de Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

Mais de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Último

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 

Último (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 

Hadoop World 2011: The State of Big Data Adoption in the Enterprise - Tony Baer - Ovum

  • 1. The State of Big Data Adoption in the Enterprise Tony Baer [email_address] November 2011
  • 2.
  • 3. Survey sample by region Europe North America APAC 27% 33% 40% Sample size = 150 organizations Minimum > 1+ TByte data in enterprise DWs/analytic data stores
  • 4. Company size & IT budget <$10m Don't Know $50m+ $10m - $50m 9% 7% 7% 77% Annual IT budget $50m - $250m <$10m >$1bn 22% 18% 17% 21% 22% Company size Big data is not limited to big companies! $250m -$1b $10m - $50m
  • 5. Survey sample by vertical industry Other Media & Entertainment Mfg Public sector (government) ICT Healthcare Financial services Retail 5% 16% 13% 5% 21% 20% 5% 9% Transportation 6%
  • 6.
  • 7. Mean analytic data store size Number of respondents 1 150 1000 3TB = mean 75 3000 5000 7000 Terabytes 2000 4000 6000 8000
  • 8. Analytic data store size by data type 0% 10% 20% 30% No unstructured data 1 - 5 TBytes 5 - 10 TBytes 10 - 20 TBytes 20 - 50 TBytes 50 - 100 TBytes 100 - 500 TBytes 500 - 1000 Tbtyes Over 1000 TBytes Structured data Unstructured/ Variably structured data
  • 9. Popular analytic data types 0% 30% 60% 90% B2B transactional Supply Chain Mgmt Call Detail Records (CDRs) Internet search indices Legal/regulatory documentation Legacy Apps CRM ERP Email/Messaging Currently Planned
  • 10. Variably structured data – current & future demand Weighted score Social Media Web Logs Sensory Graph Rich media Text Time series All sectors Financial Svcs Healthcare Retail Public Sector Healthcare Healthcare Healthcare Healthcare FS FS Retail Retail Healthcare Public Sector FS FS FS Healthcare
  • 11.
  • 12. Business objectives Big Data analytic projects change the analytics, but not the objectives 0% 30% 60% Other Advanced analytics Competitive positioning Business agility Regulatory compliance ID hidden business trends Predictive analytic insights Business forecasting Customer service Strategic decision making Operational decision making
  • 13. Business sponsors The players are currently the same 0% 30% 60% Supply chain Other Customer management Internal operations Sales and marketing Finance
  • 14. Advanced SQL analytic database use A 5-year head start on NoSQL 0% 2% 4% 6% 8% 10% 12% Oracle Exadata Sybase IQ Aster Data IBM Netezza Sand Greenplum Kognitio Teradata ParAccel Vertica Infobright
  • 15. NoSQL platform use Considering Testing/ Evaluating In Production 0% 2% 4% 6% 8% 10% 12% Amazon SimpleDB Hadoop MongoDB Membase Cassandra CouchDB
  • 16. Non-SQL languages/frameworks for analytic queries 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% Java None PHP C or C++ Other language Perl Other framework/technology? Python Ruby MapReduce Now Future
  • 17. Big Data in the cloud? No Yes Considering in next 12 months Considering in next 12 - 36 months 24% 9% 9% 58%
  • 18. Technical concerns Analytic/NoSQL databases technology immaturity Current DW investments not easily extended In-house analytic skills Increased capital investment Visualizing large datasets Technology infrastructure costs Analysis requires additional IT resources Data integration complexity Systems & network performance impact High data volatility/refresh cycles Information filtering technology Data storage issues & costs Performance issues & query response times Data quality & governance Not Important Important
  • 19.
  • 20. Who will deliver Big Data technologies & solutions? Plenty of room for new blood 0% 20% 40% 60% Existing data warehousing/ BI analytics supplier In-house Systems integrator Specialist provider Other technology provider
  • 21. Big Data IT budget plans Based on 25% of respondents provided actual numbers Current IT budget Next year's IT budget Next 2 - 5 yrs Under $100,000 $100,000 - $1 million 1 - 5 million Over $5 million
  • 22. Big Data budget plans 0% 10% 20% 30% 40% 27% 33% 44% Current IT budget Next year's IT budget In next 2 - 5 years Proportion of respondents
  • 23. Thank you – any questions? Tony Baer Email: [email_address] Twitter: @TonyBaer All Rights Reserved. No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form by any means, electronic, mechanical, photocopying, recording or otherwise, without the prior permission of the publisher, Ovum (an Informa business). The facts of this report are believed to be correct at the time of publication but cannot be guaranteed. Please note that the findings, conclusions and recommendations that Ovum delivers will be based on information gathered in good faith from both primary and secondary sources, whose accuracy we are not always in a position to guarantee. As such Ovum can accept no liability whatever for actions taken based on any information that may subsequently prove to be incorrect.

Notas do Editor

  1. Apologies about the busy – but we’ll just focus on a few highlights. Weighted on a scale 1 to 4 not important to very important Unsure’s were a small% of responses. Some interesting patterns show up here – from sectors that had the largest representation in the sample. Among the highlights, we saw peaks of interest in: Text remains by far the most popular data type – as text analytics are fairly well-established practice. Ditto with rich media. Time series data from retail and financial services, which are highly transactional event-intensive businesses. Surprisingly, also interest from public sector. Surprisingly, healthcare was the most interested in social media. This indicates that healthcare providers are looking for data from patients themselves to fill a key missing link in quality-of-care (and reputational) data: data that comes from the patient or their family or friends. Surprisingly, very little interest in web logs – even from retail. Similarly, there was lesser interest in gathering and analyzing sensory data. As a group, these specific sectors had higher levels of interest in most of these data types compared with the entire sample. That discrepancy is likely a freak of sampling; because these groups were more highly represented, our survey team probably reached more organizations with active interest. This data hints at the differences that are likely to emerge as use of Big Data analytics matures. For instance, gas, water, and electric utilities have vital stake in monitoring devices on their transmission systems and at the points of consumption – especially if they are implementing smart metering grids or similar programs to manage resource demand. We would not expect the same demand from financial services. Similarly, manufacturers, retailers, and logistics providers have clear interest in tracking onboard GPS devices and external feeds regarding road and weather conditions for tracking trucking deliveries. Ovum believes that as Big Data analyses become more commonplace in the enterprise mainstream, that vertical industry differences regarding data types will eventually emerge. For instance: graph data will be most useful where the need is for tracking customer sentiment based on the interactions of social groups, or for traffic flow analysis that may be used by municipal transport agencies or trucking companies weblogs will be used by online sites seeking to understand and drive incoming traffic rich media data will be useful for a broader range of segments, such as entertainment companies that tag and track media assets and security and law enforcement agencies that track terrorist or criminal activity social media data will be useful for any sector that is consumer- (or, in the case of public sector, voter-) driven as it provides, in effect, the world&apos;s largest virtual focus group text and document data will be useful for organizations subject to regulatory compliance mandates.
  2. Supply chain under-represented because only half the respondents have supply chains; the other half were from the service industries Sales &amp; marketing highlighted because it came from the 80% who were private sector.
  3. When you add up the numbers, the total comes out to 45% of respondents; however, as a significant subset might be implementing multiple Advanced SQL platforms, the actual proportion of respondents will likely be lower.
  4. Reveals a lot of development of analytics as web apps, where web languages used in conjunction with (not instead ofg) SQL on back end.
  5. Weighted on a scale of 1 to 4