SlideShare a Scribd company logo
1 of 16
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
1 
Making Big Data a First-class 
Citizen in the Enterprise 
Tony Baer 
tony.baer@ovum.com 
IT014-002860 
January 24, 2014
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
2 
Contents 
 The premise 
 What we mean 
 Scope – focus on Hadoop, the most popular emerging Big Data 
platform 
 Addressing the enterprise 
 IT organization 
 Data center infrastructure, processes, policies, and practices 
 Creating value; competitive benefits to the business 
 The endgame
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
3 
The premise: Voting Big Data off the island! 
Big Data must become a first-class citizen in the enterprise and 
cannot exist on its own island.
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
4 
Hadoop user base and use cases are changing 
 Users are changing from Internet companies to mainstream 
enterprises. 
 Use cases are changing from Internet search, ad optimization 
to customer churn analysis, sales and promotions, and 
operations.
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
5 
What does it mean for Big Data to become a first-class 
citizen? 
 IT organization 
 No more SWAT teams! 
 Must map to existing people and skills 
 Data center 
 Must map to existing infrastructure, subject to same constraints 
 Enterprise 
 Must address real business problems, not abstract data science research
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
6 
Contents 
 The premise 
 Addressing the enterprise 
 IT organization 
 Data center infrastructure, processes, policies, and practices 
 Creating value; competitive benefits to the business 
 The endgame
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
7 
Mapping Big Data skills to the IT organization 
 Enrich, don’t replace, your existing app developers, DBAs, system 
administrators 
 Huge existing SQL skills base – you’re not going to replace them 
 Large Java developer base, lots of scripting language diversity 
 Popularity of JavaScript/JSON 
 Skills: 
 Technology – the easy part 
 Domain and data science – not so easy 
Don’t forget the people part!
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
8 
Extending the IT organization for Big Data 
 SQL and NoSQL/Hadoop platforms are converging 
 SQL access to Hadoop 
 Hadoop platform SQL support 
 BI tool Hadoop support 
 MapReduce approaches to Advanced SQL platforms 
 MongoDB, CouchDB, Riak 
 Empowering web JavaScript developers with familiar JSON 
 Data science? 
 The apps are coming… 
SQL on Hadoop and Big Data apps are works in progress…
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
9 
Addressing the data center 
 Like most Internet technologies, Hadoop conceived in zone of trust 
 Small, elite band of practitioners 
 Big concern? Getting access to available cluster resources elsewhere 
inside the firewall 
 Enterprise? 
 Security 
 Data stewardship 
 Coping with finite resource 
 Availability and reliability
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
10 
Data center: Big Data must be secure like any 
database management system or data warehouse 
 AAA enforced for access, authentication, authorization 
 Must become more granular by user, data 
 Must become more unified 
 Integrate with LDAP/Active Directory 
 Data privacy mandates 
 This is a policy, not a technology, issue 
 “Don’t be creepy” – don’t blindside your customers based on 
knowledge they didn't know you have 
 Regulation plays driving role for some sectors
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
11 
Data center: Big Data platforms must behave like any 
database management system or data warehouse 
 Data stewardship/lifecycle 
 Data quality, protection, lifecycle management, retention 
 Resource management 
 Capacity utilization critical 
 Availability/reliability 
 Performance management essential for large clusters 
Major change from early Internet adopters
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
12 
Address the business 
Good business cases count! 
 Do: 
 Focus on existing problems (the problems are often more obvious 
than you think…) 
 Identify key points of pain, like any new IT solution 
 Don’t 
 Concoct “interesting” data science problems for the heck of it 
 Get carried away with data (with lots of data, there are lots of chances 
for detecting irrelevant trends) 
 Give up after a few tries …. iterate! 
Don’t get caught up in a data science project
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
13 
Hadoop benefits: Solving familiar problems in new 
ways 
Customer 
holistic view 
Predictive churn 
analysis 
upsell/cross-sell, 
next-best-offer, 
cross-channel ID 
resolution 
Risk mitigation 
Fraud detection, 
counter-party risk 
management, 
credit scoring 
Operational 
efficiency 
Machine data for 
managing smart 
grids, smart urban 
infrastructure, 
supply chain 
logistics 
Not arbitrary data science
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
14 
Contents 
 The premise 
 What we mean 
 Scope – focus on Hadoop, the most popular emerging Big Data 
platform 
 Addressing the enterprise 
 IT organization 
 Data center infrastructure, processes, policies, and practices 
 Creating value; competitive benefits to the business 
 The endgame
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
15 
The endgame: What becoming a first-class citizen 
really means 
 Big Data – and emerging platforms like Hadoop – originated as 
specialized IT systems requiring specially skilled practitioners. 
 This model is not sustainable as Big Data crosses over to the 
enterprise. 
 Big Data must get off its island. 
 Big Data must be accessible to the IT organization, fit into the data 
center, and address real business problems.
© Copyright Ovum. All rights reserved. Ovum is an Informa business. 
16 
Big Data: Embrace and extend 
 IT organization 
 Embrace existing SQL, Java, and other programming language skills 
 Extend skills to understand handling of larger volumes and varieties of data and 
new analytic techniques to supplement SQL 
 Data center 
 Embrace existing policies and practices for data stewardship, resource 
management, security, performance management 
 Extend policies and practices to accommodate platform with different workload 
characteristics, and support of active archiving 
 Business 
 Embrace existing competitive problems; don’t look for new problems because the 
data and platform are different 
 Extend approaches to problem solving by incorporating new data types and new 
forms of analyses to deepen understanding and insights

More Related Content

What's hot

Telco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsTelco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsAlan Quayle
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
Conflict in the Cloud – Issues & Solutions for Big Data
Conflict in the Cloud – Issues & Solutions for Big DataConflict in the Cloud – Issues & Solutions for Big Data
Conflict in the Cloud – Issues & Solutions for Big DataHalo BI
 
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera, Inc.
 
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIReal-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIibi
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemCapgemini
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...Capgemini
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataCloudera, Inc.
 
Optimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsOptimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsCloudera, Inc.
 
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Edureka!
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
A #Pink14 Presentation: Optimizing for the #SDDC
A #Pink14 Presentation: Optimizing for the #SDDCA #Pink14 Presentation: Optimizing for the #SDDC
A #Pink14 Presentation: Optimizing for the #SDDCTeamQuest Corporation
 
Customer Case Studies of Self-Service Big Data Analytics
Customer Case Studies of Self-Service Big Data AnalyticsCustomer Case Studies of Self-Service Big Data Analytics
Customer Case Studies of Self-Service Big Data AnalyticsDatameer
 
Succeeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and TechnologySucceeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and Technologyibi
 
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Denodo
 
Managing Growing Transaction Volumes Using Hadoop
Managing Growing Transaction Volumes Using HadoopManaging Growing Transaction Volumes Using Hadoop
Managing Growing Transaction Volumes Using HadoopArvind Purushothaman
 
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Cloudera, Inc.
 
Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Barry Devlin
 
Business unIntelligence, Chapter 5
Business unIntelligence, Chapter 5Business unIntelligence, Chapter 5
Business unIntelligence, Chapter 5Barry Devlin
 
Business unIntelligence - a Whistle Stop Tour
Business unIntelligence - a Whistle Stop TourBusiness unIntelligence - a Whistle Stop Tour
Business unIntelligence - a Whistle Stop TourBarry Devlin
 

What's hot (20)

Telco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsTelco Big Data 2012 Highlights
Telco Big Data 2012 Highlights
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
Conflict in the Cloud – Issues & Solutions for Big Data
Conflict in the Cloud – Issues & Solutions for Big DataConflict in the Cloud – Issues & Solutions for Big Data
Conflict in the Cloud – Issues & Solutions for Big Data
 
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learning
 
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIReal-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BI
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big Data
 
Optimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analyticsOptimize your cloud strategy for machine learning and analytics
Optimize your cloud strategy for machine learning and analytics
 
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
A #Pink14 Presentation: Optimizing for the #SDDC
A #Pink14 Presentation: Optimizing for the #SDDCA #Pink14 Presentation: Optimizing for the #SDDC
A #Pink14 Presentation: Optimizing for the #SDDC
 
Customer Case Studies of Self-Service Big Data Analytics
Customer Case Studies of Self-Service Big Data AnalyticsCustomer Case Studies of Self-Service Big Data Analytics
Customer Case Studies of Self-Service Big Data Analytics
 
Succeeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and TechnologySucceeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and Technology
 
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
 
Managing Growing Transaction Volumes Using Hadoop
Managing Growing Transaction Volumes Using HadoopManaging Growing Transaction Volumes Using Hadoop
Managing Growing Transaction Volumes Using Hadoop
 
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017
 
Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too Why Big Data Analytics Needs Business Intelligence Too
Why Big Data Analytics Needs Business Intelligence Too
 
Business unIntelligence, Chapter 5
Business unIntelligence, Chapter 5Business unIntelligence, Chapter 5
Business unIntelligence, Chapter 5
 
Business unIntelligence - a Whistle Stop Tour
Business unIntelligence - a Whistle Stop TourBusiness unIntelligence - a Whistle Stop Tour
Business unIntelligence - a Whistle Stop Tour
 

Viewers also liked

The Cassandra Platform - Christos Diou
The Cassandra Platform - Christos Diou The Cassandra Platform - Christos Diou
The Cassandra Platform - Christos Diou Cassandra Project
 
Manual cassandra NoSQL
Manual cassandra NoSQLManual cassandra NoSQL
Manual cassandra NoSQLlignia
 
Pré-processamento em Big Data
Pré-processamento em Big DataPré-processamento em Big Data
Pré-processamento em Big DataJoão Gabriel Lima
 
Apache Cassandra - Base de datos
Apache Cassandra - Base de datosApache Cassandra - Base de datos
Apache Cassandra - Base de datosZteeven Zalinas
 
Fast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming AnalyticsFast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming AnalyticsTony Baer
 
Hadoop, SQL & NoSQL: No Longer an Either-or Question
Hadoop, SQL & NoSQL: No Longer an Either-or QuestionHadoop, SQL & NoSQL: No Longer an Either-or Question
Hadoop, SQL & NoSQL: No Longer an Either-or QuestionTony Baer
 
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...Daniel Briian
 
Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elo7
 
Great Visualizations and Analytics using Business Intelligence Open Source
Great Visualizations and Analytics using Business Intelligence Open SourceGreat Visualizations and Analytics using Business Intelligence Open Source
Great Visualizations and Analytics using Business Intelligence Open SourceStratebi
 
Hadoop MapReduce Streaming and Pipes
Hadoop MapReduce  Streaming and PipesHadoop MapReduce  Streaming and Pipes
Hadoop MapReduce Streaming and PipesHanborq Inc.
 
Manual apache cassandra y comandos en la shell
Manual apache cassandra y comandos en la shellManual apache cassandra y comandos en la shell
Manual apache cassandra y comandos en la shellKevin López
 
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsBuilding a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsSpark Summit
 
Desenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosDesenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosElo7
 
PyCon APAC 2016 Keynote
PyCon APAC 2016 KeynotePyCon APAC 2016 Keynote
PyCon APAC 2016 KeynoteWes McKinney
 
Improving data interoperability in Python and R
Improving data interoperability in Python and RImproving data interoperability in Python and R
Improving data interoperability in Python and RWes McKinney
 

Viewers also liked (20)

Qcon Rio 2015 - Data Lakes Workshop
Qcon Rio 2015 - Data Lakes WorkshopQcon Rio 2015 - Data Lakes Workshop
Qcon Rio 2015 - Data Lakes Workshop
 
The Cassandra Platform - Christos Diou
The Cassandra Platform - Christos Diou The Cassandra Platform - Christos Diou
The Cassandra Platform - Christos Diou
 
Manual cassandra NoSQL
Manual cassandra NoSQLManual cassandra NoSQL
Manual cassandra NoSQL
 
All things py
All things pyAll things py
All things py
 
Nosql y cassandra
Nosql y cassandraNosql y cassandra
Nosql y cassandra
 
Pré-processamento em Big Data
Pré-processamento em Big DataPré-processamento em Big Data
Pré-processamento em Big Data
 
Apache Cassandra - Base de datos
Apache Cassandra - Base de datosApache Cassandra - Base de datos
Apache Cassandra - Base de datos
 
Fast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming AnalyticsFast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming Analytics
 
Hadoop, SQL & NoSQL: No Longer an Either-or Question
Hadoop, SQL & NoSQL: No Longer an Either-or QuestionHadoop, SQL & NoSQL: No Longer an Either-or Question
Hadoop, SQL & NoSQL: No Longer an Either-or Question
 
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
Instalacion,Configuracion y Creacion de Una Base de Datos en Apache Cassandra...
 
Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data
 
Apache cassandra
Apache cassandraApache cassandra
Apache cassandra
 
Great Visualizations and Analytics using Business Intelligence Open Source
Great Visualizations and Analytics using Business Intelligence Open SourceGreat Visualizations and Analytics using Business Intelligence Open Source
Great Visualizations and Analytics using Business Intelligence Open Source
 
Hadoop MapReduce Streaming and Pipes
Hadoop MapReduce  Streaming and PipesHadoop MapReduce  Streaming and Pipes
Hadoop MapReduce Streaming and Pipes
 
Manual apache cassandra y comandos en la shell
Manual apache cassandra y comandos en la shellManual apache cassandra y comandos en la shell
Manual apache cassandra y comandos en la shell
 
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsBuilding a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
 
Up and running with pyspark
Up and running with pysparkUp and running with pyspark
Up and running with pyspark
 
Desenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosDesenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntos
 
PyCon APAC 2016 Keynote
PyCon APAC 2016 KeynotePyCon APAC 2016 Keynote
PyCon APAC 2016 Keynote
 
Improving data interoperability in Python and R
Improving data interoperability in Python and RImproving data interoperability in Python and R
Improving data interoperability in Python and R
 

Similar to Making Big Data a First Class citizen in the enterprise

Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersDatameer
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyCloudera, Inc.
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data AnalyticsVMware Tanzu
 
A better business case for big data with Hadoop
A better business case for big data with HadoopA better business case for big data with Hadoop
A better business case for big data with HadoopAptitude Software
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessNG DATA
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
HP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for youHP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for youHP Enterprise Italia
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
5 big data at work linking discovery and bi to improve business outcomes from...
5 big data at work linking discovery and bi to improve business outcomes from...5 big data at work linking discovery and bi to improve business outcomes from...
5 big data at work linking discovery and bi to improve business outcomes from...Dr. Wilfred Lin (Ph.D.)
 
Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry Persontyle
 
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014pietvz
 
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXHadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXModern Data Stack France
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsDatameer
 
Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Dell World
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyCloudera, Inc.
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopGhassan Al-Yafie
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseJeff Kelly
 

Similar to Making Big Data a First Class citizen in the enterprise (20)

Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business Managers
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data Strategy
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
 
The value of our data
The value of our dataThe value of our data
The value of our data
 
A better business case for big data with Hadoop
A better business case for big data with HadoopA better business case for big data with Hadoop
A better business case for big data with Hadoop
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your business
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
HP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for youHP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for you
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
5 big data at work linking discovery and bi to improve business outcomes from...
5 big data at work linking discovery and bi to improve business outcomes from...5 big data at work linking discovery and bi to improve business outcomes from...
5 big data at work linking discovery and bi to improve business outcomes from...
 
Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry
 
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
 
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXHadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data Analytics
 
Are you ready for Big Data 2.0? EMA Analyst Research
Are you ready for Big Data 2.0? EMA Analyst ResearchAre you ready for Big Data 2.0? EMA Analyst Research
Are you ready for Big Data 2.0? EMA Analyst Research
 
Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data Journey
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoop
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 

Recently uploaded

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 

Recently uploaded (20)

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 

Making Big Data a First Class citizen in the enterprise

  • 1. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 1 Making Big Data a First-class Citizen in the Enterprise Tony Baer tony.baer@ovum.com IT014-002860 January 24, 2014
  • 2. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 2 Contents  The premise  What we mean  Scope – focus on Hadoop, the most popular emerging Big Data platform  Addressing the enterprise  IT organization  Data center infrastructure, processes, policies, and practices  Creating value; competitive benefits to the business  The endgame
  • 3. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 3 The premise: Voting Big Data off the island! Big Data must become a first-class citizen in the enterprise and cannot exist on its own island.
  • 4. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 4 Hadoop user base and use cases are changing  Users are changing from Internet companies to mainstream enterprises.  Use cases are changing from Internet search, ad optimization to customer churn analysis, sales and promotions, and operations.
  • 5. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 5 What does it mean for Big Data to become a first-class citizen?  IT organization  No more SWAT teams!  Must map to existing people and skills  Data center  Must map to existing infrastructure, subject to same constraints  Enterprise  Must address real business problems, not abstract data science research
  • 6. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 6 Contents  The premise  Addressing the enterprise  IT organization  Data center infrastructure, processes, policies, and practices  Creating value; competitive benefits to the business  The endgame
  • 7. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 7 Mapping Big Data skills to the IT organization  Enrich, don’t replace, your existing app developers, DBAs, system administrators  Huge existing SQL skills base – you’re not going to replace them  Large Java developer base, lots of scripting language diversity  Popularity of JavaScript/JSON  Skills:  Technology – the easy part  Domain and data science – not so easy Don’t forget the people part!
  • 8. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 8 Extending the IT organization for Big Data  SQL and NoSQL/Hadoop platforms are converging  SQL access to Hadoop  Hadoop platform SQL support  BI tool Hadoop support  MapReduce approaches to Advanced SQL platforms  MongoDB, CouchDB, Riak  Empowering web JavaScript developers with familiar JSON  Data science?  The apps are coming… SQL on Hadoop and Big Data apps are works in progress…
  • 9. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 9 Addressing the data center  Like most Internet technologies, Hadoop conceived in zone of trust  Small, elite band of practitioners  Big concern? Getting access to available cluster resources elsewhere inside the firewall  Enterprise?  Security  Data stewardship  Coping with finite resource  Availability and reliability
  • 10. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 10 Data center: Big Data must be secure like any database management system or data warehouse  AAA enforced for access, authentication, authorization  Must become more granular by user, data  Must become more unified  Integrate with LDAP/Active Directory  Data privacy mandates  This is a policy, not a technology, issue  “Don’t be creepy” – don’t blindside your customers based on knowledge they didn't know you have  Regulation plays driving role for some sectors
  • 11. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 11 Data center: Big Data platforms must behave like any database management system or data warehouse  Data stewardship/lifecycle  Data quality, protection, lifecycle management, retention  Resource management  Capacity utilization critical  Availability/reliability  Performance management essential for large clusters Major change from early Internet adopters
  • 12. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 12 Address the business Good business cases count!  Do:  Focus on existing problems (the problems are often more obvious than you think…)  Identify key points of pain, like any new IT solution  Don’t  Concoct “interesting” data science problems for the heck of it  Get carried away with data (with lots of data, there are lots of chances for detecting irrelevant trends)  Give up after a few tries …. iterate! Don’t get caught up in a data science project
  • 13. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 13 Hadoop benefits: Solving familiar problems in new ways Customer holistic view Predictive churn analysis upsell/cross-sell, next-best-offer, cross-channel ID resolution Risk mitigation Fraud detection, counter-party risk management, credit scoring Operational efficiency Machine data for managing smart grids, smart urban infrastructure, supply chain logistics Not arbitrary data science
  • 14. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 14 Contents  The premise  What we mean  Scope – focus on Hadoop, the most popular emerging Big Data platform  Addressing the enterprise  IT organization  Data center infrastructure, processes, policies, and practices  Creating value; competitive benefits to the business  The endgame
  • 15. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 15 The endgame: What becoming a first-class citizen really means  Big Data – and emerging platforms like Hadoop – originated as specialized IT systems requiring specially skilled practitioners.  This model is not sustainable as Big Data crosses over to the enterprise.  Big Data must get off its island.  Big Data must be accessible to the IT organization, fit into the data center, and address real business problems.
  • 16. © Copyright Ovum. All rights reserved. Ovum is an Informa business. 16 Big Data: Embrace and extend  IT organization  Embrace existing SQL, Java, and other programming language skills  Extend skills to understand handling of larger volumes and varieties of data and new analytic techniques to supplement SQL  Data center  Embrace existing policies and practices for data stewardship, resource management, security, performance management  Extend policies and practices to accommodate platform with different workload characteristics, and support of active archiving  Business  Embrace existing competitive problems; don’t look for new problems because the data and platform are different  Extend approaches to problem solving by incorporating new data types and new forms of analyses to deepen understanding and insights