SlideShare uma empresa Scribd logo
1 de 37
1   © 2011 Forrester Research, Inc. Reproduction Prohibited
Forrester’s View On Big Data-Trends in the Enterprise
    Big data will disrupt the data management and analytics landscape



    Vanessa Alvarez, Analyst
    November 8, 2011




2   2   © 2010 Forrester Research, Inc. Reproduction Prohibited
          2009
         © 2011 Forrester Research, Inc. Reproduction Prohibited
2
Forrester’s View of Big Data




3   3   © 2010 Forrester Research, Inc. Reproduction Prohibited
         © 2011 Forrester Research, Inc. Reproduction Prohibited
3
Big Data makes extreme scale economical


Our View of Big Data:
The techniques and technologies that make capturing
value from data at extreme scale economical


    Big Data technology:
    Handles data at extreme scale.
    Can be characterized by:
            • Massive parallel computing to divide and conquer huge workloads.
            • Extremely flexible to allow unlimited data manipulation and
            transformation.
            • Massively scalable in terms of both technology and cost.



4    © 2011 Forrester Research, Inc. Reproduction Prohibited
Dimensions of Extreme Scale




5   © 2011 Forrester Research, Inc. Reproduction Prohibited
The Big Data has not cross the chasm

                                        • Social Sensitives       • Big Buckers
                  • Web 2.0s                                                         • Survivors      • One footers
                                        • Data crunchers          • The Awakened




                         • The Hadoop Crowd
                                                         • Proprietary Plays   • The “I do tooers”   • The “I guess we betters”
                       • The NoSQL Crowd

                                       • The biggest & fastest



    Source: Forrester Research and http://en.wikipedia.org/wiki/Crossing_the_Chasm_(book)

6       © 2011 Forrester Research, Inc. Reproduction Prohibited
What Does the Customer Look Like?




7   7   © 2010 Forrester Research, Inc. Reproduction Prohibited
         © 2011 Forrester Research, Inc. Reproduction Prohibited
7
The Big Data customer has multiple personality
disorder
Firms and excited, confused and skeptical all at the same time…

 Firms want more but don’t now what to do with what they have
 We sent our survey to thousands of clients – only got 60 completes

 The high number of partial completes indicates confusion
 Firms are experimenting with data they already have
 …but planning for new types of data

 They plan on keeping their data a long time




8   © 2011 Forrester Research, Inc. Reproduction Prohibited
They want more data, but aren’t sure how to
manage what they have

                          61% believe data will                   56% are concerned with
                          change how business                      the ability to manage
                               operates.                               current data.




                                     330                                      303




    Source: “Global Survey: The Business Impact of Big Data,” Avanade, November 2010


9   © 2011 Forrester Research, Inc. Reproduction Prohibited
Buyers really want this future…
                         Current                                              Future

                                                                 “No data is discarded anymore! U.S.
                                                                 xPress leverages a large scale of
                                                                 transaction data and a diversity of
                                                                 interaction data, now extended to
                                                                 perform big data processing like
     “Our oil rigs generate about 25,000
                                                                 Hadoop …We assess driver
     data points per second and we only
                                                                 performance with image files and pick
     use about 5% of that information.”
                                                                 up customer behaviors from texts by
                                                                 customer service reps. U.S. xPress
                     — super major energy                        saved millions of dollars per year by
                     company executive                           …augmenting our enterprise data with
                                                                 sensor, meter, RFID tags, and
                                                                 geospatial data.”

                                                                          — CTO, energy company




10     © 2011 Forrester Research, Inc. Reproduction Prohibited
Forrester’s initial assessment verticals




                                                                                                         ent
                                                                           nce
                                                       om




                                                                                                        le
                                                                                            ure ta inm



                                                                                                    e sa
                                                                       ura
                                                      c
                                                   ele




                                                                                                    g
                                                                    Ins




                                                                                                hol




                                                                                                                        or
                                                                                               urin
                                                                                                  r
                                                                                   & L , Ent e
                                                 &T




                                                                                                                    ect
                                                                                           &W
                                                               e&




                                                                                           act
                                                                              es




                                                                                                                      S
                                                   s




                                                                                        eis
                                                            anc
                                              litie




                                                                          v ic




                                                                                                                 blic
                                                                                      nuf
                                                                                      dia



                                                                                      tail
                                                                       Ser
                                                         Fin




                                                                                   Me




                                                                                   Ma
                                          Ut i
                  Data




                                                                                                               Pu
                                                                                   Re
                  Marketing
                  Operations

                  Sales
                  Risk
                  Management
                  IT Analytics
                  Finance
                  Product
                  Development
                  Customer
                  Service
                  Logistics

                              Innovators        Early Adopters Early Majority       Late Majority & Laggards


Source: Forrester research – this data represents our preliminary qualitative assessment based interviews and inquiries

  11      © 2011 Forrester Research, Inc. Reproduction Prohibited
Various types of data are addressed with marketing
    and operations being top
                                   “What enterprise areas does your Big Data initiative address?”

                            Marketing                                                                               45%

                          Operations                                                                               43%

                                  Sales                                                                      38%

                Risk management                                                                        35%

                         IT analytics                                                                 33%

           Product development                                                                   32%

                              Finance                                                            32%

                 Customer service                                                             30%

                             Logistics                                           22%

                                  Other                                12%

                                     HR                                12%

             Brand management                                     8%

                                                Base: 60 IT Professionals who are Forrester clients
Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming)
 12     © 2011 Forrester Research, Inc. Reproduction Prohibited
A ¾ are actually doing something with big data or at
       least evaluating it.

                                         “What is the status of your Big Data initiative?”

                                                           Other Don't know
                                                    Testing 2%      2%
                                                       2%



                                                       Piloting
                                                        18%


                                                                                Evaluating
                                                    In production                 53%
                                                         23%




                                                 Base: 60 IT Professionals who are Forrester clients


Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming)

  13     © 2011 Forrester Research, Inc. Reproduction Prohibited
What Does the Data Look Like?




14 14   © 2010 Forrester Research, Inc. Reproduction Prohibited
         © 2011 Forrester Research, Inc. Reproduction Prohibited
 1
Firms Are Storing Many Types Of Data
 “Overall, including the data/information on networked and direct-attached disk storage that you know about,
                        how much data/information does your firm currently maintain? “


          0 to 100GB             100GB to 999GB                   1TB to 9 TB     10TB to 99TB      100TB to 9PB           10PB or more
                                                                 1%
                                            Overall — company wide 5%                 22%          33%               29%         6%

                                                      Database systems          13%         23%         27%         18%     12%       2%

                     General file storage, including file servers 5%                  20%         34%           21%        13%        2%

       Server backups for disaster recovery and continuity                      10%    19%         31%              21%     12%       2%

  Archiving, all forms including compliance, discovery, and
                                                                                14%         19%     29%             18%    11%    2%
                   infrequently accessed data
Digital and Web content repositories, such as video, audio,
                                                                                      29%         22%         25%         10% 6% 1%
                 images, and Web pages

            PC backups for disaster recovery and continuity                             41%             20%     17%       10%4%

                                  Enterprise content management                       31%         20%         21%     11% 5% 1%

                                                                  Other data           35%         18%        17%    8%4% 1%


                   Base: 1,252 North American and European IT executives and technology decision-makers
       Source: Forrester’s Forrsights Hardware Survey, Q3 2010
 15     © 2011 Forrester Research, Inc. Reproduction Prohibited
Data Is Growing Faster Than Firms Can Deal With
      “For each data/information type, what do you expect will be the growth at your company over the
                                             next 12 months?”

                           More than 100%               50% to 100%          26% to 50%      11% to 25%     Grow 1% to 10%      No
                                                                                                                              growth
                                                                        2%
                                        Overall — company wide             7%   16%              42%                    31%    2%
                                                                      1%
                                                 Database systems 4% 11%                   34%                    44%          4%
                                                               1%
                   General file storage, including file servers 3% 10%                    33%                    46%           4%
                                                                        2%
  Archiving, all forms including compliance, discovery, and
                                                                         4% 11%           28%                   44%            9%
                   infrequently accessed data
                                                          1%
       Server backups for disaster recovery and continuity 3% 11%                         32%                   40%            10%
                                                            1%
Digital and Web content repositories, such as video, audio,
                                                             3% 10%                    24%                 44%                  15%
                 images, and Web pages
                                                           1%
                          Enterprise content management 3% 9%                       19%                   50%                  15%
                                                                        1%
                                                           Other data     4% 14%                  51%                          23%
                                                         1%
          PC backups for disaster recovery and continuity   5%                   18%               44%                          26%

                        Base: 1,252 North American and European IT executives and technology decision-makers
        Source: Forrester’s Forrsights Hardware Survey, Q3 2010
 16      © 2011 Forrester Research, Inc. Reproduction Prohibited
Buyers are interested in big data for a number of
 reasons
     “What are the main business requirements or inadequacies of earlier-generation BI/DW/ET
     technologies, applications, and architecture that are causing you to consider or implement
                                             Big Data?”
           In traditional BI and DW applications, requirements come first, applications come later. In other worlds requirements
           drive applications. Big Data turns this model upside down, where free form exploration using Big Data technology to
           prove a certain hypothesis or to find a pattern, often results in specifications for a more traditional BI/DW application

                                                                    Data volume                                        75%
                           Analysis-driven requirements (Big Data) vs.
                                                                                                                58%
                           requirements driven analysis (traditional …

                                                           Data diversity, variety                          52%
                           Velocity of change and scope/requirements
                                                                                                    38%
                                          unpredictability
                           Cost. Big Data solutions are less expensive
                                                                                                30%
                              than traditional ETL/DW/BI solutions

                                                                           Other          10%

                                                                     Don't know      3%


           Cost is also a factor, in many cases, dealing with data using big data technologies is simpliy cheaper and faster than
           other methods.

                                                                Base: 60 IT Professionals

     Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming)
17    © 2011 Forrester Research, Inc. Reproduction Prohibited
Most Current Production Implementations Use Existing
      Transactional Data Sources For Big Data Analysis

        “What types of data/records are you planning to analyze using Big Data technologies?”
             Most big data use cases hype its application for analysis of new, raw data from social media, sensors and web traffic,
             but we found that firms are being very practical, with early adopters using it for operating on enterprise data they
             already have.


                        Transactional data from enterprise applications                                                  72%

                                                   Sensor/machine/device data                            42%

                                    Unstructured content from email, office…                           35%

                                      Social media (Facebook, Twitter, etc)                            35%

                                                      Locational/geospatial data                 27%

                                                                     Clickstream                 27%

                                     Image (large video/photographic) data                13%

                                                          Scientific/genomic data        12%

                                                                           Other     7%

                                                                      Don't know    5%

                                                Base: 60 IT Professionals who are Forrester clients
Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming)
 18     © 2011 Forrester Research, Inc. Reproduction Prohibited
BI, ERP and CRM applications are the biggest
      recipients of big data insights

                                     “Do your Big Data applications stand on their own?”


                                     Business intelligence (BI), analytics                                    55%


                                     Enterprise applications (ERP, CRM)                                 28%


                                                                     Don't know               17%
                                                                                                              Lower responses for
                                                                                                              BPM and BRE
                                                          Business rules (BRE)                    17%         indicate that
                                                                                                              automation of
                                                                                                              processes using
                                                 Business processes (BPM)                         17%         insights from big data
                                                                                                              is not as common as it
                                                                                                              might eventually be.
                                                                          Other           12%


                                                                  (multiple responses accepted)
                                                Base: 60 IT Professionals who are Forrester clients
Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming)
 19     © 2011 Forrester Research, Inc. Reproduction Prohibited
A significant number of Forrester clients are still using
      commercial technology for Big Data



                                  Commercial source Big Data tools                                                      47%


             Open source Big Data technology (Hadoop,
            MapReduce, Cassandra, and the other Apache                                                            37%
                       open source specs)


                                                                        Don't know                          23%



                                                                              Other                   17%



                                                                  (multiple responses accepted)

                                                Base: 60 IT Professionals who are Forrester clients
Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming)
 20     © 2011 Forrester Research, Inc. Reproduction Prohibited
What are the Technology Concerns with
                     Big Data?




21 21   © 2010 Forrester Research, Inc. Reproduction Prohibited
         © 2011 Forrester Research, Inc. Reproduction Prohibited
 2
Customer challenges for securing Big Data

                                                                 • Customers are not actively talking about
       Awareness and                                               security concerns.

      understanding are                                          • Customers need help understanding threats
           lacking                                                 in a Big Data environment




                                                                 • Main considerations: Synchronizing retention
      Country policies                                             and disposition policies across jurisdictions,
                                                                   moving data across countries.
       and laws add
        complexity                                               • Customers need help navigating frameworks
                                                                   and changes.



                                                                       Key takeaway
                                                                       Customers are in need of
                                                                       education and guidance.
 22    © 2011 Forrester Research, Inc. Reproduction Prohibited
Storage Efficiency Challenges for Big Data


                                                                • Challenge: In most instances, data is
                                                                  random and inconsistent, not duplicated

                                                                • Opportunity: There is a need for more
      De-duplication                                              intelligent identification of data




                                                                • Challenge: Compression normally
                                                                  happens instead of de-duplication, yet, will
                                                                  compress duplicate data regardless

      Compression                                               • Opportunity: There is a need for an
                                                                  automated manner in doing both de-
                                                                  duplicating, and then compressing



 23   © 2011 Forrester Research, Inc. Reproduction Prohibited
A Conservative Perspective of Storage $ Increase
                   How do you expect your firm’s spending on the following IT infrastructure expenses to
                                             change over the next 12 months?

                        Stay about the same                        Increase 5%-10%                Increase more than 10%



                                                     Storage technologies            36%                40%                15%


                          IT infrastructure/operations staff salaries                      52%                   34%            4%


                                                   Systems management                       59%                    25%          4%


                                            Data center and IT facilities              49%                   30%            9%


                             Servers and server operating systems                     43%                  35%             9%


 Infrastructure systems integration and consulting services                                57%                  21%        6%


                                      PCs and PC operating systems                     48%                  29%            6%

     Infrastructure outsourcing and managed services (based
                                                                                           57%                 18%     5%
                           on fixed-price, long-term contracts)

Pay-per-use online hosted infrastructure services, such as
                                                                                           54%              16%      5%
                                                     cloud

                          Base: 1725 North American and European IT executives and technology decision-makers
         Source: Forrester’s Forrsights Hardware Survey, Q3 2010
24       © 2011 Forrester Research, Inc. Reproduction Prohibited
Commodity vs Specialized Hardware: Pros and Cons


                                                                • Opportunity: Cost efficient, scalable, open
        Commodity                                               • Challenge: reliability, availability




                                                                • Opportunity: performance, scalable
        Specialized
                                                                • Challenge: Cost, proprietary




           Source: Forrester Research
 25   © 2011 Forrester Research, Inc. Reproduction Prohibited
Potential future state architecture: ANY DATA

                                                    Analytics Tools


                                                               Hadoop
                                                                           •Compression
                                                                           •De-duplication
                                                                           •Encryption
                                      Intelligent Management               •Provisioning
                                                                           •Virtualization




               DAS                                              SAN       NAS
              Network                                          Network   Network

          Source: Forrester Research
26   © 2011 Forrester Research, Inc. Reproduction Prohibited
Moving Ahead




27 27   © 2010 Forrester Research, Inc. Reproduction Prohibited
         © 2011 Forrester Research, Inc. Reproduction Prohibited
 2
On-Premise vs Cloud: Pros and Cons


                                                                • Challenge: Capex, space
       On-Premise                                               • Opportunity: Security, ownership




                                                                • Challenge: Security, manageability
                 Cloud
                                                                • Opportunity: Cost efficient, scalable, opex




           Source: Forrester Research
 28   © 2011 Forrester Research, Inc. Reproduction Prohibited
The market is rapidly evolving to fill gaps

      Other file storage systems that capture some Hadoop
       market share.
      Other computational frameworks, beyond MapReduce
       that improve operational analytics.
      Advanced streaming and transactional processing that
       further support real-time.
      Business-friendly features that empower your
       workforce.
      Storage efficiency and security issues will be solved by
       somebody…(?)
29     © 2011 Forrester Research, Inc. Reproduction Prohibited
Gaps in the Big Data market mean opportunities

      Intelligent Management
         – Storage efficiency
         – Security

      Interactive analytics
         – – Hadoop is batch only at present
         – More real time analytics

      Analytics, Big Data as a Service
         – Leveraging cloud




30     © 2011 Forrester Research, Inc. Reproduction Prohibited
How Does This Role Evolve?


  Data ownership is everyone’s responsibility
  Important to bring together a team within IT as well as a business analyst

  Understand your peers in applications and database
  Train or hire a data analyst that can handle data lifecycle management




31   © 2011 Forrester Research, Inc. Reproduction Prohibited
Questions, comments, discussion?




 32   © 2011 Forrester Research, Inc. Reproduction Prohibited
Thank you

Vanessa Alvarez
+1 617.613.6259
valvarez@forrester.com
Twitter: @vanessaalvarez1
Blog: blog.forrester.com
www.forrester.com




   © 2009 Forrester Research, Inc. Reproduction Prohibited
Big Data Is Disruptive Not Incremental

 Big data is not incremental solutions to old problems
 where data has grown bigger
  Big data is
       – New techniques and technologies
       – To handle that we do not deal with today
       – Using scalable and parallel technologies
       – That trade off consistency for high availability and node failures.

  Forrester definition excludes some solutions that other might call “big
   data”. Examples:
       – Massively parallel data warehouses
       – In-memory databases




34   © 2011 Forrester Research, Inc. Reproduction Prohibited
…but the “why” is about speed to value




35   © 2011 Forrester Research, Inc. Reproduction Prohibited
Security considerations for Big Data are nascent
 Current customer focus:
     – What is Big Data, what can we do with it?
     – Concerns over stability, scale, cost, availability,
       unified management
     – Not security (not yet)

 Current storage and security vendor focus:
     – What do we need to know about Big Data?
     – What can be done from a security perspective?
     – Where do we fit?

 Another type of discussion emerging:
     – Big Data for security
     – Example – Zettaset Security Data Warehouse,
       using Big Data to identify security threats



36   © 2011 Forrester Research, Inc. Reproduction Prohibited
Key Pieces to the Big Data Puzzle


  The role of SSD
       – Prices coming down
       – Commoditization of hardware happens more at the SAN, NAS than SSD
       – Analytics require speed and performance, in order to achieve near real-time

  Storage Virtualization
       – Just like servers were virtualized, so will storage be, and a “storage hypervisor”
         will determine where storage capacity will come from, whether it be SAN, NAS
         or SSD (DAS)
               – Enterprises are looking to still leverage their existing infrastructure, instead
                 of ripping and replacing with “commodity infrastructure”




37   © 2011 Forrester Research, Inc. Reproduction Prohibited

Mais conteúdo relacionado

Semelhante a Hadoop World 2011: Hadoop Trends & Predictions - Vanessa Alverez, Forrester

Information Management in the Age of Big Data
Information Management in the Age of Big DataInformation Management in the Age of Big Data
Information Management in the Age of Big Databigdatasyd
 
Search Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSearch Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSematext Group, Inc.
 
How to Secure your Data Lake
How to Secure your Data LakeHow to Secure your Data Lake
How to Secure your Data LakeSarah Story
 
INTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPINTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPDr Geetha Mohan
 
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAmazon Web Services
 
IW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchIW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchSoftware AG
 
Nov 2010 HUG: Business Intelligence for Big Data
Nov 2010 HUG: Business Intelligence for Big DataNov 2010 HUG: Business Intelligence for Big Data
Nov 2010 HUG: Business Intelligence for Big DataYahoo Developer Network
 
Enterprise Data Science at Scale
Enterprise Data Science at ScaleEnterprise Data Science at Scale
Enterprise Data Science at ScaleArtem Ervits
 
GET STARTED WITH R FOR DATA SCIENCE
GET STARTED WITH R FOR DATA SCIENCEGET STARTED WITH R FOR DATA SCIENCE
GET STARTED WITH R FOR DATA SCIENCEUSDSI
 
Analyzing Hadoop Using Hadoop
Analyzing Hadoop Using HadoopAnalyzing Hadoop Using Hadoop
Analyzing Hadoop Using HadoopDataWorks Summit
 
Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...Impetus Technologies
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAmazon Web Services
 
SNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformSNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformJoey Jablonski
 
Telco Big Data Workshop Sample
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop SampleAlan Quayle
 
BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?Tuan Yang
 
Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersDatameer
 
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Cloudera, Inc.
 
Big Data in Practice.pdf
Big Data in Practice.pdfBig Data in Practice.pdf
Big Data in Practice.pdfTom Tan
 

Semelhante a Hadoop World 2011: Hadoop Trends & Predictions - Vanessa Alverez, Forrester (20)

Information Management in the Age of Big Data
Information Management in the Age of Big DataInformation Management in the Age of Big Data
Information Management in the Age of Big Data
 
Search Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSearch Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL Backend
 
How to Secure your Data Lake
How to Secure your Data LakeHow to Secure your Data Lake
How to Secure your Data Lake
 
How to secure your data lake
How to secure your data lakeHow to secure your data lake
How to secure your data lake
 
INTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPINTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOP
 
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
 
IW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchIW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester Research
 
Nov 2010 HUG: Business Intelligence for Big Data
Nov 2010 HUG: Business Intelligence for Big DataNov 2010 HUG: Business Intelligence for Big Data
Nov 2010 HUG: Business Intelligence for Big Data
 
Enterprise Data Science at Scale
Enterprise Data Science at ScaleEnterprise Data Science at Scale
Enterprise Data Science at Scale
 
GET STARTED WITH R FOR DATA SCIENCE
GET STARTED WITH R FOR DATA SCIENCEGET STARTED WITH R FOR DATA SCIENCE
GET STARTED WITH R FOR DATA SCIENCE
 
Analyzing Hadoop Using Hadoop
Analyzing Hadoop Using HadoopAnalyzing Hadoop Using Hadoop
Analyzing Hadoop Using Hadoop
 
Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...Apache spark empowering the real time data driven enterprise - StreamAnalytix...
Apache spark empowering the real time data driven enterprise - StreamAnalytix...
 
CTAM Making Analytics Actionable RJA FINAL
CTAM Making Analytics Actionable RJA FINALCTAM Making Analytics Actionable RJA FINAL
CTAM Making Analytics Actionable RJA FINAL
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AI
 
SNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformSNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop Platform
 
Telco Big Data Workshop Sample
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop Sample
 
BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?
 
Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business Managers
 
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
 
Big Data in Practice.pdf
Big Data in Practice.pdfBig Data in Practice.pdf
Big Data in Practice.pdf
 

Mais de Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

Mais de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Último

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Último (20)

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

Hadoop World 2011: Hadoop Trends & Predictions - Vanessa Alverez, Forrester

  • 1. 1 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 2. Forrester’s View On Big Data-Trends in the Enterprise Big data will disrupt the data management and analytics landscape Vanessa Alvarez, Analyst November 8, 2011 2 2 © 2010 Forrester Research, Inc. Reproduction Prohibited 2009 © 2011 Forrester Research, Inc. Reproduction Prohibited 2
  • 3. Forrester’s View of Big Data 3 3 © 2010 Forrester Research, Inc. Reproduction Prohibited © 2011 Forrester Research, Inc. Reproduction Prohibited 3
  • 4. Big Data makes extreme scale economical Our View of Big Data: The techniques and technologies that make capturing value from data at extreme scale economical Big Data technology: Handles data at extreme scale. Can be characterized by: • Massive parallel computing to divide and conquer huge workloads. • Extremely flexible to allow unlimited data manipulation and transformation. • Massively scalable in terms of both technology and cost. 4 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 5. Dimensions of Extreme Scale 5 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 6. The Big Data has not cross the chasm • Social Sensitives • Big Buckers • Web 2.0s • Survivors • One footers • Data crunchers • The Awakened • The Hadoop Crowd • Proprietary Plays • The “I do tooers” • The “I guess we betters” • The NoSQL Crowd • The biggest & fastest Source: Forrester Research and http://en.wikipedia.org/wiki/Crossing_the_Chasm_(book) 6 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 7. What Does the Customer Look Like? 7 7 © 2010 Forrester Research, Inc. Reproduction Prohibited © 2011 Forrester Research, Inc. Reproduction Prohibited 7
  • 8. The Big Data customer has multiple personality disorder Firms and excited, confused and skeptical all at the same time…  Firms want more but don’t now what to do with what they have  We sent our survey to thousands of clients – only got 60 completes  The high number of partial completes indicates confusion  Firms are experimenting with data they already have  …but planning for new types of data  They plan on keeping their data a long time 8 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 9. They want more data, but aren’t sure how to manage what they have 61% believe data will 56% are concerned with change how business the ability to manage operates. current data. 330 303 Source: “Global Survey: The Business Impact of Big Data,” Avanade, November 2010 9 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 10. Buyers really want this future… Current Future “No data is discarded anymore! U.S. xPress leverages a large scale of transaction data and a diversity of interaction data, now extended to perform big data processing like “Our oil rigs generate about 25,000 Hadoop …We assess driver data points per second and we only performance with image files and pick use about 5% of that information.” up customer behaviors from texts by customer service reps. U.S. xPress — super major energy saved millions of dollars per year by company executive …augmenting our enterprise data with sensor, meter, RFID tags, and geospatial data.” — CTO, energy company 10 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 11. Forrester’s initial assessment verticals ent nce om le ure ta inm e sa ura c ele g Ins hol or urin r & L , Ent e &T ect &W e& act es S s eis anc litie v ic blic nuf dia tail Ser Fin Me Ma Ut i Data Pu Re Marketing Operations Sales Risk Management IT Analytics Finance Product Development Customer Service Logistics Innovators Early Adopters Early Majority Late Majority & Laggards Source: Forrester research – this data represents our preliminary qualitative assessment based interviews and inquiries 11 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 12. Various types of data are addressed with marketing and operations being top “What enterprise areas does your Big Data initiative address?” Marketing 45% Operations 43% Sales 38% Risk management 35% IT analytics 33% Product development 32% Finance 32% Customer service 30% Logistics 22% Other 12% HR 12% Brand management 8% Base: 60 IT Professionals who are Forrester clients Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming) 12 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 13. A ¾ are actually doing something with big data or at least evaluating it. “What is the status of your Big Data initiative?” Other Don't know Testing 2% 2% 2% Piloting 18% Evaluating In production 53% 23% Base: 60 IT Professionals who are Forrester clients Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming) 13 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 14. What Does the Data Look Like? 14 14 © 2010 Forrester Research, Inc. Reproduction Prohibited © 2011 Forrester Research, Inc. Reproduction Prohibited 1
  • 15. Firms Are Storing Many Types Of Data “Overall, including the data/information on networked and direct-attached disk storage that you know about, how much data/information does your firm currently maintain? “ 0 to 100GB 100GB to 999GB 1TB to 9 TB 10TB to 99TB 100TB to 9PB 10PB or more 1% Overall — company wide 5% 22% 33% 29% 6% Database systems 13% 23% 27% 18% 12% 2% General file storage, including file servers 5% 20% 34% 21% 13% 2% Server backups for disaster recovery and continuity 10% 19% 31% 21% 12% 2% Archiving, all forms including compliance, discovery, and 14% 19% 29% 18% 11% 2% infrequently accessed data Digital and Web content repositories, such as video, audio, 29% 22% 25% 10% 6% 1% images, and Web pages PC backups for disaster recovery and continuity 41% 20% 17% 10%4% Enterprise content management 31% 20% 21% 11% 5% 1% Other data 35% 18% 17% 8%4% 1% Base: 1,252 North American and European IT executives and technology decision-makers Source: Forrester’s Forrsights Hardware Survey, Q3 2010 15 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 16. Data Is Growing Faster Than Firms Can Deal With “For each data/information type, what do you expect will be the growth at your company over the next 12 months?” More than 100% 50% to 100% 26% to 50% 11% to 25% Grow 1% to 10% No growth 2% Overall — company wide 7% 16% 42% 31% 2% 1% Database systems 4% 11% 34% 44% 4% 1% General file storage, including file servers 3% 10% 33% 46% 4% 2% Archiving, all forms including compliance, discovery, and 4% 11% 28% 44% 9% infrequently accessed data 1% Server backups for disaster recovery and continuity 3% 11% 32% 40% 10% 1% Digital and Web content repositories, such as video, audio, 3% 10% 24% 44% 15% images, and Web pages 1% Enterprise content management 3% 9% 19% 50% 15% 1% Other data 4% 14% 51% 23% 1% PC backups for disaster recovery and continuity 5% 18% 44% 26% Base: 1,252 North American and European IT executives and technology decision-makers Source: Forrester’s Forrsights Hardware Survey, Q3 2010 16 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 17. Buyers are interested in big data for a number of reasons “What are the main business requirements or inadequacies of earlier-generation BI/DW/ET technologies, applications, and architecture that are causing you to consider or implement Big Data?” In traditional BI and DW applications, requirements come first, applications come later. In other worlds requirements drive applications. Big Data turns this model upside down, where free form exploration using Big Data technology to prove a certain hypothesis or to find a pattern, often results in specifications for a more traditional BI/DW application Data volume 75% Analysis-driven requirements (Big Data) vs. 58% requirements driven analysis (traditional … Data diversity, variety 52% Velocity of change and scope/requirements 38% unpredictability Cost. Big Data solutions are less expensive 30% than traditional ETL/DW/BI solutions Other 10% Don't know 3% Cost is also a factor, in many cases, dealing with data using big data technologies is simpliy cheaper and faster than other methods. Base: 60 IT Professionals Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming) 17 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 18. Most Current Production Implementations Use Existing Transactional Data Sources For Big Data Analysis “What types of data/records are you planning to analyze using Big Data technologies?” Most big data use cases hype its application for analysis of new, raw data from social media, sensors and web traffic, but we found that firms are being very practical, with early adopters using it for operating on enterprise data they already have. Transactional data from enterprise applications 72% Sensor/machine/device data 42% Unstructured content from email, office… 35% Social media (Facebook, Twitter, etc) 35% Locational/geospatial data 27% Clickstream 27% Image (large video/photographic) data 13% Scientific/genomic data 12% Other 7% Don't know 5% Base: 60 IT Professionals who are Forrester clients Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming) 18 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 19. BI, ERP and CRM applications are the biggest recipients of big data insights “Do your Big Data applications stand on their own?” Business intelligence (BI), analytics 55% Enterprise applications (ERP, CRM) 28% Don't know 17% Lower responses for BPM and BRE Business rules (BRE) 17% indicate that automation of processes using Business processes (BPM) 17% insights from big data is not as common as it might eventually be. Other 12% (multiple responses accepted) Base: 60 IT Professionals who are Forrester clients Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming) 19 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 20. A significant number of Forrester clients are still using commercial technology for Big Data Commercial source Big Data tools 47% Open source Big Data technology (Hadoop, MapReduce, Cassandra, and the other Apache 37% open source specs) Don't know 23% Other 17% (multiple responses accepted) Base: 60 IT Professionals who are Forrester clients Source: How Forrester Clients Are Using Big Data, September 2011 (Upcoming) 20 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 21. What are the Technology Concerns with Big Data? 21 21 © 2010 Forrester Research, Inc. Reproduction Prohibited © 2011 Forrester Research, Inc. Reproduction Prohibited 2
  • 22. Customer challenges for securing Big Data • Customers are not actively talking about Awareness and security concerns. understanding are • Customers need help understanding threats lacking in a Big Data environment • Main considerations: Synchronizing retention Country policies and disposition policies across jurisdictions, moving data across countries. and laws add complexity • Customers need help navigating frameworks and changes. Key takeaway Customers are in need of education and guidance. 22 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 23. Storage Efficiency Challenges for Big Data • Challenge: In most instances, data is random and inconsistent, not duplicated • Opportunity: There is a need for more De-duplication intelligent identification of data • Challenge: Compression normally happens instead of de-duplication, yet, will compress duplicate data regardless Compression • Opportunity: There is a need for an automated manner in doing both de- duplicating, and then compressing 23 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 24. A Conservative Perspective of Storage $ Increase How do you expect your firm’s spending on the following IT infrastructure expenses to change over the next 12 months? Stay about the same Increase 5%-10% Increase more than 10% Storage technologies 36% 40% 15% IT infrastructure/operations staff salaries 52% 34% 4% Systems management 59% 25% 4% Data center and IT facilities 49% 30% 9% Servers and server operating systems 43% 35% 9% Infrastructure systems integration and consulting services 57% 21% 6% PCs and PC operating systems 48% 29% 6% Infrastructure outsourcing and managed services (based 57% 18% 5% on fixed-price, long-term contracts) Pay-per-use online hosted infrastructure services, such as 54% 16% 5% cloud Base: 1725 North American and European IT executives and technology decision-makers Source: Forrester’s Forrsights Hardware Survey, Q3 2010 24 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 25. Commodity vs Specialized Hardware: Pros and Cons • Opportunity: Cost efficient, scalable, open Commodity • Challenge: reliability, availability • Opportunity: performance, scalable Specialized • Challenge: Cost, proprietary Source: Forrester Research 25 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 26. Potential future state architecture: ANY DATA Analytics Tools Hadoop •Compression •De-duplication •Encryption Intelligent Management •Provisioning •Virtualization DAS SAN NAS Network Network Network Source: Forrester Research 26 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 27. Moving Ahead 27 27 © 2010 Forrester Research, Inc. Reproduction Prohibited © 2011 Forrester Research, Inc. Reproduction Prohibited 2
  • 28. On-Premise vs Cloud: Pros and Cons • Challenge: Capex, space On-Premise • Opportunity: Security, ownership • Challenge: Security, manageability Cloud • Opportunity: Cost efficient, scalable, opex Source: Forrester Research 28 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 29. The market is rapidly evolving to fill gaps  Other file storage systems that capture some Hadoop market share.  Other computational frameworks, beyond MapReduce that improve operational analytics.  Advanced streaming and transactional processing that further support real-time.  Business-friendly features that empower your workforce.  Storage efficiency and security issues will be solved by somebody…(?) 29 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 30. Gaps in the Big Data market mean opportunities  Intelligent Management – Storage efficiency – Security  Interactive analytics – – Hadoop is batch only at present – More real time analytics  Analytics, Big Data as a Service – Leveraging cloud 30 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 31. How Does This Role Evolve?  Data ownership is everyone’s responsibility  Important to bring together a team within IT as well as a business analyst  Understand your peers in applications and database  Train or hire a data analyst that can handle data lifecycle management 31 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 32. Questions, comments, discussion? 32 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 33. Thank you Vanessa Alvarez +1 617.613.6259 valvarez@forrester.com Twitter: @vanessaalvarez1 Blog: blog.forrester.com www.forrester.com © 2009 Forrester Research, Inc. Reproduction Prohibited
  • 34. Big Data Is Disruptive Not Incremental Big data is not incremental solutions to old problems where data has grown bigger  Big data is – New techniques and technologies – To handle that we do not deal with today – Using scalable and parallel technologies – That trade off consistency for high availability and node failures.  Forrester definition excludes some solutions that other might call “big data”. Examples: – Massively parallel data warehouses – In-memory databases 34 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 35. …but the “why” is about speed to value 35 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 36. Security considerations for Big Data are nascent  Current customer focus: – What is Big Data, what can we do with it? – Concerns over stability, scale, cost, availability, unified management – Not security (not yet)  Current storage and security vendor focus: – What do we need to know about Big Data? – What can be done from a security perspective? – Where do we fit?  Another type of discussion emerging: – Big Data for security – Example – Zettaset Security Data Warehouse, using Big Data to identify security threats 36 © 2011 Forrester Research, Inc. Reproduction Prohibited
  • 37. Key Pieces to the Big Data Puzzle  The role of SSD – Prices coming down – Commoditization of hardware happens more at the SAN, NAS than SSD – Analytics require speed and performance, in order to achieve near real-time  Storage Virtualization – Just like servers were virtualized, so will storage be, and a “storage hypervisor” will determine where storage capacity will come from, whether it be SAN, NAS or SSD (DAS) – Enterprises are looking to still leverage their existing infrastructure, instead of ripping and replacing with “commodity infrastructure” 37 © 2011 Forrester Research, Inc. Reproduction Prohibited