SlideShare a Scribd company logo
1 of 21
Unlocking Value in (Big) Data

Oscar Renalias, Accenture
oscar.renalias@accenture.com
About the presenter
Oscar Renalias
Oscar is a Technology Architect and has been working at
Accenture in the Helsinki office for the last 5 years. He holds a
Bachelor’s Degree in Computer Science from the Universitat
Politècnica de Catalunya (UPC), in Barcelona.
Oscar currently belongs to the global organization within
Accenture responsible for pushing technology
innovation, working with selected new and emerging
technologies together with clients to generate business value.
Hadoop/Big Data is one of those areas.
Oscar.renalias@accenture.com
+358407725915



                                           Copyright © 2012 Accenture All rights reserved.
Agenda

• Top 4 things about Big Data & Analytics
• What is Big Data?
• Big Data Analytics – what is it?
• What does it contain?
• How is it integrated?
• How do we manage it?
• What next?




                                     Copyright © 2012 Accenture All rights reserved.
Top 4 things about Big Data Analytics

  Resistance is futile,
      you will be assimilated

  Competitive advantage
  It’s different

  Data wants to be open

Copyright © 2012 Accenture All rights reserved.
Data is growing
It’s growing. Quickly. And it’s everywhere.

                                                          Data stored in Exabytes (1018)
                                                  9000
                                                                                                   7910
                                                  8000

                                                  7000

                                                  6000

                                                  5000

                                                  4000

                                                  3000

                                                  2000
                                                                                 1227
                                                  1000
                                                                130
                                                      0
                                                                2005             2010              2015

                                              Source: IDC’s Digital Universe Study (sponsored by EMC), June 2011



Copyright © 2012 Accenture All rights reserved.
New kinds of data
Structured data vs. Unstructured data growth




                                                         Complex, Unstructured
                                                                                                               Analysis
                                                                                                               gap


                                                                                                              Our ability
                                   Relational                                                                 to analyze




        Source: An   IDC White Paper - sponsored by EMC. As the Economy Contracts, the Digital Universe Expands. May 2009.
                                                                                                                         .
Copyright © 2012 Accenture All rights reserved.
Big Data Technologies
New technologies, new approaches




 Source: Wordle for Credit Suisse, Does Size Matter Only?, September 2011
Copyright © 2012 Accenture All rights reserved.
Where do analysts see Big Data?
Gartner’s Hype Cycle for Emerging Technologies 2011




Copyright © 2012 Accenture All rights reserved.
MapReduce and Hadoop
MapReduce revolutionized how we handle large amounts of
data, Hadoop made it simple and affordable

                                                  • Originally designed and first developed in
                                                    Google as part of their efforts to more
                                                    efficiently index the web
                                                  • MapReduce splits input data into smaller
                                                    chunk that can be processed in parallel
                                                  • Scales linearly with number of nodes


                                                  • Yahoo’s implementation of MapReduce
                                                  • Open source, top-level project in the
                                                    Apache Foundation
                                                  • Designed to run on commodity software
                                                    (Linux) and hardware (consumer-grade
                                                    computers with directly attached storage)
                                                  • Large ecosystem of additional
                                                    components (both open source and
                                                    commercial)
Copyright © 2012 Accenture All rights reserved.
Big Data Analytics
What is it?

Big Data Analytics is a shift in the mindset of how we
think about analytics as an internal component to the
organization


Focuses on letting data be productized in a way that
drives meaningful insights in a rapid fashion and
innovation to exploit missed opportunities in areas
previously unlooked…


… providing a path to competitive advantage




Copyright © 2012 Accenture All rights reserved.
Big Data Analytics vs. traditional analytics
Where do they differ?

                                Technology                      Skills                   Processes &
                                                                                         Organization
                        Assumes                        Basic knowledge of            “Siloed” data
                        condensed, structured, an      reporting and analysis        organizations
         Traditional
          Analytics




                        d feature rich datasets that   tools, few specialized
                        can be modeled: relational     resources                     Only specific “views” of
                        databases, data                                              data visible across the
                        warehouses, dashboards                                       enterprise




                       A stack of tools that           Advanced                      Data is productized and
                       enables an organization to      analytical, mathematical      shared across the
         Big Data
         Analytics




                       build a framework that          and statistical knowledge     enterprise
                       allows them to extract          required to develop new
                       useful features from a          models – the data scientist   Dedicated data
                       large dataset to further                                      organizations with well-
                       understand how to model                                       defined data management
                       their data.                                                   processes and ownership



Copyright © 2012 Accenture All rights reserved.
Everything will be analyzed
The three Vs


                        Real-time

                                                                                   Event
            In-                                                                processing, H
       memory, NoS                                                                adoop +
         QL, Event                                                                NoSQL
       processing, E
           DW


             Velocity

       Relational, ET
                                                                                Hadoop, ETL
              L

                             Batch



     Volume                              Structured             Unstructured

                                                      Variety                  Source: IDC

Copyright © 2012 Accenture All rights reserved.
Big Data and Analytics in the Enterprise
 Many technology choices in a rapidly changing environment.
 Which one is right for you?
Distributed Non-Relational Storage and Processing


Big Data-Enabled Intelligence and Analysis


Analytics-Focused Massively Parallel Processing
(MPP) Software Platforms

Hardware Optimized MPP Data Warehouses


Distributed In-memory

                                                    Cloud

 Copyright © 2012 Accenture All rights reserved.
Technology
Augmenting existing analytics with Big Data technologies



                                                   Emerging
                                                     Data
                                                  Technologies




                                                                 Big Data
                                                                 Analytics
                                                   Traditional
                                                     Tools




Copyright © 2012 Accenture All rights reserved.
SAS-Hadoop integration
An example of how traditional analytics tools are evolving to interoperate
with Hadoop
SAS/Access Interface to Hadoop
    • Enable SAS user to analyze data stored in Hadoop
    • Allow Hadoop data processing from SAS client software such as Data Integration Studio, Enterprise Guide and
       Enterprise Miner.
    • The Access Engine not only move data into and out of Hadoop, but you can also run data processing and have it
       “pushed-down” into Hadoop
SAS Data Integration Studio Transformation for Hadoop
    • New sets of Hadoop transformations that enable DI studio user to load and unload data from Hadoop faster than
       Sqoop (Can connect to Oracle)
    • Perform “ETL-like” processing with Hive and Pig.
    • Hadoop specific scoring transform that enable models to be developed with Enterprise Miner to be deployed to
       Hadoop via DI Studio.




Copyright © 2012 Accenture All rights reserved.
The impact of Big Data Analytics on our landscapes
Hybrid landscapes, where old and new converge


                Internal
                apps, customer-
                facing
                apps, mobile                                                               Analysis tools
                apps                                                                      (SAS, SPSS, R,
                                       Data Services (REST, WS)                              Tableau)

                                                                                         Relational DBs
                                          Pig           Hive
                                                                         HBase
                                              MapReduce

                                                    HDFS                                   Enterprise
                                                                                              DW

                                                                 ETL                     Real-time analytics



                                                        Time
                                                        Series   Files   Social   Logs
                  Web            ERP              CRM




Copyright © 2012 Accenture All rights reserved.
Data Science and the skill gap
Closing the loop – it’s not just about technology skills


Data science
“The sexy job in the next 10 years will
be statisticians”
 – Hal Varian, Chief Economist at
Google


Data scientists are the next-generation
analytics professional, responsible for
turning the data into insight



Copyright © 2012 Accenture All rights reserved.
Big Data Analytics Management
How does Big Data Analytics Management Style Differ?
In big data analytics resources generally have a hybrid cross between
Software Engineering and Advanced Statistics. This dynamic of skill sets
produces a challenge in project methodology.


                                                  Analytics Methodologies
           Software Methodologies




Copyright © 2012 Accenture All rights reserved.
Wrapping up
Big Data is challenging current patterns of thought




   Cost-effective
                                                      Data
  computing and                                                           Big Data and Analytics
                                                   “explosion”
     storage
  Everything can be                               Data everywhere:     Resistance is futile
  stored                                          structured, unstru
                                                  ctured, other        Are the path to competitive
                                                                       advantage and create value
  Cheap large scale                               people’s
  computing power                                 data, geolocation    Compared to traditional
  readily available                               data                 analytics, they’re different; adapt
                                                                       or become irrelevant

                                                                       Open your data


Copyright © 2012 Accenture All rights reserved.
Wrapping up
How to get started
• Identify business processes that you could do
  more effectively with the help of big data and
  analytics


• Start with well-funded but small trials and proof-of-
  concepts, evolve towards a solid roadmap


• Open up your data, transformation towards a “data
  as a service” architecture


• Acquire or grow the needed technology and
  analytical skills

Copyright © 2012 Accenture All rights reserved.
Accenture Technology Vision
Strong advice on data for 2012




              http://bit.ly/accenturetechnologyvision2012

                                   Copyright © 2012 Accenture All rights reserved.

More Related Content

What's hot

Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects FailSense Corp
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
Big Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightBig Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightSunil Ranka
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
Big agendas for big data analytics projects
Big agendas for big data analytics projectsBig agendas for big data analytics projects
Big agendas for big data analytics projectsThe Marketing Distillery
 
Drive your business with predictive analytics
Drive your business with predictive analyticsDrive your business with predictive analytics
Drive your business with predictive analyticsThe Marketing Distillery
 
Self-Service Analytics Framework - Connected Brains 2018
Self-Service Analytics Framework - Connected Brains 2018Self-Service Analytics Framework - Connected Brains 2018
Self-Service Analytics Framework - Connected Brains 2018LoQutus
 
Data Analytics Strategy
Data Analytics StrategyData Analytics Strategy
Data Analytics StrategyeHealthCareers
 
What are the 6 elements of a project
What are the 6 elements of a projectWhat are the 6 elements of a project
What are the 6 elements of a projectRichardPierce28
 
Strategy session 5 - unlocking the data dividend - andy steer
Strategy   session 5 - unlocking the data dividend - andy steerStrategy   session 5 - unlocking the data dividend - andy steer
Strategy session 5 - unlocking the data dividend - andy steerAndy Steer
 
Top 10 trends in business intelligence for 2015
Top 10 trends in business intelligence for 2015Top 10 trends in business intelligence for 2015
Top 10 trends in business intelligence for 2015Tableau Software
 
Slides: Go Beyond Dashboards With the Next Generation of Analytics
Slides: Go Beyond Dashboards With the Next Generation of AnalyticsSlides: Go Beyond Dashboards With the Next Generation of Analytics
Slides: Go Beyond Dashboards With the Next Generation of AnalyticsDATAVERSITY
 
Modern Data Management
Modern Data ManagementModern Data Management
Modern Data ManagementSAP Technology
 
Valuing the data asset
Valuing the data assetValuing the data asset
Valuing the data assetBala Iyer
 
Data strategy in a Big Data world
Data strategy in a Big Data worldData strategy in a Big Data world
Data strategy in a Big Data worldCraig Milroy
 
Slides: The Automated Business Glossary
Slides: The Automated Business GlossarySlides: The Automated Business Glossary
Slides: The Automated Business GlossaryDATAVERSITY
 
Data and analytics strategy PUBLIC - ADHB 2021
Data and analytics strategy PUBLIC - ADHB 2021Data and analytics strategy PUBLIC - ADHB 2021
Data and analytics strategy PUBLIC - ADHB 2021Ali Khan
 
Importance of data analytics for business
Importance of data analytics for businessImportance of data analytics for business
Importance of data analytics for businessBranliticSocial
 

What's hot (20)

Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Big Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightBig Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to Foresight
 
Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
Big agendas for big data analytics projects
Big agendas for big data analytics projectsBig agendas for big data analytics projects
Big agendas for big data analytics projects
 
Drive your business with predictive analytics
Drive your business with predictive analyticsDrive your business with predictive analytics
Drive your business with predictive analytics
 
Self-Service Analytics Framework - Connected Brains 2018
Self-Service Analytics Framework - Connected Brains 2018Self-Service Analytics Framework - Connected Brains 2018
Self-Service Analytics Framework - Connected Brains 2018
 
Data Analytics Strategy
Data Analytics StrategyData Analytics Strategy
Data Analytics Strategy
 
What are the 6 elements of a project
What are the 6 elements of a projectWhat are the 6 elements of a project
What are the 6 elements of a project
 
Strategy session 5 - unlocking the data dividend - andy steer
Strategy   session 5 - unlocking the data dividend - andy steerStrategy   session 5 - unlocking the data dividend - andy steer
Strategy session 5 - unlocking the data dividend - andy steer
 
Top 10 trends in business intelligence for 2015
Top 10 trends in business intelligence for 2015Top 10 trends in business intelligence for 2015
Top 10 trends in business intelligence for 2015
 
Slides: Go Beyond Dashboards With the Next Generation of Analytics
Slides: Go Beyond Dashboards With the Next Generation of AnalyticsSlides: Go Beyond Dashboards With the Next Generation of Analytics
Slides: Go Beyond Dashboards With the Next Generation of Analytics
 
Modern Data Management
Modern Data ManagementModern Data Management
Modern Data Management
 
Valuing the data asset
Valuing the data assetValuing the data asset
Valuing the data asset
 
Data strategy in a Big Data world
Data strategy in a Big Data worldData strategy in a Big Data world
Data strategy in a Big Data world
 
Slides: The Automated Business Glossary
Slides: The Automated Business GlossarySlides: The Automated Business Glossary
Slides: The Automated Business Glossary
 
Data and analytics strategy PUBLIC - ADHB 2021
Data and analytics strategy PUBLIC - ADHB 2021Data and analytics strategy PUBLIC - ADHB 2021
Data and analytics strategy PUBLIC - ADHB 2021
 
Importance of data analytics for business
Importance of data analytics for businessImportance of data analytics for business
Importance of data analytics for business
 

Viewers also liked

Nick Patience, Director Product Marketing & Strategy at Recommind - Big Data:...
Nick Patience, Director Product Marketing & Strategy at Recommind - Big Data:...Nick Patience, Director Product Marketing & Strategy at Recommind - Big Data:...
Nick Patience, Director Product Marketing & Strategy at Recommind - Big Data:...Global Business Events
 
Accenture Data Science Hackathon Presentation
Accenture Data Science Hackathon PresentationAccenture Data Science Hackathon Presentation
Accenture Data Science Hackathon Presentationams345
 
Big Data Everywhere Chicago: SQL on Hadoop
Big Data Everywhere Chicago: SQL on Hadoop Big Data Everywhere Chicago: SQL on Hadoop
Big Data Everywhere Chicago: SQL on Hadoop BigDataEverywhere
 
Self service data exploration with apache drill
Self service data exploration with apache drillSelf service data exploration with apache drill
Self service data exploration with apache drillMapR Technologies
 
Democritization of Data v2
Democritization of Data v2Democritization of Data v2
Democritization of Data v2Sandy Strauss
 
Tendencias Storage
Tendencias StorageTendencias Storage
Tendencias StorageFran Navarro
 
McKinsey Global Institute - Big data: The next frontier for innovation, compe...
McKinsey Global Institute - Big data: The next frontier for innovation, compe...McKinsey Global Institute - Big data: The next frontier for innovation, compe...
McKinsey Global Institute - Big data: The next frontier for innovation, compe...Path of the Blue Eye Project
 
The Reality Behind Industrial Augmented Reality Wearables
The Reality Behind Industrial Augmented Reality WearablesThe Reality Behind Industrial Augmented Reality Wearables
The Reality Behind Industrial Augmented Reality WearablesDAQRI
 
4 voc비정형분석 문종영
4 voc비정형분석 문종영4 voc비정형분석 문종영
4 voc비정형분석 문종영Saltlux Inc.
 
National Hackathon - Problem Statements
National Hackathon - Problem StatementsNational Hackathon - Problem Statements
National Hackathon - Problem StatementsZaki Haider
 
Accenture Greenlight Insights Conference November 1st 2016
Accenture Greenlight Insights Conference November 1st 2016Accenture Greenlight Insights Conference November 1st 2016
Accenture Greenlight Insights Conference November 1st 2016Sunny Webb
 
Creating Revenue from Customer Data
Creating Revenue from Customer DataCreating Revenue from Customer Data
Creating Revenue from Customer Dataaccenture
 
Big data and value creation
Big data and value creationBig data and value creation
Big data and value creationRichard Vidgen
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Cynthia Saracco
 
Everything you want to know about sales growth but are afraid to ask
Everything you want to know about sales growth but are afraid to askEverything you want to know about sales growth but are afraid to ask
Everything you want to know about sales growth but are afraid to askMcKinsey on Marketing & Sales
 
“Haves” and “have-mores”: the accelerating digitization of the US economy
“Haves” and “have-mores”: the accelerating digitization of the US economy“Haves” and “have-mores”: the accelerating digitization of the US economy
“Haves” and “have-mores”: the accelerating digitization of the US economyMcKinsey on Marketing & Sales
 

Viewers also liked (20)

Nick Patience, Director Product Marketing & Strategy at Recommind - Big Data:...
Nick Patience, Director Product Marketing & Strategy at Recommind - Big Data:...Nick Patience, Director Product Marketing & Strategy at Recommind - Big Data:...
Nick Patience, Director Product Marketing & Strategy at Recommind - Big Data:...
 
Accenture Data Science Hackathon Presentation
Accenture Data Science Hackathon PresentationAccenture Data Science Hackathon Presentation
Accenture Data Science Hackathon Presentation
 
Big Data Everywhere Chicago: SQL on Hadoop
Big Data Everywhere Chicago: SQL on Hadoop Big Data Everywhere Chicago: SQL on Hadoop
Big Data Everywhere Chicago: SQL on Hadoop
 
Self service data exploration with apache drill
Self service data exploration with apache drillSelf service data exploration with apache drill
Self service data exploration with apache drill
 
Democritization of Data v2
Democritization of Data v2Democritization of Data v2
Democritization of Data v2
 
Tendencias Storage
Tendencias StorageTendencias Storage
Tendencias Storage
 
McKinsey Global Institute - Big data: The next frontier for innovation, compe...
McKinsey Global Institute - Big data: The next frontier for innovation, compe...McKinsey Global Institute - Big data: The next frontier for innovation, compe...
McKinsey Global Institute - Big data: The next frontier for innovation, compe...
 
The Reality Behind Industrial Augmented Reality Wearables
The Reality Behind Industrial Augmented Reality WearablesThe Reality Behind Industrial Augmented Reality Wearables
The Reality Behind Industrial Augmented Reality Wearables
 
4 voc비정형분석 문종영
4 voc비정형분석 문종영4 voc비정형분석 문종영
4 voc비정형분석 문종영
 
National Hackathon - Problem Statements
National Hackathon - Problem StatementsNational Hackathon - Problem Statements
National Hackathon - Problem Statements
 
Accenture Greenlight Insights Conference November 1st 2016
Accenture Greenlight Insights Conference November 1st 2016Accenture Greenlight Insights Conference November 1st 2016
Accenture Greenlight Insights Conference November 1st 2016
 
Creating Revenue from Customer Data
Creating Revenue from Customer DataCreating Revenue from Customer Data
Creating Revenue from Customer Data
 
Big data and value creation
Big data and value creationBig data and value creation
Big data and value creation
 
Sales Growth: Quotes from select interviews
Sales Growth: Quotes from select interviewsSales Growth: Quotes from select interviews
Sales Growth: Quotes from select interviews
 
Sales 2.0
Sales 2.0Sales 2.0
Sales 2.0
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
 
Everything you want to know about sales growth but are afraid to ask
Everything you want to know about sales growth but are afraid to askEverything you want to know about sales growth but are afraid to ask
Everything you want to know about sales growth but are afraid to ask
 
“Haves” and “have-mores”: the accelerating digitization of the US economy
“Haves” and “have-mores”: the accelerating digitization of the US economy“Haves” and “have-mores”: the accelerating digitization of the US economy
“Haves” and “have-mores”: the accelerating digitization of the US economy
 
Big Data and Advanced Analytics
Big Data and Advanced AnalyticsBig Data and Advanced Analytics
Big Data and Advanced Analytics
 
Digital Business - Accenture
Digital Business - AccentureDigital Business - Accenture
Digital Business - Accenture
 

Similar to Unlocking value in your (big) data

Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessSimplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessTeradata Aster
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureOdinot Stanislas
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data SolutionsMark Kromer
 
ActuateOne for Utility Analytics
ActuateOne for Utility AnalyticsActuateOne for Utility Analytics
ActuateOne for Utility Analyticskatsoulis
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your businessAcunu
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Edgar Alejandro Villegas
 
Présentation on radoop
Présentation on radoop   Présentation on radoop
Présentation on radoop siliconsudipt
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
Investigative Analytics- What's in a Data Scientists Toolbox
Investigative Analytics- What's in a Data Scientists ToolboxInvestigative Analytics- What's in a Data Scientists Toolbox
Investigative Analytics- What's in a Data Scientists ToolboxData Science London
 
Necto BI 3.0 presentation
Necto BI 3.0 presentationNecto BI 3.0 presentation
Necto BI 3.0 presentationstudio7design
 
OSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalOSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalAccenture the Netherlands
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)Ajay Ohri
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing DataWorks Summit
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forumbigdatawf
 
Innovate Analytics with Oracle Data Mining & Oracle R
Innovate Analytics with Oracle Data Mining & Oracle RInnovate Analytics with Oracle Data Mining & Oracle R
Innovate Analytics with Oracle Data Mining & Oracle RCapgemini
 
Augmented Analytics and Automation in the Age of the Data Scientist
Augmented Analytics and Automation in the Age of the Data ScientistAugmented Analytics and Automation in the Age of the Data Scientist
Augmented Analytics and Automation in the Age of the Data ScientistWhereScape
 

Similar to Unlocking value in your (big) data (20)

Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessSimplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the Business
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
The New Enterprise Data Platform
The New Enterprise Data PlatformThe New Enterprise Data Platform
The New Enterprise Data Platform
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
ActuateOne for Utility Analytics
ActuateOne for Utility AnalyticsActuateOne for Utility Analytics
ActuateOne for Utility Analytics
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your business
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869
 
Présentation on radoop
Présentation on radoop   Présentation on radoop
Présentation on radoop
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Investigative Analytics- What's in a Data Scientists Toolbox
Investigative Analytics- What's in a Data Scientists ToolboxInvestigative Analytics- What's in a Data Scientists Toolbox
Investigative Analytics- What's in a Data Scientists Toolbox
 
Barak regev
Barak regevBarak regev
Barak regev
 
Necto BI 3.0 presentation
Necto BI 3.0 presentationNecto BI 3.0 presentation
Necto BI 3.0 presentation
 
OSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalOSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - Technical
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forum
 
Innovate Analytics with Oracle Data Mining & Oracle R
Innovate Analytics with Oracle Data Mining & Oracle RInnovate Analytics with Oracle Data Mining & Oracle R
Innovate Analytics with Oracle Data Mining & Oracle R
 
Augmented Analytics and Automation in the Age of the Data Scientist
Augmented Analytics and Automation in the Age of the Data ScientistAugmented Analytics and Automation in the Age of the Data Scientist
Augmented Analytics and Automation in the Age of the Data Scientist
 
Sybase IQ Big Data
Sybase IQ Big DataSybase IQ Big Data
Sybase IQ Big Data
 

More from Oscar Renalias

Enterprise Open Source
Enterprise Open SourceEnterprise Open Source
Enterprise Open SourceOscar Renalias
 
DockerCon EU 2017 - Containers are not just for microservices
DockerCon EU 2017 - Containers are not just for microservicesDockerCon EU 2017 - Containers are not just for microservices
DockerCon EU 2017 - Containers are not just for microservicesOscar Renalias
 
Containers aren’t just for microservices – Containerizing Legacy Workloads
Containers aren’t just for microservices – Containerizing Legacy WorkloadsContainers aren’t just for microservices – Containerizing Legacy Workloads
Containers aren’t just for microservices – Containerizing Legacy WorkloadsOscar Renalias
 
50 production deployments a day, at least
50 production deployments a day, at least50 production deployments a day, at least
50 production deployments a day, at leastOscar Renalias
 
DockerCon 2016 - Structured Container Delivery
DockerCon 2016 - Structured Container DeliveryDockerCon 2016 - Structured Container Delivery
DockerCon 2016 - Structured Container DeliveryOscar Renalias
 
Containerize everything - Wildcardconf 2015
Containerize everything - Wildcardconf 2015Containerize everything - Wildcardconf 2015
Containerize everything - Wildcardconf 2015Oscar Renalias
 
Next-generation JavaScript - OpenSlava 2014
Next-generation JavaScript - OpenSlava 2014Next-generation JavaScript - OpenSlava 2014
Next-generation JavaScript - OpenSlava 2014Oscar Renalias
 
Node.js, for architects - OpenSlava 2013
Node.js, for architects - OpenSlava 2013Node.js, for architects - OpenSlava 2013
Node.js, for architects - OpenSlava 2013Oscar Renalias
 
OpenSlava 2013 - Dynamic Languages
OpenSlava 2013 - Dynamic LanguagesOpenSlava 2013 - Dynamic Languages
OpenSlava 2013 - Dynamic LanguagesOscar Renalias
 
Asynchronous web apps with the Play Framework 2.0
Asynchronous web apps with the Play Framework 2.0Asynchronous web apps with the Play Framework 2.0
Asynchronous web apps with the Play Framework 2.0Oscar Renalias
 
ScalaCheck Cookbook v1.0
ScalaCheck Cookbook v1.0ScalaCheck Cookbook v1.0
ScalaCheck Cookbook v1.0Oscar Renalias
 

More from Oscar Renalias (11)

Enterprise Open Source
Enterprise Open SourceEnterprise Open Source
Enterprise Open Source
 
DockerCon EU 2017 - Containers are not just for microservices
DockerCon EU 2017 - Containers are not just for microservicesDockerCon EU 2017 - Containers are not just for microservices
DockerCon EU 2017 - Containers are not just for microservices
 
Containers aren’t just for microservices – Containerizing Legacy Workloads
Containers aren’t just for microservices – Containerizing Legacy WorkloadsContainers aren’t just for microservices – Containerizing Legacy Workloads
Containers aren’t just for microservices – Containerizing Legacy Workloads
 
50 production deployments a day, at least
50 production deployments a day, at least50 production deployments a day, at least
50 production deployments a day, at least
 
DockerCon 2016 - Structured Container Delivery
DockerCon 2016 - Structured Container DeliveryDockerCon 2016 - Structured Container Delivery
DockerCon 2016 - Structured Container Delivery
 
Containerize everything - Wildcardconf 2015
Containerize everything - Wildcardconf 2015Containerize everything - Wildcardconf 2015
Containerize everything - Wildcardconf 2015
 
Next-generation JavaScript - OpenSlava 2014
Next-generation JavaScript - OpenSlava 2014Next-generation JavaScript - OpenSlava 2014
Next-generation JavaScript - OpenSlava 2014
 
Node.js, for architects - OpenSlava 2013
Node.js, for architects - OpenSlava 2013Node.js, for architects - OpenSlava 2013
Node.js, for architects - OpenSlava 2013
 
OpenSlava 2013 - Dynamic Languages
OpenSlava 2013 - Dynamic LanguagesOpenSlava 2013 - Dynamic Languages
OpenSlava 2013 - Dynamic Languages
 
Asynchronous web apps with the Play Framework 2.0
Asynchronous web apps with the Play Framework 2.0Asynchronous web apps with the Play Framework 2.0
Asynchronous web apps with the Play Framework 2.0
 
ScalaCheck Cookbook v1.0
ScalaCheck Cookbook v1.0ScalaCheck Cookbook v1.0
ScalaCheck Cookbook v1.0
 

Recently uploaded

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Recently uploaded (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Unlocking value in your (big) data

  • 1. Unlocking Value in (Big) Data Oscar Renalias, Accenture oscar.renalias@accenture.com
  • 2. About the presenter Oscar Renalias Oscar is a Technology Architect and has been working at Accenture in the Helsinki office for the last 5 years. He holds a Bachelor’s Degree in Computer Science from the Universitat Politècnica de Catalunya (UPC), in Barcelona. Oscar currently belongs to the global organization within Accenture responsible for pushing technology innovation, working with selected new and emerging technologies together with clients to generate business value. Hadoop/Big Data is one of those areas. Oscar.renalias@accenture.com +358407725915 Copyright © 2012 Accenture All rights reserved.
  • 3. Agenda • Top 4 things about Big Data & Analytics • What is Big Data? • Big Data Analytics – what is it? • What does it contain? • How is it integrated? • How do we manage it? • What next? Copyright © 2012 Accenture All rights reserved.
  • 4. Top 4 things about Big Data Analytics Resistance is futile, you will be assimilated Competitive advantage It’s different Data wants to be open Copyright © 2012 Accenture All rights reserved.
  • 5. Data is growing It’s growing. Quickly. And it’s everywhere. Data stored in Exabytes (1018) 9000 7910 8000 7000 6000 5000 4000 3000 2000 1227 1000 130 0 2005 2010 2015 Source: IDC’s Digital Universe Study (sponsored by EMC), June 2011 Copyright © 2012 Accenture All rights reserved.
  • 6. New kinds of data Structured data vs. Unstructured data growth Complex, Unstructured Analysis gap Our ability Relational to analyze Source: An IDC White Paper - sponsored by EMC. As the Economy Contracts, the Digital Universe Expands. May 2009. . Copyright © 2012 Accenture All rights reserved.
  • 7. Big Data Technologies New technologies, new approaches Source: Wordle for Credit Suisse, Does Size Matter Only?, September 2011 Copyright © 2012 Accenture All rights reserved.
  • 8. Where do analysts see Big Data? Gartner’s Hype Cycle for Emerging Technologies 2011 Copyright © 2012 Accenture All rights reserved.
  • 9. MapReduce and Hadoop MapReduce revolutionized how we handle large amounts of data, Hadoop made it simple and affordable • Originally designed and first developed in Google as part of their efforts to more efficiently index the web • MapReduce splits input data into smaller chunk that can be processed in parallel • Scales linearly with number of nodes • Yahoo’s implementation of MapReduce • Open source, top-level project in the Apache Foundation • Designed to run on commodity software (Linux) and hardware (consumer-grade computers with directly attached storage) • Large ecosystem of additional components (both open source and commercial) Copyright © 2012 Accenture All rights reserved.
  • 10. Big Data Analytics What is it? Big Data Analytics is a shift in the mindset of how we think about analytics as an internal component to the organization Focuses on letting data be productized in a way that drives meaningful insights in a rapid fashion and innovation to exploit missed opportunities in areas previously unlooked… … providing a path to competitive advantage Copyright © 2012 Accenture All rights reserved.
  • 11. Big Data Analytics vs. traditional analytics Where do they differ? Technology Skills Processes & Organization Assumes Basic knowledge of “Siloed” data condensed, structured, an reporting and analysis organizations Traditional Analytics d feature rich datasets that tools, few specialized can be modeled: relational resources Only specific “views” of databases, data data visible across the warehouses, dashboards enterprise A stack of tools that Advanced Data is productized and enables an organization to analytical, mathematical shared across the Big Data Analytics build a framework that and statistical knowledge enterprise allows them to extract required to develop new useful features from a models – the data scientist Dedicated data large dataset to further organizations with well- understand how to model defined data management their data. processes and ownership Copyright © 2012 Accenture All rights reserved.
  • 12. Everything will be analyzed The three Vs Real-time Event In- processing, H memory, NoS adoop + QL, Event NoSQL processing, E DW Velocity Relational, ET Hadoop, ETL L Batch Volume Structured Unstructured Variety Source: IDC Copyright © 2012 Accenture All rights reserved.
  • 13. Big Data and Analytics in the Enterprise Many technology choices in a rapidly changing environment. Which one is right for you? Distributed Non-Relational Storage and Processing Big Data-Enabled Intelligence and Analysis Analytics-Focused Massively Parallel Processing (MPP) Software Platforms Hardware Optimized MPP Data Warehouses Distributed In-memory Cloud Copyright © 2012 Accenture All rights reserved.
  • 14. Technology Augmenting existing analytics with Big Data technologies Emerging Data Technologies Big Data Analytics Traditional Tools Copyright © 2012 Accenture All rights reserved.
  • 15. SAS-Hadoop integration An example of how traditional analytics tools are evolving to interoperate with Hadoop SAS/Access Interface to Hadoop • Enable SAS user to analyze data stored in Hadoop • Allow Hadoop data processing from SAS client software such as Data Integration Studio, Enterprise Guide and Enterprise Miner. • The Access Engine not only move data into and out of Hadoop, but you can also run data processing and have it “pushed-down” into Hadoop SAS Data Integration Studio Transformation for Hadoop • New sets of Hadoop transformations that enable DI studio user to load and unload data from Hadoop faster than Sqoop (Can connect to Oracle) • Perform “ETL-like” processing with Hive and Pig. • Hadoop specific scoring transform that enable models to be developed with Enterprise Miner to be deployed to Hadoop via DI Studio. Copyright © 2012 Accenture All rights reserved.
  • 16. The impact of Big Data Analytics on our landscapes Hybrid landscapes, where old and new converge Internal apps, customer- facing apps, mobile Analysis tools apps (SAS, SPSS, R, Data Services (REST, WS) Tableau) Relational DBs Pig Hive HBase MapReduce HDFS Enterprise DW ETL Real-time analytics Time Series Files Social Logs Web ERP CRM Copyright © 2012 Accenture All rights reserved.
  • 17. Data Science and the skill gap Closing the loop – it’s not just about technology skills Data science “The sexy job in the next 10 years will be statisticians” – Hal Varian, Chief Economist at Google Data scientists are the next-generation analytics professional, responsible for turning the data into insight Copyright © 2012 Accenture All rights reserved.
  • 18. Big Data Analytics Management How does Big Data Analytics Management Style Differ? In big data analytics resources generally have a hybrid cross between Software Engineering and Advanced Statistics. This dynamic of skill sets produces a challenge in project methodology. Analytics Methodologies Software Methodologies Copyright © 2012 Accenture All rights reserved.
  • 19. Wrapping up Big Data is challenging current patterns of thought Cost-effective Data computing and Big Data and Analytics “explosion” storage Everything can be Data everywhere: Resistance is futile stored structured, unstru ctured, other Are the path to competitive advantage and create value Cheap large scale people’s computing power data, geolocation Compared to traditional readily available data analytics, they’re different; adapt or become irrelevant Open your data Copyright © 2012 Accenture All rights reserved.
  • 20. Wrapping up How to get started • Identify business processes that you could do more effectively with the help of big data and analytics • Start with well-funded but small trials and proof-of- concepts, evolve towards a solid roadmap • Open up your data, transformation towards a “data as a service” architecture • Acquire or grow the needed technology and analytical skills Copyright © 2012 Accenture All rights reserved.
  • 21. Accenture Technology Vision Strong advice on data for 2012 http://bit.ly/accenturetechnologyvision2012 Copyright © 2012 Accenture All rights reserved.

Editor's Notes

  1. We’llbuildontheseduringthepresentation
  2. Thebadnews? It’snotgoing stop.Largeamounts of data bring a whole set of new challenges, howshouldwegoaboutthem?
  3. It’s not just growing volumes of existing data, it’s also:The recognition of value in previously throw-away dataNew kinds of “data exhaust” – by-product data generated as part of other processes, currently ignored or thrown awayNew kinds of “intentional” dataThe combination of previously separate data
  4. Big Data isnot so muchaboutthe “big”, butaboutfinding new waystohandle and analyze data thatwerenotpossiblebefore. There are a wholelot of new technologiesthat can be usedtodealwithbig data. Are familiar withall of them? Whichoneismostsuitableforyour case?
  5. Source: http://www.gartner.com/it/page.jsp?id=1763814
  6. Let’s stopfor a secondto look at thekeyenablertechnologies in Big Data.MapReduceOriginallydesigned and firstdeveloped in Google as part of theireffortsto more efficientlyindexthe webMapReduce splits input data into smaller chunk that can be processed in parallel.Scales linearly with number of nodes.HadoopOpen sourceimplementation of MapReduce, basedonGoogle’swhitepaper. Started in Yahoo, nowan top-levelproject in the Apache Foundation.Runsoncommodity software (Linux) and hardware (consumer-grade computerswithdirectlyattachedstorage)Ratherstraightforwardtoinstall and administrateLargeecosystem of additional open sourcecomponents: Pig, Hive, Oozie, FlumeLargeecosystem of commercialofferings (bothclosed and open source)
  7. Big Data AnalyticsTechnologyMultiple tools and technologies, sometimes for the same purpose: Hadoop, NoSQL databases, in-memory analytics)Time to information is critical to extract value from data sources that include mobile devices, RFID, the web and a growing list of automated sensory technologiestraditional data warehousing processes are too slow and limited in scalabilityability to converge data from multiple data sources, both structured and unstructureddecreased that time to informationSkillsThere’sonly so muchwe can do withexploratoryprocesses; theonlywaystoeffectivelyanalyzebig data requiremathematical and statisticalconceptswithwhich more traditionalanalysts are not familiarBusinessanalystsusedto be abletomanagewith Excel and basic SQL knowledge; nowwith data thatdoesnotfollowany particular model (it’sunstructuredafterall), thereis a needto look foranalysisthat are comfortablewithstatisticals and mathematicalconcepts, who are abletodevisetheirownmodelstofindpatters and insightswherethereapparentlywerenone.Processes & OrganizationData must be open and sharedacrosstheenterprise, supportedbyorganizationsthat “own” itData must be madeavailableacrosstheenterprise (i.e. wecan’tfindtrends in data thatwe do nothave)
  8. Source: “Big Data Analytics:Future Architectures,Skills and Roadmapsfor the CIO”, IDC 2012 (http://www.sas.com/resources/asset/BigDataAnalytics-FutureArchitectures-Skills-RomapsfortheCIO.pdf)Thethree Vs:Velocity, Volume and VarietyEverythingwill be analyzed, buthowmuch do wehave, howsoon do weneedit and howfast can we do it?
  9. MapReduce and Hadoop is currently seen as a low-level paradigm on top of which high-level tools must be built that are more intuitive and easy to use non-programmer types (business analysts, data scientists)Big Data technologies have not reached maturity yet and will continue to evolve over the next coming years. IT decision makers must still be realistic about the limits of what can be achieved via these technologies, sometimes waiting instead for the next generation of data technologies.There is also a lot of start-up activity happening (Scalar, MapR). Also, “traditional” large vendors do not want to be left behind: Microsoft SQL Server 2012 will be able to read and write data from Hadoop and HDFS or run Hadoop on Microsoft’s Azure PaaS, IBM has a version of InfoSphereBigInsights ready to be run on their SmartCloud solution and Oracle has recently introduced its own appliance of both a software and hardware solution with Hadoop and in-memory capabilities for handling large amounts of data.
  10. Big Data Analytics is anaugmentation to existinganalytical infrastructure that willallow to scale and drive insights beyond “current capabilities”So the question becomes:how do we add these capabilities to interoperate with traditional tools?
  11. The worlds of structured and unstructured data are rapidly converging. Architects and CIOs must find ways to manage this convergence and enable all forms of datamanagement to coexist, sometimes using bridge technologies, such as using Hadoop to process and import data into traditional systems in ways that wouldn’t be possible with just the RDBMS approach. “Hybrid” landscapes are justthat, where Hadoop isintegratedwithexisting data warehouses, traditionalrelationaldatabases and applications in a waythattheimpactontheenterpriseisminimized.The reality is that the EDW is evolving into a virtualized cloud ecosystem in which all of these database architectures can and will coexist in a pluggable “Big Data” storage layer alongside HDFS, HBase (Hadoop’s columnar database), Cassandra (a sibling Apache project that supports peer-to-peer persistence for complex event processing and other real-time applications), graph databases, and other “NoSQL” platforms behind an abstraction layer with MapReduce as its focusBig Data is not necessarily about its “bigness.” Very few organizations are going to need the type of scale that often makes the Big Data headlines. So, far from rendering the relational database obsolete, the new advances will be incorporated over time into the traditional databases, extending their performance.Adding Hadoop to the enterprise provides a cost effective place to store vast quantities of structured data from operational systems and combine it with both internal and externally sourced unstructured / semi-structured data.Also advanced MapReduce analytical methods can be used directly against that store, or through Hive / Hbase more traditional BI tools can be used to analyze the data.
  12. We’veseenthetools,butwhoisgoingtobuild, run and maintainallthis?TechnologyskillsTheemergence of big data isbasedon new technologiesthatrequireeither training orsourcingadditionalexpertiseData scienceTraditionalanalyticalmodels do notgenerallyscalewelltothetypical “big data-like” volumes; new ways of thinking are needed, waysthathelpfindwhatwewantedtofind as well as whatwedidnotknowwecouldfindData scientists are thenextgeneration of businessanalysts, withstrongstatisticalskills and abletothink “outside of the box” lookingfor new analyticalmodels.
  13. Agile software developmentmethodologies are one of thepotentialanswerstothis.A data strategyisrequired, butwithanapproachthatisaboutmodelingless and iterating more (justlike agile).
  14. Require new tools and technologyBig Data doesn’talwaysgetitright,withorwithoutanalytics (wacky iTunes and Spotifyrecommendations, weirdLinkedInsuggestions)Require new skills in yourworkforceResistanceisfutile – Big Data and analytics are inescapableTheycreatebusinessvalueforthebottom-lineItisthepathtocompetitiveadvantageBig Data isnotonlytransforming IT, itisalsotransformingbusinesses and industries: retailrecommendations, smart meter/gridanalytics
  15. How do wegetstartedwithallthis?Identifywhichbusinessprocessescouldbenefitthemostfromimprovedhandling and processing of largeamounts of data – what are thebusinessdecisionsthatwemakeeachday and thatwe’dliketomake more efficiently and more effectively?Productize data acrossthecompany, makeit a “firstclasscitizen” and providesomekind of data servicelayer so that data isaccessiblethroughouttheenterpriseIdentifytheskill and technology gaps and decide whethertogroworacquire new talent and technologyforthecompany (withorwithoutthecloud)Itisclearthatthisrequiresaninvestment; itisthepath forward, butitrequiresthatyou as decision-makersmake a commitmenttogrowbig data in yourcompany.
  16. Source: http://www.accenture.com/us-en/technology/technology-labs/Pages/insight-accenture-technology-vision-2012.aspx (http://bit.ly/accenturetechvision2012 and http://bit.ly/accenturetechnologyvision2012)