SlideShare uma empresa Scribd logo
1 de 40
<Insert Picture Here>




Oracle Big Data Appliance and Solutions
Jean-Pierre Dijcks
Hadoop World – Nov 8th, 2012
The following is intended to outline our general product
direction. It is intended for information purposes only, and
may not be incorporated into any contract. It is not a
commitment to deliver any material, code, or functionality,
and should not be relied upon in making purchasing
decisions.
The development, release, and timing of any features or
functionality described for Oracle’s products remain at the
sole discretion of Oracle.
Case: On-line Ads and Content

                                                       Real-time: Determine
 Low                                                     best ad to place
Latency             Lookup user                        on page for this user
                       profile
                Add user    NoSQL                Expert
             if not present   DB    Input into   System
                                                        Actual
  HDFS                                 Predictions
                                                         ads
                                       on browsing
          Web                                           served
          logs
                            High scale                                    Batch
                          data reductions          BI and
                                                            Billing
 NoSQL DB                                         Analytics

      Profiles
Agenda


• Big Data Technology
• Oracle Big Data Appliance
• Big Data Applications
• Summary
• Q&A
<Insert Picture Here>

Big Data Technology
Big Data: Infrastructure Requirements


   Acquire                       Organize                     Analyze


• Low, predictable Latency
• High Transaction Volume                                    • Deep Analytics
• Flexible Data Structures                                   • Agile Development
                                                             • Massive Scalability
                             • High Throughput
                                                             • Real Time Results
                             • In-Place Preparation
                             • All Data Sources/Structures
Divided Solution Spectrum
  Data
 Variety



            Distributed                                     NoSQL
Dynamic     File Systems                                     Flexible
                                 MapReduce
Schema                                                     Specialized
                                  Solutions
           Transaction                                     Developer
            (Key-Value)                                      Centric
              Stores




                                                              SQL
Schema     DBMS                   DBMS        Advanced       Trusted
                           ETL                 Analytics
            (OLTP)                 (DW)                      Secure
                                                           Administered



           Acquire         Organize             Analyze
Oracle Integrated Software Solution Stack

          Data
         Variety


                                                                 HDFS                                Hadoop                                    In-DB
                                                                                                                                              Analytics
    Dynamic                                                                                     Oracle Loader
    Schema                                                                                                                                     “R”
                                                     Oracle NoSQL                                for Hadoop                                   Mining
                                                           DB
                                                                                                                                               Text
                                                                                                Oracle
                                                                                            Data Integrator                                   Graph
                                                                                                                                              Spatial

                                                          Oracle                                                                        Oracle
    Schema                                               Database                                                                      Database    Oracle
                                                          (OLTP)                                                                        (DW)       BI EE



                                                       Acquire                                         Organize                                Analyze


8   Copyright © 2011, Oracle and/or its affiliates. All rights      Insert Information Protection Policy Classification from Slide 8
    reserved.
Oracle Engineered Solutions
        Data
       Variety

                                                             Big Data Appliance
                                                              HDFS         Hadoop                                                    In-DB
                                                        • Hadoop                                                                    Analytics
    Dynamic                                             • NoSQL Database Loader
                                                                       Oracle
    Schema                                              • Oracle Loader for hadoop
                                                     Oracle NoSQL                                                                    “R”
                                                                         for Hadoop
                                                        • Oracle Data Integrator
                                                           DB                                                                       Mining      Exalytics
                                                                          Oracle                                                     Text       • Speed of
                                                                     Data Integrator                                                Graph         Thought
                                                                                                                                    Spatial       Analytics
                                                          Oracle                            Oracle Exadata  Oracle
    Schema                                               Database                           • OLTP & DW Database                         Oracle
                                                          (OLTP)                                             (DW)
                                                                                            • Data Mining & Oracle R                     BI EE
                                                                                              • Semantics
                                                                                              • Spatial



                                                       Acquire                                      Organize                         Analyze

9   Copyright © 2011, Oracle and/or its affiliates. All rights   Insert Information Protection Policy Classification from Slide 8
    reserved.
Big Data Appliance
Batch Usage Model



           Oracle                        Oracle              Oracle
      Big Data Appliance                 Exadata            Exalytics




                           InfiniBand              InfiniBand




     Acquire         Organize           Analyze
Why build a Hadoop Appliance?




                                             • Time to Build?
                                             • Required Expertise?
                                             • Cost and Difficulty Maintaining?

11   Copyright © 2011, Oracle and/or its affiliates. All rights   Insert Information Protection Policy Classification from Slide 8
     reserved.
Oracle Big Data Appliance Hardware


•18 Sun X4270 M2 Servers
  – 48 GB memory per node = 864 GB memory
  – 12 Intel cores per node = 216 cores
  – 24 TB storage per node = 432 TB storage
•40 Gb p/sec InfiniBand
•10 Gb p/sec Ethernet
Big Data Appliance
  Cluster of industry standard servers for Hadoop and NoSQL Database
  • Focus on Scalability and Availability at low cost


InfiniBand Network
                                                 Compute and Storage
• Redundant 40Gb/s switches
                                             • 18 High-performance low-cost
• IB connectivity to Exadata
                                               servers acting as Hadoop
                                               nodes



10GigE Network                               •   24 TB Capacity per node
• 8 10GigE ports                             •   2 6-core CPUs per node
• Datacenter connectivity                    •   Hadoop triple replication
                                             •   NoSQL Database triple
                                                 replication
Scale Out to Infinity




         Scale out by connecting racks
         to each other using Infiniband
          • Expand up to eight racks without
            additional switches
          • Scale beyond eight racks by adding
            an additional switch
Oracle Big Data Appliance Software

  •Oracle Linux 5.6
  •Java Hotspot VM
  •Apache Hadoop Distribution v0.20.x
  •R Distribution
  •Oracle NoSQL Database Enterprise
   Edition
  •Oracle Data Integrator Application
   Adapter for Hadoop
  •Oracle Loader for Hadoop
Why Open-Source Apache Hadoop?


• Fast evolution in critical features
  • Built by the Hadoop experts in the community
  • Practical instead of esoteric
  • Focus on what is needed for large clusters
• Proven at very large scale
  • In production at all the large consumers of Hadoop
  • Extremely stable in those environments
  • Well-understood by practitioners
Software Layout
           • Node 1:
             • M: Name Node, Balancer & HBase Master
             • S: HDFS Data Node, NoSQL DB Storage Node
           • Node 2:
             • M: Secondary Name Node, Management,
               Zookeeper, MySQL Slave
             • S: HDFS Data Node, NoSQL DB Storage Node
           • Node 3:
             • M: JobTracker, MySQL Master, ODI Agent,
               Hive Server
             • S: HDFS Data Node, NoSQL DB Storage Node
           • Node 4 – 18:
             • S: HDFS Data Nodes, Task Tracker, HBase
               Region Server, NoSQL DB Storage Nodes
             • Your MapReduce runs here!
Big Data Appliance
  Big Data for the Enterprise


• Optimized and Complete
  • Everything you need to store and integrate
    your lower information density data
• Integrated with Oracle Exadata
  • Analyze all your data
• Easy to Deploy
  • Risk Free, Quick Installation and Setup
• Single Vendor Support
  • Full Oracle support for the entire system and
    software set
<Insert Picture Here>

Oracle NoSQL Database
Key-Value Store Workloads

• Large dynamic schema based data repositories

• Data capture
  • Web applications
  • Online retail
  • Sensor/statistics/network capture/Mobile Devices
• Data services
  •   Scalable authentication
  •   Real-time communication (MMS, SMS, routing)
  •   Personalization / Localization
  •   Social Networks
Oracle NoSQL DB
  A distributed, scalable key-value database

• Simple Data Model
   • Key-value pair with major+sub-key paradigm
   • Read/insert/update/delete operations                    Application      Application

• Scalability                                              NoSQLDB Driver   NoSQLDB Driver

   • Dynamic data partitioning and distribution
   • Optimized data access via intelligent driver
• High availability
   • One or more replicas
   • Disaster recovery through location of replicas
   • Resilient to partition master failures
   • No single point of failure
                                                      Storage Nodes             Storage Nodes
• Transparent load balancing                                                     Data Center B
                                                       Data Center A
   • Reads from master or replicas
   • Driver is network topology & latency aware
Resolving a Request
     Operation + Key[M,m] + Value + Transaction Policy
                                                                 Client


Hash Major Key to determine
Partition id

   Use Partition Map to map Partition                    • Operation result
   id to a Rep Group                                     • New Partition Map
                                                         • RepNodeStorageTable
      Use State Table to determine eligible              information
      Storage Node(s) within Rep Group

         Use Load Balancer to select best
         eligible Rep Node

             Contact Rep Node directly
ACID Transactions
Transaction Policy                             Transaction Policy
Write Durability                               Read Consistency
• Configurable per-operation,                  • Configurable per-operation,
  application can set defaults                   application can set defaults
• Write Transaction Durability consists        • Read Consistency specified as
  of both
                                                 Absolute, Time-based, Version or
    a) Sync policy (on Master and               None
       Replica)
                                                 • Absolute  Read from the master
      • Sync – force to disk
      • Write No Sync – force to OS              • Time-based  Read from any
        buffer                                     replica that is within <time-
      • No Sync – write to local log buffer,       interval> of master or better
        flush when convenient                    • Version  Read from any replica
    b) Replica Acknowledgement Policy              that is current with <transaction-
      • All                                        token> or higher
      • Simple Majority                          • None  Read from any replica
      • None
Oracle NoSQL DB Differentiation

• Commercial Grade Software and Support
  • General-purpose
  • Reliable – Based on proven Berkeley DB JE HA
  • Easy to install and configure
• Scalable throughput, bounded latency
• Simple Programming and Operational Model
  • Simple Major + Sub key and Value data structure
  • ACID transactions
  • Configurable consistency & durability
• Easy Management
  • Web-based console, API accessible
  • Manages and Monitors: Topology; Load; Performance; Events; Alerts
• Completes Oracle large scale data storage offerings
Try NoSQL Database on OTN




 Oracle NoSQL Database:
 • Community Edition is available as a software
   only distribution
 • Enterprise Edition is available as a separately
   licensable product or as part of Big Data Appliance
<Insert Picture Here>

Oracle Loader for Hadoop
Oracle Loader for Hadoop Features

     • Load data into a partitioned or non-partitioned table
           – Single level, composite or interval partitioned table
           – Support for scalar datatypes of Oracle Database
           – Load into Oracle Database 11g Release 2


     • Runs as a Hadoop job and supports standard options

     • Pre-partitions and sorts data on Hadoop

     • Online and offline load modes




27   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Oracle Loader for Hadoop

INPUT
  1
                  MAP                                             MAP
                                                                                           ORACLE LOADER FOR HADOOP

                  MAP                                   REDUCE                    REDUCE

                                                                                            MAP

                  MAP                                   REDUCE    MAP                                         REDUCE

                                                                                            MAP

                  MAP                                   REDUCE                    REDUCE                      REDUCE
                                                                                                    SHUFFLE
                                                                                            MAP      /SORT
                                    SHUFFLE
                  MAP                /SORT                        MAP




                  MAP                                             MAP             REDUCE



                  MAP                                   REDUCE                              MAP               REDUCE



                  MAP                                   REDUCE    MAP                       MAP               REDUCE


                                    SHUFFLE                                                         SHUFFLE
                  MAP                /SORT                                                  MAP      /SORT    REDUCE
                                                                        SHUFFLE
                                                                         /SORT
INPUT
  2




28   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Oracle Loader for Hadoop: Online Option

                   Read target table metadata                             Perform
                                                                        ORACLE LOADER FOR HADOOP                 Connect to the database
                    from the database                                      partitioning, sorting, and             from reducer nodes, load
                                                                           data conversion                        into database partitions in
                                                                                                                  parallel
                                                                  MAP

                                                                                                        REDUCE

                                                                  MAP

                                                                                                        REDUCE
                                                                                   SHUFFLE
                                                                  MAP
                                                                                    /SORT




                                                                  MAP                                   REDUCE



                                                                  MAP                                   REDUCE



                                                                                   SHUFFLE
                                                                  MAP                                   REDUCE
                                                                                    /SORT




29   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Oracle Loader for Hadoop: Offline Option

                   Read target table metadata                             Perform
                                                                        ORACLE LOADER FOR HADOOP
                                                                                                                 Write from reducer nodes to
                    from the database                                      partitioning, sorting, and             Oracle Data Pump files
                                                                           data conversion

                                                                  MAP
                                                                                                                                   Import into the database in
                                                                                                        REDUCE                       parallel using external table
                                                                  MAP                                                                mechanism

                                                                                                        REDUCE
                                                                                   SHUFFLE
                                                                  MAP
                                                                                    /SORT




                                                                  MAP                                   REDUCE



                                                                  MAP                                   REDUCE



                                                                                   SHUFFLE
                                                                  MAP                                   REDUCE
                                                                                    /SORT




30   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Oracle Loader for Hadoop Advantages


     • Offload database server processing to Hadoop:
           – Convert input data to final database format
           – Compute table partition for row
           – Sort rows by primary key within a table partition
     • Generate binary datapump files
     • Balance partition groups across reducers




31   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Input and Output Formats

Input Formats                                                      Output Formats
                                                                   Online Mode
• Delimited text                                                   • Load directly from Hadoop nodes to
                                                                     Oracle database
• Hive tables                                                        – JDBC
  – Managed and external tables                                      – Parallel direct path
  – Native and non-native tables

                                                                   Offline Mode
• Write your own input format                                      • Datapump format
                                                                     – Create binary files for external tables
                                                                     – Import data into the database from the
                                                                       external table with a SQL statement
                                                                   • CSV, delimited text
                                                                     – Load through SQL*Loader or external
                                                                       table mechanism

 32   Copyright © 2011, Oracle and/or its affiliates. All rights
      reserved.
Selection Output Option for Use Case

     Oracle Loader for Hadoop
                                                                  Use Case Characteristics
     Output Option
     Online load with JDBC                                        The simplest use case for non
                                                                  partitioned tables
     Online load with Direct Path                                 Fast online load for partitioned
                                                                  tables
     Offline load with datapump files                             Fastest load method for external
                                                                  tables
     On Oracle Big Data Appliance                                 Leave data on HDFS
     Direct HDFS                                                  Parallel access from database
                                                                  Import into database when
                                                                  needed



33   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Invoking Oracle Loader for Hadoop


     • Command line
           $ hadoop jar oraloader.jar oracle.hadoop.loader.OraLoader
                      -libjars <library jar files>
                      -D <configuration properties>



            $HADOOP_HOME/bin/hadoop jar oraloader.jar oracle.hadoop.loader.oraLoader
               -libjars avro-1.4.1.jar, commons-math-2.2.jar
               -conf connection.xml
               -D mapreduce.inputformat.class=oracle.hadoop.loader.lib.input.DelimitedTextInputFormat
               -D mapreduce.outputformat.class=oracle.hadoop.loader.lib.output.JDBCOutputFormat




34   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
Automate Usage of Oracle Loader for Hadoop
     Oracle Data Integrator (ODI)

     • ODI has knowledge modules to
           – Generate data transformation code to run on Hive/Hadoop
           – Invoke Oracle Loader for Hadoop


     • Use the drag-and-drop interface in ODI to
           – Include invocation of Oracle Loader for Hadoop in any ODI
             packaged flow




36   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
37   Copyright © 2011, Oracle and/or its affiliates. All rights
     reserved.
<Insert Picture Here>

Summary
Big Data Appliance
  Big Data for the Enterprise


• Optimized and Complete
  • Everything you need to store and integrate your lower
    information density data
• Integrated with Oracle Exadata
  • Analyze all your data
• Easy to Deploy
  • Risk Free, Quick Installation and Setup
• Single Vendor Support
  • Full Oracle support for the entire system and software
    set
Big Data Appliance and Exadata
Big Data for the Enterprise


     NoSQL DB
                     
        HDFS
                     
      Hadoop
                     
      RDBMS          
Questions

Mais conteúdo relacionado

Mais procurados

IBM Power Systems Announcement Update
IBM Power Systems Announcement UpdateIBM Power Systems Announcement Update
IBM Power Systems Announcement UpdateDavid Spurway
 
Trusted advisory on technology comparison --exadata, hana, db2
Trusted advisory on technology comparison --exadata, hana, db2Trusted advisory on technology comparison --exadata, hana, db2
Trusted advisory on technology comparison --exadata, hana, db2Ajay Kumar Uppal
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...Amr Awadallah
 
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Accelerating Business Intelligence Solutions with Microsoft Azure   passAccelerating Business Intelligence Solutions with Microsoft Azure   pass
Accelerating Business Intelligence Solutions with Microsoft Azure passJason Strate
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemJames Serra
 
Oracle Database Appliance, ODA, X7-2 portfolio.
Oracle Database Appliance, ODA, X7-2 portfolio.Oracle Database Appliance, ODA, X7-2 portfolio.
Oracle Database Appliance, ODA, X7-2 portfolio.Daryll Whyte
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data avanttic Consultoría Tecnológica
 
9/ IBM POWER @ OPEN'16
9/ IBM POWER @ OPEN'169/ IBM POWER @ OPEN'16
9/ IBM POWER @ OPEN'16Kangaroot
 
Understanding the IBM Power Systems Advantage
Understanding the IBM Power Systems AdvantageUnderstanding the IBM Power Systems Advantage
Understanding the IBM Power Systems AdvantageIBM Power Systems
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Dave Segleau
 
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...Charlie Berger
 
Open Innovation with Power Systems
Open Innovation with Power Systems Open Innovation with Power Systems
Open Innovation with Power Systems IBM Power Systems
 
Red hat ceph storage customer presentation
Red hat ceph storage customer presentationRed hat ceph storage customer presentation
Red hat ceph storage customer presentationRodrigo Missiaggia
 
Nordic infrastructure Conference 2017 - SQL Server on Linux Overview
Nordic infrastructure Conference 2017 - SQL Server on Linux OverviewNordic infrastructure Conference 2017 - SQL Server on Linux Overview
Nordic infrastructure Conference 2017 - SQL Server on Linux OverviewTravis Wright
 
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics AcceleratorEDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics AcceleratorDaniel Martin
 
Machine Learning for z/OS
Machine Learning for z/OSMachine Learning for z/OS
Machine Learning for z/OSCuneyt Goksu
 
IBM Power leading Cognitive Systems
IBM Power leading Cognitive SystemsIBM Power leading Cognitive Systems
IBM Power leading Cognitive SystemsHugo Blanco
 
SUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UK
SUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UKSUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UK
SUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UKhuguk
 

Mais procurados (20)

IBM Power Systems Announcement Update
IBM Power Systems Announcement UpdateIBM Power Systems Announcement Update
IBM Power Systems Announcement Update
 
Trusted advisory on technology comparison --exadata, hana, db2
Trusted advisory on technology comparison --exadata, hana, db2Trusted advisory on technology comparison --exadata, hana, db2
Trusted advisory on technology comparison --exadata, hana, db2
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
 
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Accelerating Business Intelligence Solutions with Microsoft Azure   passAccelerating Business Intelligence Solutions with Microsoft Azure   pass
Accelerating Business Intelligence Solutions with Microsoft Azure pass
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
 
Oracle Database Appliance, ODA, X7-2 portfolio.
Oracle Database Appliance, ODA, X7-2 portfolio.Oracle Database Appliance, ODA, X7-2 portfolio.
Oracle Database Appliance, ODA, X7-2 portfolio.
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
 
9/ IBM POWER @ OPEN'16
9/ IBM POWER @ OPEN'169/ IBM POWER @ OPEN'16
9/ IBM POWER @ OPEN'16
 
IBM Power8 announce
IBM Power8 announceIBM Power8 announce
IBM Power8 announce
 
Understanding the IBM Power Systems Advantage
Understanding the IBM Power Systems AdvantageUnderstanding the IBM Power Systems Advantage
Understanding the IBM Power Systems Advantage
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
 
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
Oracle’s Advanced Analytics & Machine Learning 12.2c New Features & Road Map;...
 
Developer day v2
Developer day v2Developer day v2
Developer day v2
 
Open Innovation with Power Systems
Open Innovation with Power Systems Open Innovation with Power Systems
Open Innovation with Power Systems
 
Red hat ceph storage customer presentation
Red hat ceph storage customer presentationRed hat ceph storage customer presentation
Red hat ceph storage customer presentation
 
Nordic infrastructure Conference 2017 - SQL Server on Linux Overview
Nordic infrastructure Conference 2017 - SQL Server on Linux OverviewNordic infrastructure Conference 2017 - SQL Server on Linux Overview
Nordic infrastructure Conference 2017 - SQL Server on Linux Overview
 
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics AcceleratorEDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
 
Machine Learning for z/OS
Machine Learning for z/OSMachine Learning for z/OS
Machine Learning for z/OS
 
IBM Power leading Cognitive Systems
IBM Power leading Cognitive SystemsIBM Power leading Cognitive Systems
IBM Power leading Cognitive Systems
 
SUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UK
SUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UKSUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UK
SUSE, Hadoop and Big Data Update. Stephen Mogg, SUSE UK
 

Semelhante a Hadoop World 2011: Unlocking the Value of Big Data with Oracle - Jean-Pierre Dijcks - Oracle

Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time ApplicationsDataWorks Summit
 
Oracle Advanced Analytics
Oracle Advanced AnalyticsOracle Advanced Analytics
Oracle Advanced Analyticsaghosh_us
 
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopCloudera, Inc.
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Cloudera, Inc.
 
The IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceThe IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceIBM Danmark
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Cloudera, Inc.
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data SolutionsMark Kromer
 
Processing Big Data
Processing Big DataProcessing Big Data
Processing Big Datacwensel
 
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLChoosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLScaleBase
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
A unified data modeler in the world of big data
A unified data modeler in the world of big dataA unified data modeler in the world of big data
A unified data modeler in the world of big dataWilliam Luk
 
From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012Anand Deshpande
 
Big data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosqlBig data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosqlKhanderao Kand
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Jonathan Seidman
 

Semelhante a Hadoop World 2011: Unlocking the Value of Big Data with Oracle - Jean-Pierre Dijcks - Oracle (20)

Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time Applications
 
Oracle Advanced Analytics
Oracle Advanced AnalyticsOracle Advanced Analytics
Oracle Advanced Analytics
 
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
The IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceThe IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse appliance
 
Sql no sql
Sql no sqlSql no sql
Sql no sql
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Processing Big Data
Processing Big DataProcessing Big Data
Processing Big Data
 
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLChoosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
A unified data modeler in the world of big data
A unified data modeler in the world of big dataA unified data modeler in the world of big data
A unified data modeler in the world of big data
 
Cloud computing era
Cloud computing eraCloud computing era
Cloud computing era
 
From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012
 
Big data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosqlBig data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosql
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Drill njhug -19 feb2013
Drill njhug -19 feb2013Drill njhug -19 feb2013
Drill njhug -19 feb2013
 
Cosbench apac
Cosbench apacCosbench apac
Cosbench apac
 

Mais de Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

Mais de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Último

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 

Último (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 

Hadoop World 2011: Unlocking the Value of Big Data with Oracle - Jean-Pierre Dijcks - Oracle

  • 1. <Insert Picture Here> Oracle Big Data Appliance and Solutions Jean-Pierre Dijcks Hadoop World – Nov 8th, 2012
  • 2. The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remain at the sole discretion of Oracle.
  • 3. Case: On-line Ads and Content Real-time: Determine Low best ad to place Latency Lookup user on page for this user profile Add user NoSQL Expert if not present DB Input into System Actual HDFS Predictions ads on browsing Web served logs High scale Batch data reductions BI and Billing NoSQL DB Analytics Profiles
  • 4. Agenda • Big Data Technology • Oracle Big Data Appliance • Big Data Applications • Summary • Q&A
  • 5. <Insert Picture Here> Big Data Technology
  • 6. Big Data: Infrastructure Requirements Acquire Organize Analyze • Low, predictable Latency • High Transaction Volume • Deep Analytics • Flexible Data Structures • Agile Development • Massive Scalability • High Throughput • Real Time Results • In-Place Preparation • All Data Sources/Structures
  • 7. Divided Solution Spectrum Data Variety Distributed NoSQL Dynamic File Systems Flexible MapReduce Schema Specialized Solutions Transaction Developer (Key-Value) Centric Stores SQL Schema DBMS DBMS Advanced Trusted ETL Analytics (OLTP) (DW) Secure Administered Acquire Organize Analyze
  • 8. Oracle Integrated Software Solution Stack Data Variety HDFS Hadoop In-DB Analytics Dynamic Oracle Loader Schema “R” Oracle NoSQL for Hadoop Mining DB Text Oracle Data Integrator Graph Spatial Oracle Oracle Schema Database Database Oracle (OLTP) (DW) BI EE Acquire Organize Analyze 8 Copyright © 2011, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
  • 9. Oracle Engineered Solutions Data Variety Big Data Appliance HDFS Hadoop In-DB • Hadoop Analytics Dynamic • NoSQL Database Loader Oracle Schema • Oracle Loader for hadoop Oracle NoSQL “R” for Hadoop • Oracle Data Integrator DB Mining Exalytics Oracle Text • Speed of Data Integrator Graph Thought Spatial Analytics Oracle Oracle Exadata Oracle Schema Database • OLTP & DW Database Oracle (OLTP) (DW) • Data Mining & Oracle R BI EE • Semantics • Spatial Acquire Organize Analyze 9 Copyright © 2011, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
  • 10. Big Data Appliance Batch Usage Model Oracle Oracle Oracle Big Data Appliance Exadata Exalytics InfiniBand InfiniBand Acquire Organize Analyze
  • 11. Why build a Hadoop Appliance? • Time to Build? • Required Expertise? • Cost and Difficulty Maintaining? 11 Copyright © 2011, Oracle and/or its affiliates. All rights Insert Information Protection Policy Classification from Slide 8 reserved.
  • 12. Oracle Big Data Appliance Hardware •18 Sun X4270 M2 Servers – 48 GB memory per node = 864 GB memory – 12 Intel cores per node = 216 cores – 24 TB storage per node = 432 TB storage •40 Gb p/sec InfiniBand •10 Gb p/sec Ethernet
  • 13. Big Data Appliance Cluster of industry standard servers for Hadoop and NoSQL Database • Focus on Scalability and Availability at low cost InfiniBand Network Compute and Storage • Redundant 40Gb/s switches • 18 High-performance low-cost • IB connectivity to Exadata servers acting as Hadoop nodes 10GigE Network • 24 TB Capacity per node • 8 10GigE ports • 2 6-core CPUs per node • Datacenter connectivity • Hadoop triple replication • NoSQL Database triple replication
  • 14. Scale Out to Infinity Scale out by connecting racks to each other using Infiniband • Expand up to eight racks without additional switches • Scale beyond eight racks by adding an additional switch
  • 15. Oracle Big Data Appliance Software •Oracle Linux 5.6 •Java Hotspot VM •Apache Hadoop Distribution v0.20.x •R Distribution •Oracle NoSQL Database Enterprise Edition •Oracle Data Integrator Application Adapter for Hadoop •Oracle Loader for Hadoop
  • 16. Why Open-Source Apache Hadoop? • Fast evolution in critical features • Built by the Hadoop experts in the community • Practical instead of esoteric • Focus on what is needed for large clusters • Proven at very large scale • In production at all the large consumers of Hadoop • Extremely stable in those environments • Well-understood by practitioners
  • 17. Software Layout • Node 1: • M: Name Node, Balancer & HBase Master • S: HDFS Data Node, NoSQL DB Storage Node • Node 2: • M: Secondary Name Node, Management, Zookeeper, MySQL Slave • S: HDFS Data Node, NoSQL DB Storage Node • Node 3: • M: JobTracker, MySQL Master, ODI Agent, Hive Server • S: HDFS Data Node, NoSQL DB Storage Node • Node 4 – 18: • S: HDFS Data Nodes, Task Tracker, HBase Region Server, NoSQL DB Storage Nodes • Your MapReduce runs here!
  • 18. Big Data Appliance Big Data for the Enterprise • Optimized and Complete • Everything you need to store and integrate your lower information density data • Integrated with Oracle Exadata • Analyze all your data • Easy to Deploy • Risk Free, Quick Installation and Setup • Single Vendor Support • Full Oracle support for the entire system and software set
  • 20. Key-Value Store Workloads • Large dynamic schema based data repositories • Data capture • Web applications • Online retail • Sensor/statistics/network capture/Mobile Devices • Data services • Scalable authentication • Real-time communication (MMS, SMS, routing) • Personalization / Localization • Social Networks
  • 21. Oracle NoSQL DB A distributed, scalable key-value database • Simple Data Model • Key-value pair with major+sub-key paradigm • Read/insert/update/delete operations Application Application • Scalability NoSQLDB Driver NoSQLDB Driver • Dynamic data partitioning and distribution • Optimized data access via intelligent driver • High availability • One or more replicas • Disaster recovery through location of replicas • Resilient to partition master failures • No single point of failure Storage Nodes Storage Nodes • Transparent load balancing Data Center B Data Center A • Reads from master or replicas • Driver is network topology & latency aware
  • 22. Resolving a Request Operation + Key[M,m] + Value + Transaction Policy Client Hash Major Key to determine Partition id Use Partition Map to map Partition • Operation result id to a Rep Group • New Partition Map • RepNodeStorageTable Use State Table to determine eligible information Storage Node(s) within Rep Group Use Load Balancer to select best eligible Rep Node Contact Rep Node directly
  • 23. ACID Transactions Transaction Policy Transaction Policy Write Durability Read Consistency • Configurable per-operation, • Configurable per-operation, application can set defaults application can set defaults • Write Transaction Durability consists • Read Consistency specified as of both Absolute, Time-based, Version or a) Sync policy (on Master and None Replica) • Absolute  Read from the master • Sync – force to disk • Write No Sync – force to OS • Time-based  Read from any buffer replica that is within <time- • No Sync – write to local log buffer, interval> of master or better flush when convenient • Version  Read from any replica b) Replica Acknowledgement Policy that is current with <transaction- • All token> or higher • Simple Majority • None  Read from any replica • None
  • 24. Oracle NoSQL DB Differentiation • Commercial Grade Software and Support • General-purpose • Reliable – Based on proven Berkeley DB JE HA • Easy to install and configure • Scalable throughput, bounded latency • Simple Programming and Operational Model • Simple Major + Sub key and Value data structure • ACID transactions • Configurable consistency & durability • Easy Management • Web-based console, API accessible • Manages and Monitors: Topology; Load; Performance; Events; Alerts • Completes Oracle large scale data storage offerings
  • 25. Try NoSQL Database on OTN Oracle NoSQL Database: • Community Edition is available as a software only distribution • Enterprise Edition is available as a separately licensable product or as part of Big Data Appliance
  • 26. <Insert Picture Here> Oracle Loader for Hadoop
  • 27. Oracle Loader for Hadoop Features • Load data into a partitioned or non-partitioned table – Single level, composite or interval partitioned table – Support for scalar datatypes of Oracle Database – Load into Oracle Database 11g Release 2 • Runs as a Hadoop job and supports standard options • Pre-partitions and sorts data on Hadoop • Online and offline load modes 27 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 28. Oracle Loader for Hadoop INPUT 1 MAP MAP ORACLE LOADER FOR HADOOP MAP REDUCE REDUCE MAP MAP REDUCE MAP REDUCE MAP MAP REDUCE REDUCE REDUCE SHUFFLE MAP /SORT SHUFFLE MAP /SORT MAP MAP MAP REDUCE MAP REDUCE MAP REDUCE MAP REDUCE MAP MAP REDUCE SHUFFLE SHUFFLE MAP /SORT MAP /SORT REDUCE SHUFFLE /SORT INPUT 2 28 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 29. Oracle Loader for Hadoop: Online Option Read target table metadata Perform ORACLE LOADER FOR HADOOP Connect to the database from the database partitioning, sorting, and from reducer nodes, load data conversion into database partitions in parallel MAP REDUCE MAP REDUCE SHUFFLE MAP /SORT MAP REDUCE MAP REDUCE SHUFFLE MAP REDUCE /SORT 29 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 30. Oracle Loader for Hadoop: Offline Option Read target table metadata Perform ORACLE LOADER FOR HADOOP Write from reducer nodes to from the database partitioning, sorting, and Oracle Data Pump files data conversion MAP Import into the database in REDUCE parallel using external table MAP mechanism REDUCE SHUFFLE MAP /SORT MAP REDUCE MAP REDUCE SHUFFLE MAP REDUCE /SORT 30 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 31. Oracle Loader for Hadoop Advantages • Offload database server processing to Hadoop: – Convert input data to final database format – Compute table partition for row – Sort rows by primary key within a table partition • Generate binary datapump files • Balance partition groups across reducers 31 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 32. Input and Output Formats Input Formats Output Formats Online Mode • Delimited text • Load directly from Hadoop nodes to Oracle database • Hive tables – JDBC – Managed and external tables – Parallel direct path – Native and non-native tables Offline Mode • Write your own input format • Datapump format – Create binary files for external tables – Import data into the database from the external table with a SQL statement • CSV, delimited text – Load through SQL*Loader or external table mechanism 32 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 33. Selection Output Option for Use Case Oracle Loader for Hadoop Use Case Characteristics Output Option Online load with JDBC The simplest use case for non partitioned tables Online load with Direct Path Fast online load for partitioned tables Offline load with datapump files Fastest load method for external tables On Oracle Big Data Appliance Leave data on HDFS Direct HDFS Parallel access from database Import into database when needed 33 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 34. Invoking Oracle Loader for Hadoop • Command line $ hadoop jar oraloader.jar oracle.hadoop.loader.OraLoader -libjars <library jar files> -D <configuration properties> $HADOOP_HOME/bin/hadoop jar oraloader.jar oracle.hadoop.loader.oraLoader -libjars avro-1.4.1.jar, commons-math-2.2.jar -conf connection.xml -D mapreduce.inputformat.class=oracle.hadoop.loader.lib.input.DelimitedTextInputFormat -D mapreduce.outputformat.class=oracle.hadoop.loader.lib.output.JDBCOutputFormat 34 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 35. Automate Usage of Oracle Loader for Hadoop Oracle Data Integrator (ODI) • ODI has knowledge modules to – Generate data transformation code to run on Hive/Hadoop – Invoke Oracle Loader for Hadoop • Use the drag-and-drop interface in ODI to – Include invocation of Oracle Loader for Hadoop in any ODI packaged flow 36 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 36. 37 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 38. Big Data Appliance Big Data for the Enterprise • Optimized and Complete • Everything you need to store and integrate your lower information density data • Integrated with Oracle Exadata • Analyze all your data • Easy to Deploy • Risk Free, Quick Installation and Setup • Single Vendor Support • Full Oracle support for the entire system and software set
  • 39. Big Data Appliance and Exadata Big Data for the Enterprise NoSQL DB  HDFS  Hadoop  RDBMS 

Notas do Editor

  1. Changed Count to Volume =&gt;
  2. Is Developer Centric the right word? Should we hyphenate, or put comma’s
  3. Benefits for Online Mode: No need to write to disk after Hadoop job Simpler management for use cases with lots of nodes generating output filesBenefits for Offline Mode (DP Files): Import operation can be parallelized in the database Fastest option for external tables
  4. Direct HDFS:Access data on HDFS through the external table mechanismBenefitsData on HDFS can be queried from the databaseImport into the database as needed