SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
Slidedeck Mehr als Reporting - Datenanalysen mit Oracle R Enterprise - DOAG Development and DOAG SIG BigData 2014
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Mehr als Reporting –
Datenanalysen mit Oracle R Enterprise
Dr. Nadine Schöne
Sales Consultant
Oracle Direct, Sales Consulting
Dr. Michael Haupt
Principal Member of Technical Staff
Oracle Labs, Virtual Machine Research Group
25. September 2014
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Safe Harbor Statement
The following is intended to outline our general product direction. It is intended for
information purposes only, and may not be incorporated into any contract. It is not a
commitment to deliver any material, code, or functionality, and should not be relied upon
in making purchasing decisions. The development, release, and timing of any features or
functionality described for Oracle’s products remains at the sole discretion of Oracle.
3
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Agenda
Mehr als Standard Reporting?
Weiterführende Datenanalysen
R und Oracle R Enterprise (ORE)
Demo
Benefits
Ausblick: Mehr Performance für R
1
2
3
4
5
4
6
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Mehr als Standard Reporting?
5
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Reporting
6
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Weiterführende Datenanalysen
7
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | 8
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Sensordaten-Analyse I
9
200.000 Haushalte
3 Jahre
1 Messung/Stunde
5.256 Mrd. Messwerte
(2.628 Messwerte/Kunde)
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Sensordaten-Analyse II
10
10 s/Modell
200.000 Haushalte
➔
200.000 Modelle
23 Tage + 4 Stunden 4,3 Stunden
Oracle R
Enterprise
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
R Screenshots
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Oracle Advanced Analytics
• Data Understanding & Visualization
– Summary & Descriptive Statistics
– Histograms, scatter plots, box plots, bar charts
– R graphics: 3-D plots, link plots, special R graph types
– Cross tabulations
– Tests for Correlations (t-test, Pearson’s, ANOVA)
– Selected Base SAS equivalents
• Data Selection, Preparation and Transformations
– Joins, Tables, Views, Data Selection, Data Filter, SQL time windows, Multiple
schemas
– Sampling techniques
– Re-coding, Missing values
– Aggregations
– Spatial data
– R to SQL transparency and push down
• Classification Models
– Logistic Regression (GLM)
– Naive Bayes
– Decision Trees
– Support Vector Machines (SVM)
– Neural Networks (NNs)
• Regression Models
– Multiple Regression (GLM)
– Support Vector Machines
Große Bandbreite an In-Database Data Mining und statistischen Funktionen
 Clustering
– Hierarchical K-means
– Orthogonal Partitioning
– Expectation Maximization
 Anomaly Detection
– Special case Support Vector Machine (1-Class SVM)
 Associations / Market Basket Analysis
– A Priori algorithm
 Feature Selection and Reduction
– Attribute Importance (Minimum Description Length)
– Principal Components Analysis (PCA)
– Non-negative Matrix Factorization
– Singular Vector Decomposition
 Text Mining
– Most OAA algorithms support unstructured data (i.e. customer
comments, email, abstracts, etc.)
 Transactional Data
– Most OAA algorithms support transactional data (i.e. purchase
transactions, repeated measures over time)
 R packages—ability to run open source
– Broad range of R CRAN packages can be run as part of database
process via R to SQL transparency and/or via Embedded R mode
* included in every Oracle Database
Deskriptive Datenanalyse & Visualization
Klassifikations- & Regressions Modelle
Clustering
Verwendung von
Open Source R packages
Daten Aufbereitung & Transformationen
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Wichtige Themen für Enterprise Data Analytics
1. Skalierbarkeit
2. Performance
3. Entwicklung &
Produktion
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
R und Oracle R Enterprise (ORE)
14
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Aspekte herkömmlicher R/Datenbank-Interaktion
15
R logo © R Foundation, vonhttp://www.r-project.org
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
R Engine andere
R-Packages
Oracle R Enterprise Packages
User R Engine (Dektop)1
User-Tabellen
Oracle DBSQL
Ergebnisse
Datenbank Compute Engine2
R Engine andere
R-Packages
Oracle R Enterprise Packages
R Engine(s) verwaltet durch Oracle DB
R
Ergebnisse
3
Post-Processing
der Ergebnisse
Analysen, die in der Oracle
DB nicht verfügbar sind
Ausführung in Collaboration
mit der Oracle DB
„Collaborative Execution“-Modell
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Oracles R Technologien
•Oracle R Distribution
•ROracle
•Oracle R Enterprise
•Oracle R Advanced Analytics for Hadoop
Für R Comunity frei verfügbar
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Demo
18
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Benefits
19
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Benefits I
5.881 R-Packages
20
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Benefits II
21
Integration
Performance & Scalability
Performante Enterprise
Predictive Analytics Applikationen
Geringe Total Costs of Ownership
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Ausblick: Mehr Performance für R
22
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
FastR
• Neuimplementierung von R in Java
– Verwendung von Graal (Compiler) und Truffle (AST-Interpreter)
– Dynamische Compilierung, Skalierung auf heterogenen Architekturen
– Beteiligt: Oracle Labs (Deutschland, USA, Österreich),
JKU Linz,
Purdue University,
TU Dortmund
23
U
U U
U
U I
I I
G
G I
I I
G
G
Node Rewriting
for Profiling Feedback
AST Interpreter
Rewritten Nodes
AST Interpreter
Uninitialized Nodes
Compilation using
Partial Evaluation
Compiled Code
Node Transitions
S
U
I
D
G
Uninitialized Integer
Generic
DoubleString
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
“R is a powerful and interesting tool for
data analysis! ORE brings R into a
scalable DB engine (solving problems
of data management, analysis and
scalability). We actually can obtain
information and added value from not
so actively used data.”
– Stefano Alberto Russo, Researcher at CERN Openlab
24
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Weitere Informationen
25
ORE-Diskussionsforum:
https://community.oracle.com/community/developer/english/business_intelligence/data_warehousing/r
Oracle Advanced Analytics:
http://www.oracle.com/technetwork/database/options/advanced-analytics/index.html
ORE-Blog:
https://blogs.oracle.com/R/
FastR:
https://bitbucket.org/allR/fastR
Graal/Truffle:
https://wiki.openjdk.java.net/display/Graal/Main
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Kontakt
Dr. Nadine Schöne| Sales Consultant
Email: nadine.schoene@oracle.com
Tel: +49 331 200 7190
ORACLE Deutschland B.V. & Co. KG
Schiffbauergasse 14
14467 Potsdam
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | 27
Slidedeck Mehr als Reporting - Datenanalysen mit Oracle R Enterprise - DOAG Development and DOAG SIG BigData 2014

Mais conteúdo relacionado

Mais procurados

Oracle Spatial Studio: Fast and Easy Spatial Analytics and Maps
Oracle Spatial Studio:  Fast and Easy Spatial Analytics and MapsOracle Spatial Studio:  Fast and Easy Spatial Analytics and Maps
Oracle Spatial Studio: Fast and Easy Spatial Analytics and MapsJean Ihm
 
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your DataBuild Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your DataJean Ihm
 
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...Jean Ihm
 
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal Srivatsan Ramanujam
 
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...LDBC council
 
How To Visualize Graphs
How To Visualize GraphsHow To Visualize Graphs
How To Visualize GraphsJean Ihm
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataEMC
 
Basics of Digital Design and Verilog
Basics of Digital Design and VerilogBasics of Digital Design and Verilog
Basics of Digital Design and VerilogGanesan Narayanasamy
 
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...Jean Ihm
 
Apache HAWQ and Apache MADlib: Journey to Apache
Apache HAWQ and Apache MADlib: Journey to ApacheApache HAWQ and Apache MADlib: Journey to Apache
Apache HAWQ and Apache MADlib: Journey to ApachePivotalOpenSourceHub
 
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics EngineLDBC council
 
Intro to R for SAS and SPSS User Webinar
Intro to R for SAS and SPSS User WebinarIntro to R for SAS and SPSS User Webinar
Intro to R for SAS and SPSS User WebinarRevolution Analytics
 
Sangam 18 - The New Optimizer in Oracle 12c
Sangam 18 - The New Optimizer in Oracle 12cSangam 18 - The New Optimizer in Oracle 12c
Sangam 18 - The New Optimizer in Oracle 12cConnor McDonald
 
Introduction to Property Graph Features (AskTOM Office Hours part 1)
Introduction to Property Graph Features (AskTOM Office Hours part 1) Introduction to Property Graph Features (AskTOM Office Hours part 1)
Introduction to Property Graph Features (AskTOM Office Hours part 1) Jean Ihm
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalRevolution Analytics
 
An Introduction to Spark with Scala
An Introduction to Spark with ScalaAn Introduction to Spark with Scala
An Introduction to Spark with ScalaChetan Khatri
 
Building Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANABuilding Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANASAP Technology
 
Apache Hadoop Crash Course
Apache Hadoop Crash CourseApache Hadoop Crash Course
Apache Hadoop Crash CourseDataWorks Summit
 

Mais procurados (18)

Oracle Spatial Studio: Fast and Easy Spatial Analytics and Maps
Oracle Spatial Studio:  Fast and Easy Spatial Analytics and MapsOracle Spatial Studio:  Fast and Easy Spatial Analytics and Maps
Oracle Spatial Studio: Fast and Easy Spatial Analytics and Maps
 
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your DataBuild Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
 
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...
 
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal
 
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
 
How To Visualize Graphs
How To Visualize GraphsHow To Visualize Graphs
How To Visualize Graphs
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big Data
 
Basics of Digital Design and Verilog
Basics of Digital Design and VerilogBasics of Digital Design and Verilog
Basics of Digital Design and Verilog
 
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
 
Apache HAWQ and Apache MADlib: Journey to Apache
Apache HAWQ and Apache MADlib: Journey to ApacheApache HAWQ and Apache MADlib: Journey to Apache
Apache HAWQ and Apache MADlib: Journey to Apache
 
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
 
Intro to R for SAS and SPSS User Webinar
Intro to R for SAS and SPSS User WebinarIntro to R for SAS and SPSS User Webinar
Intro to R for SAS and SPSS User Webinar
 
Sangam 18 - The New Optimizer in Oracle 12c
Sangam 18 - The New Optimizer in Oracle 12cSangam 18 - The New Optimizer in Oracle 12c
Sangam 18 - The New Optimizer in Oracle 12c
 
Introduction to Property Graph Features (AskTOM Office Hours part 1)
Introduction to Property Graph Features (AskTOM Office Hours part 1) Introduction to Property Graph Features (AskTOM Office Hours part 1)
Introduction to Property Graph Features (AskTOM Office Hours part 1)
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 final
 
An Introduction to Spark with Scala
An Introduction to Spark with ScalaAn Introduction to Spark with Scala
An Introduction to Spark with Scala
 
Building Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANABuilding Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANA
 
Apache Hadoop Crash Course
Apache Hadoop Crash CourseApache Hadoop Crash Course
Apache Hadoop Crash Course
 

Destaque (20)

Samplels & Sampling Techniques
Samplels & Sampling TechniquesSamplels & Sampling Techniques
Samplels & Sampling Techniques
 
Common sampling techniques
Common sampling techniquesCommon sampling techniques
Common sampling techniques
 
Sampling
SamplingSampling
Sampling
 
Chapter 8-SAMPLE & SAMPLING TECHNIQUES
Chapter 8-SAMPLE & SAMPLING TECHNIQUESChapter 8-SAMPLE & SAMPLING TECHNIQUES
Chapter 8-SAMPLE & SAMPLING TECHNIQUES
 
Lo Hei Lunch
Lo Hei LunchLo Hei Lunch
Lo Hei Lunch
 
Top 10 Most Streamed Songs Of All Time
Top 10 Most Streamed Songs Of All TimeTop 10 Most Streamed Songs Of All Time
Top 10 Most Streamed Songs Of All Time
 
La Peninsula Iberica entre s.VIII y XI
La Peninsula Iberica entre s.VIII y XILa Peninsula Iberica entre s.VIII y XI
La Peninsula Iberica entre s.VIII y XI
 
Etica y moral (haiderma jose)
Etica y moral (haiderma jose)Etica y moral (haiderma jose)
Etica y moral (haiderma jose)
 
Plagio certificacion
Plagio certificacionPlagio certificacion
Plagio certificacion
 
Apps für Technische Redakteure
Apps für Technische RedakteureApps für Technische Redakteure
Apps für Technische Redakteure
 
deber
deberdeber
deber
 
El proceso de Conversión - Escuela Virtual RCC Colombia - mar 14 de 2013
El proceso de Conversión - Escuela Virtual RCC Colombia - mar 14 de 2013El proceso de Conversión - Escuela Virtual RCC Colombia - mar 14 de 2013
El proceso de Conversión - Escuela Virtual RCC Colombia - mar 14 de 2013
 
Trabajo
TrabajoTrabajo
Trabajo
 
Quinto de-primaria
Quinto de-primariaQuinto de-primaria
Quinto de-primaria
 
Testbericht zum Jura impressa z9
Testbericht zum Jura impressa z9Testbericht zum Jura impressa z9
Testbericht zum Jura impressa z9
 
Flip Chart MonaVie Mexico
Flip Chart MonaVie MexicoFlip Chart MonaVie Mexico
Flip Chart MonaVie Mexico
 
Nuevo presentación de microsoft power point
Nuevo presentación de microsoft power pointNuevo presentación de microsoft power point
Nuevo presentación de microsoft power point
 
Actividad 1.1
Actividad 1.1Actividad 1.1
Actividad 1.1
 
Web2.0 junior medina
Web2.0 junior medinaWeb2.0 junior medina
Web2.0 junior medina
 
Ejercicios 2
Ejercicios 2Ejercicios 2
Ejercicios 2
 

Semelhante a Slidedeck Mehr als Reporting - Datenanalysen mit Oracle R Enterprise - DOAG Development and DOAG SIG BigData 2014

Slidedeck Datenanalysen auf Enterprise-Niveau mit Oracle R Enterprise - DOAG2014
Slidedeck Datenanalysen auf Enterprise-Niveau mit Oracle R Enterprise - DOAG2014Slidedeck Datenanalysen auf Enterprise-Niveau mit Oracle R Enterprise - DOAG2014
Slidedeck Datenanalysen auf Enterprise-Niveau mit Oracle R Enterprise - DOAG2014Nadine Schoene
 
Slidedeck Datenanalyse mit Oracle R Enterprise for Beginners - DOAG2015
Slidedeck Datenanalyse mit Oracle R Enterprise for Beginners - DOAG2015Slidedeck Datenanalyse mit Oracle R Enterprise for Beginners - DOAG2015
Slidedeck Datenanalyse mit Oracle R Enterprise for Beginners - DOAG2015Nadine Schoene
 
Slidedeck Datenanalysen auf Speed - Oracle R Enterprise (ORE) Demo - DOAG Big...
Slidedeck Datenanalysen auf Speed - Oracle R Enterprise (ORE) Demo - DOAG Big...Slidedeck Datenanalysen auf Speed - Oracle R Enterprise (ORE) Demo - DOAG Big...
Slidedeck Datenanalysen auf Speed - Oracle R Enterprise (ORE) Demo - DOAG Big...Nadine Schoene
 
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)Jeffrey T. Pollock
 
A practical introduction to Oracle NoSQL Database - OOW2014
A practical introduction to Oracle NoSQL Database - OOW2014A practical introduction to Oracle NoSQL Database - OOW2014
A practical introduction to Oracle NoSQL Database - OOW2014Anuj Sahni
 
Oracle super cluster for oracle e business suite
Oracle super cluster for oracle e business suiteOracle super cluster for oracle e business suite
Oracle super cluster for oracle e business suiteOTN Systems Hub
 
Tapping into the Big Data Reservoir (CON7934)
Tapping into the Big Data Reservoir (CON7934)Tapping into the Big Data Reservoir (CON7934)
Tapping into the Big Data Reservoir (CON7934)Jeffrey T. Pollock
 
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUGIntroducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUGSandesh Rao
 
Oracle Warehouse Builder to Oracle Data Integrator 12c Migration Utility
Oracle Warehouse Builder to Oracle Data Integrator 12c Migration UtilityOracle Warehouse Builder to Oracle Data Integrator 12c Migration Utility
Oracle Warehouse Builder to Oracle Data Integrator 12c Migration UtilityNoel Sidebotham
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine LearningAUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine LearningSandesh Rao
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...jdijcks
 
All of the Performance Tuning Features in Oracle SQL Developer
All of the Performance Tuning Features in Oracle SQL DeveloperAll of the Performance Tuning Features in Oracle SQL Developer
All of the Performance Tuning Features in Oracle SQL DeveloperJeff Smith
 
Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...
Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...
Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...Thomas Wuerthinger
 
Tame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data IntegrationTame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data IntegrationMichael Rainey
 
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...DataWorks Summit
 
Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Fran Navarro
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data avanttic Consultoría Tecnológica
 
Nida event oracle business analytics 1 sep2016
Nida event   oracle business analytics 1 sep2016Nida event   oracle business analytics 1 sep2016
Nida event oracle business analytics 1 sep2016BAINIDA
 
Reducing the Risks of Migrating Off Oracle
Reducing the Risks of Migrating Off OracleReducing the Risks of Migrating Off Oracle
Reducing the Risks of Migrating Off OracleEDB
 

Semelhante a Slidedeck Mehr als Reporting - Datenanalysen mit Oracle R Enterprise - DOAG Development and DOAG SIG BigData 2014 (20)

Slidedeck Datenanalysen auf Enterprise-Niveau mit Oracle R Enterprise - DOAG2014
Slidedeck Datenanalysen auf Enterprise-Niveau mit Oracle R Enterprise - DOAG2014Slidedeck Datenanalysen auf Enterprise-Niveau mit Oracle R Enterprise - DOAG2014
Slidedeck Datenanalysen auf Enterprise-Niveau mit Oracle R Enterprise - DOAG2014
 
Slidedeck Datenanalyse mit Oracle R Enterprise for Beginners - DOAG2015
Slidedeck Datenanalyse mit Oracle R Enterprise for Beginners - DOAG2015Slidedeck Datenanalyse mit Oracle R Enterprise for Beginners - DOAG2015
Slidedeck Datenanalyse mit Oracle R Enterprise for Beginners - DOAG2015
 
Slidedeck Datenanalysen auf Speed - Oracle R Enterprise (ORE) Demo - DOAG Big...
Slidedeck Datenanalysen auf Speed - Oracle R Enterprise (ORE) Demo - DOAG Big...Slidedeck Datenanalysen auf Speed - Oracle R Enterprise (ORE) Demo - DOAG Big...
Slidedeck Datenanalysen auf Speed - Oracle R Enterprise (ORE) Demo - DOAG Big...
 
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)
 
A practical introduction to Oracle NoSQL Database - OOW2014
A practical introduction to Oracle NoSQL Database - OOW2014A practical introduction to Oracle NoSQL Database - OOW2014
A practical introduction to Oracle NoSQL Database - OOW2014
 
Oracle super cluster for oracle e business suite
Oracle super cluster for oracle e business suiteOracle super cluster for oracle e business suite
Oracle super cluster for oracle e business suite
 
Tapping into the Big Data Reservoir (CON7934)
Tapping into the Big Data Reservoir (CON7934)Tapping into the Big Data Reservoir (CON7934)
Tapping into the Big Data Reservoir (CON7934)
 
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUGIntroducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
 
Oracle Warehouse Builder to Oracle Data Integrator 12c Migration Utility
Oracle Warehouse Builder to Oracle Data Integrator 12c Migration UtilityOracle Warehouse Builder to Oracle Data Integrator 12c Migration Utility
Oracle Warehouse Builder to Oracle Data Integrator 12c Migration Utility
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine LearningAUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
Anuragh Ravindran
Anuragh RavindranAnuragh Ravindran
Anuragh Ravindran
 
All of the Performance Tuning Features in Oracle SQL Developer
All of the Performance Tuning Features in Oracle SQL DeveloperAll of the Performance Tuning Features in Oracle SQL Developer
All of the Performance Tuning Features in Oracle SQL Developer
 
Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...
Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...
Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...
 
Tame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data IntegrationTame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data Integration
 
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
 
Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
 
Nida event oracle business analytics 1 sep2016
Nida event   oracle business analytics 1 sep2016Nida event   oracle business analytics 1 sep2016
Nida event oracle business analytics 1 sep2016
 
Reducing the Risks of Migrating Off Oracle
Reducing the Risks of Migrating Off OracleReducing the Risks of Migrating Off Oracle
Reducing the Risks of Migrating Off Oracle
 

Último

SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024Becky Burwell
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationGiorgio Carbone
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Guido X Jansen
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introductionsanjaymuralee1
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructuresonikadigital1
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerPavel Šabatka
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Vladislav Solodkiy
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxVenkatasubramani13
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)Data & Analytics Magazin
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...PrithaVashisht1
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityAggregage
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?sonikadigital1
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionajayrajaganeshkayala
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxDwiAyuSitiHartinah
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best PracticesDataArchiva
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.JasonViviers2
 

Último (17)

SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - Presentation
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introduction
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructure
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayer
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptx
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual intervention
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.
 

Slidedeck Mehr als Reporting - Datenanalysen mit Oracle R Enterprise - DOAG Development and DOAG SIG BigData 2014

  • 2. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Mehr als Reporting – Datenanalysen mit Oracle R Enterprise Dr. Nadine Schöne Sales Consultant Oracle Direct, Sales Consulting Dr. Michael Haupt Principal Member of Technical Staff Oracle Labs, Virtual Machine Research Group 25. September 2014
  • 3. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle. 3
  • 4. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Agenda Mehr als Standard Reporting? Weiterführende Datenanalysen R und Oracle R Enterprise (ORE) Demo Benefits Ausblick: Mehr Performance für R 1 2 3 4 5 4 6
  • 5. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Mehr als Standard Reporting? 5
  • 6. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Reporting 6
  • 7. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Weiterführende Datenanalysen 7
  • 8. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | 8
  • 9. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Sensordaten-Analyse I 9 200.000 Haushalte 3 Jahre 1 Messung/Stunde 5.256 Mrd. Messwerte (2.628 Messwerte/Kunde)
  • 10. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Sensordaten-Analyse II 10 10 s/Modell 200.000 Haushalte ➔ 200.000 Modelle 23 Tage + 4 Stunden 4,3 Stunden Oracle R Enterprise
  • 11. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R Screenshots
  • 12. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle Advanced Analytics • Data Understanding & Visualization – Summary & Descriptive Statistics – Histograms, scatter plots, box plots, bar charts – R graphics: 3-D plots, link plots, special R graph types – Cross tabulations – Tests for Correlations (t-test, Pearson’s, ANOVA) – Selected Base SAS equivalents • Data Selection, Preparation and Transformations – Joins, Tables, Views, Data Selection, Data Filter, SQL time windows, Multiple schemas – Sampling techniques – Re-coding, Missing values – Aggregations – Spatial data – R to SQL transparency and push down • Classification Models – Logistic Regression (GLM) – Naive Bayes – Decision Trees – Support Vector Machines (SVM) – Neural Networks (NNs) • Regression Models – Multiple Regression (GLM) – Support Vector Machines Große Bandbreite an In-Database Data Mining und statistischen Funktionen  Clustering – Hierarchical K-means – Orthogonal Partitioning – Expectation Maximization  Anomaly Detection – Special case Support Vector Machine (1-Class SVM)  Associations / Market Basket Analysis – A Priori algorithm  Feature Selection and Reduction – Attribute Importance (Minimum Description Length) – Principal Components Analysis (PCA) – Non-negative Matrix Factorization – Singular Vector Decomposition  Text Mining – Most OAA algorithms support unstructured data (i.e. customer comments, email, abstracts, etc.)  Transactional Data – Most OAA algorithms support transactional data (i.e. purchase transactions, repeated measures over time)  R packages—ability to run open source – Broad range of R CRAN packages can be run as part of database process via R to SQL transparency and/or via Embedded R mode * included in every Oracle Database Deskriptive Datenanalyse & Visualization Klassifikations- & Regressions Modelle Clustering Verwendung von Open Source R packages Daten Aufbereitung & Transformationen
  • 13. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Wichtige Themen für Enterprise Data Analytics 1. Skalierbarkeit 2. Performance 3. Entwicklung & Produktion
  • 14. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R und Oracle R Enterprise (ORE) 14
  • 15. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Aspekte herkömmlicher R/Datenbank-Interaktion 15 R logo © R Foundation, vonhttp://www.r-project.org
  • 16. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R Engine andere R-Packages Oracle R Enterprise Packages User R Engine (Dektop)1 User-Tabellen Oracle DBSQL Ergebnisse Datenbank Compute Engine2 R Engine andere R-Packages Oracle R Enterprise Packages R Engine(s) verwaltet durch Oracle DB R Ergebnisse 3 Post-Processing der Ergebnisse Analysen, die in der Oracle DB nicht verfügbar sind Ausführung in Collaboration mit der Oracle DB „Collaborative Execution“-Modell
  • 17. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracles R Technologien •Oracle R Distribution •ROracle •Oracle R Enterprise •Oracle R Advanced Analytics for Hadoop Für R Comunity frei verfügbar
  • 18. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Demo 18
  • 19. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Benefits 19
  • 20. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Benefits I 5.881 R-Packages 20
  • 21. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Benefits II 21 Integration Performance & Scalability Performante Enterprise Predictive Analytics Applikationen Geringe Total Costs of Ownership
  • 22. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Ausblick: Mehr Performance für R 22
  • 23. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | FastR • Neuimplementierung von R in Java – Verwendung von Graal (Compiler) und Truffle (AST-Interpreter) – Dynamische Compilierung, Skalierung auf heterogenen Architekturen – Beteiligt: Oracle Labs (Deutschland, USA, Österreich), JKU Linz, Purdue University, TU Dortmund 23 U U U U U I I I G G I I I G G Node Rewriting for Profiling Feedback AST Interpreter Rewritten Nodes AST Interpreter Uninitialized Nodes Compilation using Partial Evaluation Compiled Code Node Transitions S U I D G Uninitialized Integer Generic DoubleString
  • 24. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | “R is a powerful and interesting tool for data analysis! ORE brings R into a scalable DB engine (solving problems of data management, analysis and scalability). We actually can obtain information and added value from not so actively used data.” – Stefano Alberto Russo, Researcher at CERN Openlab 24
  • 25. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Weitere Informationen 25 ORE-Diskussionsforum: https://community.oracle.com/community/developer/english/business_intelligence/data_warehousing/r Oracle Advanced Analytics: http://www.oracle.com/technetwork/database/options/advanced-analytics/index.html ORE-Blog: https://blogs.oracle.com/R/ FastR: https://bitbucket.org/allR/fastR Graal/Truffle: https://wiki.openjdk.java.net/display/Graal/Main
  • 26. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Kontakt Dr. Nadine Schöne| Sales Consultant Email: nadine.schoene@oracle.com Tel: +49 331 200 7190 ORACLE Deutschland B.V. & Co. KG Schiffbauergasse 14 14467 Potsdam
  • 27. Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | 27