Revolution R Enterprise: 100% R and More (14 Mar 2013)
1. Revolution Confidential
R evolution R :
100% R and More
P res ented by:
David S mith @ revodavid
V P Marketing and C ommunity
R evolution A nalytic s
3. Marc h 13, 2013: Welc ome! Revolution Confidential
Thanks for coming.
Slides and replay available (soon) at:
http://bit.ly/YbfQo1
David Smith
VP Marketing & Community, Revolution Analytics
Editor, Revolutions blog
http://blog.revolutionanalytics.com
Twitter: @revodavid
3
4. In today’s webc as t: Revolution Confidential
About Revolution Analytics and R
How Revolution R Enterprise enhances R
Resources for getting more from R
Q&A
4
5. Revolution Confidential
Revolution Analytics is the leading commercial
provider of software and support for the
open-source R statistical computing language
R evolution R E nterpris e is
Enterprise-ready
Multi-platform
Scalable from desktop to big data
Delivers high performance analytics
Easier to build and deploy analytic applications
5
6. What is R ? Download the White PaperConfidential
R is Hot
Revolution
bit.ly/r-is-hot
Data analysis software
A powerful programming language
Development platform designed by and for statisticians
A complete environment
Huge library of algorithms for data access, data
manipulation, analysis and graphics
An open-source software project
Free, open, and active
A vibrant community
Thousands of contributors, 2 million users
Resources and help in every domain
6
7. R is exploding in popularity and
func tionality Revolution Confidential
“I’ve been astonished by the rate at which
R has been adopted. Four years ago,
everyone in my economics department [at
the University of Chicago] was using
Stata; now, as far as I can tell, R is the
standard tool, and students learn it first.”
Deputy Editor for New Products at Forbes
“A key benefit of R is that it provides near-
instant availability of new and
experimental methods created by its user
base — without waiting for the
development/release cycle of commercial
software. SAS recognizes the value of R
to our customer base…”
Product Marketing Manager SAS Institute, Inc
Source: http://r4stats.com/popularity; “Why R is a name to know in 2011”,
Forbes; number of packages is now 4,250 7
8. A Vibrant R Us er More: The R Ecosystem
C ommunity bit.ly/R-ecosystem
Revolution Confidential
Local R User
Groups (93)
Groups (102)
8
9. R evolution A nalytic s
S c ales R to the E nterpris e Revolution Confidential
Revolution R Enterprise
Power
Distributed high
Power performance analytics
Productivity
Build & deploy analytics
applications easily
Productivity
Enterprise Enterprise Readiness
Readiness
Enterprise landscape
Full-service customer
support, consulting and
training
9
10. R evolution R E nterpris e Revolution Confidential
High P erformanc e, Multi-P latform A nalytic s P latform
Revolution R Enterprise
DevelopR
DeployR Integrated Development
Web Services
Environment
ScaleR
High Performance Big Data Analytics
RevoR
Performance Enhanced Open Source R
Open Source R packages
PlatformR
Parallel Distributed Computing
IBM/Netezza, IBM/Platform LSF, MS HPC Server, MS Azure Burst
ConnectR
High Speed Connectors
HDFS, Hbase, ODBC, SAS
10
11. R evolution R E nterpris e: Revolution Confidential
Performance Enhancements
Enterprise Performance
Deployment
Greater Productivity & Ease of Use
Open Source
Tackle “Big Data” Technical Productivity
Support
IT-Friendly Enterprise Deployment
Training Big
& Consulting Data Analysis
On-Call Experts
11
13. T he s tandard R interfac e Revolution Confidential
13
14. DevelopR Integrated Development E nvironment
Revolution Confidential
Script with type
ahead and code Solutions window
snippets for organizing
code and data
Sophisticated
debugging with
breakpoints , variable Objects
values etc. loaded in the
R
Environment
Packages Object
installed and details
loaded
http://www.revolutionanalytics.com/demos/revolution-productivity-environment/demo.htm
14
16. P erformanc e: Multi-threaded Math Revolution Confidential
Open Revolution R
Source R Enterprise
Computation (4-core laptop) Open Source R Revolution R Speedup
Linear Algebra1
Matrix Multiply 176 sec 9.3 sec 18x
Cholesky Factorization 25.5 sec 1.3 sec 19x
Linear Discriminant Analysis 189 sec 74 sec 3x
General R Benchmarks2
R Benchmarks (Matrix Functions) 22 sec 3.5 sec 5x
R Benchmarks (Program Control) 5.6 sec 5.4 sec Not appreciable
1. http://www.revolutionanalytics.com/why-revolution-r/benchmarks.php
2. http://r.research.att.com/benchmarks/
16
18. R evoS c aleR brings the power of
B ig Data to R Revolution Confidential
Parallel External
Abstracted
Memory Algorithms
communications layer
exploit available Communications
Distributed provides portability of
compute resources Framework
Statistical code between
(cores & computers) Algorithms platforms: server,
independent of
cluster, or in-database
platform
Use the high-speed
local data mart R Language Familiar, high-
(XDF), or stream Interface productivity
Data Source
data from SAS, programming
API
ODBC, HDFS or other environment for R
remote data users
sources.
18
19. S c aleR A ddres s es P erformanc e and
C apac ity L imitations of Open S ourc e R
Revolution Confidential
19
20. High P erformanc e B ig Data A nalytic s with
S c aleR Revolution Confidential
R Data Step Descriptive Statistical Sampling
Statistics Tests
Predictive Data Machine Simulation
Models Visualization Learning
20
21. R evolution R E nterpris e S c aleR : Revolution Confidential
High P erformanc e B ig Data A nalytic s
Data Prep, Distillation & Descriptive Analytics
R Data Step Descriptive Statistics Statistical Tests
Data import – Delimited, Min / Max Chi Square Test
Fixed, SAS, SPSS, OBDC Mean Kendall Rank Correlation
Variable creation & Median (approx.) Fisher’s Exact Test
transformation Quantiles (approx.) Student’s t-Test
Recode variables Standard Deviation
Factor variables Variance
Missing value handling Correlation Sampling
Sort Covariance
Merge Sum of Squares (cross product
Split matrix for set variables) Subsample (observations &
Aggregate by category Pairwise Cross tabs variables)
(means, sums) Risk Ratio & Odds Ratio Random Sampling
Cross-Tabulation of Data
(standard tables & long form)
Marginal Summaries of Cross
Tabulations
21
22. R evolution R E nterpris e S c aleR : Revolution Confidential
High P erformanc e B ig Data A nalytic s
Statistical Modeling Machine Learning
Predictive Models Data Visualization Cluster Analysis
Sum of Squares (cross product Histogram K-Means
matrix for set variables) Line Plot
Multiple Linear Regression Scatter Plot
Generalized Linear Models (GLM) Lorenz Curve
- All exponential family ROC Curves (actual data and Classification
distributions: binomial, Gaussian, predicted values)
inverse Gaussian, Poisson,
Tweedie. Standard link functions Decision Trees
including: cauchit, identity, log,
logit, probit. User defined Simulation
distributions & link functions.
Covariance & Correlation
Matrices
Logistic Regression
Classification & Regression Trees
Predictions/scoring for models Monte Carlo
Residuals for all models
22
24. C reate c us tom, on-demand analytic s applic ations
Revolution Confidential
S ome examples :
On-demand sales
forecasting
Real-time social
Leveraging the media sentiment
power of R from analysis
Microsoft tools
24
25. R evolution R E nterpris e DeployR
integrates R with applic ations Revolution Confidential
Data Analysis
DeployR
R / Statistical Deployment
Modeling Expert Expert
Business Intelligence
Mobile Web Apps
Seamless
Bring the power of R to any web enabled application
Simple
Leverage common APIs including JS, Java, .NET
Scalable
Robustly scale user and compute workloads
Secure Cloud / SaaS
Manage enterprise security with LDAP & SSO
25
26. R evolution R Revolution Confidential
E nterpris e
A rc hitec ture
Use a connected MPP
server or cluster for:
Data exploration
On-demand R
applications
Big-data predictive
models
Offline (batch)
operations
Code generation for
real-time deployment
27. C onnec tR for Hadoop: S tream data from
Revolution Confidential
Hadoop to R evolution R E nterpris e
28. Revolution Confidential
On-Call Technical Support
Consulting
Migration | Analytics | Applications | Validation
Training
R | Revolution R | Statistical Topics
Systems Integration
BI | ERP | Databases | Cloud
28
30. Why c us tomers c hoos e R evolution R
E nterpris e Revolution Confidential
INNOVATION MULTI-PLATFORM
TIME-to-VALUE VALUE
30
31. T hank You! Revolution Confidential
Download slides, replay
http://bit.ly/YbfQo1
Resources for getting started with R
http://bit.ly/ZnZGt2
Get Revolution R Enterprise
Contact Sales: http://bit.ly/hey-revo
Free to Academics:
www.revolutionanalytics.com/academic
We’re Hiring!
www.revolutionanalytics.com/careers
31
32. T hank you. Revolution Confidential
The leading commercial provider of software and support for the popular
open source R statistics language.
www.revolutionanalytics.com 650.646.9545 Twitter: @RevolutionR
32