3. Dec ember 14, 2011: Welc ome! Revolution Confidential
Thanks for coming.
Slides and replay available (soon) at:
http://bit.ly/rOSvwK
David Smith
VP Marketing, Revolution Analytics
Editor, Revolutions blog
http://blog.revolutionanalytics.com
Twitter: @revodavid
3
4. In today’s webc as t: Revolution Confidential
About Revolution Analytics and R
What Revolution R adds to R
Resources for getting more from R
Q&A
Introducing Revolution R 4
5. What is R ? Download the White PaperConfidential
R is Hot
Revolution
bit.ly/r-is-hot
Data analysis software
A programming language
Development platform designed by and for statisticians
An environment
Huge library of algorithms for data access, data
manipulation, analysis and graphics
An open-source software project
Free, open, and active
A community
Thousands of contributors, 2 million users
Resources and help in every domain
5
6. R is exploding in popularity and
func tionality Revolution Confidential
Scholarly Activity
Google Scholar hits (’05-’09 CAGR)
R 46% “I’ve been astonished by the rate at which
R has been adopted. Four years ago,
SAS -11%
everyone in my economics department [at
SPSS -27%
the University of Chicago] was using
Stata; now, as far as I can tell, R is the
S-Plus 0% standard tool, and students learn it first.”
Stata 10%
Deputy Editor for New Products at Forbes
Package Growth
Number of R packages listed on CRAN
“A key benefit of R is that it provides near-
instant availability of new and
experimental methods created by its user
base — without waiting for the
development/release cycle of commercial
software. SAS recognizes the value of R
to our customer base…”
Product Marketing Manager SAS Institute, Inc.
2002 2004 2006 2008 2010
Source: http://r4stats.com/popularity 6
7. “ R is the mos t powerful & flexible s tatis tic al
Revolution Confidential
programming language in the world” 1
Capabilities
Sophisticated
statistical analyses
Predictive analytics
Data visualization
Applications
Real-time trading MSFT [2009-
Last 29.29
Finance 30
Risk assessment 25
Forecasting 20
Bio-technology 15
Drug development
Social networks
.. and more
1. Norman Nie, multiple interviews 7
8. From: The R Ecosystem
R Us er C ommunity bit.ly/R-ecosystem
8
10. R evolution R E nterpris e is Revolution Confidential
10
11. R P roduc tivity E nvironment (Windows )
Revolution Confidential
Script with type
ahead and code Solutions window
snippets for organizing
code and data
Sophisticated
debugging with
breakpoints , variable Objects
values etc. loaded in the
R
Environment
Packages Object
installed and details
loaded
http://www.revolutionanalytics.com/demos/revolution-productivity-environment/demo.htm
11
12. Interac tive Debugging Revolution Confidential
One-click to set a breakpoint in an R script
Step in/out/over, inspect variables
Eliminate the edit -> browser -> repair cycle
12
13. P erformanc e: Multi-threaded Math Revolution Confidential
Open Revolution R
Source R Enterprise
Computation (4-core laptop) Open Source R Revolution R Speedup
Linear Algebra1
Matrix Multiply 327 sec 13.4 sec 23x
Cholesky Factorization 31.3 sec 1.8 sec 17x
Linear Discriminant Analysis 216 sec 74.6 sec 2x
General R Benchmarks2
R Benchmarks (Matrix Functions) 22 sec 3.5 sec 5x
R Benchmarks (Program Control) 5.6 sec 5.4 sec Not appreciable
1. http://www.revolutionanalytics.com/why-revolution-r/benchmarks.php
2. http://r.research.att.com/benchmarks/
13
14. T hree P aradigms for B ig Data Revolution Confidential
Standard R engine is constrained by
capacity and performance
Revolution R Enterprise offers three
methods for big data with R:
Off-line: high-performance file-based analytics
Off-line, parallel & distributed analytics
On-line, in-database analytics
Hadoop
Netezza
14
15. R evolution R E nterpris e with R evoS c aleR
B ig Data S tatis tic s in R Revolution Confidential
www.revolutionanalytics.com/bigdata
Every US airline
departure and arrival,
1987-2008
File: AirlineData87to08.xdf
Rows: 123.5 million
Variables: 29
Size on disk: 13.2Gb
arrDelayLm2 <- rxLinMod(ArrDelay ~ DayOfWeek:F(CRSDepTime),cube=TRUE)
15
16. E xample: Old Wives C ens us A nalys is Revolution Confidential
http://info.revolutionanalytics.com/Cen
susOldWivesWhitePaper.html
16
17. R evoS c aleR – Dis tributed C omputing Revolution Confidential
Compute • Portions of the data source are
Data Node made available to each compute
Partition (RevoScaleR) node
• RevoScaleR on the master node
Compute assigns a task to each compute
Data Node node
Partition (RevoScaleR)
Master • Each compute node independently
Node processes its data, and returns its
Compute (RevoScaleR) intermediate results back to the
Data Node master node
Partition (RevoScaleR)
• master node aggregates all of the
intermediate results from each
Compute compute node and produces the
Data Node final result
Partition (RevoScaleR)
*Available now for Microsoft HPC Server
Video demo: http://bit.ly/ugQ9KR
17
18. R evolution A nalytic s with Netezza A pplianc e
Revolution Confidential
More info: http://bit.ly/R-Netezza
18
19. R evoC onnec tR for Hadoop Revolution Confidential
Write Map-Reduce analytics using
HBASE only R code with these R
packages:
HDFS
rhdfs - R and HDFS
R
Thrift rhbase - R and HBASE
Map or
Reduce
rmr - R and MapReduce
Task rhbase
rhdfs
Node
Revolution R More information at:
Job Client bit.ly/r-hadoop
Tracker rmr
19
20. E nterpris e R eadines s :
R evolution R E nterpris e S erver Revolution Confidential
Multi-User Support
Production Applications
Integrate R analytics into Web based applications
Data Analysis and Visualization
Reporting
Dashboards
Interactive applications
Revolution R Enterprise Server with RevoDeployR
20
21. Deployment with R evolution R E nterpris e Revolution Confidential
End User Desktop Business
Interactive Web
Applications Intelligence
Applications
(e.g. Excel) (e.g. Jaspersoft)
Application
Client libraries (JavaScript, Java, .NET)
Developer
HTTP/HTTPS – JSON/XML
R RevoDeployR Web Services
Programmer
Session Data/Script
Authentication Administration
Management Management
R
21
22. C oming s oon: R evolution R G UI Revolution Confidential
Accessible
Powerful
Extensible
11
23. T he A dvanc ed A nalytic s S tac k Revolution Confidential
Deployment / Consumption
Advanced Analytics
ETL
Data / Infrastructure
“Open Analytics Stack” White Paper: bit.ly/lC43Kw
23
24. Revolution Confidential
On-Call Technical Support
Consulting
Migration | Analytics | Applications | Validation
Training
R | Revolution R | Statistical Topics
Systems Integration
BI | ERP | Databases | Cloud
24
26. Why R ? Revolution Confidential
Every data analysis technique at your fingertips
Create beautiful and unique data visualizations
Get better results faster
Draw on the talents of data scientists worldwide
R is hot, and growing fast
26
27. R evolution R E nterpris e Revolution Confidential
Production-Grade Statistical Analysis for the Workplace
High-performance R for multiprocessor systems
Modern Integrated Development Environment
Statistical Analysis of Terabyte-Class Data Sets
In-database R analytics with Hadoop and Netezza
Deploy R Applications via Web Services
Telephone and email technical support
Training and consulting services
100% compatible with R packages
Easy-to-Use GUI1
1 Coming Soon 27
28. F urther R eading Revolution Confidential
http://bit.ly/revo-r-pdf http://bit.ly/r-is-hot
28
29. R evolution R E nterpris e: F ree to A c ademia Revolution Confidential
Personal use
Research
Teaching
Package development
Free Academic Download
www.revolutionanalytics.com/downloads/free-academic.php
Discounted Technical Support Subscriptions Available
29
30. T hank You! Revolution Confidential
Download slides, replay (from Oct 20)
http://bit.ly/rOSvwK
Learn more about Revolution R
revolutionanalytics.com/products
Contact Revolution Analytics
http://bit.ly/hey-revo
Dec 20: Big Data Analysis Starts with R
A 30-minute executive webinar to find out how companies of all types and sizes can
integrate “R” into their “big data” analytics infrastructure strategy.
www.revolutionanalytics.com/news-events/free-webinars
30
32. Revolution Confidential
The leading commercial provider of software and support for the
popular open source R statistics language.
www.revolutionanalytics.com
+1 (650) 646 9545
Twitter: @RevolutionR
32