SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
Revolution Confidential




R evolution R :
100% R and More



P res ented by:
David S mith
V P Marketing, R evolution
   A nalytic s
Revolution Confidential




P oll Ques tion
    Which stats package do you use
                 most?
Dec ember 14, 2011: Welc ome!                                Revolution Confidential




 Thanks for coming.
 Slides and replay available (soon) at:
   http://bit.ly/rOSvwK



                    David Smith
                    VP Marketing, Revolution Analytics
                    Editor, Revolutions blog
                              http://blog.revolutionanalytics.com
                    Twitter: @revodavid




                                                                               3
In today’s webc as t:                       Revolution Confidential




 About Revolution Analytics and R

 What Revolution R adds to R

 Resources for getting more from R

 Q&A


                 Introducing Revolution R                     4
What is R ?                          Download the White PaperConfidential
                                            R is Hot
                                                      Revolution



                                            bit.ly/r-is-hot
 Data analysis software
 A programming language
   Development platform designed by and for statisticians
 An environment
   Huge library of algorithms for data access, data
    manipulation, analysis and graphics
 An open-source software project
   Free, open, and active
 A community
   Thousands of contributors, 2 million users
   Resources and help in every domain

                                                                     5
R is exploding in popularity and
func tionality                                                                                                              Revolution Confidential


Scholarly Activity
          Google Scholar hits (’05-’09 CAGR)

     R                                                               46%                      “I’ve been astonished by the rate at which
                                                                                                 R has been adopted. Four years ago,
  SAS               -11%
                                                                                              everyone in my economics department [at
 SPSS     -27%
                                                                                                  the University of Chicago] was using
                                                                                                 Stata; now, as far as I can tell, R is the
 S-Plus                           0%                                                           standard tool, and students learn it first.”

  Stata                                  10%

                                                                                         Deputy Editor for New Products at Forbes
Package Growth
          Number of R packages listed on CRAN

                                                                                              “A key benefit of R is that it provides near-
                                                                                                    instant availability of new and
                                                                                              experimental methods created by its user
                                                                                                    base — without waiting for the
                                                                                              development/release cycle of commercial
                                                                                               software. SAS recognizes the value of R
                                                                                                       to our customer base…”


                                                                                         Product Marketing Manager SAS Institute, Inc.
                 2002      2004   2006         2008     2010


                                                      Source: http://r4stats.com/popularity                                                   6
“ R is the mos t powerful & flexible s tatis tic al
                                                    Revolution Confidential
programming language in the world”       1


 Capabilities
    Sophisticated
     statistical analyses
    Predictive analytics
    Data visualization
 Applications
      Real-time trading    MSFT                                    [2009-



   
                              Last 29.29


       Finance                                                          30




      Risk assessment                                                  25




      Forecasting                                                      20




      Bio-technology                                                   15




      Drug development
      Social networks
      .. and more

                                   1. Norman Nie, multiple interviews        7
From: The R Ecosystem
R Us er C ommunity   bit.ly/R-ecosystem




                                              8
Revolution Confidential




P oll Ques tion
    If you're not using R today, what
    would you most like to use R for?
R evolution R E nterpris e is   Revolution Confidential




                                                 10
R P roduc tivity E nvironment (Windows )
                                                                                               Revolution Confidential
                                          Script with type
                                          ahead and code                           Solutions window
                                             snippets                               for organizing
                                                                                    code and data

     Sophisticated
    debugging with
 breakpoints , variable                              Objects
      values etc.                                 loaded in the
                                                       R
                                                  Environment
                  Packages                                                                           Object
                installed and                                                                        details
                   loaded




             http://www.revolutionanalytics.com/demos/revolution-productivity-environment/demo.htm

                                                                                                                11
Interac tive Debugging                        Revolution Confidential




 One-click to set a breakpoint in an R script
 Step in/out/over, inspect variables
 Eliminate the edit -> browser -> repair cycle




                                                               12
P erformanc e: Multi-threaded Math                                                              Revolution Confidential




  Open                                                 Revolution R
  Source R                                               Enterprise




 Computation (4-core laptop)                Open Source R              Revolution R                Speedup
 Linear Algebra1
       Matrix Multiply                               327 sec                13.4 sec                     23x
       Cholesky Factorization                       31.3 sec                  1.8 sec                    17x
       Linear Discriminant Analysis                  216 sec                74.6 sec                       2x
 General R Benchmarks2
       R Benchmarks (Matrix Functions)                22 sec                  3.5 sec                      5x
       R Benchmarks (Program Control)                 5.6 sec                 5.4 sec        Not appreciable

                                         1. http://www.revolutionanalytics.com/why-revolution-r/benchmarks.php
                                         2. http://r.research.att.com/benchmarks/

                                                                                                                 13
T hree P aradigms for B ig Data              Revolution Confidential




 Standard R engine is constrained by
  capacity and performance

 Revolution R Enterprise offers three
  methods for big data with R:
   Off-line: high-performance file-based analytics
   Off-line, parallel & distributed analytics
   On-line, in-database analytics
      Hadoop
      Netezza

                                                              14
R evolution R E nterpris e with R evoS c aleR
B ig Data S tatis tic s in R                                                             Revolution Confidential




                              www.revolutionanalytics.com/bigdata



Every US airline
departure and arrival,
1987-2008


File: AirlineData87to08.xdf
Rows: 123.5 million
Variables: 29
Size on disk: 13.2Gb




                 arrDelayLm2 <- rxLinMod(ArrDelay ~ DayOfWeek:F(CRSDepTime),cube=TRUE)




                                                                                                          15
E xample: Old Wives C ens us A nalys is    Revolution Confidential




 http://info.revolutionanalytics.com/Cen
 susOldWivesWhitePaper.html




                                                            16
R evoS c aleR – Dis tributed C omputing                      Revolution Confidential




              Compute                       •   Portions of the data source are
  Data         Node                             made available to each compute
 Partition   (RevoScaleR)                       node

                                            •   RevoScaleR on the master node
              Compute                           assigns a task to each compute
  Data         Node                             node
 Partition   (RevoScaleR)
                               Master       •   Each compute node independently
                               Node             processes its data, and returns its
              Compute        (RevoScaleR)       intermediate results back to the
  Data         Node                             master node
 Partition   (RevoScaleR)
                                            •   master node aggregates all of the
                                                intermediate results from each
              Compute                           compute node and produces the
  Data         Node                             final result
 Partition   (RevoScaleR)




                            *Available now for Microsoft HPC Server
                                    Video demo: http://bit.ly/ugQ9KR
                                                                              17
R evolution A nalytic s with Netezza A pplianc e
                                              Revolution Confidential




         More info: http://bit.ly/R-Netezza

                                                               18
R evoC onnec tR for Hadoop                                            Revolution Confidential




                                              Write Map-Reduce analytics using
                        HBASE                 only R code with these R
                                              packages:
              HDFS
                                                     rhdfs - R and HDFS
   R
                                  Thrift             rhbase - R and HBASE
 Map or
 Reduce
                                                     rmr - R and MapReduce
 Task                                      rhbase
                    rhdfs
 Node

                                  Revolution R        More information at:
            Job                      Client           bit.ly/r-hadoop
          Tracker           rmr




                                                                                       19
E nterpris e R eadines s :
R evolution R E nterpris e S erver          Revolution Confidential




 Multi-User Support
 Production Applications

 Integrate R analytics into Web based applications
     Data Analysis and Visualization
     Reporting
     Dashboards
     Interactive applications
 Revolution R Enterprise Server with RevoDeployR


                                                             20
Deployment with R evolution R E nterpris e                                 Revolution Confidential




End User        Desktop                   Business
                                                                 Interactive Web
               Applications              Intelligence
                                                                   Applications
               (e.g. Excel)           (e.g. Jaspersoft)

Application
                      Client libraries (JavaScript, Java, .NET)
Developer


                                                 HTTP/HTTPS – JSON/XML


R                             RevoDeployR Web Services
Programmer
                Session                            Data/Script
                               Authentication                      Administration
              Management                          Management



                  R



                                                                                             21
C oming s oon: R evolution R G UI     Revolution Confidential



          Accessible




                       Powerful




                                  Extensible




                                                       11
T he A dvanc ed A nalytic s S tac k                           Revolution Confidential




       Deployment / Consumption




       Advanced Analytics




       ETL




       Data / Infrastructure




                “Open Analytics Stack” White Paper: bit.ly/lC43Kw
                                                                               23
Revolution Confidential




 On-Call Technical Support
 Consulting
   Migration | Analytics | Applications | Validation
 Training
   R | Revolution R | Statistical Topics
 Systems Integration
   BI | ERP | Databases | Cloud

                                                                24
Revolution Confidential




Wrapping Up
Why R ?                                        Revolution Confidential




   Every data analysis technique at your fingertips
   Create beautiful and unique data visualizations
   Get better results faster
   Draw on the talents of data scientists worldwide
   R is hot, and growing fast




                                                                26
R evolution R E nterpris e                                Revolution Confidential

Production-Grade Statistical Analysis for the Workplace

  High-performance R for multiprocessor systems
  Modern Integrated Development Environment
  Statistical Analysis of Terabyte-Class Data Sets
  In-database R analytics with Hadoop and Netezza
  Deploy R Applications via Web Services
  Telephone and email technical support
  Training and consulting services
  100% compatible with R packages
  Easy-to-Use GUI1



                               1   Coming Soon                             27
F urther R eading                             Revolution Confidential




          http://bit.ly/revo-r-pdf   http://bit.ly/r-is-hot
                                                               28
R evolution R E nterpris e: F ree to A c ademia                   Revolution Confidential




                                   Personal use
                                   Research
                                   Teaching
                                   Package development


           Free Academic Download
 www.revolutionanalytics.com/downloads/free-academic.php
           Discounted Technical Support Subscriptions Available

                                                                                   29
T hank You!                                                                Revolution Confidential



 Download slides, replay (from Oct 20)
   http://bit.ly/rOSvwK

 Learn more about Revolution R
   revolutionanalytics.com/products

 Contact Revolution Analytics
   http://bit.ly/hey-revo


                 Dec 20: Big Data Analysis Starts with R
  A 30-minute executive webinar to find out how companies of all types and sizes can
           integrate “R” into their “big data” analytics infrastructure strategy.

         www.revolutionanalytics.com/news-events/free-webinars

                                                                                            30
Revolution Confidential




P oll Ques tion
     What interests you most about
      Revolution R Enterprise?
Revolution Confidential




The leading commercial provider of software and support for the
          popular open source R statistics language.



                 www.revolutionanalytics.com
                     +1 (650) 646 9545
                   Twitter: @RevolutionR



                                                                          32

Mais conteúdo relacionado

Semelhante a Revolution R Enterprise - 100% R and More

Business Analytics with R - Webinar
Business Analytics with R - WebinarBusiness Analytics with R - Webinar
Business Analytics with R - Webinar
Edureka!
 
Integrate Your Advanced Analytics into BI Apps and MS Office and Multiply The...
Integrate Your Advanced Analytics into BI Apps and MS Office and Multiply The...Integrate Your Advanced Analytics into BI Apps and MS Office and Multiply The...
Integrate Your Advanced Analytics into BI Apps and MS Office and Multiply The...
Revolution Analytics
 
2 it unit-1 start learning r
2 it   unit-1 start learning r2 it   unit-1 start learning r
2 it unit-1 start learning r
Netaji Gandi
 
Business Analytics with R
Business Analytics with R Business Analytics with R
Business Analytics with R
Edureka!
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and Storm
Revolution Analytics
 

Semelhante a Revolution R Enterprise - 100% R and More (20)

Revolution Analytics Podcast
Revolution Analytics PodcastRevolution Analytics Podcast
Revolution Analytics Podcast
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with R
 
100% R and More: Plus What's New in Revolution R Enterprise 6.0
100% R and More: Plus What's New in Revolution R Enterprise 6.0100% R and More: Plus What's New in Revolution R Enterprise 6.0
100% R and More: Plus What's New in Revolution R Enterprise 6.0
 
Big data analytics on teradata with revolution r enterprise bill jacobs
Big data analytics on teradata with revolution r enterprise   bill jacobsBig data analytics on teradata with revolution r enterprise   bill jacobs
Big data analytics on teradata with revolution r enterprise bill jacobs
 
Applications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the MarketplaceApplications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the Marketplace
 
Business Analytics with R - Webinar
Business Analytics with R - WebinarBusiness Analytics with R - Webinar
Business Analytics with R - Webinar
 
R and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopR and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with Hadoop
 
Reason To learn & use r
Reason To learn & use rReason To learn & use r
Reason To learn & use r
 
The use of R statistical package in controlled infrastructure. The case of Cl...
The use of R statistical package in controlled infrastructure. The case of Cl...The use of R statistical package in controlled infrastructure. The case of Cl...
The use of R statistical package in controlled infrastructure. The case of Cl...
 
Integrate Your Advanced Analytics into BI Apps and MS Office and Multiply The...
Integrate Your Advanced Analytics into BI Apps and MS Office and Multiply The...Integrate Your Advanced Analytics into BI Apps and MS Office and Multiply The...
Integrate Your Advanced Analytics into BI Apps and MS Office and Multiply The...
 
2 it unit-1 start learning r
2 it   unit-1 start learning r2 it   unit-1 start learning r
2 it unit-1 start learning r
 
UNIT-1 Start Learning R.pdf
UNIT-1 Start Learning R.pdfUNIT-1 Start Learning R.pdf
UNIT-1 Start Learning R.pdf
 
Business Analytics with R
Business Analytics with R Business Analytics with R
Business Analytics with R
 
GNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based MedicineGNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based Medicine
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and Storm
 
Kristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics SoftwareKristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics Software
 
Microsoft and Revolution Analytics -- what's the add-value? 20150629
Microsoft and Revolution Analytics -- what's the add-value? 20150629Microsoft and Revolution Analytics -- what's the add-value? 20150629
Microsoft and Revolution Analytics -- what's the add-value? 20150629
 
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
 
R and Rcmdr Statistical Software
R and Rcmdr Statistical SoftwareR and Rcmdr Statistical Software
R and Rcmdr Statistical Software
 
Revolution Analytics: a 5-minute history
Revolution Analytics: a 5-minute historyRevolution Analytics: a 5-minute history
Revolution Analytics: a 5-minute history
 

Mais de Revolution Analytics

The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 final
Revolution Analytics
 

Mais de Revolution Analytics (20)

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the Cloud
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to Azure
 
R in Minecraft
R in Minecraft R in Minecraft
R in Minecraft
 
The case for R for AI developers
The case for R for AI developersThe case for R for AI developers
The case for R for AI developers
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the Cloud
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R Then and Now
R Then and NowR Then and Now
R Then and Now
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per Second
 
Reproducible Data Science with R
Reproducible Data Science with RReproducible Data Science with R
Reproducible Data Science with R
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source Communities
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with R
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data Science
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the Cloud
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductor
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 final
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint package
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Revolution R Enterprise - 100% R and More

  • 1. Revolution Confidential R evolution R : 100% R and More P res ented by: David S mith V P Marketing, R evolution A nalytic s
  • 2. Revolution Confidential P oll Ques tion Which stats package do you use most?
  • 3. Dec ember 14, 2011: Welc ome! Revolution Confidential  Thanks for coming.  Slides and replay available (soon) at:  http://bit.ly/rOSvwK David Smith VP Marketing, Revolution Analytics Editor, Revolutions blog http://blog.revolutionanalytics.com Twitter: @revodavid 3
  • 4. In today’s webc as t: Revolution Confidential  About Revolution Analytics and R  What Revolution R adds to R  Resources for getting more from R  Q&A Introducing Revolution R 4
  • 5. What is R ? Download the White PaperConfidential R is Hot Revolution bit.ly/r-is-hot  Data analysis software  A programming language  Development platform designed by and for statisticians  An environment  Huge library of algorithms for data access, data manipulation, analysis and graphics  An open-source software project  Free, open, and active  A community  Thousands of contributors, 2 million users  Resources and help in every domain 5
  • 6. R is exploding in popularity and func tionality Revolution Confidential Scholarly Activity Google Scholar hits (’05-’09 CAGR) R 46% “I’ve been astonished by the rate at which R has been adopted. Four years ago, SAS -11% everyone in my economics department [at SPSS -27% the University of Chicago] was using Stata; now, as far as I can tell, R is the S-Plus 0% standard tool, and students learn it first.” Stata 10% Deputy Editor for New Products at Forbes Package Growth Number of R packages listed on CRAN “A key benefit of R is that it provides near- instant availability of new and experimental methods created by its user base — without waiting for the development/release cycle of commercial software. SAS recognizes the value of R to our customer base…” Product Marketing Manager SAS Institute, Inc. 2002 2004 2006 2008 2010 Source: http://r4stats.com/popularity 6
  • 7. “ R is the mos t powerful & flexible s tatis tic al Revolution Confidential programming language in the world” 1  Capabilities  Sophisticated statistical analyses  Predictive analytics  Data visualization  Applications  Real-time trading MSFT [2009-  Last 29.29 Finance 30  Risk assessment 25  Forecasting 20  Bio-technology 15  Drug development  Social networks  .. and more 1. Norman Nie, multiple interviews 7
  • 8. From: The R Ecosystem R Us er C ommunity bit.ly/R-ecosystem 8
  • 9. Revolution Confidential P oll Ques tion If you're not using R today, what would you most like to use R for?
  • 10. R evolution R E nterpris e is Revolution Confidential 10
  • 11. R P roduc tivity E nvironment (Windows ) Revolution Confidential Script with type ahead and code Solutions window snippets for organizing code and data Sophisticated debugging with breakpoints , variable Objects values etc. loaded in the R Environment Packages Object installed and details loaded http://www.revolutionanalytics.com/demos/revolution-productivity-environment/demo.htm 11
  • 12. Interac tive Debugging Revolution Confidential  One-click to set a breakpoint in an R script  Step in/out/over, inspect variables  Eliminate the edit -> browser -> repair cycle 12
  • 13. P erformanc e: Multi-threaded Math Revolution Confidential Open Revolution R Source R Enterprise Computation (4-core laptop) Open Source R Revolution R Speedup Linear Algebra1 Matrix Multiply 327 sec 13.4 sec 23x Cholesky Factorization 31.3 sec 1.8 sec 17x Linear Discriminant Analysis 216 sec 74.6 sec 2x General R Benchmarks2 R Benchmarks (Matrix Functions) 22 sec 3.5 sec 5x R Benchmarks (Program Control) 5.6 sec 5.4 sec Not appreciable 1. http://www.revolutionanalytics.com/why-revolution-r/benchmarks.php 2. http://r.research.att.com/benchmarks/ 13
  • 14. T hree P aradigms for B ig Data Revolution Confidential  Standard R engine is constrained by capacity and performance  Revolution R Enterprise offers three methods for big data with R:  Off-line: high-performance file-based analytics  Off-line, parallel & distributed analytics  On-line, in-database analytics  Hadoop  Netezza 14
  • 15. R evolution R E nterpris e with R evoS c aleR B ig Data S tatis tic s in R Revolution Confidential www.revolutionanalytics.com/bigdata Every US airline departure and arrival, 1987-2008 File: AirlineData87to08.xdf Rows: 123.5 million Variables: 29 Size on disk: 13.2Gb arrDelayLm2 <- rxLinMod(ArrDelay ~ DayOfWeek:F(CRSDepTime),cube=TRUE) 15
  • 16. E xample: Old Wives C ens us A nalys is Revolution Confidential http://info.revolutionanalytics.com/Cen susOldWivesWhitePaper.html 16
  • 17. R evoS c aleR – Dis tributed C omputing Revolution Confidential Compute • Portions of the data source are Data Node made available to each compute Partition (RevoScaleR) node • RevoScaleR on the master node Compute assigns a task to each compute Data Node node Partition (RevoScaleR) Master • Each compute node independently Node processes its data, and returns its Compute (RevoScaleR) intermediate results back to the Data Node master node Partition (RevoScaleR) • master node aggregates all of the intermediate results from each Compute compute node and produces the Data Node final result Partition (RevoScaleR) *Available now for Microsoft HPC Server Video demo: http://bit.ly/ugQ9KR 17
  • 18. R evolution A nalytic s with Netezza A pplianc e Revolution Confidential More info: http://bit.ly/R-Netezza 18
  • 19. R evoC onnec tR for Hadoop Revolution Confidential Write Map-Reduce analytics using HBASE only R code with these R packages: HDFS  rhdfs - R and HDFS R Thrift  rhbase - R and HBASE Map or Reduce  rmr - R and MapReduce Task rhbase rhdfs Node Revolution R More information at: Job Client bit.ly/r-hadoop Tracker rmr 19
  • 20. E nterpris e R eadines s : R evolution R E nterpris e S erver Revolution Confidential  Multi-User Support  Production Applications  Integrate R analytics into Web based applications  Data Analysis and Visualization  Reporting  Dashboards  Interactive applications  Revolution R Enterprise Server with RevoDeployR 20
  • 21. Deployment with R evolution R E nterpris e Revolution Confidential End User Desktop Business Interactive Web Applications Intelligence Applications (e.g. Excel) (e.g. Jaspersoft) Application Client libraries (JavaScript, Java, .NET) Developer HTTP/HTTPS – JSON/XML R RevoDeployR Web Services Programmer Session Data/Script Authentication Administration Management Management R 21
  • 22. C oming s oon: R evolution R G UI Revolution Confidential Accessible Powerful Extensible 11
  • 23. T he A dvanc ed A nalytic s S tac k Revolution Confidential Deployment / Consumption Advanced Analytics ETL Data / Infrastructure “Open Analytics Stack” White Paper: bit.ly/lC43Kw 23
  • 24. Revolution Confidential  On-Call Technical Support  Consulting  Migration | Analytics | Applications | Validation  Training  R | Revolution R | Statistical Topics  Systems Integration  BI | ERP | Databases | Cloud 24
  • 26. Why R ? Revolution Confidential  Every data analysis technique at your fingertips  Create beautiful and unique data visualizations  Get better results faster  Draw on the talents of data scientists worldwide  R is hot, and growing fast 26
  • 27. R evolution R E nterpris e Revolution Confidential Production-Grade Statistical Analysis for the Workplace  High-performance R for multiprocessor systems  Modern Integrated Development Environment  Statistical Analysis of Terabyte-Class Data Sets  In-database R analytics with Hadoop and Netezza  Deploy R Applications via Web Services  Telephone and email technical support  Training and consulting services  100% compatible with R packages  Easy-to-Use GUI1 1 Coming Soon 27
  • 28. F urther R eading Revolution Confidential http://bit.ly/revo-r-pdf http://bit.ly/r-is-hot 28
  • 29. R evolution R E nterpris e: F ree to A c ademia Revolution Confidential  Personal use  Research  Teaching  Package development Free Academic Download www.revolutionanalytics.com/downloads/free-academic.php Discounted Technical Support Subscriptions Available 29
  • 30. T hank You! Revolution Confidential  Download slides, replay (from Oct 20)  http://bit.ly/rOSvwK  Learn more about Revolution R  revolutionanalytics.com/products  Contact Revolution Analytics  http://bit.ly/hey-revo Dec 20: Big Data Analysis Starts with R A 30-minute executive webinar to find out how companies of all types and sizes can integrate “R” into their “big data” analytics infrastructure strategy. www.revolutionanalytics.com/news-events/free-webinars 30
  • 31. Revolution Confidential P oll Ques tion What interests you most about Revolution R Enterprise?
  • 32. Revolution Confidential The leading commercial provider of software and support for the popular open source R statistics language. www.revolutionanalytics.com +1 (650) 646 9545 Twitter: @RevolutionR 32