SlideShare uma empresa Scribd logo
1 de 16
Baixar para ler offline
Execution of R Built
             Predictive Solutions
             Alex Guazzelli, PhD
             VP, Analytics - Zementis, Inc.

             useR! 2010




Zementis ©
Exporting Models from R




        Memory
                         Why?
               Speed                         Freedom
                       Transparency

    Interoperability             Accessibility

                       Because you can!




  Zementis ©                                           2
Exporting Models from R




               How?
               PMML

  Zementis ©              3
PMML


               Predictive Model Markup Language (PMML)
               PMML is an XML-based language to
 PMML
                   Define data mining models
                   Share models between compliant applications
               Standard for exchange of models to
                   Avoid proprietary issues and incompatibilities
                   Easily put models to work
               Clear separation of tasks
                   Model development vs. model execution
                   Scientists focus on building the best model
                   Eliminates need for custom model deployment




 Zementis ©                                                          4
PMML Structure



                                      • A Data Dictionary defines all the raw data
                                        fields (including missing value strategy and
                                        outlier treatment).
                  Models
                                      • Several Data Transformations strategies allow
                                       for intelligent extraction of feature detectors from
                                       raw data (“data massaging”).
    Transformations
                                      • A comprehensive list of Data-Mining Models
                                       offers power and flexibility.

                                      • Post-processing of results allow for tailored
PMML defines a standard not only to     decisions.
represent data-mining models, but
also data handling and data           • Model Explanation allows for performance
transformations (pre- and post-         evaluation.
processing)



     Zementis ©                                                                           5
PMML
Industry Support


                    Matured and Supported by Industry
                    Data Mining Group http://www.dmg.org
   PMML
                    Vendor independent consortium
                    Mature standard
                         Current version 4.0
                         Active group and constant enhancements
                    Industry supporters
                         Major Players: IBM/SPSS, Oracle, SAP, Microsoft
                         Analytics: KXEN, SAS, Salford, Togaware, Zementis
                         BI: Microstrategy, Teradata, Tibco, Pentaho
                         Open Source: R, KNIME, Rapid-I
                         Others: Equifax, FICO, Open Data Group, Visa,
                           Pervasive, NASA




   Zementis ©                                                                 6
Using the PMML package to export
                  a Neural Network model.




Zementis ©                                      7
Model is readily exported in PMML and ready to be used.




Zementis ©                                                             8
From R to PMML
Supported Packages/Objects




 nnet: Neural Networks          lm/glm: Regression

    hclust: Clustering           kmeans: Clustering

rpart: Decision Trees                ksvm: SVMs

    arules: Association Rules     randomForest




   Zementis ©                                         9
Got Models…




              Data Analysis

             Statistical Model

              PMML Export




              What Now?

 Zementis ©                       10
ADAPA


    ADAPA by Zementis
             Predictive Decisioning Platform
             PMML-based
             Drools to integrate business logic
             Scalable execution platform
             Real-time integration into business processes
             Accessible from anywhere
             Not a model development environment

   ADAPA on Amazon Elastic Compute Cloud
            Software as a service
            Up/Down scaling as needed
            Pay-as-you-go
                 Amazon Payments ($.99 per hour)
            Amazon experience & reliability


     Zementis ©
From Model Building to Model Deployment
 Model Building              Model Deployment




 Zementis ©                                     12
Model Execution




  Zementis ©
Model Execution via iPhone




  Zementis ©
Zementis Contributions


 • ADAPA: A decision engine that deploys models expressed in PMML and executes
 them in real-time. Available for on-site and cloud deployments.
 • Excel Add-in: Allows for scoring in ADAPA directly from within Excel.
 • PMML Converter: Validates, converts, and corrects old and new PMML code.
 Available at the DMG website and at http://www.zementis.com/pmml.htm.
 • Contributing Member of the DMG: Submitted several proposals for PMML 4.0 and
 already working with other members on PMML 4.1.
 • Code contributor for the R PMML package (available on CRAN).
 • PMML Articles: R Journal and SIGKDD Explorations Newsletter. Available for
 downloading at http://www.zementis.com/manual.htm
 • PMML Book: Available on Amazon.com.
 • PMML Blogs: Several blogs on PMML topics (http://adapasupport.zementis.com and
 http://www.predictive-analytics.info).



   Zementis ©                                                                       15
Thank You!


                          E-mail: info@zementis.com



    U.S.A Headquarters             Asia Office

   6125 Cornerstone Court East     19/F., Unit A
   Suite 250                       Ho Lee Commercial Building
   San Diego, CA, 92121            38-44 D’Aguilar Street
                                   Central, Hong Kong (S.A.R.)

   Tel: +1 619 330-0780            Tel: +852 2868-0878
   Fax: +1 858 535-0227            Fax: +852 2845-6027




Zementis ©                                                       16

Mais conteúdo relacionado

Semelhante a PMML Execution of R Built Predictive Solutions

Predictive Analytics - Big Data Warehousing Meetup, Zementis
Predictive Analytics - Big Data Warehousing Meetup, ZementisPredictive Analytics - Big Data Warehousing Meetup, Zementis
Predictive Analytics - Big Data Warehousing Meetup, ZementisCaserta
 
Zeller Edm Summit Agile Deployment Of Predictive Analytics
Zeller Edm Summit   Agile Deployment Of Predictive AnalyticsZeller Edm Summit   Agile Deployment Of Predictive Analytics
Zeller Edm Summit Agile Deployment Of Predictive AnalyticsRonald.Ramos
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Kun Le
 
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5Robert Grossman
 
Evaluation of meta modeling tools for domain specific modeling language chnjl
Evaluation of meta modeling   tools for domain specific modeling language chnjlEvaluation of meta modeling   tools for domain specific modeling language chnjl
Evaluation of meta modeling tools for domain specific modeling language chnjlPG Scholar
 
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
Kamanja: Driving Business Value through Real-Time Decisioning SolutionsKamanja: Driving Business Value through Real-Time Decisioning Solutions
Kamanja: Driving Business Value through Real-Time Decisioning SolutionsGreg Makowski
 
Application services in Private Clouds - IBM Smarter Business 2013
Application services in Private Clouds - IBM Smarter Business 2013Application services in Private Clouds - IBM Smarter Business 2013
Application services in Private Clouds - IBM Smarter Business 2013IBM Sverige
 
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...Amazon Web Services Korea
 
Why z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsWhy z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsTeodoro Cipresso
 
Introduction to WebSphere Message Broker
Introduction to WebSphere Message BrokerIntroduction to WebSphere Message Broker
Introduction to WebSphere Message BrokerAnt Phillips
 
SPSS Modeler 16 What's New!?
SPSS Modeler 16 What's New!?SPSS Modeler 16 What's New!?
SPSS Modeler 16 What's New!?Chris Sparshott
 
Building a Hybrid Cloud The Real Deal
Building a Hybrid Cloud The Real DealBuilding a Hybrid Cloud The Real Deal
Building a Hybrid Cloud The Real DealRightScale
 
Practical Artificial Intelligence: Deep Learning Beyond Cats and Cars
Practical Artificial Intelligence: Deep Learning Beyond Cats and CarsPractical Artificial Intelligence: Deep Learning Beyond Cats and Cars
Practical Artificial Intelligence: Deep Learning Beyond Cats and CarsAlexey Rybakov
 
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...Aaron Zornes
 
Ds roi tc_world
Ds roi tc_worldDs roi tc_world
Ds roi tc_worldvsrtwin
 
Google cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptxGoogle cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptxGDSCNiT
 
Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09Hortonworks
 

Semelhante a PMML Execution of R Built Predictive Solutions (20)

Predictive Analytics - Big Data Warehousing Meetup, Zementis
Predictive Analytics - Big Data Warehousing Meetup, ZementisPredictive Analytics - Big Data Warehousing Meetup, Zementis
Predictive Analytics - Big Data Warehousing Meetup, Zementis
 
Zeller Edm Summit Agile Deployment Of Predictive Analytics
Zeller Edm Summit   Agile Deployment Of Predictive AnalyticsZeller Edm Summit   Agile Deployment Of Predictive Analytics
Zeller Edm Summit Agile Deployment Of Predictive Analytics
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...
 
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
 
Evaluation of meta modeling tools for domain specific modeling language chnjl
Evaluation of meta modeling   tools for domain specific modeling language chnjlEvaluation of meta modeling   tools for domain specific modeling language chnjl
Evaluation of meta modeling tools for domain specific modeling language chnjl
 
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
Kamanja: Driving Business Value through Real-Time Decisioning SolutionsKamanja: Driving Business Value through Real-Time Decisioning Solutions
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
 
Application services in Private Clouds - IBM Smarter Business 2013
Application services in Private Clouds - IBM Smarter Business 2013Application services in Private Clouds - IBM Smarter Business 2013
Application services in Private Clouds - IBM Smarter Business 2013
 
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
엔터프라이즈의 AI/ML 활용을 돕는 Paxata 지능형 데이터 전처리 플랫폼 (최문규 이사, PAXATA) :: AWS Techforum...
 
Why z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsWhy z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIs
 
Introduction to WebSphere Message Broker
Introduction to WebSphere Message BrokerIntroduction to WebSphere Message Broker
Introduction to WebSphere Message Broker
 
SPSS Modeler 16 What's New!?
SPSS Modeler 16 What's New!?SPSS Modeler 16 What's New!?
SPSS Modeler 16 What's New!?
 
2009-dec-10 Architectuur en HL7
2009-dec-10 Architectuur en HL72009-dec-10 Architectuur en HL7
2009-dec-10 Architectuur en HL7
 
Building a Hybrid Cloud The Real Deal
Building a Hybrid Cloud The Real DealBuilding a Hybrid Cloud The Real Deal
Building a Hybrid Cloud The Real Deal
 
Practical Artificial Intelligence: Deep Learning Beyond Cats and Cars
Practical Artificial Intelligence: Deep Learning Beyond Cats and CarsPractical Artificial Intelligence: Deep Learning Beyond Cats and Cars
Practical Artificial Intelligence: Deep Learning Beyond Cats and Cars
 
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
 
Ds roi tc_world
Ds roi tc_worldDs roi tc_world
Ds roi tc_world
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Google cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptxGoogle cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptx
 
Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09Zementis hortonworks-webinar-2014-09
Zementis hortonworks-webinar-2014-09
 
Ameya_Kasbekar_Resume
Ameya_Kasbekar_ResumeAmeya_Kasbekar_Resume
Ameya_Kasbekar_Resume
 

PMML Execution of R Built Predictive Solutions

  • 1. Execution of R Built Predictive Solutions Alex Guazzelli, PhD VP, Analytics - Zementis, Inc. useR! 2010 Zementis ©
  • 2. Exporting Models from R Memory Why? Speed Freedom Transparency Interoperability Accessibility Because you can! Zementis © 2
  • 3. Exporting Models from R How? PMML Zementis © 3
  • 4. PMML Predictive Model Markup Language (PMML)  PMML is an XML-based language to PMML  Define data mining models  Share models between compliant applications  Standard for exchange of models to  Avoid proprietary issues and incompatibilities  Easily put models to work  Clear separation of tasks  Model development vs. model execution  Scientists focus on building the best model  Eliminates need for custom model deployment Zementis © 4
  • 5. PMML Structure • A Data Dictionary defines all the raw data fields (including missing value strategy and outlier treatment). Models • Several Data Transformations strategies allow for intelligent extraction of feature detectors from raw data (“data massaging”). Transformations • A comprehensive list of Data-Mining Models offers power and flexibility. • Post-processing of results allow for tailored PMML defines a standard not only to decisions. represent data-mining models, but also data handling and data • Model Explanation allows for performance transformations (pre- and post- evaluation. processing) Zementis © 5
  • 6. PMML Industry Support Matured and Supported by Industry  Data Mining Group http://www.dmg.org PMML  Vendor independent consortium  Mature standard  Current version 4.0  Active group and constant enhancements  Industry supporters  Major Players: IBM/SPSS, Oracle, SAP, Microsoft  Analytics: KXEN, SAS, Salford, Togaware, Zementis  BI: Microstrategy, Teradata, Tibco, Pentaho  Open Source: R, KNIME, Rapid-I  Others: Equifax, FICO, Open Data Group, Visa, Pervasive, NASA Zementis © 6
  • 7. Using the PMML package to export a Neural Network model. Zementis © 7
  • 8. Model is readily exported in PMML and ready to be used. Zementis © 8
  • 9. From R to PMML Supported Packages/Objects nnet: Neural Networks lm/glm: Regression hclust: Clustering kmeans: Clustering rpart: Decision Trees ksvm: SVMs arules: Association Rules randomForest Zementis © 9
  • 10. Got Models…  Data Analysis  Statistical Model  PMML Export What Now? Zementis © 10
  • 11. ADAPA  ADAPA by Zementis  Predictive Decisioning Platform  PMML-based  Drools to integrate business logic  Scalable execution platform  Real-time integration into business processes  Accessible from anywhere  Not a model development environment  ADAPA on Amazon Elastic Compute Cloud  Software as a service  Up/Down scaling as needed  Pay-as-you-go  Amazon Payments ($.99 per hour)  Amazon experience & reliability Zementis ©
  • 12. From Model Building to Model Deployment Model Building Model Deployment Zementis © 12
  • 13. Model Execution Zementis ©
  • 14. Model Execution via iPhone Zementis ©
  • 15. Zementis Contributions • ADAPA: A decision engine that deploys models expressed in PMML and executes them in real-time. Available for on-site and cloud deployments. • Excel Add-in: Allows for scoring in ADAPA directly from within Excel. • PMML Converter: Validates, converts, and corrects old and new PMML code. Available at the DMG website and at http://www.zementis.com/pmml.htm. • Contributing Member of the DMG: Submitted several proposals for PMML 4.0 and already working with other members on PMML 4.1. • Code contributor for the R PMML package (available on CRAN). • PMML Articles: R Journal and SIGKDD Explorations Newsletter. Available for downloading at http://www.zementis.com/manual.htm • PMML Book: Available on Amazon.com. • PMML Blogs: Several blogs on PMML topics (http://adapasupport.zementis.com and http://www.predictive-analytics.info). Zementis © 15
  • 16. Thank You! E-mail: info@zementis.com U.S.A Headquarters Asia Office 6125 Cornerstone Court East 19/F., Unit A Suite 250 Ho Lee Commercial Building San Diego, CA, 92121 38-44 D’Aguilar Street Central, Hong Kong (S.A.R.) Tel: +1 619 330-0780 Tel: +852 2868-0878 Fax: +1 858 535-0227 Fax: +852 2845-6027 Zementis © 16