SlideShare uma empresa Scribd logo
1 de 15
Baixar para ler offline
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Extending the Reach of R
to the Enterprise
TIBCOEnterpriseRuntimeforR
Lou Bajuk-Yorgan, Sr. Director, Product Management
lbajuk@tibco.com
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Extending the Reach of R to the Enterprise
• TIBCO, S+, R and me
• Why (and how) did we embrace R?
• Challenges of R for Enterprise applications
• TIBCO Enterprise Runtime for R (TERR)
• Benefits for organizations (and individuals) who use R
• Examples of TERR integration and performance
• Learn more and try it yourself
- 2 -
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Our (and My) Journey to TERR
• John Chambers developed the S language at Bell Labs
– Starting in the mid 70’s
• Insightful (Mathsoft/ Statsci) founded to commercial S as
S+ in 1987
– The “plus”: statistical libraries, documentation, and support
– Later focus on commercial users, ease of use, server integration
– I started using S+ ~1993, joined Insightful in 1996
• R: development begun by Ross Ihaka and Robert
Gentleman at University of Auckland in 1995
• Insightful acquired by TIBCO in 2008
– Spotfire (for Data Discovery and Visualization) acquired in 2007
• Focus shifted on integration of advanced analytics to
enterprise applications
– Step 1: Embrace R
- 3 -
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Why embrace R?
• Focus of energy and innovation in the statistical
community
– Agility and power of the S/R language
– Provides a freely available platform for anyone to use
– Easily and develop and share new analytic methods as packages
– Hugely successful
• Many users, many contributed packages
• Default way that people communicate in statistics
• R provides tremendous benefits to statisticians
– But large enterprises are often challenged to leverage that value
beyond the individual statistician
- 4 -
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
What does “Embracing R” look like?
• TIBCO Spotfire Statistics Services
– Integration of R & S+ into Spotfire (and
other) applications
– Easily provide targeted, relevant
advanced analytics to larger community
of users, combined with the interactive
visualization of Spotfire
– Statistician can ensure compliance &
proper usage
– Later integrated with SAS® and
MATLAB® as well
• Contribute to the R community
– Sponsor useR conferences, contribute
to R Foundation
– Contribute bug reports and fixes to R
core
– Contribute packages to community
• Well received—but our Enterprise
customers need more
- 5 -
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Enterprise Challenges for R
• R struggles with Big Data
– Customers don’t use R, or reimplement R code in specialized libraries
or other languages
– Lose agility & consistency, delay time to production, lose opportunities
• R was not built for enterprise usage and integration
– Built as an academic tool for research and teaching
– Some R Core members exploring different directions
– Software vendors attempting to use R in ways it was never intended
• GPL has been great for statisticians, but limits enterprise innovation
and investment
– Viral open source licensing risks IP
– Only attracts innovations people are willing to give away
– Large vendors avoid tight integration due to open source concerns
• Free to acquire, but costly to maintain
– Version incompatibilities, variable quality in packages
– Lack of enterprise-level technical support
– Difficult to validate
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
TIBCO Enterprise Runtime for R (TERR)
• Unique, enterprise-quality runtime for the R language
– Fundamentally different from anything else on the market
• Developed from the ground up, based on our long history and expertise with
S+
– Brings scalability & stability to agile R language
– Faster, more robust and more memory-efficient than R
– TIBCO IP: Not open source/GPL
• Licensable for embedding and redistribution by partners
– Brand new architecture, new foundation for future investment
– Broad support for R functions and hundreds of CRAN packages
• Ongoing effort to broaden our coverage of R
• Extends the Reach of R to the Enterprise
– Develop in Open Source R, deploy on TERR
– Agility of R
– Rapidly iterate prototyping to production without recoding/retesting
– Easily integrate predictive analytics consistently across organization
• Initial Focus: Integration with Spotfire platform
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Providing value for organizations who use R
Embeddable
TERR Engine
Custom (tight) integration, batch, existing grids, etc.
• Faster than R, more robust, better memory management, fully
supported
• Low level APIs for tight integration
• Basic data i/o (text files, JDBC, etc.)
• Non GPL
TERR in
Statistics Services
Distributed analytics
• Managed pools of engines
• Load balancing, queuing, failover, parallelization, etc.
• High level APIs for loose integration, data i/o (C#, Java)
• Central management of analytics, R packages
TERR in Spotfire
Ad hoc tools and interactive applications powered by advanced analytics
• Spotfire Analytics platform: interactive visualization & data discovery,
easily build and share applications, broad data access, etc.
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Providing Value for individuals who use R
• Not seeking to displace R from statistician’s desktops
– Enterprise platform for the deployment and integration of your
work—without having to rewrite it!
• Contribute packages to CRAN
– As we port from S+ or develop for TERR
• Supports “Develop in OS R, Deploy on TERR”
• Now available on CRAN
– splusTimeSeries, splusTimeDate, sjdbc
– tibbrConnector
• TERR Developer Edition
– Full version of TERR engine for testing code prior to deployment
• Console only—expect you to use your favorite R IDE
– Perpetual License for free, non-production use
– Supported through Community site
- 9 -
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
TERR Examples
• TERR vs. R raw performance
• TERR in Spotfire—powering Predictive Analytics tools
and applications for Data Discovery
• TERR and real-time event processing
- 10
-
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
TERR vs. R Raw Performance
One specific example
• Non-optimal, non-vectorized, real-world R script
• For loop with row by row processing
for (i in seq(1,length=nrow(df))) {
…process each customer record…
}
Results
• TERR is ~35x faster for 50K rows, 150x faster for 500K rows
• No code modification required
We are looking for more real-world performance tests!
• On average 2-10x faster than R in microtests
- 11 -
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
TERR in Spotfire: Predictive Modeling
Predictive Modeling Tools
• Regression
• Classification
• Model Diagnostics
Build and shared Spotfire
applications with
embedded analytics
Power of predictive
analytics in Spotfire
• For non-R programmer
• Leverage the interactive
visualizations of Spotfire
• Powered by embedded
TERR engine
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Real time Fraud Detection
• TIBCO Business Events
– Complex Event Processing system
• Example integration with TERR
– Deploy R models on TERR engine for real-time scoring in
response to complex events
– Random forest model, scoring online transactions for fraud
• Testing on a small group of servers
– Total of 26 CPU cores
• Results
– 2200 transactions scored per second, using random forest
model in integrated TERR engine
- 13
-
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Extending the Reach of R to the Enterprise
• TIBCO, S+, R and me
• Why (and how) did we embrace R?
• Challenges of R for Enterprise applications
• TIBCO Enterprise Runtime for R (TERR)
• Benefits for organizations (and individuals) who use R
• Examples of TERR integration and performance
• Learn more and try it yourself
- 14
-
Click to edit Master title style
© Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information.
Learn more and Try it yourself
• useR presentation: Memory Management in TERR
– By Michael Sannella, Friday 11:50
• TERR Community now available at TIBCOmmunity.com
– Resources, FAQs, Forums
– Details of R coverage
– Product documentation & download
– More info at spotfire.tibco.com/terr
• TERR Developer Edition
– Full version of TERR engine for testing code prior to deployment
– Supported through TIBCOmmunity, download via tap.tibco.com
• We want your feedback and input!
– Real world performance tests
– Package & R coverage prioritization
– Via TERR Community, or contact me lbajuk@tibco.com or
@loubajuk

Mais conteúdo relacionado

Semelhante a Extend the Reach of R to the Enterprise (for useR! 2013)

Extending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireExtending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireLou Bajuk
 
Extending the R language to BI and Real-time Applications JSM 2015
Extending the R language to BI and Real-time Applications JSM 2015Extending the R language to BI and Real-time Applications JSM 2015
Extending the R language to BI and Real-time Applications JSM 2015Lou Bajuk
 
TERR in BI and Real Time applications
TERR in BI and Real Time applicationsTERR in BI and Real Time applications
TERR in BI and Real Time applicationsLou Bajuk
 
Using the R Language in BI and Real Time Applications (useR 2015)
Using the R Language in BI and Real Time Applications (useR 2015)Using the R Language in BI and Real Time Applications (useR 2015)
Using the R Language in BI and Real Time Applications (useR 2015)Lou Bajuk
 
Deploying R in BI and Real time Applications
Deploying R in BI and Real time ApplicationsDeploying R in BI and Real time Applications
Deploying R in BI and Real time ApplicationsLou Bajuk
 
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...Data Con LA
 
R in BI and Streaming Applications for useR 2016
R in BI and Streaming Applications for useR 2016R in BI and Streaming Applications for useR 2016
R in BI and Streaming Applications for useR 2016Lou Bajuk
 
Applying the R Language to BI and Real Time Applications
Applying the R Language to BI and Real Time ApplicationsApplying the R Language to BI and Real Time Applications
Applying the R Language to BI and Real Time ApplicationsLou Bajuk
 
Applying R in BI and Real Time applications EARL London 2015
Applying R in BI and Real Time applications EARL London 2015Applying R in BI and Real Time applications EARL London 2015
Applying R in BI and Real Time applications EARL London 2015Lou Bajuk
 
Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos...
 Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos... Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos...
Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos...Senturus
 
TIBCO Advanced Analytics Meetup (TAAM) November 2015
TIBCO Advanced Analytics Meetup (TAAM) November 2015TIBCO Advanced Analytics Meetup (TAAM) November 2015
TIBCO Advanced Analytics Meetup (TAAM) November 2015Bipin Singh
 
Engineering Effectiveness
Engineering EffectivenessEngineering Effectiveness
Engineering EffectivenessMarcio Sete
 
Houston Energy Data Science Meet up_TIBCO Slides
Houston Energy Data Science Meet up_TIBCO SlidesHouston Energy Data Science Meet up_TIBCO Slides
Houston Energy Data Science Meet up_TIBCO SlidesJennifer Walsh
 
Framework and Product Comparison for Big Data Log Analytics and ITOA
Framework and Product Comparison for Big Data Log Analytics and ITOA Framework and Product Comparison for Big Data Log Analytics and ITOA
Framework and Product Comparison for Big Data Log Analytics and ITOA Kai Wähner
 
Real time applications using the R Language
Real time applications using the R LanguageReal time applications using the R Language
Real time applications using the R LanguageLou Bajuk
 
Embedded Analytics: Build or Buy
Embedded Analytics: Build or BuyEmbedded Analytics: Build or Buy
Embedded Analytics: Build or BuyTIBCO Jaspersoft
 
Building A Business-Facing Mobile Developer Community
Building A Business-Facing Mobile Developer CommunityBuilding A Business-Facing Mobile Developer Community
Building A Business-Facing Mobile Developer CommunityProgrammableWeb
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...Chief Analytics Officer Forum
 
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025Nicola Sandoli
 

Semelhante a Extend the Reach of R to the Enterprise (for useR! 2013) (20)

Extending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireExtending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and Spotfire
 
Extending the R language to BI and Real-time Applications JSM 2015
Extending the R language to BI and Real-time Applications JSM 2015Extending the R language to BI and Real-time Applications JSM 2015
Extending the R language to BI and Real-time Applications JSM 2015
 
TERR in BI and Real Time applications
TERR in BI and Real Time applicationsTERR in BI and Real Time applications
TERR in BI and Real Time applications
 
Using the R Language in BI and Real Time Applications (useR 2015)
Using the R Language in BI and Real Time Applications (useR 2015)Using the R Language in BI and Real Time Applications (useR 2015)
Using the R Language in BI and Real Time Applications (useR 2015)
 
Deploying R in BI and Real time Applications
Deploying R in BI and Real time ApplicationsDeploying R in BI and Real time Applications
Deploying R in BI and Real time Applications
 
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
 
R in BI and Streaming Applications for useR 2016
R in BI and Streaming Applications for useR 2016R in BI and Streaming Applications for useR 2016
R in BI and Streaming Applications for useR 2016
 
Applying the R Language to BI and Real Time Applications
Applying the R Language to BI and Real Time ApplicationsApplying the R Language to BI and Real Time Applications
Applying the R Language to BI and Real Time Applications
 
Applying R in BI and Real Time applications EARL London 2015
Applying R in BI and Real Time applications EARL London 2015Applying R in BI and Real Time applications EARL London 2015
Applying R in BI and Real Time applications EARL London 2015
 
Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos...
 Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos... Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos...
Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos...
 
TIBCO Advanced Analytics Meetup (TAAM) November 2015
TIBCO Advanced Analytics Meetup (TAAM) November 2015TIBCO Advanced Analytics Meetup (TAAM) November 2015
TIBCO Advanced Analytics Meetup (TAAM) November 2015
 
Engineering Effectiveness
Engineering EffectivenessEngineering Effectiveness
Engineering Effectiveness
 
Houston Energy Data Science Meet up_TIBCO Slides
Houston Energy Data Science Meet up_TIBCO SlidesHouston Energy Data Science Meet up_TIBCO Slides
Houston Energy Data Science Meet up_TIBCO Slides
 
Framework and Product Comparison for Big Data Log Analytics and ITOA
Framework and Product Comparison for Big Data Log Analytics and ITOA Framework and Product Comparison for Big Data Log Analytics and ITOA
Framework and Product Comparison for Big Data Log Analytics and ITOA
 
Real time applications using the R Language
Real time applications using the R LanguageReal time applications using the R Language
Real time applications using the R Language
 
Embedded Analytics: Build or Buy
Embedded Analytics: Build or BuyEmbedded Analytics: Build or Buy
Embedded Analytics: Build or Buy
 
Buy vs build_nam
Buy vs build_namBuy vs build_nam
Buy vs build_nam
 
Building A Business-Facing Mobile Developer Community
Building A Business-Facing Mobile Developer CommunityBuilding A Business-Facing Mobile Developer Community
Building A Business-Facing Mobile Developer Community
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
 
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
 

Mais de Lou Bajuk

R Consortium update for EARL Boston Oct 2017
R Consortium update for EARL Boston Oct 2017R Consortium update for EARL Boston Oct 2017
R Consortium update for EARL Boston Oct 2017Lou Bajuk
 
Reusing and Managing R models in an Enterprise
Reusing and Managing  R models in an EnterpriseReusing and Managing  R models in an Enterprise
Reusing and Managing R models in an EnterpriseLou Bajuk
 
R consortium update EARL London Sept 2017
R consortium update EARL London Sept 2017R consortium update EARL London Sept 2017
R consortium update EARL London Sept 2017Lou Bajuk
 
Making Data Science accessible to a wider audience
Making Data Science accessible to a wider audienceMaking Data Science accessible to a wider audience
Making Data Science accessible to a wider audienceLou Bajuk
 
R Consortium Update for EARL June 2017
R Consortium Update for EARL June 2017R Consortium Update for EARL June 2017
R Consortium Update for EARL June 2017Lou Bajuk
 
Streaming analytics overview for R
Streaming analytics overview for RStreaming analytics overview for R
Streaming analytics overview for RLou Bajuk
 
Tibco streaming analytics overview and roadmap
Tibco streaming analytics overview and roadmapTibco streaming analytics overview and roadmap
Tibco streaming analytics overview and roadmapLou Bajuk
 
Embracing data science for smarter analytics apps
Embracing data science for smarter analytics appsEmbracing data science for smarter analytics apps
Embracing data science for smarter analytics appsLou Bajuk
 
EARL Sept 2016 R consortium
EARL Sept 2016 R consortiumEARL Sept 2016 R consortium
EARL Sept 2016 R consortiumLou Bajuk
 
The Importance of an Analytics Platform
The Importance of an Analytics PlatformThe Importance of an Analytics Platform
The Importance of an Analytics PlatformLou Bajuk
 
Software Testing and the R language
Software Testing and the R languageSoftware Testing and the R language
Software Testing and the R languageLou Bajuk
 
Sannella use r2013-terr-memory-management
Sannella use r2013-terr-memory-managementSannella use r2013-terr-memory-management
Sannella use r2013-terr-memory-managementLou Bajuk
 

Mais de Lou Bajuk (12)

R Consortium update for EARL Boston Oct 2017
R Consortium update for EARL Boston Oct 2017R Consortium update for EARL Boston Oct 2017
R Consortium update for EARL Boston Oct 2017
 
Reusing and Managing R models in an Enterprise
Reusing and Managing  R models in an EnterpriseReusing and Managing  R models in an Enterprise
Reusing and Managing R models in an Enterprise
 
R consortium update EARL London Sept 2017
R consortium update EARL London Sept 2017R consortium update EARL London Sept 2017
R consortium update EARL London Sept 2017
 
Making Data Science accessible to a wider audience
Making Data Science accessible to a wider audienceMaking Data Science accessible to a wider audience
Making Data Science accessible to a wider audience
 
R Consortium Update for EARL June 2017
R Consortium Update for EARL June 2017R Consortium Update for EARL June 2017
R Consortium Update for EARL June 2017
 
Streaming analytics overview for R
Streaming analytics overview for RStreaming analytics overview for R
Streaming analytics overview for R
 
Tibco streaming analytics overview and roadmap
Tibco streaming analytics overview and roadmapTibco streaming analytics overview and roadmap
Tibco streaming analytics overview and roadmap
 
Embracing data science for smarter analytics apps
Embracing data science for smarter analytics appsEmbracing data science for smarter analytics apps
Embracing data science for smarter analytics apps
 
EARL Sept 2016 R consortium
EARL Sept 2016 R consortiumEARL Sept 2016 R consortium
EARL Sept 2016 R consortium
 
The Importance of an Analytics Platform
The Importance of an Analytics PlatformThe Importance of an Analytics Platform
The Importance of an Analytics Platform
 
Software Testing and the R language
Software Testing and the R languageSoftware Testing and the R language
Software Testing and the R language
 
Sannella use r2013-terr-memory-management
Sannella use r2013-terr-memory-managementSannella use r2013-terr-memory-management
Sannella use r2013-terr-memory-management
 

Último

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 

Último (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 

Extend the Reach of R to the Enterprise (for useR! 2013)

  • 1. © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Extending the Reach of R to the Enterprise TIBCOEnterpriseRuntimeforR Lou Bajuk-Yorgan, Sr. Director, Product Management lbajuk@tibco.com
  • 2. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Extending the Reach of R to the Enterprise • TIBCO, S+, R and me • Why (and how) did we embrace R? • Challenges of R for Enterprise applications • TIBCO Enterprise Runtime for R (TERR) • Benefits for organizations (and individuals) who use R • Examples of TERR integration and performance • Learn more and try it yourself - 2 -
  • 3. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Our (and My) Journey to TERR • John Chambers developed the S language at Bell Labs – Starting in the mid 70’s • Insightful (Mathsoft/ Statsci) founded to commercial S as S+ in 1987 – The “plus”: statistical libraries, documentation, and support – Later focus on commercial users, ease of use, server integration – I started using S+ ~1993, joined Insightful in 1996 • R: development begun by Ross Ihaka and Robert Gentleman at University of Auckland in 1995 • Insightful acquired by TIBCO in 2008 – Spotfire (for Data Discovery and Visualization) acquired in 2007 • Focus shifted on integration of advanced analytics to enterprise applications – Step 1: Embrace R - 3 -
  • 4. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Why embrace R? • Focus of energy and innovation in the statistical community – Agility and power of the S/R language – Provides a freely available platform for anyone to use – Easily and develop and share new analytic methods as packages – Hugely successful • Many users, many contributed packages • Default way that people communicate in statistics • R provides tremendous benefits to statisticians – But large enterprises are often challenged to leverage that value beyond the individual statistician - 4 -
  • 5. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. What does “Embracing R” look like? • TIBCO Spotfire Statistics Services – Integration of R & S+ into Spotfire (and other) applications – Easily provide targeted, relevant advanced analytics to larger community of users, combined with the interactive visualization of Spotfire – Statistician can ensure compliance & proper usage – Later integrated with SAS® and MATLAB® as well • Contribute to the R community – Sponsor useR conferences, contribute to R Foundation – Contribute bug reports and fixes to R core – Contribute packages to community • Well received—but our Enterprise customers need more - 5 -
  • 6. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Enterprise Challenges for R • R struggles with Big Data – Customers don’t use R, or reimplement R code in specialized libraries or other languages – Lose agility & consistency, delay time to production, lose opportunities • R was not built for enterprise usage and integration – Built as an academic tool for research and teaching – Some R Core members exploring different directions – Software vendors attempting to use R in ways it was never intended • GPL has been great for statisticians, but limits enterprise innovation and investment – Viral open source licensing risks IP – Only attracts innovations people are willing to give away – Large vendors avoid tight integration due to open source concerns • Free to acquire, but costly to maintain – Version incompatibilities, variable quality in packages – Lack of enterprise-level technical support – Difficult to validate
  • 7. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. TIBCO Enterprise Runtime for R (TERR) • Unique, enterprise-quality runtime for the R language – Fundamentally different from anything else on the market • Developed from the ground up, based on our long history and expertise with S+ – Brings scalability & stability to agile R language – Faster, more robust and more memory-efficient than R – TIBCO IP: Not open source/GPL • Licensable for embedding and redistribution by partners – Brand new architecture, new foundation for future investment – Broad support for R functions and hundreds of CRAN packages • Ongoing effort to broaden our coverage of R • Extends the Reach of R to the Enterprise – Develop in Open Source R, deploy on TERR – Agility of R – Rapidly iterate prototyping to production without recoding/retesting – Easily integrate predictive analytics consistently across organization • Initial Focus: Integration with Spotfire platform
  • 8. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Providing value for organizations who use R Embeddable TERR Engine Custom (tight) integration, batch, existing grids, etc. • Faster than R, more robust, better memory management, fully supported • Low level APIs for tight integration • Basic data i/o (text files, JDBC, etc.) • Non GPL TERR in Statistics Services Distributed analytics • Managed pools of engines • Load balancing, queuing, failover, parallelization, etc. • High level APIs for loose integration, data i/o (C#, Java) • Central management of analytics, R packages TERR in Spotfire Ad hoc tools and interactive applications powered by advanced analytics • Spotfire Analytics platform: interactive visualization & data discovery, easily build and share applications, broad data access, etc.
  • 9. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Providing Value for individuals who use R • Not seeking to displace R from statistician’s desktops – Enterprise platform for the deployment and integration of your work—without having to rewrite it! • Contribute packages to CRAN – As we port from S+ or develop for TERR • Supports “Develop in OS R, Deploy on TERR” • Now available on CRAN – splusTimeSeries, splusTimeDate, sjdbc – tibbrConnector • TERR Developer Edition – Full version of TERR engine for testing code prior to deployment • Console only—expect you to use your favorite R IDE – Perpetual License for free, non-production use – Supported through Community site - 9 -
  • 10. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. TERR Examples • TERR vs. R raw performance • TERR in Spotfire—powering Predictive Analytics tools and applications for Data Discovery • TERR and real-time event processing - 10 -
  • 11. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. TERR vs. R Raw Performance One specific example • Non-optimal, non-vectorized, real-world R script • For loop with row by row processing for (i in seq(1,length=nrow(df))) { …process each customer record… } Results • TERR is ~35x faster for 50K rows, 150x faster for 500K rows • No code modification required We are looking for more real-world performance tests! • On average 2-10x faster than R in microtests - 11 -
  • 12. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. TERR in Spotfire: Predictive Modeling Predictive Modeling Tools • Regression • Classification • Model Diagnostics Build and shared Spotfire applications with embedded analytics Power of predictive analytics in Spotfire • For non-R programmer • Leverage the interactive visualizations of Spotfire • Powered by embedded TERR engine
  • 13. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Real time Fraud Detection • TIBCO Business Events – Complex Event Processing system • Example integration with TERR – Deploy R models on TERR engine for real-time scoring in response to complex events – Random forest model, scoring online transactions for fraud • Testing on a small group of servers – Total of 26 CPU cores • Results – 2200 transactions scored per second, using random forest model in integrated TERR engine - 13 -
  • 14. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Extending the Reach of R to the Enterprise • TIBCO, S+, R and me • Why (and how) did we embrace R? • Challenges of R for Enterprise applications • TIBCO Enterprise Runtime for R (TERR) • Benefits for organizations (and individuals) who use R • Examples of TERR integration and performance • Learn more and try it yourself - 14 -
  • 15. Click to edit Master title style © Copyright 2000-2013 TIBCO Software Inc. All rights reserved. TIBCO Confidential & Proprietary Information. Learn more and Try it yourself • useR presentation: Memory Management in TERR – By Michael Sannella, Friday 11:50 • TERR Community now available at TIBCOmmunity.com – Resources, FAQs, Forums – Details of R coverage – Product documentation & download – More info at spotfire.tibco.com/terr • TERR Developer Edition – Full version of TERR engine for testing code prior to deployment – Supported through TIBCOmmunity, download via tap.tibco.com • We want your feedback and input! – Real world performance tests – Package & R coverage prioritization – Via TERR Community, or contact me lbajuk@tibco.com or @loubajuk