SlideShare uma empresa Scribd logo
1 de 47
Baixar para ler offline
Actuarial Science as Data Science
Actuarial Modeling in R


Revolution Analytics Webinar   Jim Guszcza, FCAS, MAAA

                               Deloitte Consulting LLP
                               University of Wisconsin-Madison
March 28, 2012
About Your Presenter




•    James Guszcza, PhD, FCAS, MAAA
•    National Predictive Analytics Lead – Deloitte Consulting Actuarial, Risk, Analytics practice
•    Assistant professor of actuarial science & risk management – U. Wisconsin-Madison
•    PhD in Philosophy – The University of Chicago
•    Fellow of the Casualty Actuarial Society
•    Lots experience building predictive models / analyzing data in and outside of insurance

    jguszcza@deloitte.com
    jguszcza@bus.wisc.edu
     2   Deloitte Analytics Institute                                                      © 2011 Deloitte LLP
Agenda
         Introduction

         Actuarial Science and Data Science

         R Background

         Case Studies

         • Fitting a complex size of loss model

         • Loss Reserving

         • Bayesian Hierarchical Modeling

         • Revolution: Tweedie Regression on big data
Actuarial Science and
Data Science
Not Just Hype


“Perhaps the most important cultural trend today: The
 explosion of data about every aspect of our world and
 the rise of applied math gurus who know how to use it.”

    -- Chris Anderson, editor-in-chief of Wired

• So behavioral economics is important in insurance for two
  classes of reasons:

     • Decision-makers at insurance companies are human
     • People making insurance purchasing decisions are human




5    Deloitte Analytics Institute                             © 2010 Deloitte LLP
Brave New World With Such Algorithms In IT

• The analysis of data affects:

• What we buy

• What we read

• What we watch

• How we network

• How we socialize

• The opinions we form

• Whom we date and marry!

6   Deloitte Analytics Institute             © 2010 Deloitte LLP
Clinical vs Actuarial Judgment – the Motion Picture




7   Deloitte Analytics Institute                      © 2010 Deloitte LLP
Analytics Everywhere

                 • Neural net models are used to predict movie box-office returns based on
                   features of their scripts

                 • Decision tree models are used to help ER doctors better triage patients
                   complaining of chest pain.

                 • Predictive models are used to predict the price of different wine vintages
                   based on variables about the growing season.

                 • Predictive models to help commercial insurance underwriters better select
                   and price risks.

                 • Predict which non-custodial parents are at highest risk of falling into
                   arrears on their child support.

                 • Predicting which job candidates will successfully make it through the
                   interviewing / recruiting process… and which candidates will subsequently
                   retain and perform well on the job.

                 • Predicting which doctors are at highest risk of being sued for malpractice.

                 • Predicting the ultimate severity of injury claims.
8   Deloitte Analytics Institute                                        (Deloitte applications in green)
                                                                                             © 2010 Deloitte LLP
At the Center of It All: Data Science
      Or: “The Collision between Statistics and Computation”



• Today the analytics world is
  different largely due to
  exponential growth in
  computing power.

• The skill set underlying
  business analytics is
  increasingly called
  data science.

• Data science goes beyond:
    • Traditional statistics
    • Business intelligence [BI]
                                                       Image borrowed from Drew Conway’s blog
    • Information technology                http://www.dataists.com/2010/09/the-data-science-venn-diagram

9   Deloitte Analytics Institute                                                             © 2010 Deloitte LLP
Where Do We Want to Be?




       •Here?

                                               Image borrowed from Drew Conway’s blog
                                    http://www.dataists.com/2010/09/the-data-science-venn-diagram

10   Deloitte Analytics Institute                                                    © 2010 Deloitte LLP
Where Do We Want to Be?




       •Or Here?

                                               Image borrowed from Drew Conway’s blog
                                    http://www.dataists.com/2010/09/the-data-science-venn-diagram

11   Deloitte Analytics Institute                                                    © 2010 Deloitte LLP
On then, on to R




12   Deloitte Analytics Institute   © 2010 Deloitte LLP
R Background
R Overview

     R is an open-source, object-oriented statistical programming language.
     In the past decade, it has become the global lingua franca of statistics.


• History:
     • R is based on the S statistical programming language developed by
       John Chambers at Bell labs in the 1980’s
     • R is an open-source implementation of the S language
     • Developed by Robert Gentlemen and Ross Ihaka at U Auckland
     • Revolution R is a commercially supported, scalable implementation
       of R, with parallel processing and big data capabilities


• Features:
     • R is an interactive, object-oriented programming environment
     • R has advanced graphical capabilities
     • Statisticians around the world contribute add-on packages


14   Deloitte Analytics Institute                                          © 2010 Deloitte LLP
On the Shoulders of Giants

• … therefore prominent people tend say things like this:




http://www.nytimes.com/2009/01/07/technology/business-computing/07program.html?pagewanted=all




15   Deloitte Analytics Institute                                                               © 2010 Deloitte LLP
Facets of R

• In a recent article John Chambers discussed 6 “Facets of R”
     1.    An interface to computational procedures of many kinds
     2.    Interactive, hands-on in real time
     3.    Functional in its model of programming
     4.    Object-oriented, “everything is an object”
     5.    Modular, built from standardized pieces
     6.    Collaborative, a world-wide, open-source effort


• Interactive interface: Chambers was influenced by APL
     • In the days before spreadsheets, APL was very popular in the actuarial
       community
     • One of the rare interactive scientific computing environments
     • Gives user ability to express novel computations
     • Heavy emphasis on matrices and arrays
     • But: unlike R, APL had no interface to procedures

16   Deloitte Analytics Institute                                          © 2010 Deloitte LLP
A Network ExteRnality

• Hal Varian’s “giant” has grown at
  an exponential rate.

• The open-source nature of R
  has encouraged top researchers
  from around the world to
  contribute new, often highly
  advanced, packages.

• Result: a powerful “network
  effect”.
     • The value of a product increases as
       more people use it.


• R has become something like
  the Wikipedia of the statistics
  world.
17   Deloitte Analytics Institute            © 2010 Deloitte LLP
Adoption in the Actuarial World




18   Deloitte Analytics Institute   © 2010 Deloitte LLP
Free from Frees

• Jed Frees at the University of Wisconsin-Madison has made R integral to
  his new book on regression and time series. He maintains a nice website
  containing R instructions, data, and code.




     http://instruction.bus.wisc.edu/jfrees/jfreesbooks/Regression%20Modeling/BookWebDec2010/learnR.html
19    Deloitte Analytics Institute                                                         © 2010 Deloitte LLP
Case Studies
Some Everyday Uses of R

• Free-form Exploratory Data Analysis
     • ad hoc data munging, data visualizations, fitting simple models on the fly
     • Loss models (“exam 4/C”)


• Unsupervised Learning
     • Correlation analysis, principal component / factor analysis, variable clustering,
       k-means and hierarchical clustering, self-organizing maps, association rules
       (aka “market basket analysis”), Latent Dirichlet Analysis


• Supervised Learning
     • “statistics paradigm”: GLM, Multilevel/Hierarchical models, quantile
       regression
     • “machine learning paradigm: CART, MARS, Random Forests, Neural
       Networks, Support Vector Machines
     • Bayesian data analysis (MCMC simulation), causal analysis


• Optimization
21   Deloitte Analytics Institute                                              © 2010 Deloitte LLP
Case Study #1
Loss Distribution Modeling
Modeling a Non-Trivial Loss Distribution


• A typical actuarial
  problem: modeling a
  highly skew and
  ambiguous loss




                                    8 e-06
  distribution




                                    6 e-06
• Traditional medium of
  analysis: spreadsheets.

                                    4 e-06
• Why limit ourselves?              2 e-06
                                    0 e+00




                                             0 e+00   1 e+06   2 e+06     3 e+06   4 e+06         5 e+06

                                                                   loss
23   Deloitte Analytics Institute                                                  © 2010 Deloitte LLP
Case Study #2
Loss Reserving
Three Approaches to Loss Reserving

    • A garden-variety loss triangle:
                                               Cumulative   Losses in 1000's
     AY           premium             12      24     36        48      60    72      84      96     108     120   CL Ult   CL LR         CL res
    1988            2,609            404     986 1,342      1,582 1,736 1,833     1,907   1,967   2,006   2,036    2,036     0.78             0
    1989            2,694            387     964 1,336      1,580 1,726 1,823     1,903   1,949   1,987            2,017     0.75            29
    1990            2,594            421   1,037 1,401      1,604 1,729 1,821     1,878   1,919                    1,986     0.77            67
    1991            2,609            338     753 1,029      1,195 1,326 1,395     1,446                            1,535     0.59            89
    1992            2,077            257     569    754       892     958 1,007                                    1,110     0.53           103
    1993            1,703            193     423    589       661     713                                            828     0.49           115
    1994            1,438            142     361    463       533                                                    675     0.47           142
    1995            1,093            160     312    408                                                              601     0.55           193
    1996            1,012            131     352                                                                     702     0.69           350
    1997             976             122                                                                             576     0.59           454

chain link                         2.365 1.354 1.164 1.090 1.054 1.038 1.026 1.020 1.015  1.000                   12,067                   1,543
chain ldf                          4.720 1.996 1.473 1.266 1.162 1.102 1.062 1.035 1.015  1.000
growth curve                      21.2% 50.1% 67.9% 79.0% 86.1% 90.7% 94.2% 96.6% 98.5% 100.0%




    • Let’s use R to forecast outstanding losses using three methods:
           • Replicate the above chain-ladder spreadsheet calculation – easy!
           • Use the Over-dispersed Poisson GLM model
           • Longitudinal data analysis using growth curves

    25     Deloitte Analytics Institute                                                                                     © 2010 Deloitte LLP
What Do You See?

    • Let’s look at the loss triangle with fresh eyes.

    • We would like to do stochastic reserving the “right” way.

    • What considerations come to mind?


                                               Cumulative   Losses in 1000's
     AY           premium             12      24     36        48      60    72      84      96     108     120   CL Ult   CL LR         CL res
    1988            2,609            404     986 1,342      1,582 1,736 1,833     1,907   1,967   2,006   2,036    2,036     0.78             0
    1989            2,694            387     964 1,336      1,580 1,726 1,823     1,903   1,949   1,987            2,017     0.75            29
    1990            2,594            421   1,037 1,401      1,604 1,729 1,821     1,878   1,919                    1,986     0.77            67
    1991            2,609            338     753 1,029      1,195 1,326 1,395     1,446                            1,535     0.59            89
    1992            2,077            257     569    754       892     958 1,007                                    1,110     0.53           103
    1993            1,703            193     423    589       661     713                                            828     0.49           115
    1994            1,438            142     361    463       533                                                    675     0.47           142
    1995            1,093            160     312    408                                                              601     0.55           193
    1996            1,012            131     352                                                                     702     0.69           350
    1997             976             122                                                                             576     0.59           454

chain link                         2.365 1.354 1.164 1.090 1.054 1.038 1.026 1.020 1.015  1.000                   12,067                   1,543
chain ldf                          4.720 1.996 1.473 1.266 1.162 1.102 1.062 1.035 1.015  1.000
growth curve                      21.2% 50.1% 67.9% 79.0% 86.1% 90.7% 94.2% 96.6% 98.5% 100.0%
    26     Deloitte Analytics Institute                                                                                     © 2010 Deloitte LLP
Some Essential Features of Loss Reserving
                                                                                                 Cumulative   Losses in 1000's
                                                                  AY        premium     12      24     36        48      60    72      84      96     108     120   CL Ult   CL LR    CL res
                                                                 1988         2,609    404     986 1,342      1,582 1,736 1,833     1,907   1,967   2,006   2,036    2,036     0.78        0
                                                                 1989         2,694    387     964 1,336      1,580 1,726 1,823     1,903   1,949   1,987            2,017     0.75       29
                                                                 1990         2,594    421   1,037 1,401      1,604 1,729 1,821     1,878   1,919                    1,986     0.77       67
                                                                 1991         2,609    338     753 1,029      1,195 1,326 1,395     1,446                            1,535     0.59       89




• Repeated measures
                                                                 1992         2,077    257     569    754       892     958 1,007                                    1,110     0.53      103
                                                                 1993         1,703    193     423    589       661     713                                            828     0.49      115
                                                                 1994         1,438    142     361    463       533                                                    675     0.47      142
                                                                 1995         1,093    160     312    408                                                              601     0.55      193
                                                                 1996         1,012    131     352                                                                     702     0.69      350
                                                                 1997          976     122                                                                             576     0.59      454




     • The dataset is inherently longitudinal in nature.     chain link
                                                             chain ldf
                                                             growth curve
                                                                                       2.365 1.354 1.164 1.090 1.054 1.038 1.026 1.020 1.015
                                                                                       4.720 1.996 1.473 1.266 1.162 1.102 1.062 1.035 1.015
                                                                                                                                              1.000
                                                                                                                                              1.000
                                                                                      21.2% 50.1% 67.9% 79.0% 86.1% 90.7% 94.2% 96.6% 98.5% 100.0%
                                                                                                                                                                    12,067             1,543




• A “Bundle” of time series
     • Loss triangle: a collection of time series that are “related” to one another…
     • … no guarantee that the same development pattern is appropriate to each one


• Non-linear
     • Each year’s loss development pattern in inherently non-linear
     • Ultimate loss (ratio) is an asymptote


• Incomplete information
     • Few loss triangles contain all of the information needed to make forecasts
     • Most reserving exercises must incorporate judgment and/or background
       information
      Loss reserving is inherently Bayesian
27   Deloitte Analytics Institute                                                                                                       © 2010 Deloitte LLP
Origin of the Approach: Dave’s Idea + Random Effects




                                    +




                                        =



28   Deloitte Analytics Institute                  © 2010 Deloitte LLP
And Now it’s Bayesian




• Fully Bayesian model
• Provides posterior credible
  intervals (“range of reasonable
  reserves”)
• Add further hierarchical structure
  to simultaneously model loss
  development for multiple
  companies. (Wayne’s idea!)
   29   Deloitte Analytics Institute   © 2010 Deloitte LLP
Case Study #3
Hierarchical Bayes Ratemaking
Workers Comp Ratemaking

• We have 7 years of Workers Comp data
     •   Data from Klugman [1992 Bayes book]
     •   128 workers comp classes (types of business)
     •   7 years of summarized data
     •   Given: total payroll, claim count by class
     •   (payroll is a measure of “exposure” in this domain)

     • Problem: use years 1-6 data to predict year 7




31   Deloitte Analytics Institute                              © 2010 Deloitte LLP
Empirical Bayes “Credibility” Approach

• Naïve approach:
     • Calculate average year 1-6 claim frequency by class
     • Use these 128 averages as estimates for year 7.



• Better approach: build empirical Bayes hierarchical model.
     • “Bühlmann-Straub credibility model”
     • “Shrinks” low-credibility classes towards the grand mean
     • Use Douglas Bates’ lme4 package (UW-Madison again!)




                               clmcnti ~ Poi ( payrolli λ j[ i ] )
                                               (
                                    λ j ~ N µλ , σ λ
                                                   2
                                                         )
32   Deloitte Analytics Institute                                    © 2010 Deloitte LLP
Shrinkage Effect of Empirical Bayes Model

• Top row: estimated claim
  frequencies from un-pooled                                     Modeled Claim Frequency by C
  model.                                                         Poisson Models: No Pooling and Simple

     • Separately calculate
       #claims/payroll by class         no pool



• Bottom row: estimated
  claim frequencies from
  Poisson hierarchical
  (credibility) model.

• Credibility estimates are
  “shrunk” towards the grand
  mean.                                 hierach
     • Dotted line: shrinkage between
       5=10%.
     • Solid line: shrinkage > 10%           0.00   grand mean   0.05                              0.10

                                                                 Claim Frequency
33   Deloitte Analytics Institute                                                   © 2010 Deloitte LLP
clmcnti ~ Poi ( payrolli λ j[ i ] )
Now Specify a Fully Bayesian Model                                                             (
                                                                                    λ j ~ N µλ , σ λ
                                                                                                   2
                                                                                                             )

• Here we specify a fully Bayesian model.
     • Use the rjags package
     • JAGS: Just Another Gibbs Sampler
          • We’re standing on the shoulders of giants named David Spiegelhalter, Martyn Plummer, …




34   Deloitte Analytics Institute                                                                  © 2010 Deloitte LLP
clmcnti ~ Poi ( payrolli λ j[ i ] )
Now Specify a Fully Bayesian Model                          (
                                                 λ j ~ N µλ , σ λ
                                                                2
                                                                          )

• Here we specify a fully Bayesian model.
     • Poisson regression with an offset




35   Deloitte Analytics Institute                               © 2010 Deloitte LLP
clmcnti ~ Poi ( payrolli λ j[ i ] )
Now Specify a Fully Bayesian Model                          (
                                                 λ j ~ N µλ , σ λ
                                                                2
                                                                          )

• Here we specify a fully Bayesian model.
     • Allow for overdispersion




36   Deloitte Analytics Institute                               © 2010 Deloitte LLP
clmcnti ~ Poi ( payrolli λ j[ i ] )
Now Specify a Fully Bayesian Model                          (
                                                 λ j ~ N µλ , σ λ
                                                                2
                                                                          )

• Here we specify a fully Bayesian model.
     • Allow for overdispersion




37   Deloitte Analytics Institute                               © 2010 Deloitte LLP
clmcnti ~ Poi ( payrolli λ j[ i ] )
Now Specify a Fully Bayesian Model                                                (
                                                                       λ j ~ N µλ , σ λ
                                                                                      2
                                                                                                )

• Here we specify a fully Bayesian model.
     • “Credibility weighting” (aka shrinkage) results from giving class-level intercepts
       a probability sub-model.




38   Deloitte Analytics Institute                                                     © 2010 Deloitte LLP
clmcnti ~ Poi ( payrolli λ j[ i ] )
Now Specify a Fully Bayesian Model                                         (
                                                                λ j ~ N µλ , σ λ
                                                                               2
                                                                                         )

• Here we specify a fully Bayesian model.
     • Put a diffuse prior on all of the hyperparameters
     • Fully Bayesian model
     • Bayes or Bust!




39   Deloitte Analytics Institute                                              © 2010 Deloitte LLP
clmcnti ~ Poi ( payrolli λ j[ i ] )
Now Specify a Fully Bayesian Model                                          (
                                                                 λ j ~ N µλ , σ λ
                                                                                2
                                                                                          )

• Here we specify a fully Bayesian model.
     • Replace year-7 actual values with missing values
     • We model the year-7 results … produce 128 posterior density estimates
     • Can compare actual claims with Bayesian posterior probabilities




40   Deloitte Analytics Institute                                               © 2010 Deloitte LLP
A Credible Result

• Let’s rank the top 30
  WC classes by the
  median of the
  posterior predictive
  density of year-7
  claim count.

• 87% of the top 30
  classes have actual
  year-7 claim count
  falling within the
  90% posterior
  credible interval.



41   Deloitte Analytics Institute   © 2010 Deloitte LLP
Case Study #4
Big Data in Revolution R
Big Data Headed Our Way

• Credibility concerns and a Bayesian outlook
  are part and parcel of actuarial science.

• But for many actuaries, working with “big
  data” is a much more pressing concern.
     • Many millions of personal lines policy terms
     • Premium, loss, credit, billing transactions
     • Telematics data
     • … much more to come


• Base R handles data in memory
     • This is beautiful for “small data” problems like doing loss
       reserving on summarized data
     • But breaks down for many industrial datasets


• So on to Revolution-R



43   Deloitte Analytics Institute                                    © 2011 Deloitte LLP
The kaggle Allstate Claim Prediction Challenge Data




44   Deloitte Analytics Institute                     © 2011 Deloitte LLP
Loading the Data

•    Data volume:
     • 13M rows
     • ~ 40 cols


• Took about 6-7
  minutes to load
     • Perform some
       variable
       transformations
       on the fly to
       minimize passes
       though the data.


• Data saved on
  disk in “xdf” file
  format for easy
  access and
  interactive
  modeling.

45   Deloitte Analytics Institute   © 2011 Deloitte LLP
Viewing the Data

•     Data characteristics:
      • 13,184,290 rows
      • A few dozen predictive variables (mostly blinded)
      • Target variable: claim amount


•    kaggle competition goal: build a model that segments well out-of-sample
•    Let’s use the 2005-6 data to predict the 2007 data
•    (Just a quick model to get a sense of Revolution R’s scalability)
•    Tweedie regression models fit in seconds




46     Deloitte Analytics Institute                                        © 2011 Deloitte LLP
Helpful Resources

•    Edward (Jed) Frees – Regression modeling with actuarial and financial
     applications http://www.amazon.com/Regression-Actuarial-Financial-Applications-
     International/dp/0521135966
•    Andrew Gelman / Jennifer Hill - Data Analysis using Regression and
     Multilevel/Hierarchical Models http://www.amazon.com/Analysis-Regression-Multilevel-
     Hierarchical-
     Models/dp/052168689X/ref=sr_1_1?s=books&ie=UTF8&qid=1332961819&sr=1-1
•    Venables and Ripley – Modern Applied Statistics in S http://www.amazon.com/Modern-
     Applied-Statistics-
     Computing/dp/1441930086/ref=sr_1_1?s=books&ie=UTF8&qid=1332961867&sr=1-1
•    Hastie, Tibshirani, Friedman – the Elements of Statistical
     Learning http://www.amazon.com/The-Elements-Statistical-Learning-
     Prediction/dp/0387848576/ref=sr_1_1?s=books&ie=UTF8&qid=1332961913&sr=1-1
•    Gelman, Carlin, Stern, Ruin – Bayesian Data Analysis http://www.amazon.com/Bayesian-
     Analysis-Edition-Chapman-Statistical/dp/158488388X/ref=tag_dpp_lp_edpp_ttl_in
•    John Kruschke – Doing Bayesian Data Analysis http://www.amazon.com/Doing-Bayesian-
     Data-Analysis-
     Tutorial/dp/0123814855/ref=sr_1_3?s=books&ie=UTF8&qid=1332961975&sr=1-3




47   Deloitte Analytics Institute                                               © 2011 Deloitte LLP

Mais conteúdo relacionado

Mais procurados

Technology Innovation Trends In Insurance | Navdeep Arora
Technology Innovation Trends In Insurance | Navdeep Arora Technology Innovation Trends In Insurance | Navdeep Arora
Technology Innovation Trends In Insurance | Navdeep Arora Navdeep Arora
 
Salesforce Basecamp Helsinki 8.5.2018 - Boston Consulting Group
Salesforce Basecamp Helsinki 8.5.2018 - Boston Consulting GroupSalesforce Basecamp Helsinki 8.5.2018 - Boston Consulting Group
Salesforce Basecamp Helsinki 8.5.2018 - Boston Consulting GroupSalesforce Finland
 
What Does Good Risk Culture Actually Look Like?
What Does Good Risk Culture Actually Look Like?What Does Good Risk Culture Actually Look Like?
What Does Good Risk Culture Actually Look Like?accenture
 
Applied Innovation for the Unorganization
Applied Innovation for the UnorganizationApplied Innovation for the Unorganization
Applied Innovation for the UnorganizationCapgemini
 
Data Disruption for Insurance - Perspective from th
Data Disruption for Insurance - Perspective from thData Disruption for Insurance - Perspective from th
Data Disruption for Insurance - Perspective from thDataiku
 
The Next Era of Data Economy 8.5.2019
The Next Era of Data Economy 8.5.2019The Next Era of Data Economy 8.5.2019
The Next Era of Data Economy 8.5.2019Sitra / Hyvinvointi
 
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...Capgemini
 
Can Financial Institutions be the next Digital Masters? Capgemini says YES
Can Financial Institutions be the next Digital Masters? Capgemini says YESCan Financial Institutions be the next Digital Masters? Capgemini says YES
Can Financial Institutions be the next Digital Masters? Capgemini says YESCapgemini
 
Guide to Data Analytics: The Trend That's Reshaping the Insurance Industry
 Guide to Data Analytics: The Trend That's Reshaping the Insurance Industry Guide to Data Analytics: The Trend That's Reshaping the Insurance Industry
Guide to Data Analytics: The Trend That's Reshaping the Insurance IndustryApplied Systems
 
Big Data BlackOut: Are Utilities Powering Up Their Data Analytics?
Big Data BlackOut: Are Utilities Powering Up Their Data Analytics?Big Data BlackOut: Are Utilities Powering Up Their Data Analytics?
Big Data BlackOut: Are Utilities Powering Up Their Data Analytics?Capgemini
 
10 WealthTech podcasts every wealth advisor should listen to
10 WealthTech podcasts every wealth advisor should listen to10 WealthTech podcasts every wealth advisor should listen to
10 WealthTech podcasts every wealth advisor should listen toIBM Analytics
 
8 Ways to Win Against Next-Gen Insurers
8 Ways to Win Against Next-Gen Insurers8 Ways to Win Against Next-Gen Insurers
8 Ways to Win Against Next-Gen InsurersNuxeo
 
Augmented Intelligence for EmTech May 2016 (Anand-final-Without Video) Presen...
Augmented Intelligence for EmTech May 2016 (Anand-final-Without Video) Presen...Augmented Intelligence for EmTech May 2016 (Anand-final-Without Video) Presen...
Augmented Intelligence for EmTech May 2016 (Anand-final-Without Video) Presen...Anand Rao
 
Close the AI Action Gap in Financial Services
Close the AI Action Gap in Financial ServicesClose the AI Action Gap in Financial Services
Close the AI Action Gap in Financial ServicesCognizant
 
Madison Park Group Member Management Software Market Update - Nonprofit & Ass...
Madison Park Group Member Management Software Market Update - Nonprofit & Ass...Madison Park Group Member Management Software Market Update - Nonprofit & Ass...
Madison Park Group Member Management Software Market Update - Nonprofit & Ass...Madison Park Group
 
Governance (Corporate And Technology)
Governance (Corporate And Technology)Governance (Corporate And Technology)
Governance (Corporate And Technology)Jerald Burget
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfdawnrk
 

Mais procurados (20)

Technology Innovation Trends In Insurance | Navdeep Arora
Technology Innovation Trends In Insurance | Navdeep Arora Technology Innovation Trends In Insurance | Navdeep Arora
Technology Innovation Trends In Insurance | Navdeep Arora
 
Big Data in Global Retail Market 2021
Big Data in Global Retail Market 2021Big Data in Global Retail Market 2021
Big Data in Global Retail Market 2021
 
Salesforce Basecamp Helsinki 8.5.2018 - Boston Consulting Group
Salesforce Basecamp Helsinki 8.5.2018 - Boston Consulting GroupSalesforce Basecamp Helsinki 8.5.2018 - Boston Consulting Group
Salesforce Basecamp Helsinki 8.5.2018 - Boston Consulting Group
 
What Does Good Risk Culture Actually Look Like?
What Does Good Risk Culture Actually Look Like?What Does Good Risk Culture Actually Look Like?
What Does Good Risk Culture Actually Look Like?
 
Applied Innovation for the Unorganization
Applied Innovation for the UnorganizationApplied Innovation for the Unorganization
Applied Innovation for the Unorganization
 
Data Disruption for Insurance - Perspective from th
Data Disruption for Insurance - Perspective from thData Disruption for Insurance - Perspective from th
Data Disruption for Insurance - Perspective from th
 
CIO Pulse Sept 2008
CIO Pulse Sept 2008CIO Pulse Sept 2008
CIO Pulse Sept 2008
 
The Next Era of Data Economy 8.5.2019
The Next Era of Data Economy 8.5.2019The Next Era of Data Economy 8.5.2019
The Next Era of Data Economy 8.5.2019
 
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
 
Can Financial Institutions be the next Digital Masters? Capgemini says YES
Can Financial Institutions be the next Digital Masters? Capgemini says YESCan Financial Institutions be the next Digital Masters? Capgemini says YES
Can Financial Institutions be the next Digital Masters? Capgemini says YES
 
Guide to Data Analytics: The Trend That's Reshaping the Insurance Industry
 Guide to Data Analytics: The Trend That's Reshaping the Insurance Industry Guide to Data Analytics: The Trend That's Reshaping the Insurance Industry
Guide to Data Analytics: The Trend That's Reshaping the Insurance Industry
 
Big Data BlackOut: Are Utilities Powering Up Their Data Analytics?
Big Data BlackOut: Are Utilities Powering Up Their Data Analytics?Big Data BlackOut: Are Utilities Powering Up Their Data Analytics?
Big Data BlackOut: Are Utilities Powering Up Their Data Analytics?
 
10 WealthTech podcasts every wealth advisor should listen to
10 WealthTech podcasts every wealth advisor should listen to10 WealthTech podcasts every wealth advisor should listen to
10 WealthTech podcasts every wealth advisor should listen to
 
8 Ways to Win Against Next-Gen Insurers
8 Ways to Win Against Next-Gen Insurers8 Ways to Win Against Next-Gen Insurers
8 Ways to Win Against Next-Gen Insurers
 
Augmented Intelligence for EmTech May 2016 (Anand-final-Without Video) Presen...
Augmented Intelligence for EmTech May 2016 (Anand-final-Without Video) Presen...Augmented Intelligence for EmTech May 2016 (Anand-final-Without Video) Presen...
Augmented Intelligence for EmTech May 2016 (Anand-final-Without Video) Presen...
 
Close the AI Action Gap in Financial Services
Close the AI Action Gap in Financial ServicesClose the AI Action Gap in Financial Services
Close the AI Action Gap in Financial Services
 
Madison Park Group Member Management Software Market Update - Nonprofit & Ass...
Madison Park Group Member Management Software Market Update - Nonprofit & Ass...Madison Park Group Member Management Software Market Update - Nonprofit & Ass...
Madison Park Group Member Management Software Market Update - Nonprofit & Ass...
 
Governance (Corporate And Technology)
Governance (Corporate And Technology)Governance (Corporate And Technology)
Governance (Corporate And Technology)
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
 
The hospital of the future
The hospital of the futureThe hospital of the future
The hospital of the future
 

Destaque

Insurance pricing
Insurance pricingInsurance pricing
Insurance pricingLincy PT
 
Advanced Pricing in General Insurance
Advanced Pricing in General InsuranceAdvanced Pricing in General Insurance
Advanced Pricing in General InsuranceSyed Danish Ali
 
Machine Learning for Actuaries
Machine Learning for ActuariesMachine Learning for Actuaries
Machine Learning for ActuariesArthur Charpentier
 
Principles and practices insc. n theory of rate making agents trg. laju
Principles and practices insc. n theory of rate making agents trg. lajuPrinciples and practices insc. n theory of rate making agents trg. laju
Principles and practices insc. n theory of rate making agents trg. lajuLajpat Ray Chandnani
 
Marine and Energy Pricing Paper - FINAL
Marine and Energy Pricing Paper - FINALMarine and Energy Pricing Paper - FINAL
Marine and Energy Pricing Paper - FINALDarren Farr
 
10 minute statistics and actuarial science il session
10 minute statistics and actuarial science il session10 minute statistics and actuarial science il session
10 minute statistics and actuarial science il sessionSandra Gallipeau
 
meteodyn WT CFD modeling of forest canopy flows: input parameters, calibratio...
meteodyn WT CFD modeling of forest canopy flows: input parameters, calibratio...meteodyn WT CFD modeling of forest canopy flows: input parameters, calibratio...
meteodyn WT CFD modeling of forest canopy flows: input parameters, calibratio...Jean-Claude Meteodyn
 
R Programming: Mathematical Functions In R
R Programming: Mathematical Functions In RR Programming: Mathematical Functions In R
R Programming: Mathematical Functions In RRsquared Academy
 
Predictive Modeling using R
Predictive Modeling using RPredictive Modeling using R
Predictive Modeling using RRachit Jauhari
 
Multi-Scale Effects of Landscape Heterogeneity and Forest Management on Ungul...
Multi-Scale Effects of Landscape Heterogeneity and Forest Management on Ungul...Multi-Scale Effects of Landscape Heterogeneity and Forest Management on Ungul...
Multi-Scale Effects of Landscape Heterogeneity and Forest Management on Ungul...National Institute of Food and Agriculture
 
Latest seo news, tips and tricks website lists
Latest seo news, tips and tricks website listsLatest seo news, tips and tricks website lists
Latest seo news, tips and tricks website listsManickam Srinivasan
 
Olrac SPS Predictive Insurance Solutions
Olrac SPS Predictive Insurance SolutionsOlrac SPS Predictive Insurance Solutions
Olrac SPS Predictive Insurance SolutionsJustin Shanks
 
Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based E...
Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based E...Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based E...
Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based E...David Losada
 
Non-life Insurance Industry in India: An Overview
Non-life Insurance Industry in India: An OverviewNon-life Insurance Industry in India: An Overview
Non-life Insurance Industry in India: An OverviewFinMitra
 
Combining Linear and Non Linear Modeling Techniques
Combining Linear and Non Linear Modeling Techniques Combining Linear and Non Linear Modeling Techniques
Combining Linear and Non Linear Modeling Techniques Salford Systems
 
NITOL INSURANCE COMPANY LIMITED
NITOL INSURANCE COMPANY LIMITEDNITOL INSURANCE COMPANY LIMITED
NITOL INSURANCE COMPANY LIMITEDBushra Tanannum
 
Electron Configuration
Electron ConfigurationElectron Configuration
Electron Configurationcrumpjason
 

Destaque (20)

Princing insurance contracts with R
Princing insurance contracts with RPrincing insurance contracts with R
Princing insurance contracts with R
 
Insurance pricing
Insurance pricingInsurance pricing
Insurance pricing
 
Advanced Pricing in General Insurance
Advanced Pricing in General InsuranceAdvanced Pricing in General Insurance
Advanced Pricing in General Insurance
 
Machine Learning for Actuaries
Machine Learning for ActuariesMachine Learning for Actuaries
Machine Learning for Actuaries
 
Principles and practices insc. n theory of rate making agents trg. laju
Principles and practices insc. n theory of rate making agents trg. lajuPrinciples and practices insc. n theory of rate making agents trg. laju
Principles and practices insc. n theory of rate making agents trg. laju
 
Slides amsterdam-2013
Slides amsterdam-2013Slides amsterdam-2013
Slides amsterdam-2013
 
Claims Reserving
Claims ReservingClaims Reserving
Claims Reserving
 
Marine and Energy Pricing Paper - FINAL
Marine and Energy Pricing Paper - FINALMarine and Energy Pricing Paper - FINAL
Marine and Energy Pricing Paper - FINAL
 
10 minute statistics and actuarial science il session
10 minute statistics and actuarial science il session10 minute statistics and actuarial science il session
10 minute statistics and actuarial science il session
 
meteodyn WT CFD modeling of forest canopy flows: input parameters, calibratio...
meteodyn WT CFD modeling of forest canopy flows: input parameters, calibratio...meteodyn WT CFD modeling of forest canopy flows: input parameters, calibratio...
meteodyn WT CFD modeling of forest canopy flows: input parameters, calibratio...
 
R Programming: Mathematical Functions In R
R Programming: Mathematical Functions In RR Programming: Mathematical Functions In R
R Programming: Mathematical Functions In R
 
Predictive Modeling using R
Predictive Modeling using RPredictive Modeling using R
Predictive Modeling using R
 
Multi-Scale Effects of Landscape Heterogeneity and Forest Management on Ungul...
Multi-Scale Effects of Landscape Heterogeneity and Forest Management on Ungul...Multi-Scale Effects of Landscape Heterogeneity and Forest Management on Ungul...
Multi-Scale Effects of Landscape Heterogeneity and Forest Management on Ungul...
 
Latest seo news, tips and tricks website lists
Latest seo news, tips and tricks website listsLatest seo news, tips and tricks website lists
Latest seo news, tips and tricks website lists
 
Olrac SPS Predictive Insurance Solutions
Olrac SPS Predictive Insurance SolutionsOlrac SPS Predictive Insurance Solutions
Olrac SPS Predictive Insurance Solutions
 
Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based E...
Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based E...Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based E...
Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based E...
 
Non-life Insurance Industry in India: An Overview
Non-life Insurance Industry in India: An OverviewNon-life Insurance Industry in India: An Overview
Non-life Insurance Industry in India: An Overview
 
Combining Linear and Non Linear Modeling Techniques
Combining Linear and Non Linear Modeling Techniques Combining Linear and Non Linear Modeling Techniques
Combining Linear and Non Linear Modeling Techniques
 
NITOL INSURANCE COMPANY LIMITED
NITOL INSURANCE COMPANY LIMITEDNITOL INSURANCE COMPANY LIMITED
NITOL INSURANCE COMPANY LIMITED
 
Electron Configuration
Electron ConfigurationElectron Configuration
Electron Configuration
 

Semelhante a Actuarial Modeling in R: Data Science for Insurance

10. fri 1130 1230 soni - analytics in academia
10. fri 1130 1230 soni - analytics in academia10. fri 1130 1230 soni - analytics in academia
10. fri 1130 1230 soni - analytics in academiaJon Hedlund
 
Building Data Science Teams
Building Data Science TeamsBuilding Data Science Teams
Building Data Science TeamsEMC
 
Telco Big Data Workshop Sample
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop SampleAlan Quayle
 
US SEC Mandates, Python, and Financial Modeling
US SEC Mandates, Python, and Financial ModelingUS SEC Mandates, Python, and Financial Modeling
US SEC Mandates, Python, and Financial ModelingActiveState
 
SNW Spring 10 Presentation
SNW Spring 10 PresentationSNW Spring 10 Presentation
SNW Spring 10 PresentationJeff Kubacki
 
State of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - KeynoteState of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - KeynoteNeo4j
 
Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMONeo4j
 
Original: Lean Data Model Storming for the Agile Enterprise
Original: Lean Data Model Storming for the Agile EnterpriseOriginal: Lean Data Model Storming for the Agile Enterprise
Original: Lean Data Model Storming for the Agile EnterpriseDaniel Upton
 
Newcastle Intro 2015
Newcastle Intro 2015Newcastle Intro 2015
Newcastle Intro 2015Lee Schlenker
 
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...Dell World
 
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...Data Science Society
 
Tech Trends 2015: The fusion of business and IT
Tech Trends 2015: The fusion of business and ITTech Trends 2015: The fusion of business and IT
Tech Trends 2015: The fusion of business and ITDeloitte United States
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
S.P.A.C.E. Exploration for Software Engineering
 S.P.A.C.E. Exploration for Software Engineering S.P.A.C.E. Exploration for Software Engineering
S.P.A.C.E. Exploration for Software EngineeringCS, NcState
 

Semelhante a Actuarial Modeling in R: Data Science for Insurance (20)

10. fri 1130 1230 soni - analytics in academia
10. fri 1130 1230 soni - analytics in academia10. fri 1130 1230 soni - analytics in academia
10. fri 1130 1230 soni - analytics in academia
 
Building Data Science Teams
Building Data Science TeamsBuilding Data Science Teams
Building Data Science Teams
 
Telco Big Data Workshop Sample
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop Sample
 
US SEC Mandates, Python, and Financial Modeling
US SEC Mandates, Python, and Financial ModelingUS SEC Mandates, Python, and Financial Modeling
US SEC Mandates, Python, and Financial Modeling
 
Moving Big Data to Big Value
Moving Big Data to Big ValueMoving Big Data to Big Value
Moving Big Data to Big Value
 
SNW Spring 10 Presentation
SNW Spring 10 PresentationSNW Spring 10 Presentation
SNW Spring 10 Presentation
 
Promise notes
Promise notesPromise notes
Promise notes
 
State of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - KeynoteState of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - Keynote
 
NBSintro2013
NBSintro2013NBSintro2013
NBSintro2013
 
Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMO
 
Original: Lean Data Model Storming for the Agile Enterprise
Original: Lean Data Model Storming for the Agile EnterpriseOriginal: Lean Data Model Storming for the Agile Enterprise
Original: Lean Data Model Storming for the Agile Enterprise
 
Newcastle Intro 2015
Newcastle Intro 2015Newcastle Intro 2015
Newcastle Intro 2015
 
resume - CV
resume - CVresume - CV
resume - CV
 
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
 
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
 
Tech Trends 2015: The fusion of business and IT
Tech Trends 2015: The fusion of business and ITTech Trends 2015: The fusion of business and IT
Tech Trends 2015: The fusion of business and IT
 
Data is not the new snake oil
Data is not the new snake oilData is not the new snake oil
Data is not the new snake oil
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
Introduction 2014
Introduction 2014Introduction 2014
Introduction 2014
 
S.P.A.C.E. Exploration for Software Engineering
 S.P.A.C.E. Exploration for Software Engineering S.P.A.C.E. Exploration for Software Engineering
S.P.A.C.E. Exploration for Software Engineering
 

Mais de Revolution Analytics

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudRevolution Analytics
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureRevolution Analytics
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudRevolution Analytics
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondRevolution Analytics
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source CommunitiesRevolution Analytics
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with RRevolution Analytics
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceRevolution Analytics
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudRevolution Analytics
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorRevolution Analytics
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalRevolution Analytics
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint packageRevolution Analytics
 

Mais de Revolution Analytics (20)

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the Cloud
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to Azure
 
R in Minecraft
R in Minecraft R in Minecraft
R in Minecraft
 
The case for R for AI developers
The case for R for AI developersThe case for R for AI developers
The case for R for AI developers
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the Cloud
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R Then and Now
R Then and NowR Then and Now
R Then and Now
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per Second
 
Reproducible Data Science with R
Reproducible Data Science with RReproducible Data Science with R
Reproducible Data Science with R
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source Communities
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with R
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data Science
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the Cloud
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductor
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 final
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint package
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 

Último

VIP Kolkata Call Girl Jodhpur Park 👉 8250192130 Available With Room
VIP Kolkata Call Girl Jodhpur Park 👉 8250192130  Available With RoomVIP Kolkata Call Girl Jodhpur Park 👉 8250192130  Available With Room
VIP Kolkata Call Girl Jodhpur Park 👉 8250192130 Available With Roomdivyansh0kumar0
 
The Economic History of the U.S. Lecture 19.pdf
The Economic History of the U.S. Lecture 19.pdfThe Economic History of the U.S. Lecture 19.pdf
The Economic History of the U.S. Lecture 19.pdfGale Pooley
 
Log your LOA pain with Pension Lab's brilliant campaign
Log your LOA pain with Pension Lab's brilliant campaignLog your LOA pain with Pension Lab's brilliant campaign
Log your LOA pain with Pension Lab's brilliant campaignHenry Tapper
 
06_Joeri Van Speybroek_Dell_MeetupDora&Cybersecurity.pdf
06_Joeri Van Speybroek_Dell_MeetupDora&Cybersecurity.pdf06_Joeri Van Speybroek_Dell_MeetupDora&Cybersecurity.pdf
06_Joeri Van Speybroek_Dell_MeetupDora&Cybersecurity.pdfFinTech Belgium
 
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...ssifa0344
 
03_Emmanuel Ndiaye_Degroof Petercam.pptx
03_Emmanuel Ndiaye_Degroof Petercam.pptx03_Emmanuel Ndiaye_Degroof Petercam.pptx
03_Emmanuel Ndiaye_Degroof Petercam.pptxFinTech Belgium
 
VIP Kolkata Call Girl Serampore 👉 8250192130 Available With Room
VIP Kolkata Call Girl Serampore 👉 8250192130  Available With RoomVIP Kolkata Call Girl Serampore 👉 8250192130  Available With Room
VIP Kolkata Call Girl Serampore 👉 8250192130 Available With Roomdivyansh0kumar0
 
Andheri Call Girls In 9825968104 Mumbai Hot Models
Andheri Call Girls In 9825968104 Mumbai Hot ModelsAndheri Call Girls In 9825968104 Mumbai Hot Models
Andheri Call Girls In 9825968104 Mumbai Hot Modelshematsharma006
 
Q3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesQ3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesMarketing847413
 
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsHigh Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Call US 📞 9892124323 ✅ Kurla Call Girls In Kurla ( Mumbai ) secure service
Call US 📞 9892124323 ✅ Kurla Call Girls In Kurla ( Mumbai ) secure serviceCall US 📞 9892124323 ✅ Kurla Call Girls In Kurla ( Mumbai ) secure service
Call US 📞 9892124323 ✅ Kurla Call Girls In Kurla ( Mumbai ) secure servicePooja Nehwal
 
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxOAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxhiddenlevers
 
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 💋✔💕😘 Naughty Call Girls Se...
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 💋✔💕😘 Naughty Call Girls Se...Russian Call Girls In Gtb Nagar (Delhi) 9711199012 💋✔💕😘 Naughty Call Girls Se...
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 💋✔💕😘 Naughty Call Girls Se...shivangimorya083
 
The Economic History of the U.S. Lecture 23.pdf
The Economic History of the U.S. Lecture 23.pdfThe Economic History of the U.S. Lecture 23.pdf
The Economic History of the U.S. Lecture 23.pdfGale Pooley
 
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptxFinTech Belgium
 
The Economic History of the U.S. Lecture 17.pdf
The Economic History of the U.S. Lecture 17.pdfThe Economic History of the U.S. Lecture 17.pdf
The Economic History of the U.S. Lecture 17.pdfGale Pooley
 
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...ssifa0344
 

Último (20)

VIP Kolkata Call Girl Jodhpur Park 👉 8250192130 Available With Room
VIP Kolkata Call Girl Jodhpur Park 👉 8250192130  Available With RoomVIP Kolkata Call Girl Jodhpur Park 👉 8250192130  Available With Room
VIP Kolkata Call Girl Jodhpur Park 👉 8250192130 Available With Room
 
The Economic History of the U.S. Lecture 19.pdf
The Economic History of the U.S. Lecture 19.pdfThe Economic History of the U.S. Lecture 19.pdf
The Economic History of the U.S. Lecture 19.pdf
 
Log your LOA pain with Pension Lab's brilliant campaign
Log your LOA pain with Pension Lab's brilliant campaignLog your LOA pain with Pension Lab's brilliant campaign
Log your LOA pain with Pension Lab's brilliant campaign
 
06_Joeri Van Speybroek_Dell_MeetupDora&Cybersecurity.pdf
06_Joeri Van Speybroek_Dell_MeetupDora&Cybersecurity.pdf06_Joeri Van Speybroek_Dell_MeetupDora&Cybersecurity.pdf
06_Joeri Van Speybroek_Dell_MeetupDora&Cybersecurity.pdf
 
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
 
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
 
03_Emmanuel Ndiaye_Degroof Petercam.pptx
03_Emmanuel Ndiaye_Degroof Petercam.pptx03_Emmanuel Ndiaye_Degroof Petercam.pptx
03_Emmanuel Ndiaye_Degroof Petercam.pptx
 
VIP Kolkata Call Girl Serampore 👉 8250192130 Available With Room
VIP Kolkata Call Girl Serampore 👉 8250192130  Available With RoomVIP Kolkata Call Girl Serampore 👉 8250192130  Available With Room
VIP Kolkata Call Girl Serampore 👉 8250192130 Available With Room
 
Andheri Call Girls In 9825968104 Mumbai Hot Models
Andheri Call Girls In 9825968104 Mumbai Hot ModelsAndheri Call Girls In 9825968104 Mumbai Hot Models
Andheri Call Girls In 9825968104 Mumbai Hot Models
 
Q3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesQ3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast Slides
 
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsHigh Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 
Call US 📞 9892124323 ✅ Kurla Call Girls In Kurla ( Mumbai ) secure service
Call US 📞 9892124323 ✅ Kurla Call Girls In Kurla ( Mumbai ) secure serviceCall US 📞 9892124323 ✅ Kurla Call Girls In Kurla ( Mumbai ) secure service
Call US 📞 9892124323 ✅ Kurla Call Girls In Kurla ( Mumbai ) secure service
 
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
 
Commercial Bank Economic Capsule - April 2024
Commercial Bank Economic Capsule - April 2024Commercial Bank Economic Capsule - April 2024
Commercial Bank Economic Capsule - April 2024
 
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxOAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
 
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 💋✔💕😘 Naughty Call Girls Se...
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 💋✔💕😘 Naughty Call Girls Se...Russian Call Girls In Gtb Nagar (Delhi) 9711199012 💋✔💕😘 Naughty Call Girls Se...
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 💋✔💕😘 Naughty Call Girls Se...
 
The Economic History of the U.S. Lecture 23.pdf
The Economic History of the U.S. Lecture 23.pdfThe Economic History of the U.S. Lecture 23.pdf
The Economic History of the U.S. Lecture 23.pdf
 
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
 
The Economic History of the U.S. Lecture 17.pdf
The Economic History of the U.S. Lecture 17.pdfThe Economic History of the U.S. Lecture 17.pdf
The Economic History of the U.S. Lecture 17.pdf
 
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
 

Actuarial Modeling in R: Data Science for Insurance

  • 1. Actuarial Science as Data Science Actuarial Modeling in R Revolution Analytics Webinar Jim Guszcza, FCAS, MAAA Deloitte Consulting LLP University of Wisconsin-Madison March 28, 2012
  • 2. About Your Presenter • James Guszcza, PhD, FCAS, MAAA • National Predictive Analytics Lead – Deloitte Consulting Actuarial, Risk, Analytics practice • Assistant professor of actuarial science & risk management – U. Wisconsin-Madison • PhD in Philosophy – The University of Chicago • Fellow of the Casualty Actuarial Society • Lots experience building predictive models / analyzing data in and outside of insurance jguszcza@deloitte.com jguszcza@bus.wisc.edu 2 Deloitte Analytics Institute © 2011 Deloitte LLP
  • 3. Agenda Introduction Actuarial Science and Data Science R Background Case Studies • Fitting a complex size of loss model • Loss Reserving • Bayesian Hierarchical Modeling • Revolution: Tweedie Regression on big data
  • 5. Not Just Hype “Perhaps the most important cultural trend today: The explosion of data about every aspect of our world and the rise of applied math gurus who know how to use it.” -- Chris Anderson, editor-in-chief of Wired • So behavioral economics is important in insurance for two classes of reasons: • Decision-makers at insurance companies are human • People making insurance purchasing decisions are human 5 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 6. Brave New World With Such Algorithms In IT • The analysis of data affects: • What we buy • What we read • What we watch • How we network • How we socialize • The opinions we form • Whom we date and marry! 6 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 7. Clinical vs Actuarial Judgment – the Motion Picture 7 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 8. Analytics Everywhere • Neural net models are used to predict movie box-office returns based on features of their scripts • Decision tree models are used to help ER doctors better triage patients complaining of chest pain. • Predictive models are used to predict the price of different wine vintages based on variables about the growing season. • Predictive models to help commercial insurance underwriters better select and price risks. • Predict which non-custodial parents are at highest risk of falling into arrears on their child support. • Predicting which job candidates will successfully make it through the interviewing / recruiting process… and which candidates will subsequently retain and perform well on the job. • Predicting which doctors are at highest risk of being sued for malpractice. • Predicting the ultimate severity of injury claims. 8 Deloitte Analytics Institute (Deloitte applications in green) © 2010 Deloitte LLP
  • 9. At the Center of It All: Data Science Or: “The Collision between Statistics and Computation” • Today the analytics world is different largely due to exponential growth in computing power. • The skill set underlying business analytics is increasingly called data science. • Data science goes beyond: • Traditional statistics • Business intelligence [BI] Image borrowed from Drew Conway’s blog • Information technology http://www.dataists.com/2010/09/the-data-science-venn-diagram 9 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 10. Where Do We Want to Be? •Here? Image borrowed from Drew Conway’s blog http://www.dataists.com/2010/09/the-data-science-venn-diagram 10 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 11. Where Do We Want to Be? •Or Here? Image borrowed from Drew Conway’s blog http://www.dataists.com/2010/09/the-data-science-venn-diagram 11 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 12. On then, on to R 12 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 14. R Overview R is an open-source, object-oriented statistical programming language. In the past decade, it has become the global lingua franca of statistics. • History: • R is based on the S statistical programming language developed by John Chambers at Bell labs in the 1980’s • R is an open-source implementation of the S language • Developed by Robert Gentlemen and Ross Ihaka at U Auckland • Revolution R is a commercially supported, scalable implementation of R, with parallel processing and big data capabilities • Features: • R is an interactive, object-oriented programming environment • R has advanced graphical capabilities • Statisticians around the world contribute add-on packages 14 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 15. On the Shoulders of Giants • … therefore prominent people tend say things like this: http://www.nytimes.com/2009/01/07/technology/business-computing/07program.html?pagewanted=all 15 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 16. Facets of R • In a recent article John Chambers discussed 6 “Facets of R” 1. An interface to computational procedures of many kinds 2. Interactive, hands-on in real time 3. Functional in its model of programming 4. Object-oriented, “everything is an object” 5. Modular, built from standardized pieces 6. Collaborative, a world-wide, open-source effort • Interactive interface: Chambers was influenced by APL • In the days before spreadsheets, APL was very popular in the actuarial community • One of the rare interactive scientific computing environments • Gives user ability to express novel computations • Heavy emphasis on matrices and arrays • But: unlike R, APL had no interface to procedures 16 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 17. A Network ExteRnality • Hal Varian’s “giant” has grown at an exponential rate. • The open-source nature of R has encouraged top researchers from around the world to contribute new, often highly advanced, packages. • Result: a powerful “network effect”. • The value of a product increases as more people use it. • R has become something like the Wikipedia of the statistics world. 17 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 18. Adoption in the Actuarial World 18 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 19. Free from Frees • Jed Frees at the University of Wisconsin-Madison has made R integral to his new book on regression and time series. He maintains a nice website containing R instructions, data, and code. http://instruction.bus.wisc.edu/jfrees/jfreesbooks/Regression%20Modeling/BookWebDec2010/learnR.html 19 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 21. Some Everyday Uses of R • Free-form Exploratory Data Analysis • ad hoc data munging, data visualizations, fitting simple models on the fly • Loss models (“exam 4/C”) • Unsupervised Learning • Correlation analysis, principal component / factor analysis, variable clustering, k-means and hierarchical clustering, self-organizing maps, association rules (aka “market basket analysis”), Latent Dirichlet Analysis • Supervised Learning • “statistics paradigm”: GLM, Multilevel/Hierarchical models, quantile regression • “machine learning paradigm: CART, MARS, Random Forests, Neural Networks, Support Vector Machines • Bayesian data analysis (MCMC simulation), causal analysis • Optimization 21 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 22. Case Study #1 Loss Distribution Modeling
  • 23. Modeling a Non-Trivial Loss Distribution • A typical actuarial problem: modeling a highly skew and ambiguous loss 8 e-06 distribution 6 e-06 • Traditional medium of analysis: spreadsheets. 4 e-06 • Why limit ourselves? 2 e-06 0 e+00 0 e+00 1 e+06 2 e+06 3 e+06 4 e+06 5 e+06 loss 23 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 24. Case Study #2 Loss Reserving
  • 25. Three Approaches to Loss Reserving • A garden-variety loss triangle: Cumulative Losses in 1000's AY premium 12 24 36 48 60 72 84 96 108 120 CL Ult CL LR CL res 1988 2,609 404 986 1,342 1,582 1,736 1,833 1,907 1,967 2,006 2,036 2,036 0.78 0 1989 2,694 387 964 1,336 1,580 1,726 1,823 1,903 1,949 1,987 2,017 0.75 29 1990 2,594 421 1,037 1,401 1,604 1,729 1,821 1,878 1,919 1,986 0.77 67 1991 2,609 338 753 1,029 1,195 1,326 1,395 1,446 1,535 0.59 89 1992 2,077 257 569 754 892 958 1,007 1,110 0.53 103 1993 1,703 193 423 589 661 713 828 0.49 115 1994 1,438 142 361 463 533 675 0.47 142 1995 1,093 160 312 408 601 0.55 193 1996 1,012 131 352 702 0.69 350 1997 976 122 576 0.59 454 chain link 2.365 1.354 1.164 1.090 1.054 1.038 1.026 1.020 1.015 1.000 12,067 1,543 chain ldf 4.720 1.996 1.473 1.266 1.162 1.102 1.062 1.035 1.015 1.000 growth curve 21.2% 50.1% 67.9% 79.0% 86.1% 90.7% 94.2% 96.6% 98.5% 100.0% • Let’s use R to forecast outstanding losses using three methods: • Replicate the above chain-ladder spreadsheet calculation – easy! • Use the Over-dispersed Poisson GLM model • Longitudinal data analysis using growth curves 25 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 26. What Do You See? • Let’s look at the loss triangle with fresh eyes. • We would like to do stochastic reserving the “right” way. • What considerations come to mind? Cumulative Losses in 1000's AY premium 12 24 36 48 60 72 84 96 108 120 CL Ult CL LR CL res 1988 2,609 404 986 1,342 1,582 1,736 1,833 1,907 1,967 2,006 2,036 2,036 0.78 0 1989 2,694 387 964 1,336 1,580 1,726 1,823 1,903 1,949 1,987 2,017 0.75 29 1990 2,594 421 1,037 1,401 1,604 1,729 1,821 1,878 1,919 1,986 0.77 67 1991 2,609 338 753 1,029 1,195 1,326 1,395 1,446 1,535 0.59 89 1992 2,077 257 569 754 892 958 1,007 1,110 0.53 103 1993 1,703 193 423 589 661 713 828 0.49 115 1994 1,438 142 361 463 533 675 0.47 142 1995 1,093 160 312 408 601 0.55 193 1996 1,012 131 352 702 0.69 350 1997 976 122 576 0.59 454 chain link 2.365 1.354 1.164 1.090 1.054 1.038 1.026 1.020 1.015 1.000 12,067 1,543 chain ldf 4.720 1.996 1.473 1.266 1.162 1.102 1.062 1.035 1.015 1.000 growth curve 21.2% 50.1% 67.9% 79.0% 86.1% 90.7% 94.2% 96.6% 98.5% 100.0% 26 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 27. Some Essential Features of Loss Reserving Cumulative Losses in 1000's AY premium 12 24 36 48 60 72 84 96 108 120 CL Ult CL LR CL res 1988 2,609 404 986 1,342 1,582 1,736 1,833 1,907 1,967 2,006 2,036 2,036 0.78 0 1989 2,694 387 964 1,336 1,580 1,726 1,823 1,903 1,949 1,987 2,017 0.75 29 1990 2,594 421 1,037 1,401 1,604 1,729 1,821 1,878 1,919 1,986 0.77 67 1991 2,609 338 753 1,029 1,195 1,326 1,395 1,446 1,535 0.59 89 • Repeated measures 1992 2,077 257 569 754 892 958 1,007 1,110 0.53 103 1993 1,703 193 423 589 661 713 828 0.49 115 1994 1,438 142 361 463 533 675 0.47 142 1995 1,093 160 312 408 601 0.55 193 1996 1,012 131 352 702 0.69 350 1997 976 122 576 0.59 454 • The dataset is inherently longitudinal in nature. chain link chain ldf growth curve 2.365 1.354 1.164 1.090 1.054 1.038 1.026 1.020 1.015 4.720 1.996 1.473 1.266 1.162 1.102 1.062 1.035 1.015 1.000 1.000 21.2% 50.1% 67.9% 79.0% 86.1% 90.7% 94.2% 96.6% 98.5% 100.0% 12,067 1,543 • A “Bundle” of time series • Loss triangle: a collection of time series that are “related” to one another… • … no guarantee that the same development pattern is appropriate to each one • Non-linear • Each year’s loss development pattern in inherently non-linear • Ultimate loss (ratio) is an asymptote • Incomplete information • Few loss triangles contain all of the information needed to make forecasts • Most reserving exercises must incorporate judgment and/or background information  Loss reserving is inherently Bayesian 27 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 28. Origin of the Approach: Dave’s Idea + Random Effects + = 28 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 29. And Now it’s Bayesian • Fully Bayesian model • Provides posterior credible intervals (“range of reasonable reserves”) • Add further hierarchical structure to simultaneously model loss development for multiple companies. (Wayne’s idea!) 29 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 30. Case Study #3 Hierarchical Bayes Ratemaking
  • 31. Workers Comp Ratemaking • We have 7 years of Workers Comp data • Data from Klugman [1992 Bayes book] • 128 workers comp classes (types of business) • 7 years of summarized data • Given: total payroll, claim count by class • (payroll is a measure of “exposure” in this domain) • Problem: use years 1-6 data to predict year 7 31 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 32. Empirical Bayes “Credibility” Approach • Naïve approach: • Calculate average year 1-6 claim frequency by class • Use these 128 averages as estimates for year 7. • Better approach: build empirical Bayes hierarchical model. • “Bühlmann-Straub credibility model” • “Shrinks” low-credibility classes towards the grand mean • Use Douglas Bates’ lme4 package (UW-Madison again!) clmcnti ~ Poi ( payrolli λ j[ i ] ) ( λ j ~ N µλ , σ λ 2 ) 32 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 33. Shrinkage Effect of Empirical Bayes Model • Top row: estimated claim frequencies from un-pooled Modeled Claim Frequency by C model. Poisson Models: No Pooling and Simple • Separately calculate #claims/payroll by class no pool • Bottom row: estimated claim frequencies from Poisson hierarchical (credibility) model. • Credibility estimates are “shrunk” towards the grand mean. hierach • Dotted line: shrinkage between 5=10%. • Solid line: shrinkage > 10% 0.00 grand mean 0.05 0.10 Claim Frequency 33 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 34. clmcnti ~ Poi ( payrolli λ j[ i ] ) Now Specify a Fully Bayesian Model ( λ j ~ N µλ , σ λ 2 ) • Here we specify a fully Bayesian model. • Use the rjags package • JAGS: Just Another Gibbs Sampler • We’re standing on the shoulders of giants named David Spiegelhalter, Martyn Plummer, … 34 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 35. clmcnti ~ Poi ( payrolli λ j[ i ] ) Now Specify a Fully Bayesian Model ( λ j ~ N µλ , σ λ 2 ) • Here we specify a fully Bayesian model. • Poisson regression with an offset 35 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 36. clmcnti ~ Poi ( payrolli λ j[ i ] ) Now Specify a Fully Bayesian Model ( λ j ~ N µλ , σ λ 2 ) • Here we specify a fully Bayesian model. • Allow for overdispersion 36 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 37. clmcnti ~ Poi ( payrolli λ j[ i ] ) Now Specify a Fully Bayesian Model ( λ j ~ N µλ , σ λ 2 ) • Here we specify a fully Bayesian model. • Allow for overdispersion 37 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 38. clmcnti ~ Poi ( payrolli λ j[ i ] ) Now Specify a Fully Bayesian Model ( λ j ~ N µλ , σ λ 2 ) • Here we specify a fully Bayesian model. • “Credibility weighting” (aka shrinkage) results from giving class-level intercepts a probability sub-model. 38 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 39. clmcnti ~ Poi ( payrolli λ j[ i ] ) Now Specify a Fully Bayesian Model ( λ j ~ N µλ , σ λ 2 ) • Here we specify a fully Bayesian model. • Put a diffuse prior on all of the hyperparameters • Fully Bayesian model • Bayes or Bust! 39 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 40. clmcnti ~ Poi ( payrolli λ j[ i ] ) Now Specify a Fully Bayesian Model ( λ j ~ N µλ , σ λ 2 ) • Here we specify a fully Bayesian model. • Replace year-7 actual values with missing values • We model the year-7 results … produce 128 posterior density estimates • Can compare actual claims with Bayesian posterior probabilities 40 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 41. A Credible Result • Let’s rank the top 30 WC classes by the median of the posterior predictive density of year-7 claim count. • 87% of the top 30 classes have actual year-7 claim count falling within the 90% posterior credible interval. 41 Deloitte Analytics Institute © 2010 Deloitte LLP
  • 42. Case Study #4 Big Data in Revolution R
  • 43. Big Data Headed Our Way • Credibility concerns and a Bayesian outlook are part and parcel of actuarial science. • But for many actuaries, working with “big data” is a much more pressing concern. • Many millions of personal lines policy terms • Premium, loss, credit, billing transactions • Telematics data • … much more to come • Base R handles data in memory • This is beautiful for “small data” problems like doing loss reserving on summarized data • But breaks down for many industrial datasets • So on to Revolution-R 43 Deloitte Analytics Institute © 2011 Deloitte LLP
  • 44. The kaggle Allstate Claim Prediction Challenge Data 44 Deloitte Analytics Institute © 2011 Deloitte LLP
  • 45. Loading the Data • Data volume: • 13M rows • ~ 40 cols • Took about 6-7 minutes to load • Perform some variable transformations on the fly to minimize passes though the data. • Data saved on disk in “xdf” file format for easy access and interactive modeling. 45 Deloitte Analytics Institute © 2011 Deloitte LLP
  • 46. Viewing the Data • Data characteristics: • 13,184,290 rows • A few dozen predictive variables (mostly blinded) • Target variable: claim amount • kaggle competition goal: build a model that segments well out-of-sample • Let’s use the 2005-6 data to predict the 2007 data • (Just a quick model to get a sense of Revolution R’s scalability) • Tweedie regression models fit in seconds 46 Deloitte Analytics Institute © 2011 Deloitte LLP
  • 47. Helpful Resources • Edward (Jed) Frees – Regression modeling with actuarial and financial applications http://www.amazon.com/Regression-Actuarial-Financial-Applications- International/dp/0521135966 • Andrew Gelman / Jennifer Hill - Data Analysis using Regression and Multilevel/Hierarchical Models http://www.amazon.com/Analysis-Regression-Multilevel- Hierarchical- Models/dp/052168689X/ref=sr_1_1?s=books&ie=UTF8&qid=1332961819&sr=1-1 • Venables and Ripley – Modern Applied Statistics in S http://www.amazon.com/Modern- Applied-Statistics- Computing/dp/1441930086/ref=sr_1_1?s=books&ie=UTF8&qid=1332961867&sr=1-1 • Hastie, Tibshirani, Friedman – the Elements of Statistical Learning http://www.amazon.com/The-Elements-Statistical-Learning- Prediction/dp/0387848576/ref=sr_1_1?s=books&ie=UTF8&qid=1332961913&sr=1-1 • Gelman, Carlin, Stern, Ruin – Bayesian Data Analysis http://www.amazon.com/Bayesian- Analysis-Edition-Chapman-Statistical/dp/158488388X/ref=tag_dpp_lp_edpp_ttl_in • John Kruschke – Doing Bayesian Data Analysis http://www.amazon.com/Doing-Bayesian- Data-Analysis- Tutorial/dp/0123814855/ref=sr_1_3?s=books&ie=UTF8&qid=1332961975&sr=1-3 47 Deloitte Analytics Institute © 2011 Deloitte LLP