Mais conteúdo relacionado Semelhante a SAS_AE16_Shootout_Team39 (20) SAS_AE16_Shootout_Team391. #analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Baby Boomers and the Aging Population
Team 39: Yaotong Cai, Chunla He,
Wenhao Pan, Richard Ross , Shiyu Ye
Advisor: Dr. Dan Hall
2. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Problem Summary
Oldest Baby Boomers Reaching Retirement Age
Aging population will carry new challenges
How will the incidence of common diseases shift
Labor force age demographics
Social Security solvency
Potential Impacts
Vaccines
Policy Changes
3. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Results
Workflow
Age Handling
Modeling - Disease and Labor Force
Uncertainty
Outline
4. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Cancer is expected to continue to decline in incidence
Asthma is expected to increase in incidence for females
through 2060, but has already peaked for males (2002)
Alzheimer’s Disease & Dementia, Osteoarthritis, and
Septicemia are predicted to increase over the next 45 years
without reaching a peak
Results – Hospitalization Rates
5. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Disease Trajectories
6. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Results – Labor Force
Dominating Age Groups:
Management, professional and related: 25-34 and 35-44 will make up more
than half of this industry by 2045
Natural Resources, construction, and maintenance: 45-54 age group will hold
a majority of this industry by 2050.
Production, transportation and material moving: 20-24 and 25-34 age group
will hold a majority of this industry by 2050.
Sales and Office: 25-34, 55-64, and 65 years and over age group will hold
over 70% of this industry by 2050.
Service: 20-24 and 25-34 age group will hold a majority of this industry by
2045.
7. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Results – Labor Force
8. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Results – Labor Force
Industry level: (1) Natural Resources, Construction, and Maintenance,
(2) Production, Transportation and Material Moving, and
(3) Sales and Office
were most affected by economic recession.
Age Group level: younger workers were hit hardest by
recession
9. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Health Care Cost Reduction: Vaccine
Suppose Septicemia Vaccines developed in 2025
Hospitalizations decreased by 20%, costs per hospitalization
decreased by 25%
Savings Projections:
Septicemia Rate projected to increase greatly
$40B savings in 2025
$2.5 T savings in 2060
10. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Projected Revenue and Expenditures – Current System
Estimate of payment: 117,000 × 12.4%
Estimate of cost: $ 1503 × 12 for over 67 age population
Result: Society Security Payments (inflow) will be less than expenditures
(outflow) after 2020
Social Security system will reach insolvency by 2035, consistent with
the Social Security Administration's current estimates
Forecasted $8 trillion deficit by 2060
Forecast on Sustainability of Social Security
& Tax System
11. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Projected Revenue and Expenditures – Policy Change Strategies
1) Increase the retirement age to 70
Balanced until 2030, solvent through 2060
2) Reduce the benefit amount to $1300/month
Balanced until 2025, solvent through 2060
3) Increase the tax rate from 12.4% to 14.5%
Balanced until 2025, solvent through 2060
Forecast on Sustainability of Social Security
& Tax System
12. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Workflow: Analysis Process
EDA &
Feature
Engineering
Aggregation &
Matching of Age
Group:
(1). Simple
Combination
(2). Age Mapping
Disease Prevalence
Study
--Poisson GLMM
--2D P-spline Models
Labor Force Demand
Study
--Multinomial Logistic
GLMM with P-splines
Projected population
in industries;
Solutions for Social
Security & Tax
sustainability
Projected trends of
disease incidence;
Implications of
Vaccines
Missing Data
Handling
13. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Age Categories in Three Data Sets
14. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Simple combination
Arithmetic aggregation or disaggregation for some age groups
Smoothing
Reconcile the remaining Age groups
based Rizzi et al. (2015)’s work to
disaggregate and re-aggregate age
in one-year bins, while adding cohort
effects (smoothing age and cohort)
Overall Strategy – Matching age groups
15. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Smoothing Results
16. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Labor force and disease modeling problems have similar
elements:
Stratified annual count data for multiple age groups over relatively
short time spans, with goal of forecasting far into future.
Wanted to impose little structure on data (smoothing) and borrow
strength across age strata (mixed models).
Use Generalized linear mixed models (GLMMs) to implement
P-spline smoothing (PROC GLIMMIX) to model observed
trends and make forecasts.
Prediction intervals important. Must recognize huge uncertainty.
Framework accommodates missing data easily.
Overall Modeling Strategy:
17. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Primary analysis based on Poisson GLMMs with log link and P-
splines for time effects.
Models included age stratum-specific intercepts and slopes on time but
smoothing parameters common to all age groups.
For strata with anomalous behavior/small sample size, model
constraints used to link adjacent age strata and avoid unrealistic results.
Secondary analysis based on 2-D P splines (Currie et al. 2006,
MortalitySmooth package in R).
Smoothing over time and age (counts disaggregated by age).
Results similar to primary analysis
Disease Models:
18. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Natural framework is multinomial logit model (each worker in one of five
industries or other).
Again, P-splines to smooth effects of time, with age-specific intercepts and
slopes on time → multinomial logit GLMM.
By Begg and Gray, 1984, if large prevalence “other” category used as
reference level, binomial error distributions can be used to fit separately
for each industry w/ little efficiency loss. Therefore used binomial logit
GLMMs in PROC GLIMMIX.
Models included log-time effect truncated at year 2006 to allow effects of
economic recession on labor force to dissipate. Avoids short term effects
of recession having undue effect on long term forecasts.
Labor Force Models
19. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
Uncertainty in modeling diseases data
1. Predicting rates several decades into the future based on 10-
15 years data.
2. Training data covers a time period with a major anomalous
event
3. Age matching and missing data
Limitations
20. Copyr ight © 2016, SAS Institute Inc. All rights reser ved.
#analyticsx
Notas do Editor Cancer decreases
Asthma curves down and then up
Dementia / Septicemia / Osteoarthritis skyrockets Examples of Cohort Effects: Number of 23-year-olds in 2020 is related to the number of 28-year-olds in 2025.
GLMM help us to capture non-linear effects through time
P-splines were used because they are well suited for modeling rates through time.
Both GLMM and P-splines provided similar results, which helps us to feel confident that our predictions are reasonable
Yaotong’s Comment: For 2D P-spline model, I think it would be better if we talk about how we utilize results in age group disaggregation to generate future predictions. What do you mean here?
GLMM help us to capture non-linear effects through time
P-splines were used because they are well suited for modeling rates through time.
Both GLMM and P-splines provided similar results, which helps us to feel confident that our predictions are reasonable
Yaotong’s Comment: For 2D P-spline model, I think it would be better if we talk about how we utilize results in age group disaggregation to generate future predictions. What do you mean here?