SlideShare a Scribd company logo
1 of 20
Meta-regression with DisMod-MR:
how robust is the model?
June 18, 2013
Hannah M Peterson
Post-Bachelor Fellow
Global Burden of Disease Study 2010
2
YLDs
• Measures morbidity
• Requires age-specific prevalence
o For 291 outcomes
o For 2 sexes
o For 187 countries
o For 3 years
3
Is negative-binomial distribution
the best choice?
DisMod-MR
4
Alternative distributions
5
Distribution Probability Density Function
Normal
Lognormal
Binomial
Negative-
binomial
Alternative distributions
6
Distribution Probability Density Function
Normal
Lognormal
Binomial
Negative-
binomial
Alternative distributions
7
Distribution Probability Density Function
Normal
Lognormal
Binomial
Negative-
binomial
Alternative distributions
8
Distribution Probability Density Function
Normal
Lognormal
Binomial
Negative-
binomial
Potential experimental frameworks
• Data collection
o Ideal
o Impractical
• Simulation
o Impossible to know true data distribution
• Out-of-sample cross validation
o Do not have to choose distribution
9
Out-of-sample cross validation
10
Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
11
Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
12
Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
• Fit the remaining 75% of
data (“training data”)
13
Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
• Fit the remaining 75% of
data (“training data”)
• Use fit to calculate statistics
for test data
14
Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
• Fit the remaining 75% of
data (“training data”)
• Use fit to calculate statistics
for test data
• For each distribution
• For 1000 test-train splits
• For each disease data set
15
Comparing distributions
16
How to determine the best distribution?
Metrics of evaluation
•
17
Results
18
Percent of wins (%)
Distribution Bias MAE PC Total
Normal 22.1 20.6 34.6 25.7
Lognormal 29.7 13.0 36.5 26.4
Binomial 26.3 48.3 1.9 25.5
Negative-
binomial
21.9 18.1 27.1 22.4
Conclusions
• Choice of distribution doesn’t greatly influence results
• Best overall performance: lognormal distribution
o Contingent on method to adjust data whose value is 0
• Further investigate when each distribution performs best
o Dependent on number of covariates, priors, amount of data?
19
Thank you
Hannah Peterson
peterhm@uw.edu
www.healthmetricsandevaluation.org

More Related Content

More from Institute for Health Metrics and Evaluation - University of Washington

More from Institute for Health Metrics and Evaluation - University of Washington (20)

Salud Mesoamerica Initiative: Mixed-Methods Evaluation Plan
Salud Mesoamerica Initiative: Mixed-Methods Evaluation PlanSalud Mesoamerica Initiative: Mixed-Methods Evaluation Plan
Salud Mesoamerica Initiative: Mixed-Methods Evaluation Plan
 
Salud Mesoamerica Initiative: Select results from the third operation measure...
Salud Mesoamerica Initiative: Select results from the third operation measure...Salud Mesoamerica Initiative: Select results from the third operation measure...
Salud Mesoamerica Initiative: Select results from the third operation measure...
 
Salud Mesoamerica Process Evaluation: Evidence on Culture Change in Health Sy...
Salud Mesoamerica Process Evaluation: Evidence on Culture Change in Health Sy...Salud Mesoamerica Process Evaluation: Evidence on Culture Change in Health Sy...
Salud Mesoamerica Process Evaluation: Evidence on Culture Change in Health Sy...
 
Salud Mesoamérica Initiative: Mixed-Methods Evaluation Plan
Salud Mesoamérica Initiative: Mixed-Methods Evaluation PlanSalud Mesoamérica Initiative: Mixed-Methods Evaluation Plan
Salud Mesoamérica Initiative: Mixed-Methods Evaluation Plan
 
Salud Mesoamerica Initiative: Select results from the second operation measur...
Salud Mesoamerica Initiative: Select results from the second operation measur...Salud Mesoamerica Initiative: Select results from the second operation measur...
Salud Mesoamerica Initiative: Select results from the second operation measur...
 
Salud Mesoamérica Initiative: Select results from the baseline measurement
Salud Mesoamérica Initiative: Select results from the baseline measurementSalud Mesoamérica Initiative: Select results from the baseline measurement
Salud Mesoamérica Initiative: Select results from the baseline measurement
 
Quality of under-5 mortality statistics in Yucatán, Mexico (Spanish)
Quality of under-5 mortality statistics in Yucatán, Mexico (Spanish)Quality of under-5 mortality statistics in Yucatán, Mexico (Spanish)
Quality of under-5 mortality statistics in Yucatán, Mexico (Spanish)
 
Under-5 mortality and healthcare in Yucatán – 2017 Results dissemination work...
Under-5 mortality and healthcare in Yucatán – 2017 Results dissemination work...Under-5 mortality and healthcare in Yucatán – 2017 Results dissemination work...
Under-5 mortality and healthcare in Yucatán – 2017 Results dissemination work...
 
Under-5 mortality and healthcare in Yucatán – 2021 Results dissemination work...
Under-5 mortality and healthcare in Yucatán – 2021 Results dissemination work...Under-5 mortality and healthcare in Yucatán – 2021 Results dissemination work...
Under-5 mortality and healthcare in Yucatán – 2021 Results dissemination work...
 
The Global Fund Prospective Country Evaluation
The Global Fund Prospective Country EvaluationThe Global Fund Prospective Country Evaluation
The Global Fund Prospective Country Evaluation
 
Prospective Country Evaluation 2019 Synthesis Findings
Prospective Country Evaluation 2019 Synthesis FindingsProspective Country Evaluation 2019 Synthesis Findings
Prospective Country Evaluation 2019 Synthesis Findings
 
Global Burden of Disease (GBD) 2017 study findings
Global Burden of Disease (GBD) 2017 study findingsGlobal Burden of Disease (GBD) 2017 study findings
Global Burden of Disease (GBD) 2017 study findings
 
Expected Human Capital: Key themes and talking points
Expected Human Capital: Key themes and talking pointsExpected Human Capital: Key themes and talking points
Expected Human Capital: Key themes and talking points
 
Global Health Financing
Global Health FinancingGlobal Health Financing
Global Health Financing
 
Maternal and Child Mortality in the United States
Maternal and Child Mortality in the United StatesMaternal and Child Mortality in the United States
Maternal and Child Mortality in the United States
 
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
Salud Mesoamérica 2015 Initiative: Select results from the first operation me...
 
Chronic diseases and their risk factors in the Kingdom of Saudi Arabia
Chronic diseases and their risk factors in the Kingdom of Saudi ArabiaChronic diseases and their risk factors in the Kingdom of Saudi Arabia
Chronic diseases and their risk factors in the Kingdom of Saudi Arabia
 
Speyer communicating dataforimpact_2015
Speyer communicating dataforimpact_2015Speyer communicating dataforimpact_2015
Speyer communicating dataforimpact_2015
 
Understanding the costs of and constraints to health service delivery in Ghana
Understanding the costs of and constraints to health service delivery in GhanaUnderstanding the costs of and constraints to health service delivery in Ghana
Understanding the costs of and constraints to health service delivery in Ghana
 
ABCE: Understanding the costs of and constraints to health service delivery ...
ABCE: Understanding the costs of and constraints to health service delivery ...ABCE: Understanding the costs of and constraints to health service delivery ...
ABCE: Understanding the costs of and constraints to health service delivery ...
 

Recently uploaded

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 

Recently uploaded (20)

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 

Meta-regression with DisMod-MR: how robust is the model?

Editor's Notes

  1. Global Burden of Disease Study 2010 (GBD)-huge endeavor to measure health loss from disease, injuries, and risk using the Disability Adjusted Life Year (DALY)-coarsely described in the this 18-step process-I am just going to focus on a small subsection, the calculation of DALYs for injuries and disease-further narrow focus to the calculation of YLDsfigure:Murray, Ezzati, et. al. 2013. “GBD 2010: design, definitions, and metrics”. The Lancet. 380(9859):2063-2066.
  2. -YLDsmeasure morbidity, or years lived in less than full health-the YLD calculation needs age-specific prevalence estimates, for GBD, this means ---for 291 outcomes ---for 2 sexes---for 187 countries---for 3 years-however prevalence data is often less than ideal, -examples all available data in Western Europe for GDB2010 Study---sparse (fungal diseases) ---noisy (lower back pain) ---sparse and noisy (cannabis dependence data)-to calculate age-specific prevalence, used a tool called DisMod-MR
  3. -DisMod-MR is designed to address missing data and inconsistency ---used epidemiologic data and covariate data to calculate the age-specific prevalence based on a negative-binomial distribution---assumes all epidemiological data follows a negative-binomial distribution-is it really the best distribution to model the epidemiologic data?figure: Vos, Flaxman, et. al. 2013. “Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990-2010: a systematic analysis for the Global Burden of Disease Study 2010”. The Lancet. 380(9859):2163-2196.
  4. Normal𝜇=𝑚𝑒𝑎𝑛𝜎=𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛-mathematically convenient-PROBLEM: allows negative estimates of prevalence, physiological impossibleNegative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to vary
  5. Lognormal𝜇=𝑚𝑒𝑎𝑛𝜎=𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛-bounds estimates at 0-PROBLEM: doesn’t allow prevalence to be 0---can’t take the log of 0-changed values of 0 to be 1 observation-other options would be to use an offset lognormal distribution-but somehow, have to work around estimates of 0Negative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to vary
  6. Binomial-which Dr. Flaxman already discussed-discrete model𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦Negative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to vary
  7. Negative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to varyNegative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to vary
  8. Several ways to test which distribution is the best-ideal-data collection---actually go to country (region??) and measure age-specific prevalence---expensiveimpractical-simulation---great for testing, not for validation---problem: have to choose from what distribution the simulated data/measurements come------this is what we’re testing------simulation can showwhatever you want------impossible to know from what distribution measurement-out-of-sample cross validation---way to evaluate and compare distributions---shows how model performs in real life------can test out-of-sample predictive validity------don’t have to choose data distribution---concerns------unstable with sparse data-----------not just the epidemiologic data-----------also covariates and priors
  9. This experiment-57 different disease data sets---met inclusion criteria of more than 4 prevalence points in western europe---not a birth-condition meaning prevalence data is only at age 0-restricted to Western EuropeTo explain out-of-sample cross validation usedan example from GBD2010fungal diseases
  10. Randomly select 25% of data to withhold as test datatest data used to evaluate results
  11. Test data is withheld from DisMod-MR
  12. And the remaining data is fit
  13. From the fit, these estimates are compared to the test dataThis comparison of the estimate to the test data is where the statistics are calculatedthe same test-train split fits are created for each of the distribution so we can make a comparison
  14. -process repeated 1000 times with different test-train splits-repeated for 57 different disease data set---met inclusion criteria of more than 4 prevalence points in western europe---not a birth-condition meaning prevalence data is only at age 057 disease/injury conditions met this criteria
  15. metrics that capture different aspects of model performanceWant a model that is precise, accurate, well-calibrated -precise (bias)---measures average difference between the test data and prediction-accurate (median absolute error-MAE)---measure of overall error---many small errors create one large number---sensitive to mean and scale---less sensitive to outliers-calibrated (percent coverage-PC)---calibrated, meaning that our estimates are in the correct range of values------if we aim for 95% uncertainty, we expect 95% of our estimates to be good------more than that and the model is over confident------less than that and the model isn’t very good---percent of time the uncertainty interval of the prediction contains the observation---sensitive to discrete distributionsto determine which distribution performed the best, counted the the winner for each disease data set and split
  16. -for different metrics different distributions are superior---makes sense, since each distribution has it’s strengths and weaknesses---smallest bias: lognormal---minimum MAE: binomial---closest percent coverage: lognormal-concern about most frequent results and not raw numbers:---differences are small ------bias, ten-thousandths (E-4), average bias is negative binomial------mae, hundreds-overall winner: lognormal
  17. -previously saw, distribution choice doesn’t greatly influence DisMod-MR’s estimates of age-specific prev-results differ by metric-Best overall performance: lognormal distribution---STRESS:Contingent on method to adjust data whose value is 0-Further investigate when each distribution performs best---Dependent on number of covariates, priors, amount of data?DisMod-MR is robust in that choice of distribution for epidemiological values does not greatly influence estimates, but one distribution performs the best most frequently