SlideShare uma empresa Scribd logo
1 de 36
Baixar para ler offline
1
Abstract
Claims coming from human medical observational studies, when tested rigorously,
most often fail to replicate. Whereas randomized clinical trials replicate over 80%
of the time, medical observational studies replicate only 10 to 20% of the time. Multiple
re-test studies reported JAMA failed to replicate. For example in the early 1990s,
Vitamin E was reported to protect against heart attacks. Large, well-conducted
randomized clinical trials did not replicate this claim. The claim that Type A Personality
leads to heart attacks failed to replicate in two separate studies, yet the myth still lives.
Clearly, there are systematic problems with how observational studies are conducted
and analyzed that need to be identified and fixed. Edwards Deming, the most famous
quality expert ever, says that any problem with a failed process is not the fault of the
workers, scientists conducting observational studies, but of management. Funding
agencies and journal editors need to fix a clearly broken process. Technical problems
are identified. Tough management solution are proposed. A simple statistical analysis
strategy is presented. Many human health problems can only be examined using
observational data. Our proposals, technical and managerial, should lead to more
reliable claims along with fair ways to judge their reliability.
NISS
22
Contact
Information
Stan Young
National Institute of Statistical Sciences
www.niss.org
young@niss.org
919 685 9328
NISS
NISS 3
Hayek, 1974, Nobel Lecture
It is often difficult enough for the expert,
and certainly in many instances impossible
for the layman, to distinguish between
legitimate and illegitimate claims advanced
in the name of science.
It is often difficult enough for the expert,
and certainly in many instances impossible
for the layman, to distinguish between
legitimate and illegitimate claims advanced
in the name of science.
NISS 4
Hayek (2)
...much effort will have to be directed toward debunking
such arrogations*, some of which have by now become
the vested interests of established university departments.
*Claims without proper foundation
5
Reliability of Literature Claims
S. Stanley Young
National Institute of Statistical Sciences
Young@niss.org, 919 685 9328
VIP Lecture
NISS
66
Science point of view
What is the meaning of life?
What is real?
What is reproducible?
Fooled (fooling) by randomness?
NISS
77
The Players
1. The workers – scientists
2. The communicators –
a. PR people
b. Bloggers
c. Reporters
d. Science writers
3. The consumers – public, regulatory
agencies, trial lawyers
4. The management – funding agencies,
journal editors
NISS
88
The Worker is not the Problem.
W. Edwards Deming,
the most visionary innovator ever on quality control, said
The worker is not the problem.
The problem is at the top! Management!
To Deming, blaming the workers—individual researchers—
is as incorrect as it is useless.
Bringing the system under control is the responsibility
of those managing it.
NISS
9
Problems with observational studies
“Everything is dangerous”
1. Data staging
2. No written analysis protocol
3. Multiple testing
4. Multiple modeling
5. Uncorrected bias
6. Self-serving paper writing
7. Self-serving press release
8. Actually believe the claims
9NISS
Assertion : Every study is positive
Data Staging
 Bias
Multiple testing
Multiple model searching
Any or all will lead to essentially all
observational studies being positive!
10NISS
11
First, data staging
Stan:
Why do you think data staging is a big issue?
Because it can be done in myriad ways, is
rarely documented, and is usually not
reproducible?
David Madigan
11NISS
12
Multiple Testing: P-value, t-test
Population,
real or theoretical
Two samples,
random
NISS
10-sided dice experiment
12/25/12
NISS
14
How do you get a “p < 0.05”?
Answer: Ask lots of questions.
61 questions
95% chance of
a positive study!
NISS
1515
Let’s run an epidemiology study!
10-sided dice simulation:
Coffee causes X.
NISS
1616
P-value plot – 60 p-values.
NISS
17
Cereal determines human gender
Really?????
17NISS
NISS
19
2 Cancer types, 48 pesticies, 96 questions
Three claims made, only one appears valid.
NISS
Multiple Modeling/Bias
Take a simple difference
Paper, data, claim
American Cancer Society Cancer Prevention Study II
No association with CV deaths, corrected for PM2.5.
Ozone associated with respiratory deaths.
21
Jerrett et al. Large Search Space
22
Covariate Adjustment
27
= 128
23
Large search space
32 x 128 = 4,096
The data used in this paper is not available.
We are asked to trust that analysis decisions
were good and claims are robust.
Any adjustment for multiple testing
and/or multiple modeling renders p-values NS.
24
2525
Crisis in science? 2011, 2012
Nature, 2012
Significance, 2011
NISS
26
Claims from observational studies tested in RCTs
27
What can funding agencies do?
Fund data generation and analysis
separately.
Fund replication studies.
Require data used in publication be
posted on publication.
28
What can journal editors do?
Quality by inspection, p-value < 0.05, is not working.
(Many workers are gaming the system.)
Management needs to re-design the system to build
quality into the product.
Papers following good manufacturing procedures and
addressing important questions, should be accepted
without regard to statistical significance.
Require data used in publication be posted on
publication.
29
What can you, the consumer, do?
(not much)
1. Be skeptical of observational study claims.
2. Read the actual paper.
3. Count the claims under consideration.
4. Ask for the data set.
5. Letter to editor : voodoo stats and trust me
science. (Educate editors.)
6. Write to funding agency.
7. Write to congressman.
29NISS
30
“New” p-value plot, - log10
(p-value), Dmitri Zaykin
31
Conclusions
Most science claims do not replicate.
Deming: Don't blame the worker
(or expect them to adopt different methods).
Funding agencies and journal editors have been AWOL.
Require data to be placed in depository on publication.
32
One irate study evaluator, 2012
Mens Sana Monograph, 2012
3333
Contact
Information
Stan Young
National Institute of Statistical Sciences
www.niss.org
young@niss.org
919 685 9328
NISS
Ozone/PM2.5 Acute Deaths LA
34
3535
Suggestions for effective management
of observational studies
No funding / publication without:
1. Public posting protocol before study initiation.
2. Public posting of data set on publication.
3. Clear statement of questions under consideration.
4. Conform to “Reproducible Research” guidelines.
5. Any claims must be independently replicated.
NISS
3636
Congressional Management:
True Science Transparency Act
Any federal agency proposing rule-making or legislation
shall specifically name each document used to support
the proposed rule-making or legislation and provide
all data used in said document for viewing by the public.
See also OSTP memorandum, 22Feb2013.
NISS

Mais conteúdo relacionado

Mais procurados

Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist PerformanceProbing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
jemille6
 
Exploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory ResearchExploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory Research
jemille6
 
The Scientific Method
The Scientific Method The Scientific Method
The Scientific Method
simonandisa
 
Oom not doom a novel method for improving psychological science, Bradley Woods
Oom not doom   a novel method for improving psychological science, Bradley WoodsOom not doom   a novel method for improving psychological science, Bradley Woods
Oom not doom a novel method for improving psychological science, Bradley Woods
NZ Psychological Society
 
Effect of fMRI Scan Presentation on Perceptions of Homosexuality
Effect of fMRI Scan Presentation on Perceptions of HomosexualityEffect of fMRI Scan Presentation on Perceptions of Homosexuality
Effect of fMRI Scan Presentation on Perceptions of Homosexuality
Jacob Wilson
 

Mais procurados (20)

Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...
Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...
Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...
 
Mayo: Day #2 slides
Mayo: Day #2 slidesMayo: Day #2 slides
Mayo: Day #2 slides
 
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
 
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist PerformanceProbing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy
 
Gelman psych crisis_2
Gelman psych crisis_2Gelman psych crisis_2
Gelman psych crisis_2
 
Replication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden ControversiesReplication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden Controversies
 
Severe Testing: The Key to Error Correction
Severe Testing: The Key to Error CorrectionSevere Testing: The Key to Error Correction
Severe Testing: The Key to Error Correction
 
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
 
D. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severelyD. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severely
 
Mayo &amp; parker spsp 2016 june 16
Mayo &amp; parker   spsp 2016 june 16Mayo &amp; parker   spsp 2016 june 16
Mayo &amp; parker spsp 2016 june 16
 
AstroLecture2
AstroLecture2AstroLecture2
AstroLecture2
 
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
 
Exploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory ResearchExploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory Research
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
 
Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively
 
Improve your study with pre-registration
Improve your study with pre-registrationImprove your study with pre-registration
Improve your study with pre-registration
 
The Scientific Method
The Scientific Method The Scientific Method
The Scientific Method
 
Oom not doom a novel method for improving psychological science, Bradley Woods
Oom not doom   a novel method for improving psychological science, Bradley WoodsOom not doom   a novel method for improving psychological science, Bradley Woods
Oom not doom a novel method for improving psychological science, Bradley Woods
 
Effect of fMRI Scan Presentation on Perceptions of Homosexuality
Effect of fMRI Scan Presentation on Perceptions of HomosexualityEffect of fMRI Scan Presentation on Perceptions of Homosexuality
Effect of fMRI Scan Presentation on Perceptions of Homosexuality
 

Semelhante a 02 young vpi lecture 2014

MAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docxMAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
smile790243
 
Biomedical Research_House of Cards_Editorial
Biomedical Research_House of Cards_EditorialBiomedical Research_House of Cards_Editorial
Biomedical Research_House of Cards_Editorial
Rathnam Chaguturu
 
Research methodology
Research methodologyResearch methodology
Research methodology
Tosif Ahmad
 
SDR 24_2 proof
SDR 24_2 proofSDR 24_2 proof
SDR 24_2 proof
minamorris
 
TITLE OF THE PAPER121Report on GeriatricsPro
TITLE OF THE PAPER121Report on GeriatricsProTITLE OF THE PAPER121Report on GeriatricsPro
TITLE OF THE PAPER121Report on GeriatricsPro
TakishaPeck109
 

Semelhante a 02 young vpi lecture 2014 (20)

Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatistician
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in health
 
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docxMAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
 
Principles of data_science
Principles of data_sciencePrinciples of data_science
Principles of data_science
 
Biomedical Research_House of Cards_Editorial
Biomedical Research_House of Cards_EditorialBiomedical Research_House of Cards_Editorial
Biomedical Research_House of Cards_Editorial
 
The Many Lives of Data
The Many Lives of DataThe Many Lives of Data
The Many Lives of Data
 
Big Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- CeliBig Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- Celi
 
Nurses and Data Science
Nurses and Data ScienceNurses and Data Science
Nurses and Data Science
 
Alopecia Areata Research Summit Summary
Alopecia Areata Research Summit SummaryAlopecia Areata Research Summit Summary
Alopecia Areata Research Summit Summary
 
Research questions
Research questions Research questions
Research questions
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health care
 
Questions for knowledge creators
Questions for knowledge creatorsQuestions for knowledge creators
Questions for knowledge creators
 
MIT Medical Evidence Bootcamp for Journalists 2012
MIT Medical Evidence Bootcamp for Journalists 2012MIT Medical Evidence Bootcamp for Journalists 2012
MIT Medical Evidence Bootcamp for Journalists 2012
 
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...
 
Research methodology
Research methodologyResearch methodology
Research methodology
 
SDR 24_2 proof
SDR 24_2 proofSDR 24_2 proof
SDR 24_2 proof
 
Believable Science
Believable ScienceBelievable Science
Believable Science
 
TITLE OF THE PAPER121Report on GeriatricsPro
TITLE OF THE PAPER121Report on GeriatricsProTITLE OF THE PAPER121Report on GeriatricsPro
TITLE OF THE PAPER121Report on GeriatricsPro
 
Title of the paper121 report on geriatricspro
Title of the paper121 report on geriatricsproTitle of the paper121 report on geriatricspro
Title of the paper121 report on geriatricspro
 
A Guide To Using Qualitative Research Methodology
A Guide To Using Qualitative Research MethodologyA Guide To Using Qualitative Research Methodology
A Guide To Using Qualitative Research Methodology
 

Mais de jemille6

What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?
jemille6
 
What's the question?
What's the question? What's the question?
What's the question?
jemille6
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Two
jemille6
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testing
jemille6
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredging
jemille6
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probability
jemille6
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severity
jemille6
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)
jemille6
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)
jemille6
 
The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (
jemille6
 

Mais de jemille6 (20)

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
 
D. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfD. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdf
 
reid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdf
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inference
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?
 
What's the question?
What's the question? What's the question?
What's the question?
 
The neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metascience
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Two
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testing
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredging
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probability
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severity
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...
 
The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (
 

Último

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 

02 young vpi lecture 2014

  • 1. 1 Abstract Claims coming from human medical observational studies, when tested rigorously, most often fail to replicate. Whereas randomized clinical trials replicate over 80% of the time, medical observational studies replicate only 10 to 20% of the time. Multiple re-test studies reported JAMA failed to replicate. For example in the early 1990s, Vitamin E was reported to protect against heart attacks. Large, well-conducted randomized clinical trials did not replicate this claim. The claim that Type A Personality leads to heart attacks failed to replicate in two separate studies, yet the myth still lives. Clearly, there are systematic problems with how observational studies are conducted and analyzed that need to be identified and fixed. Edwards Deming, the most famous quality expert ever, says that any problem with a failed process is not the fault of the workers, scientists conducting observational studies, but of management. Funding agencies and journal editors need to fix a clearly broken process. Technical problems are identified. Tough management solution are proposed. A simple statistical analysis strategy is presented. Many human health problems can only be examined using observational data. Our proposals, technical and managerial, should lead to more reliable claims along with fair ways to judge their reliability. NISS
  • 2. 22 Contact Information Stan Young National Institute of Statistical Sciences www.niss.org young@niss.org 919 685 9328 NISS
  • 3. NISS 3 Hayek, 1974, Nobel Lecture It is often difficult enough for the expert, and certainly in many instances impossible for the layman, to distinguish between legitimate and illegitimate claims advanced in the name of science. It is often difficult enough for the expert, and certainly in many instances impossible for the layman, to distinguish between legitimate and illegitimate claims advanced in the name of science.
  • 4. NISS 4 Hayek (2) ...much effort will have to be directed toward debunking such arrogations*, some of which have by now become the vested interests of established university departments. *Claims without proper foundation
  • 5. 5 Reliability of Literature Claims S. Stanley Young National Institute of Statistical Sciences Young@niss.org, 919 685 9328 VIP Lecture NISS
  • 6. 66 Science point of view What is the meaning of life? What is real? What is reproducible? Fooled (fooling) by randomness? NISS
  • 7. 77 The Players 1. The workers – scientists 2. The communicators – a. PR people b. Bloggers c. Reporters d. Science writers 3. The consumers – public, regulatory agencies, trial lawyers 4. The management – funding agencies, journal editors NISS
  • 8. 88 The Worker is not the Problem. W. Edwards Deming, the most visionary innovator ever on quality control, said The worker is not the problem. The problem is at the top! Management! To Deming, blaming the workers—individual researchers— is as incorrect as it is useless. Bringing the system under control is the responsibility of those managing it. NISS
  • 9. 9 Problems with observational studies “Everything is dangerous” 1. Data staging 2. No written analysis protocol 3. Multiple testing 4. Multiple modeling 5. Uncorrected bias 6. Self-serving paper writing 7. Self-serving press release 8. Actually believe the claims 9NISS
  • 10. Assertion : Every study is positive Data Staging  Bias Multiple testing Multiple model searching Any or all will lead to essentially all observational studies being positive! 10NISS
  • 11. 11 First, data staging Stan: Why do you think data staging is a big issue? Because it can be done in myriad ways, is rarely documented, and is usually not reproducible? David Madigan 11NISS
  • 12. 12 Multiple Testing: P-value, t-test Population, real or theoretical Two samples, random NISS
  • 14. 14 How do you get a “p < 0.05”? Answer: Ask lots of questions. 61 questions 95% chance of a positive study! NISS
  • 15. 1515 Let’s run an epidemiology study! 10-sided dice simulation: Coffee causes X. NISS
  • 16. 1616 P-value plot – 60 p-values. NISS
  • 17. 17 Cereal determines human gender Really????? 17NISS
  • 18. NISS
  • 19. 19 2 Cancer types, 48 pesticies, 96 questions Three claims made, only one appears valid.
  • 21. Paper, data, claim American Cancer Society Cancer Prevention Study II No association with CV deaths, corrected for PM2.5. Ozone associated with respiratory deaths. 21
  • 22. Jerrett et al. Large Search Space 22
  • 24. Large search space 32 x 128 = 4,096 The data used in this paper is not available. We are asked to trust that analysis decisions were good and claims are robust. Any adjustment for multiple testing and/or multiple modeling renders p-values NS. 24
  • 25. 2525 Crisis in science? 2011, 2012 Nature, 2012 Significance, 2011 NISS
  • 26. 26 Claims from observational studies tested in RCTs
  • 27. 27 What can funding agencies do? Fund data generation and analysis separately. Fund replication studies. Require data used in publication be posted on publication.
  • 28. 28 What can journal editors do? Quality by inspection, p-value < 0.05, is not working. (Many workers are gaming the system.) Management needs to re-design the system to build quality into the product. Papers following good manufacturing procedures and addressing important questions, should be accepted without regard to statistical significance. Require data used in publication be posted on publication.
  • 29. 29 What can you, the consumer, do? (not much) 1. Be skeptical of observational study claims. 2. Read the actual paper. 3. Count the claims under consideration. 4. Ask for the data set. 5. Letter to editor : voodoo stats and trust me science. (Educate editors.) 6. Write to funding agency. 7. Write to congressman. 29NISS
  • 30. 30 “New” p-value plot, - log10 (p-value), Dmitri Zaykin
  • 31. 31 Conclusions Most science claims do not replicate. Deming: Don't blame the worker (or expect them to adopt different methods). Funding agencies and journal editors have been AWOL. Require data to be placed in depository on publication.
  • 32. 32 One irate study evaluator, 2012 Mens Sana Monograph, 2012
  • 33. 3333 Contact Information Stan Young National Institute of Statistical Sciences www.niss.org young@niss.org 919 685 9328 NISS
  • 35. 3535 Suggestions for effective management of observational studies No funding / publication without: 1. Public posting protocol before study initiation. 2. Public posting of data set on publication. 3. Clear statement of questions under consideration. 4. Conform to “Reproducible Research” guidelines. 5. Any claims must be independently replicated. NISS
  • 36. 3636 Congressional Management: True Science Transparency Act Any federal agency proposing rule-making or legislation shall specifically name each document used to support the proposed rule-making or legislation and provide all data used in said document for viewing by the public. See also OSTP memorandum, 22Feb2013. NISS