D. Mayo JSM slides v2.pdf

J
Sir David Cox’s Statistical Philosophy and
Its Relevance to Today’s Statistical
Controversies
0
15 July 1924 – 18 January 2022
Deborah G. Mayo
Virginia Tech August 8, 2023
JSM 2023, Toronto
Cox is celebrated for his pioneering work for
which he received many awards…
• International Prize in Statistics in 2016.
• Copley Medal, the Royal Society’s highest award, 2010.
• Knighted in 1985.
• Gold Medal for Cancer Research for the "Proportional
Hazard Regression Model" in 1990.
1
2
Importance of statistical foundations
Alongside his contributions that have transformed
applied statistics, Cox focused on the foundations of
statistical inference
“Without some systematic structure statistical methods
for the analysis of data become a collection of tricks…”
(2006, xiii)
3
How does a philosopher of statistics wind up
working with a giant in statistics?
In late summer 2003 (nearly 20 years ago today), I
(boldly) invited him to be in a session I was forming on
philosophy of statistics
[Optimality: The Second Erich L. Lehmann
Symposium, May 19–22, 2004; Rice University, Texas]
4
Discussions over the next several years:
• “Frequentist Statistics as a Theory of Inductive
Inference” (Mayo and Cox 2006)
• “Objectivity and Conditionality in Frequentist
Inference” (Cox and Mayo 2010)
5
“A Statistical Scientist Meets a Philosopher
of Science”
Sir David Cox: “…we both think foundations of
statistical inference are important; why do you think
that is?”
Mayo: “…in statistics …we invariably cross into
philosophical questions about empirical knowledge
and inductive inference.” (Cox and Mayo 2011)
6
Key Feature of Cox’s Statistical Philosophy
• We need to calibrate methods: how they would
behave in (actual or hypothetical) repeated sampling
“any proposed method of analysis that in repeated
application would mostly give misleading answers is
fatally flawed” (Cox 2006, 198)
Weak repeated sampling principle
7
Two philosophical questions
(still controversial):
1. How can the frequentist calibration be used as
an evidential or inferential assessment
(epistemological use)?
2. How can we ensure: “that the hypothetical long
run used in calibration is relevant to the specific
data” (Cox 2006, 198)
8
“…we begin with the core elements of significance
testing in a version very strongly related to but in
some respect different from both Fisherian and
Neyman-Pearson approaches…” (2006, 80)
Mayo & Cox 2006
“Frequentist statistics as a theory of
inductive inference”
R. A. Fisher J. Neyman E. Pearson
Simple statistical significance tests
9
“…to test the conformity of the particular data under
analysis with H0 in some respect:
[We find a] test statistic T, such that
• the larger the value of T the more inconsistent are
the data with H0;
…the p-value Pr(T ≥ tobs; H0)” (81)
10
Testing rationales: Inductive behavior vs
inductive inference:
Small p-values are taken as inconsistency with H0 (in the
direction being probed)
• Behavioristic: To follow a rule with low error rates.
• Evidential: The specific data set provides evidence
of discrepancy from H0. Why?
(FEV) Frequentist Principle of Evidence:
Mayo and Cox (2006)
11
“FEV(i): y is (strong) evidence against H0 i.e., (strong)
evidence of discrepancy from H0 if and only if [were H0
correct] then, with high probability
[(1-p)] this would have resulted in a less discordant result
than [observed in] y” (82)
Inferential interpretation
12
“a statistical version of the valid form…modus tollens
[falsification]
[here] probability arises …to characterize the ‘riskiness’ or
probativeness or severity of the tests to which hypotheses
are put…reminiscent of the philosophy of Karl Popper”
(82)
13
FEV (ii) for interpreting non-statistically significant
results, avoiding classic fallacies
14
Move away from a binary
“significant / not significant” report
• One way: consider testing several discrepancies from
a reference hypothesis H0 (i.e., several Hi’s) and
reporting how well or poorly warranted they are by
the data
This gets beyond long-standing and recent
problems leading some to suggest restricting
statistical significance tests to quality control
decisions (behavioristic use)
15
Analogy with measuring devices
16
“The significance test is a measuring device for
accordance with a specified hypothesis calibrated …by
its performance in repeated applications,
…we employ the performance features to make
inferences about aspects of the particular thing that is
measured, aspects that the measuring tool is
appropriately capable of revealing.” (84)
17
Relevance
“in ensuring that the … calibration is relevant to
the specific data under analysis, often taking due
account of how the data were obtained.”
(Cox 2006, 198)
18
Selection effects
Selectively reporting effects, optional stopping, trying and
trying again etc. may result in failing the weak repeated
sampling principle.
19
“the probability of finding some such discordance or other
may be high even under the null. Thus, following FEV(i),
we would not have genuine evidence of discordance with
the null, and unless the 𝑝-value is modified appropriately,
the inference would be misleading.” (92)
Selection effects
20
• “The probability is high we would not obtain a match
with person i, if i were not the criminal;
• By FEV, the match is good evidence that i is the
criminal.
• A non-match virtually excludes the person.” (94)
Not all cases with selection or multiplicity call
for adjustments
Searching a full database for a DNA match:
21
Calibration is the source of objectivity:
“Objectivity and Conditionality in Frequentist
Inference” (Cox & Mayo 2010):
“Frequentist methods achieve an objective connection
to hypotheses about the data-generating process by
being constrained and calibrated by the method’s error
probabilities” (Cox and Mayo 2010, 277)
22
“Objectivity and Conditionality in
Frequentist Inference” (Cox & Mayo 2010)
When Cox proposed this second paper include
“conditioning", I hesitated….
Good long-run performance isn’t enough
Cox’s (1958) “two measuring instruments” (“weighing
machine” example): toss a fair coin to test the mean of a
Normal distribution with either a very large, or a small,
variance (both known); and you know which was used.
23
Good long-run performance isn’t enough
As Cox (1958) argued, once you know the precision of
the tool used
Weak conditionality principle
24
“the p-value assessment should be made conditional on
the experiment actually run.” (Cox and Mayo 2010, 296)
25
• The “weighing machine” example caused “a subtle
earthquake” in statistical foundations. Why?
• The “best” test on long-run optimality grounds is the
unconditional test.
26
Does “conditioning for relevance” lead to the
unique case?
“[Many argue a frequentist] “is faced with the following
dilemma: either to deny the appropriateness of
conditioning on the precision of the tool chosen by the
toss of a coin, or else to embrace [a] principle* … that
frequentist sampling distributions are irrelevant to
inference [postdata]. This is a false dilemma. Conditioning
is warranted in achieving objective frequentist goals.”
(Cox & Mayo 2010, 298)
-
*Strong Likelihood Principle
27
• The (2010) paper with Cox was the basis for my
later article in Statistical Science debunking the
argument that appears to lead to the dilemma:
Mayo (2014)
28
“Principles of Statistical Inference” (2006)
“The object of the present book is to [describe the main]
controversies over more foundational issues that have
rumbled on …for more than 200 years”.
• Cox (1958) “Some problems connected with statistical
inference”
• Cox (2020) “Statistical significance”
29
“His writing on statistical significance and p-values
seemed to need repeating for each new generation”
(Heather Battey and Nancy Reid 2022, IMF obituary for
Sir David Cox)
Cox advanced a great many other
foundational issues I have not discussed:
• relating statistical and scientific inference;
• testing assumptions of models;
• rival paradigms for interpreting probability,
frequentist and Bayesian
30
David R. Cox Foundations of Statistics Award
Wednesday, August 9: 10:30-12:20
Metro Toronto Convention Centre Room: CC-701A
Chair: Ron Wasserstein (ASA)
Inaugural recipient: Nancy Reid, University of Toronto
“The Importance of Foundations in Statistical Science”
31
References
• Battey, H. and Reid, N. (2022). “Obituary: David Cox 1924-2022”. Institute of
Mathematical Statistics homepage, April 1, 2022.
• Cox, D.R. (1958). Some problems connected with statistical inference. Ann. Math. Statis
• Cox, D. R. (2006). Principles of Statistical Inference, CUP.
• Cox, D. R. (2018) In gentle praise of significance tests. 2018 RSS annual meeting. (Link
to Youtube video here.)
• Cox, D. R. & Mayo D. G. (2010). Objectivity and Conditionality in Frequentist Inference.
In Mayo & Spanos (eds), Error and Inference: Recent Exchanges on Experimental
Reasoning, Reliability and the Objectivity and Rationality of Science, pp. 276-304. CUP
• Cox, D. R. and Mayo, D. G. (2011). A Statistical Scientist Meets a Philosopher of
Science: A Conversation between Sir David Cox and Deborah Mayo. Rationality,
Markets and Morals (RMM) 2, 103–14.
• Mayo, D. G. (1996). Error and the Growth of Experimental Knowledge, Chicago:
Chicago University Press. (1998 Lakatos Prize)
32
• Mayo, D. G. (2014). “On the Birnbaum Argument for the Strong Likelihood Principle,”
(with discussion) Statistical Science 29(2) pp. 227-239, 261-266.
• Mayo, D. G. (2018). Statistical Inference as Severe Testing: How to Get Beyond the
Statistics Wars, Cambridge: Cambridge University Press.
• Mayo, D. G. (2022). A Remembrance of Sir David Cox: ‘”In celebrating Cox’s
immense contributions, we should recognise how much there is yet to learn from
him”. Significance. (April 2022: 36). [Link to all 4 remembrances.]
• Mayo, D.G. and Cox, D. R. (2006) “Frequentist Statistics as a Theory of Inductive
Inference,” Optimality: The Second Erich L. Lehmann Symposium (ed. J. Rojo),
Lecture Notes-Monograph series, IMS, Vol. 49: 77-97. [Reprincted in Mayo and
Spanos 2010.
• Reid, N. and Cox, D. R. (2015). On Some Principles of Statistical Inference.
International Statistical Review 83(2): 293-308.
33
1 de 34

Recomendados

Mayod@psa 21(na) por
Mayod@psa 21(na)Mayod@psa 21(na)
Mayod@psa 21(na)DeborahMayo4
74 visualizações48 slides
D. Mayo: Philosophical Interventions in the Statistics Wars por
D. Mayo: Philosophical Interventions in the Statistics WarsD. Mayo: Philosophical Interventions in the Statistics Wars
D. Mayo: Philosophical Interventions in the Statistics Warsjemille6
2.2K visualizações50 slides
Data Science definition por
Data Science definitionData Science definition
Data Science definitionCarloLauro1
24 visualizações49 slides
Let's talk about Data Science por
Let's talk about Data ScienceLet's talk about Data Science
Let's talk about Data ScienceCarlo Lauro
76 visualizações49 slides
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica... por
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...
Statistical "Reforms": Fixing Science or Threats to Replication and Falsifica...jemille6
931 visualizações45 slides
D. G. Mayo: Your data-driven claims must still be probed severely por
D. G. Mayo: Your data-driven claims must still be probed severelyD. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severelyjemille6
8.1K visualizações41 slides

Mais conteúdo relacionado

Similar a D. Mayo JSM slides v2.pdf

The Statistics Wars and Their Casualties (w/refs) por
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)jemille6
876 visualizações69 slides
Frequentist Statistics as a Theory of Inductive Inference (2/27/14) por
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)Frequentist Statistics as a Theory of Inductive Inference (2/27/14)
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)jemille6
6.7K visualizações47 slides
P-Value "Reforms": Fixing Science or Threat to Replication and Falsification por
P-Value "Reforms": Fixing Science or Threat to Replication and FalsificationP-Value "Reforms": Fixing Science or Threat to Replication and Falsification
P-Value "Reforms": Fixing Science or Threat to Replication and Falsificationjemille6
272 visualizações43 slides
State of the Art Informatics for Research Reproducibility, Reliability, and... por
 State of the Art  Informatics for Research Reproducibility, Reliability, and... State of the Art  Informatics for Research Reproducibility, Reliability, and...
State of the Art Informatics for Research Reproducibility, Reliability, and...Micah Altman
2.4K visualizações59 slides
SMMULs por
SMMULsSMMULs
SMMULsCharles Lance
304 visualizações40 slides
D. Mayo: Philosophy of Statistics & the Replication Crisis in Science por
D. Mayo: Philosophy of Statistics & the Replication Crisis in ScienceD. Mayo: Philosophy of Statistics & the Replication Crisis in Science
D. Mayo: Philosophy of Statistics & the Replication Crisis in Sciencejemille6
5K visualizações76 slides

Similar a D. Mayo JSM slides v2.pdf(20)

The Statistics Wars and Their Casualties (w/refs) por jemille6
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)
jemille6876 visualizações
Frequentist Statistics as a Theory of Inductive Inference (2/27/14) por jemille6
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)Frequentist Statistics as a Theory of Inductive Inference (2/27/14)
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)
jemille66.7K visualizações
P-Value "Reforms": Fixing Science or Threat to Replication and Falsification por jemille6
P-Value "Reforms": Fixing Science or Threat to Replication and FalsificationP-Value "Reforms": Fixing Science or Threat to Replication and Falsification
P-Value "Reforms": Fixing Science or Threat to Replication and Falsification
jemille6272 visualizações
State of the Art Informatics for Research Reproducibility, Reliability, and... por Micah Altman
 State of the Art  Informatics for Research Reproducibility, Reliability, and... State of the Art  Informatics for Research Reproducibility, Reliability, and...
State of the Art Informatics for Research Reproducibility, Reliability, and...
Micah Altman2.4K visualizações
SMMULs por Charles Lance
SMMULsSMMULs
SMMULs
Charles Lance304 visualizações
D. Mayo: Philosophy of Statistics & the Replication Crisis in Science por jemille6
D. Mayo: Philosophy of Statistics & the Replication Crisis in ScienceD. Mayo: Philosophy of Statistics & the Replication Crisis in Science
D. Mayo: Philosophy of Statistics & the Replication Crisis in Science
jemille65K visualizações
D. G. Mayo Columbia slides for Workshop on Probability &Learning por jemille6
D. G. Mayo Columbia slides for Workshop on Probability &LearningD. G. Mayo Columbia slides for Workshop on Probability &Learning
D. G. Mayo Columbia slides for Workshop on Probability &Learning
jemille6137 visualizações
West-Vanderbilt-Talk--Revised-22March2017.ppt por kait23
West-Vanderbilt-Talk--Revised-22March2017.pptWest-Vanderbilt-Talk--Revised-22March2017.ppt
West-Vanderbilt-Talk--Revised-22March2017.ppt
kait231 visão
Meeting #1 Slides Phil 6334/Econ 6614 SP2019 por jemille6
Meeting #1 Slides Phil 6334/Econ 6614 SP2019Meeting #1 Slides Phil 6334/Econ 6614 SP2019
Meeting #1 Slides Phil 6334/Econ 6614 SP2019
jemille65.3K visualizações
Computational Social Science:The Collaborative Futures of Big Data, Computer ... por Academia Sinica
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Academia Sinica1.4K visualizações
"Reproducibility from the Informatics Perspective" por Micah Altman
"Reproducibility from the Informatics Perspective""Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective"
Micah Altman566 visualizações
Grounded theory methodology of qualitative data analysis por Dr. Shiv S Tripathi
Grounded theory methodology of qualitative data analysisGrounded theory methodology of qualitative data analysis
Grounded theory methodology of qualitative data analysis
Dr. Shiv S Tripathi25.7K visualizações
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio... por Matthew Lease
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collectio...
Matthew Lease614 visualizações
The Importance Of Quantitative Research Designs por Nicole Savoie
The Importance Of Quantitative Research DesignsThe Importance Of Quantitative Research Designs
The Importance Of Quantitative Research Designs
Nicole Savoie2 visualizações
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES por Micah Altman
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCESBROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
Micah Altman741 visualizações
Does preregistration improve the interpretability and credibility of research... por Mark Rubin
Does preregistration improve the interpretability and credibility of research...Does preregistration improve the interpretability and credibility of research...
Does preregistration improve the interpretability and credibility of research...
Mark Rubin39 visualizações
05 astrostat feigelson por Marco Quartulli
05 astrostat feigelson05 astrostat feigelson
05 astrostat feigelson
Marco Quartulli745 visualizações
see CV por butest
see CVsee CV
see CV
butest382 visualizações

Mais de jemille6

reid-postJSM-DRC.pdf por
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfjemille6
176 visualizações72 slides
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022 por
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022jemille6
23 visualizações51 slides
Causal inference is not statistical inference por
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inferencejemille6
2.5K visualizações36 slides
What are questionable research practices? por
What are questionable research practices?What are questionable research practices?
What are questionable research practices?jemille6
2.6K visualizações30 slides
What's the question? por
What's the question? What's the question?
What's the question? jemille6
2.5K visualizações48 slides
The neglected importance of complexity in statistics and Metascience por
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metasciencejemille6
2.5K visualizações45 slides

Mais de jemille6(20)

reid-postJSM-DRC.pdf por jemille6
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdf
jemille6176 visualizações
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022 por jemille6
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
jemille623 visualizações
Causal inference is not statistical inference por jemille6
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inference
jemille62.5K visualizações
What are questionable research practices? por jemille6
What are questionable research practices?What are questionable research practices?
What are questionable research practices?
jemille62.6K visualizações
What's the question? por jemille6
What's the question? What's the question?
What's the question?
jemille62.5K visualizações
The neglected importance of complexity in statistics and Metascience por jemille6
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metascience
jemille62.5K visualizações
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a... por jemille6
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
jemille62.5K visualizações
On Severity, the Weight of Evidence, and the Relationship Between the Two por jemille6
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Two
jemille62.5K visualizações
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel... por jemille6
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
jemille62.5K visualizações
Comparing Frequentists and Bayesian Control of Multiple Testing por jemille6
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testing
jemille6757 visualizações
Good Data Dredging por jemille6
Good Data DredgingGood Data Dredging
Good Data Dredging
jemille6791 visualizações
The Duality of Parameters and the Duality of Probability por jemille6
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probability
jemille6728 visualizações
Error Control and Severity por jemille6
Error Control and SeverityError Control and Severity
Error Control and Severity
jemille6731 visualizações
On the interpretation of the mathematical characteristics of statistical test... por jemille6
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...
jemille65.4K visualizações
The role of background assumptions in severity appraisal ( por jemille6
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (
jemille65.4K visualizações
The two statistical cornerstones of replicability: addressing selective infer... por jemille6
The two statistical cornerstones of replicability: addressing selective infer...The two statistical cornerstones of replicability: addressing selective infer...
The two statistical cornerstones of replicability: addressing selective infer...
jemille65.5K visualizações
The replication crisis: are P-values the problem and are Bayes factors the so... por jemille6
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
jemille65.4K visualizações
The ASA president Task Force Statement on Statistical Significance and Replic... por jemille6
The ASA president Task Force Statement on Statistical Significance and Replic...The ASA president Task Force Statement on Statistical Significance and Replic...
The ASA president Task Force Statement on Statistical Significance and Replic...
jemille610.4K visualizações
D. G. Mayo jan 11 slides por jemille6
D. G. Mayo jan 11 slides D. G. Mayo jan 11 slides
D. G. Mayo jan 11 slides
jemille616.2K visualizações
T. Pradeu & M. Lemoine: Philosophy in Science: Definition and Boundaries por jemille6
T. Pradeu & M. Lemoine: Philosophy in Science: Definition and BoundariesT. Pradeu & M. Lemoine: Philosophy in Science: Definition and Boundaries
T. Pradeu & M. Lemoine: Philosophy in Science: Definition and Boundaries
jemille62.1K visualizações

Último

Psychology KS4 por
Psychology KS4Psychology KS4
Psychology KS4WestHatch
98 visualizações4 slides
ICS3211_lecture 08_2023.pdf por
ICS3211_lecture 08_2023.pdfICS3211_lecture 08_2023.pdf
ICS3211_lecture 08_2023.pdfVanessa Camilleri
231 visualizações30 slides
Psychology KS5 por
Psychology KS5Psychology KS5
Psychology KS5WestHatch
119 visualizações5 slides
Jibachha publishing Textbook.docx por
Jibachha publishing Textbook.docxJibachha publishing Textbook.docx
Jibachha publishing Textbook.docxDrJibachhaSahVetphys
51 visualizações14 slides
Education and Diversity.pptx por
Education and Diversity.pptxEducation and Diversity.pptx
Education and Diversity.pptxDrHafizKosar
193 visualizações16 slides
Gross Anatomy of the Liver por
Gross Anatomy of the LiverGross Anatomy of the Liver
Gross Anatomy of the Liverobaje godwin sunday
61 visualizações12 slides

Último(20)

Psychology KS4 por WestHatch
Psychology KS4Psychology KS4
Psychology KS4
WestHatch98 visualizações
ICS3211_lecture 08_2023.pdf por Vanessa Camilleri
ICS3211_lecture 08_2023.pdfICS3211_lecture 08_2023.pdf
ICS3211_lecture 08_2023.pdf
Vanessa Camilleri231 visualizações
Psychology KS5 por WestHatch
Psychology KS5Psychology KS5
Psychology KS5
WestHatch119 visualizações
Jibachha publishing Textbook.docx por DrJibachhaSahVetphys
Jibachha publishing Textbook.docxJibachha publishing Textbook.docx
Jibachha publishing Textbook.docx
DrJibachhaSahVetphys51 visualizações
Education and Diversity.pptx por DrHafizKosar
Education and Diversity.pptxEducation and Diversity.pptx
Education and Diversity.pptx
DrHafizKosar193 visualizações
Gross Anatomy of the Liver por obaje godwin sunday
Gross Anatomy of the LiverGross Anatomy of the Liver
Gross Anatomy of the Liver
obaje godwin sunday61 visualizações
Women from Hackney’s History: Stoke Newington by Sue Doe por History of Stoke Newington
Women from Hackney’s History: Stoke Newington by Sue DoeWomen from Hackney’s History: Stoke Newington by Sue Doe
Women from Hackney’s History: Stoke Newington by Sue Doe
History of Stoke Newington163 visualizações
Relationship of psychology with other subjects. por palswagata2003
Relationship of psychology with other subjects.Relationship of psychology with other subjects.
Relationship of psychology with other subjects.
palswagata200352 visualizações
S1_SD_Resources Walkthrough.pptx por LAZAROAREVALO1
S1_SD_Resources Walkthrough.pptxS1_SD_Resources Walkthrough.pptx
S1_SD_Resources Walkthrough.pptx
LAZAROAREVALO164 visualizações
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB... por Nguyen Thanh Tu Collection
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
Nguyen Thanh Tu Collection88 visualizações
CUNY IT Picciano.pptx por apicciano
CUNY IT Picciano.pptxCUNY IT Picciano.pptx
CUNY IT Picciano.pptx
apicciano54 visualizações
ICS3211_lecture 09_2023.pdf por Vanessa Camilleri
ICS3211_lecture 09_2023.pdfICS3211_lecture 09_2023.pdf
ICS3211_lecture 09_2023.pdf
Vanessa Camilleri115 visualizações
Java Simplified: Understanding Programming Basics por Akshaj Vadakkath Joshy
Java Simplified: Understanding Programming BasicsJava Simplified: Understanding Programming Basics
Java Simplified: Understanding Programming Basics
Akshaj Vadakkath Joshy322 visualizações
The basics - information, data, technology and systems.pdf por JonathanCovena1
The basics - information, data, technology and systems.pdfThe basics - information, data, technology and systems.pdf
The basics - information, data, technology and systems.pdf
JonathanCovena1146 visualizações
MIXING OF PHARMACEUTICALS.pptx por Anupkumar Sharma
MIXING OF PHARMACEUTICALS.pptxMIXING OF PHARMACEUTICALS.pptx
MIXING OF PHARMACEUTICALS.pptx
Anupkumar Sharma95 visualizações
Narration lesson plan por TARIQ KHAN
Narration lesson planNarration lesson plan
Narration lesson plan
TARIQ KHAN61 visualizações
Class 9 lesson plans por TARIQ KHAN
Class 9 lesson plansClass 9 lesson plans
Class 9 lesson plans
TARIQ KHAN51 visualizações

D. Mayo JSM slides v2.pdf

  • 1. Sir David Cox’s Statistical Philosophy and Its Relevance to Today’s Statistical Controversies 0 15 July 1924 – 18 January 2022 Deborah G. Mayo Virginia Tech August 8, 2023 JSM 2023, Toronto
  • 2. Cox is celebrated for his pioneering work for which he received many awards… • International Prize in Statistics in 2016. • Copley Medal, the Royal Society’s highest award, 2010. • Knighted in 1985. • Gold Medal for Cancer Research for the "Proportional Hazard Regression Model" in 1990. 1
  • 3. 2 Importance of statistical foundations Alongside his contributions that have transformed applied statistics, Cox focused on the foundations of statistical inference “Without some systematic structure statistical methods for the analysis of data become a collection of tricks…” (2006, xiii)
  • 4. 3 How does a philosopher of statistics wind up working with a giant in statistics? In late summer 2003 (nearly 20 years ago today), I (boldly) invited him to be in a session I was forming on philosophy of statistics [Optimality: The Second Erich L. Lehmann Symposium, May 19–22, 2004; Rice University, Texas]
  • 5. 4 Discussions over the next several years: • “Frequentist Statistics as a Theory of Inductive Inference” (Mayo and Cox 2006) • “Objectivity and Conditionality in Frequentist Inference” (Cox and Mayo 2010)
  • 6. 5 “A Statistical Scientist Meets a Philosopher of Science” Sir David Cox: “…we both think foundations of statistical inference are important; why do you think that is?” Mayo: “…in statistics …we invariably cross into philosophical questions about empirical knowledge and inductive inference.” (Cox and Mayo 2011)
  • 7. 6 Key Feature of Cox’s Statistical Philosophy • We need to calibrate methods: how they would behave in (actual or hypothetical) repeated sampling “any proposed method of analysis that in repeated application would mostly give misleading answers is fatally flawed” (Cox 2006, 198) Weak repeated sampling principle
  • 8. 7 Two philosophical questions (still controversial): 1. How can the frequentist calibration be used as an evidential or inferential assessment (epistemological use)? 2. How can we ensure: “that the hypothetical long run used in calibration is relevant to the specific data” (Cox 2006, 198)
  • 9. 8 “…we begin with the core elements of significance testing in a version very strongly related to but in some respect different from both Fisherian and Neyman-Pearson approaches…” (2006, 80) Mayo & Cox 2006 “Frequentist statistics as a theory of inductive inference” R. A. Fisher J. Neyman E. Pearson
  • 10. Simple statistical significance tests 9 “…to test the conformity of the particular data under analysis with H0 in some respect: [We find a] test statistic T, such that • the larger the value of T the more inconsistent are the data with H0; …the p-value Pr(T ≥ tobs; H0)” (81)
  • 11. 10 Testing rationales: Inductive behavior vs inductive inference: Small p-values are taken as inconsistency with H0 (in the direction being probed) • Behavioristic: To follow a rule with low error rates. • Evidential: The specific data set provides evidence of discrepancy from H0. Why?
  • 12. (FEV) Frequentist Principle of Evidence: Mayo and Cox (2006) 11 “FEV(i): y is (strong) evidence against H0 i.e., (strong) evidence of discrepancy from H0 if and only if [were H0 correct] then, with high probability [(1-p)] this would have resulted in a less discordant result than [observed in] y” (82)
  • 13. Inferential interpretation 12 “a statistical version of the valid form…modus tollens [falsification] [here] probability arises …to characterize the ‘riskiness’ or probativeness or severity of the tests to which hypotheses are put…reminiscent of the philosophy of Karl Popper” (82)
  • 14. 13 FEV (ii) for interpreting non-statistically significant results, avoiding classic fallacies
  • 15. 14 Move away from a binary “significant / not significant” report • One way: consider testing several discrepancies from a reference hypothesis H0 (i.e., several Hi’s) and reporting how well or poorly warranted they are by the data
  • 16. This gets beyond long-standing and recent problems leading some to suggest restricting statistical significance tests to quality control decisions (behavioristic use) 15
  • 17. Analogy with measuring devices 16 “The significance test is a measuring device for accordance with a specified hypothesis calibrated …by its performance in repeated applications, …we employ the performance features to make inferences about aspects of the particular thing that is measured, aspects that the measuring tool is appropriately capable of revealing.” (84)
  • 18. 17 Relevance “in ensuring that the … calibration is relevant to the specific data under analysis, often taking due account of how the data were obtained.” (Cox 2006, 198)
  • 19. 18 Selection effects Selectively reporting effects, optional stopping, trying and trying again etc. may result in failing the weak repeated sampling principle.
  • 20. 19 “the probability of finding some such discordance or other may be high even under the null. Thus, following FEV(i), we would not have genuine evidence of discordance with the null, and unless the 𝑝-value is modified appropriately, the inference would be misleading.” (92) Selection effects
  • 21. 20 • “The probability is high we would not obtain a match with person i, if i were not the criminal; • By FEV, the match is good evidence that i is the criminal. • A non-match virtually excludes the person.” (94) Not all cases with selection or multiplicity call for adjustments Searching a full database for a DNA match:
  • 22. 21 Calibration is the source of objectivity: “Objectivity and Conditionality in Frequentist Inference” (Cox & Mayo 2010): “Frequentist methods achieve an objective connection to hypotheses about the data-generating process by being constrained and calibrated by the method’s error probabilities” (Cox and Mayo 2010, 277)
  • 23. 22 “Objectivity and Conditionality in Frequentist Inference” (Cox & Mayo 2010) When Cox proposed this second paper include “conditioning", I hesitated….
  • 24. Good long-run performance isn’t enough Cox’s (1958) “two measuring instruments” (“weighing machine” example): toss a fair coin to test the mean of a Normal distribution with either a very large, or a small, variance (both known); and you know which was used. 23
  • 25. Good long-run performance isn’t enough As Cox (1958) argued, once you know the precision of the tool used Weak conditionality principle 24 “the p-value assessment should be made conditional on the experiment actually run.” (Cox and Mayo 2010, 296)
  • 26. 25 • The “weighing machine” example caused “a subtle earthquake” in statistical foundations. Why? • The “best” test on long-run optimality grounds is the unconditional test.
  • 27. 26 Does “conditioning for relevance” lead to the unique case? “[Many argue a frequentist] “is faced with the following dilemma: either to deny the appropriateness of conditioning on the precision of the tool chosen by the toss of a coin, or else to embrace [a] principle* … that frequentist sampling distributions are irrelevant to inference [postdata]. This is a false dilemma. Conditioning is warranted in achieving objective frequentist goals.” (Cox & Mayo 2010, 298) - *Strong Likelihood Principle
  • 28. 27 • The (2010) paper with Cox was the basis for my later article in Statistical Science debunking the argument that appears to lead to the dilemma: Mayo (2014)
  • 29. 28 “Principles of Statistical Inference” (2006) “The object of the present book is to [describe the main] controversies over more foundational issues that have rumbled on …for more than 200 years”. • Cox (1958) “Some problems connected with statistical inference” • Cox (2020) “Statistical significance”
  • 30. 29 “His writing on statistical significance and p-values seemed to need repeating for each new generation” (Heather Battey and Nancy Reid 2022, IMF obituary for Sir David Cox)
  • 31. Cox advanced a great many other foundational issues I have not discussed: • relating statistical and scientific inference; • testing assumptions of models; • rival paradigms for interpreting probability, frequentist and Bayesian 30
  • 32. David R. Cox Foundations of Statistics Award Wednesday, August 9: 10:30-12:20 Metro Toronto Convention Centre Room: CC-701A Chair: Ron Wasserstein (ASA) Inaugural recipient: Nancy Reid, University of Toronto “The Importance of Foundations in Statistical Science” 31
  • 33. References • Battey, H. and Reid, N. (2022). “Obituary: David Cox 1924-2022”. Institute of Mathematical Statistics homepage, April 1, 2022. • Cox, D.R. (1958). Some problems connected with statistical inference. Ann. Math. Statis • Cox, D. R. (2006). Principles of Statistical Inference, CUP. • Cox, D. R. (2018) In gentle praise of significance tests. 2018 RSS annual meeting. (Link to Youtube video here.) • Cox, D. R. & Mayo D. G. (2010). Objectivity and Conditionality in Frequentist Inference. In Mayo & Spanos (eds), Error and Inference: Recent Exchanges on Experimental Reasoning, Reliability and the Objectivity and Rationality of Science, pp. 276-304. CUP • Cox, D. R. and Mayo, D. G. (2011). A Statistical Scientist Meets a Philosopher of Science: A Conversation between Sir David Cox and Deborah Mayo. Rationality, Markets and Morals (RMM) 2, 103–14. • Mayo, D. G. (1996). Error and the Growth of Experimental Knowledge, Chicago: Chicago University Press. (1998 Lakatos Prize) 32
  • 34. • Mayo, D. G. (2014). “On the Birnbaum Argument for the Strong Likelihood Principle,” (with discussion) Statistical Science 29(2) pp. 227-239, 261-266. • Mayo, D. G. (2018). Statistical Inference as Severe Testing: How to Get Beyond the Statistics Wars, Cambridge: Cambridge University Press. • Mayo, D. G. (2022). A Remembrance of Sir David Cox: ‘”In celebrating Cox’s immense contributions, we should recognise how much there is yet to learn from him”. Significance. (April 2022: 36). [Link to all 4 remembrances.] • Mayo, D.G. and Cox, D. R. (2006) “Frequentist Statistics as a Theory of Inductive Inference,” Optimality: The Second Erich L. Lehmann Symposium (ed. J. Rojo), Lecture Notes-Monograph series, IMS, Vol. 49: 77-97. [Reprincted in Mayo and Spanos 2010. • Reid, N. and Cox, D. R. (2015). On Some Principles of Statistical Inference. International Statistical Review 83(2): 293-308. 33