SlideShare uma empresa Scribd logo
1 de 13
Making ANOVA & Tukey HSD testing
clearer with Compact Letter Display
Gaetan Lion, September 3, 2022
1
Introduction
ANOVA is an incomplete test because it only tells you if several variables, or factors, have
different Means. But, it does not tell you which specific ones are truly different. Maybe
out of 5 variables (A, B, C, D, E) only E is truly different. And, this sole variable causes the
ANOVA F test to be statistically significant. The other 4 variables could have similar Means.
The Tukey Highly Significant Difference test (Tukey HSD) remedies the above situation. This
is a post-ANOVA test that tests whether each variable is different from any of the other
ones. And, Tukey HSD is conducted on a one-on-one matched variable basis just like an
unpaired t test. So, Tukey HSD tests the difference in Means for A vs. B, A vs. C, A vs. D, etc.
While the Tukey HSD test provides an abundance of supplementary information to ANOVA,
its output is overwhelming for non-statisticians.
Compact Letter Display (CLD) dramatically improves the clarity of the ANOVA & Tukey HSD
test output.
2
CLD basics
1. CLD identifies where the statistical significant differences are.
Each variable that shares a Mean that is not statistically different from another one will share the same letter.
For examples:
”a” “ab” “b”
The above indicates that the first variable “a” has a Mean that is statistically different from the third one “b”.
But, the second variable “ab” has a Mean that is not statistically different from either the first or third
variable.
”a” “ab” “bc” “c”
The above indicates that the first variable “a” has a Mean that is statistically different from the third variable
“bc” and the fourth one “c”. But, this first variable “a” is not statistically different from the second one “ab”.
2. CLD also ranks the variables in descending Mean order.
So, the variable with the highest Mean will be named “a” (if it is statistically different from all the others).
And, the variable with the lowest Mean will have the highest letter.
3
Working through an Example
We are going to test if the average rainfall in 5 West Coast cities is statistically
different. These cities are:
Eugene (OR)
Portland (OR)
San Francisco (CA)
Seattle (WA)
Spokane (WA)
The data is annual rainfall (1951 – 2021). The data source is NOAA.
4
The basic data summary
5
ANOVA F test.
So, we know that the Cities have statistically different Average Rainfall
But, as shown this ANOVA F test really does not tell you much if anything.
6
Tukey HSD test identifies the difference between specific matched cities
Tukey HSD test output
We can observe that two pairs of matched cities
have non-statistically difference in Means.
These are:
Portland – Seattle. p-value 0.54
San Francisco – Spokane. p-value 0.08
San Francisco – Spokane is not quite statistically
significant, when using an alpha level of 0.05.
7
Using a Box Plot to visualize this data
8
Top whisker = ~ 99.7th percentile
Top of box = 75th percentile
Line near middle of box =
Median or 50th percentile
Bottom of box = 25th percentile
Bottom whisker = ~ 0.3d
percentile
Box Plot explanation
This box plot has a lot of information. But, it is a bit challenging to readily identify the cities’ rainfall levels
that are different from each other vs. the ones that are similar.
There is more info on Box Plot visual
interpretation on the last slide in the Appendix
section.
Putting an information package together
Tukey HSD test output
All the info is there. But, it is
rather challenging to interpret.
9
Rearranging the basic data with CLD
The original data set just sorted in
alphabetical order is not that
informative.
The revised data set using CLD is a lot
more informative. The cities are
ranked by Mean rainfall descending
order. And, the CLD identifies readily
which cities have statistically
significant Mean differences, and
which do not.
Eugene has a statistically significant higher Mean rainfall than all the other cities. So, it is “a”.
Seattle and Portland have similar Mean rainfall (not statistically different). So, they both come in as “b”.
San Francisco and Spokane have less rainfall than the other cities. And, their respective rainfall levels
are similar. So, they come in as “c”.
10
Rearranging the Box Plots using CLD
Within this Box Plot, it is challenging to differentiate
cities’ rainfall relative levels and to figure out which
ones are similar vs. dissimilar.
This Box Plot using CLD is more informative. The cities’ rainfall
levels are sorted in descending order. The color intensity is tiered
with more dense texture reflecting higher rainfall levels. And,
the CLD letters identify which cities have similar rainfall levels
and which do not. 11
12
Upgraded information package with CLD
Using CLD, you can readily
identify the cities with similar
vs. dissimilar rainfall levels.
Appendix: Box Plot explanation
13

Mais conteúdo relacionado

Mais procurados

WETLAND MAPPING USING RS AND GIS
WETLAND MAPPING USING RS AND GISWETLAND MAPPING USING RS AND GIS
WETLAND MAPPING USING RS AND GIS
Abhiram Kanigolla
 

Mais procurados (10)

ssrassignment-180211113637 (1).pptx
ssrassignment-180211113637 (1).pptxssrassignment-180211113637 (1).pptx
ssrassignment-180211113637 (1).pptx
 
Cytogenetics iscn
Cytogenetics iscnCytogenetics iscn
Cytogenetics iscn
 
2014 mendelian-genetics
2014 mendelian-genetics2014 mendelian-genetics
2014 mendelian-genetics
 
Genetics and its history with gregor mendel law
Genetics and its history with  gregor mendel lawGenetics and its history with  gregor mendel law
Genetics and its history with gregor mendel law
 
Remote Sensing for Assessing Crop Residue Cover and Soil Tillage Intensity
Remote Sensing for Assessing Crop Residue Cover and Soil Tillage IntensityRemote Sensing for Assessing Crop Residue Cover and Soil Tillage Intensity
Remote Sensing for Assessing Crop Residue Cover and Soil Tillage Intensity
 
Male sterility in Plants
Male sterility in PlantsMale sterility in Plants
Male sterility in Plants
 
Spatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSpatial analysis and Analysis Tools
Spatial analysis and Analysis Tools
 
Linkage
LinkageLinkage
Linkage
 
WETLAND MAPPING USING RS AND GIS
WETLAND MAPPING USING RS AND GISWETLAND MAPPING USING RS AND GIS
WETLAND MAPPING USING RS AND GIS
 
10.3 gene pools and speciation
10.3 gene pools and speciation10.3 gene pools and speciation
10.3 gene pools and speciation
 

Semelhante a Compact Letter Display (CLD). How it works

WEEK 7 – HW 7 FALL 2017CORRELATIONREGRESSION PROBLEMS BASED O.docx
WEEK 7 – HW 7   FALL 2017CORRELATIONREGRESSION PROBLEMS BASED O.docxWEEK 7 – HW 7   FALL 2017CORRELATIONREGRESSION PROBLEMS BASED O.docx
WEEK 7 – HW 7 FALL 2017CORRELATIONREGRESSION PROBLEMS BASED O.docx
cockekeshia
 
For this assignment, use the aschooltest.sav dataset.The d
For this assignment, use the aschooltest.sav dataset.The dFor this assignment, use the aschooltest.sav dataset.The d
For this assignment, use the aschooltest.sav dataset.The d
MerrileeDelvalle969
 
chapter 9CorrelationLearning ObjectivesAfter readi.docx
chapter 9CorrelationLearning ObjectivesAfter readi.docxchapter 9CorrelationLearning ObjectivesAfter readi.docx
chapter 9CorrelationLearning ObjectivesAfter readi.docx
mccormicknadine86
 
Estimators for structural equation models of Likert scale data
Estimators for structural equation models of Likert scale dataEstimators for structural equation models of Likert scale data
Estimators for structural equation models of Likert scale data
Nick Stauner
 
Cs221 lecture3-fall11
Cs221 lecture3-fall11Cs221 lecture3-fall11
Cs221 lecture3-fall11
darwinrlo
 

Semelhante a Compact Letter Display (CLD). How it works (11)

10 Must-Know Statistical Concepts for Data Scientists.docx
10 Must-Know Statistical Concepts for Data Scientists.docx10 Must-Know Statistical Concepts for Data Scientists.docx
10 Must-Know Statistical Concepts for Data Scientists.docx
 
Exploring australian economy and diversity
Exploring australian economy and diversityExploring australian economy and diversity
Exploring australian economy and diversity
 
WEEK 7 – HW 7 FALL 2017CORRELATIONREGRESSION PROBLEMS BASED O.docx
WEEK 7 – HW 7   FALL 2017CORRELATIONREGRESSION PROBLEMS BASED O.docxWEEK 7 – HW 7   FALL 2017CORRELATIONREGRESSION PROBLEMS BASED O.docx
WEEK 7 – HW 7 FALL 2017CORRELATIONREGRESSION PROBLEMS BASED O.docx
 
Analyzing Compare and Contrast Essays: DNA Profiling
Analyzing Compare and Contrast Essays: DNA ProfilingAnalyzing Compare and Contrast Essays: DNA Profiling
Analyzing Compare and Contrast Essays: DNA Profiling
 
For this assignment, use the aschooltest.sav dataset.The d
For this assignment, use the aschooltest.sav dataset.The dFor this assignment, use the aschooltest.sav dataset.The d
For this assignment, use the aschooltest.sav dataset.The d
 
chapter 9CorrelationLearning ObjectivesAfter readi.docx
chapter 9CorrelationLearning ObjectivesAfter readi.docxchapter 9CorrelationLearning ObjectivesAfter readi.docx
chapter 9CorrelationLearning ObjectivesAfter readi.docx
 
Quantitative analysis: A brief introduction
Quantitative analysis: A brief introductionQuantitative analysis: A brief introduction
Quantitative analysis: A brief introduction
 
Estimators for structural equation models of Likert scale data
Estimators for structural equation models of Likert scale dataEstimators for structural equation models of Likert scale data
Estimators for structural equation models of Likert scale data
 
A brief introduction to quantitative analysis
A brief introduction to quantitative analysisA brief introduction to quantitative analysis
A brief introduction to quantitative analysis
 
Cs221 lecture3-fall11
Cs221 lecture3-fall11Cs221 lecture3-fall11
Cs221 lecture3-fall11
 
Presenting data
Presenting dataPresenting data
Presenting data
 

Mais de Gaetan Lion

Mais de Gaetan Lion (20)

DRU projections testing.pptx
DRU projections testing.pptxDRU projections testing.pptx
DRU projections testing.pptx
 
Climate Change in 24 US Cities
Climate Change in 24 US CitiesClimate Change in 24 US Cities
Climate Change in 24 US Cities
 
CalPERS pensions vs. Social Security
CalPERS pensions vs. Social SecurityCalPERS pensions vs. Social Security
CalPERS pensions vs. Social Security
 
Recessions.pptx
Recessions.pptxRecessions.pptx
Recessions.pptx
 
Inequality in the United States
Inequality in the United StatesInequality in the United States
Inequality in the United States
 
Housing Price Models
Housing Price ModelsHousing Price Models
Housing Price Models
 
Global Aging.pdf
Global Aging.pdfGlobal Aging.pdf
Global Aging.pdf
 
Cryptocurrencies as an asset class
Cryptocurrencies as an asset classCryptocurrencies as an asset class
Cryptocurrencies as an asset class
 
Can you Deep Learn the Stock Market?
Can you Deep Learn the Stock Market?Can you Deep Learn the Stock Market?
Can you Deep Learn the Stock Market?
 
Can Treasury Inflation Protected Securities predict Inflation?
Can Treasury Inflation Protected Securities predict Inflation?Can Treasury Inflation Protected Securities predict Inflation?
Can Treasury Inflation Protected Securities predict Inflation?
 
How overvalued is the Stock Market?
How overvalued is the Stock Market? How overvalued is the Stock Market?
How overvalued is the Stock Market?
 
The relationship between the Stock Market and Interest Rates
The relationship between the Stock Market and Interest RatesThe relationship between the Stock Market and Interest Rates
The relationship between the Stock Market and Interest Rates
 
Life expectancy
Life expectancyLife expectancy
Life expectancy
 
Comparing R vs. Python for data visualization
Comparing R vs. Python for data visualizationComparing R vs. Python for data visualization
Comparing R vs. Python for data visualization
 
Will Stock Markets survive in 200 years?
Will Stock Markets survive in 200 years?Will Stock Markets survive in 200 years?
Will Stock Markets survive in 200 years?
 
Standardization
StandardizationStandardization
Standardization
 
Is Tom Brady the greatest quarterback?
Is Tom Brady the greatest quarterback?Is Tom Brady the greatest quarterback?
Is Tom Brady the greatest quarterback?
 
Regularization why you should avoid them
Regularization why you should avoid themRegularization why you should avoid them
Regularization why you should avoid them
 
Basketball the 3 pt game
Basketball the 3 pt gameBasketball the 3 pt game
Basketball the 3 pt game
 
Japan vs. US comparison on numerous dimensions
Japan vs. US comparison on numerous dimensionsJapan vs. US comparison on numerous dimensions
Japan vs. US comparison on numerous dimensions
 

Último

Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
HyderabadDolls
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
Health
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 

Último (20)

Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 

Compact Letter Display (CLD). How it works

  • 1. Making ANOVA & Tukey HSD testing clearer with Compact Letter Display Gaetan Lion, September 3, 2022 1
  • 2. Introduction ANOVA is an incomplete test because it only tells you if several variables, or factors, have different Means. But, it does not tell you which specific ones are truly different. Maybe out of 5 variables (A, B, C, D, E) only E is truly different. And, this sole variable causes the ANOVA F test to be statistically significant. The other 4 variables could have similar Means. The Tukey Highly Significant Difference test (Tukey HSD) remedies the above situation. This is a post-ANOVA test that tests whether each variable is different from any of the other ones. And, Tukey HSD is conducted on a one-on-one matched variable basis just like an unpaired t test. So, Tukey HSD tests the difference in Means for A vs. B, A vs. C, A vs. D, etc. While the Tukey HSD test provides an abundance of supplementary information to ANOVA, its output is overwhelming for non-statisticians. Compact Letter Display (CLD) dramatically improves the clarity of the ANOVA & Tukey HSD test output. 2
  • 3. CLD basics 1. CLD identifies where the statistical significant differences are. Each variable that shares a Mean that is not statistically different from another one will share the same letter. For examples: ”a” “ab” “b” The above indicates that the first variable “a” has a Mean that is statistically different from the third one “b”. But, the second variable “ab” has a Mean that is not statistically different from either the first or third variable. ”a” “ab” “bc” “c” The above indicates that the first variable “a” has a Mean that is statistically different from the third variable “bc” and the fourth one “c”. But, this first variable “a” is not statistically different from the second one “ab”. 2. CLD also ranks the variables in descending Mean order. So, the variable with the highest Mean will be named “a” (if it is statistically different from all the others). And, the variable with the lowest Mean will have the highest letter. 3
  • 4. Working through an Example We are going to test if the average rainfall in 5 West Coast cities is statistically different. These cities are: Eugene (OR) Portland (OR) San Francisco (CA) Seattle (WA) Spokane (WA) The data is annual rainfall (1951 – 2021). The data source is NOAA. 4
  • 5. The basic data summary 5
  • 6. ANOVA F test. So, we know that the Cities have statistically different Average Rainfall But, as shown this ANOVA F test really does not tell you much if anything. 6
  • 7. Tukey HSD test identifies the difference between specific matched cities Tukey HSD test output We can observe that two pairs of matched cities have non-statistically difference in Means. These are: Portland – Seattle. p-value 0.54 San Francisco – Spokane. p-value 0.08 San Francisco – Spokane is not quite statistically significant, when using an alpha level of 0.05. 7
  • 8. Using a Box Plot to visualize this data 8 Top whisker = ~ 99.7th percentile Top of box = 75th percentile Line near middle of box = Median or 50th percentile Bottom of box = 25th percentile Bottom whisker = ~ 0.3d percentile Box Plot explanation This box plot has a lot of information. But, it is a bit challenging to readily identify the cities’ rainfall levels that are different from each other vs. the ones that are similar. There is more info on Box Plot visual interpretation on the last slide in the Appendix section.
  • 9. Putting an information package together Tukey HSD test output All the info is there. But, it is rather challenging to interpret. 9
  • 10. Rearranging the basic data with CLD The original data set just sorted in alphabetical order is not that informative. The revised data set using CLD is a lot more informative. The cities are ranked by Mean rainfall descending order. And, the CLD identifies readily which cities have statistically significant Mean differences, and which do not. Eugene has a statistically significant higher Mean rainfall than all the other cities. So, it is “a”. Seattle and Portland have similar Mean rainfall (not statistically different). So, they both come in as “b”. San Francisco and Spokane have less rainfall than the other cities. And, their respective rainfall levels are similar. So, they come in as “c”. 10
  • 11. Rearranging the Box Plots using CLD Within this Box Plot, it is challenging to differentiate cities’ rainfall relative levels and to figure out which ones are similar vs. dissimilar. This Box Plot using CLD is more informative. The cities’ rainfall levels are sorted in descending order. The color intensity is tiered with more dense texture reflecting higher rainfall levels. And, the CLD letters identify which cities have similar rainfall levels and which do not. 11
  • 12. 12 Upgraded information package with CLD Using CLD, you can readily identify the cities with similar vs. dissimilar rainfall levels.
  • 13. Appendix: Box Plot explanation 13