SlideShare a Scribd company logo
1 of 10
The Chi Square Test
• A statistical method used to determine
goodness of fit
– Goodness of fit refers to how close the observed
data are to those predicted from a hypothesis

• Note:
– The chi square test does not prove that a
hypothesis is correct
• It evaluates to what extent the data and the hypothesis
have a good fit
The Chi Square Test (we will cover this
in lab; the following slides will be useful to
review after that lab)

• The general formula is

χ2 = Σ

(O – E)2
E

• where
– O = observed data in each category
– E = observed data in each category based on the
experimenter’s hypothesis
Σ = Sum of the calculations for each category
• Consider the following example in Drosophila
melanogaster
• Gene affecting wing shape
– c+ = Normal wing
– c = Curved wing

• Gene affecting body color
– e+ = Normal (gray)
– e = ebony

• Note:
– The wild-type allele is designated with a + sign
– Recessive mutant alleles are designated with lowercase
letters

• The Cross:
– A cross is made between two true-breeding flies (c+c+e+e+
and ccee). The flies of the F1 generation are then allowed
to mate with each other to produce an F2 generation.
• The outcome
– F1 generation
• All offspring have straight wings and gray bodies
– F2 generation
•
•
•
•
•

193 straight wings, gray bodies
69 straight wings, ebony bodies
64 curved wings, gray bodies
26 curved wings, ebony bodies
352 total flies

• Applying the chi square test
– Step 1: Propose a null hypothesis (Ho) that allows us to
calculate the expected values based on Mendel’s laws
• The two traits are independently assorting
– Step 2: Calculate the expected values of the four
phenotypes, based on the hypothesis
• According to our hypothesis, there should be a
9:3:3:1 ratio on the F2 generation
Phenotype

Expected
probability

Expected
number

Observed number

straight wings,
gray bodies

9/16

9/16 X 352 = 198

193

straight wings,
ebony bodies

3/16

3/16 X 352 = 66

64

curved wings,
gray bodies

3/16

3/16 X 352 = 66

62

curved wings,
ebony bodies

1/16

1/16 X 352 = 22

24
– Step 3: Apply the chi square formula

χ2 =
χ2 =

(O1 – E1)2
E1
(193 – 198)2
198

+

+

(O2 – E2)2
E2
(69 – 66)2
66

χ2 = 0.13 + 0.14 + 0.06 + 0.73
χ = 1.06
2

+

+

(O3 – E3)2
E3
(64 – 66)2
66

+

+

(O4 – E4)2
E4
(26 – 22)2
22

Expected
number

Observed
number

198

193

66

64

66

62

22

24
• Step 4: Interpret the chi square value
– The calculated chi square value can be used to obtain
probabilities, or P values, from a chi square table
• These probabilities allow us to determine the likelihood that the
observed deviations are due to random chance alone

– Low chi square values indicate a high probability that the
observed deviations could be due to random chance alone
– High chi square values indicate a low probability that the
observed deviations are due to random chance alone
– If the chi square value results in a probability that is less than
0.05 (ie: less than 5%) it is considered statistically
significant
• The hypothesis is rejected
• Step 4: Interpret the chi square value
– Before we can use the chi square table, we have to
determine the degrees of freedom (df)
• The df is a measure of the number of categories that are
independent of each other
• If you know the 3 of the 4 categories you can deduce the
4th (total number of progeny – categories 1-3)
• df = n – 1
– where n = total number of categories
• In our experiment, there are four phenotypes/categories
– Therefore, df = 4 – 1 = 3
– Refer to Table 2.1
1.06
• Step 4: Interpret the chi square value
– With df = 3, the chi square value of 1.06 is slightly greater
than 1.005 (which corresponds to P-value = 0.80)
– P-value = 0.80 means that Chi-square values equal to or
greater than 1.005 are expected to occur 80% of the time
due to random chance alone; that is, when the null
hypothesis is true.
– Therefore, it is quite probable that the deviations between
the observed and expected values in this experiment can be
explained by random sampling error and the null hypothesis
is not rejected. What was the null hypothesis?

More Related Content

What's hot

Research method ch08 statistical methods 2 anova
Research method ch08 statistical methods 2 anovaResearch method ch08 statistical methods 2 anova
Research method ch08 statistical methods 2 anova
naranbatn
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.ppt
Nursing Path
 

What's hot (20)

Chi square Test
Chi square TestChi square Test
Chi square Test
 
Goodness of-fit
Goodness of-fit  Goodness of-fit
Goodness of-fit
 
Chi squared test
Chi squared testChi squared test
Chi squared test
 
Standard error-Biostatistics
Standard error-BiostatisticsStandard error-Biostatistics
Standard error-Biostatistics
 
Student's T-Test
Student's T-TestStudent's T-Test
Student's T-Test
 
Stat 3203 -pps sampling
Stat 3203 -pps samplingStat 3203 -pps sampling
Stat 3203 -pps sampling
 
Research method ch08 statistical methods 2 anova
Research method ch08 statistical methods 2 anovaResearch method ch08 statistical methods 2 anova
Research method ch08 statistical methods 2 anova
 
Student's T-test, Paired T-Test, ANOVA & Proportionate Test
Student's T-test, Paired T-Test, ANOVA & Proportionate TestStudent's T-test, Paired T-Test, ANOVA & Proportionate Test
Student's T-test, Paired T-Test, ANOVA & Proportionate Test
 
Chi – square test
Chi – square testChi – square test
Chi – square test
 
Normal distribution curve
Normal distribution curveNormal distribution curve
Normal distribution curve
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
Sample Size Determination
Sample Size DeterminationSample Size Determination
Sample Size Determination
 
Unit 4
Unit 4Unit 4
Unit 4
 
Chi square test final
Chi square test finalChi square test final
Chi square test final
 
Basic Concepts of Probability
Basic Concepts of ProbabilityBasic Concepts of Probability
Basic Concepts of Probability
 
Chisquare
ChisquareChisquare
Chisquare
 
Paired t Test
Paired t TestPaired t Test
Paired t Test
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.ppt
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Confidence interval
Confidence intervalConfidence interval
Confidence interval
 

Viewers also liked

Statistical concepts
Statistical conceptsStatistical concepts
Statistical concepts
Carlo Magno
 
Significance tests
Significance testsSignificance tests
Significance tests
Jinho Choi
 
Quantitative techniques in research
Quantitative techniques in researchQuantitative techniques in research
Quantitative techniques in research
Carlo Magno
 
Aron chpt 11 ed (2)
Aron chpt 11 ed (2)Aron chpt 11 ed (2)
Aron chpt 11 ed (2)
Sandra Nicks
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentation
cae_021
 

Viewers also liked (20)

Chi square test
Chi square testChi square test
Chi square test
 
The Chi Square Test
The Chi Square TestThe Chi Square Test
The Chi Square Test
 
Statistical concepts
Statistical conceptsStatistical concepts
Statistical concepts
 
Significance tests
Significance testsSignificance tests
Significance tests
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Lesson2
Lesson2Lesson2
Lesson2
 
Quantitative techniques in research
Quantitative techniques in researchQuantitative techniques in research
Quantitative techniques in research
 
Chi square
Chi squareChi square
Chi square
 
Aron chpt 11 ed (2)
Aron chpt 11 ed (2)Aron chpt 11 ed (2)
Aron chpt 11 ed (2)
 
Reporting chi square goodness of fit test of independence in apa
Reporting chi square goodness of fit test of independence in apaReporting chi square goodness of fit test of independence in apa
Reporting chi square goodness of fit test of independence in apa
 
Friedman Test- A Presentation
Friedman Test- A PresentationFriedman Test- A Presentation
Friedman Test- A Presentation
 
Ang Mapahitas-on nga Mariposa
Ang Mapahitas-on nga MariposaAng Mapahitas-on nga Mariposa
Ang Mapahitas-on nga Mariposa
 
Chi squared test
Chi squared testChi squared test
Chi squared test
 
Chi square analysis
Chi square analysisChi square analysis
Chi square analysis
 
Chi-Square Test of Independence
Chi-Square Test of IndependenceChi-Square Test of Independence
Chi-Square Test of Independence
 
Reporting Pearson Correlation Test of Independence in APA
Reporting Pearson Correlation Test of Independence in APAReporting Pearson Correlation Test of Independence in APA
Reporting Pearson Correlation Test of Independence in APA
 
GCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankGCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s Rank
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentation
 
Correlation
CorrelationCorrelation
Correlation
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear Regression
 

Similar to The chi square_test

Str t-test1
Str   t-test1Str   t-test1
Str t-test1
iamkim
 
Chi square[1]
Chi square[1]Chi square[1]
Chi square[1]
sbarkanic
 
Geneticschapter2part2 140126121602-phpapp02
Geneticschapter2part2 140126121602-phpapp02Geneticschapter2part2 140126121602-phpapp02
Geneticschapter2part2 140126121602-phpapp02
Cleophas Rwemera
 

Similar to The chi square_test (20)

Chi sequare
Chi sequareChi sequare
Chi sequare
 
The Chi Square Test
The Chi Square TestThe Chi Square Test
The Chi Square Test
 
Chi square test
Chi square test Chi square test
Chi square test
 
Probability Theory for Data Scientists
Probability Theory for Data ScientistsProbability Theory for Data Scientists
Probability Theory for Data Scientists
 
Chi square distribution and analysis of frequencies.pptx
Chi square distribution and analysis of frequencies.pptxChi square distribution and analysis of frequencies.pptx
Chi square distribution and analysis of frequencies.pptx
 
random variable and distribution
random variable and distributionrandom variable and distribution
random variable and distribution
 
Goodness of fit (ppt)
Goodness of fit (ppt)Goodness of fit (ppt)
Goodness of fit (ppt)
 
Business research methods 2
Business research methods 2Business research methods 2
Business research methods 2
 
Str t-test1
Str   t-test1Str   t-test1
Str t-test1
 
Chapter7ppt.pdf
Chapter7ppt.pdfChapter7ppt.pdf
Chapter7ppt.pdf
 
Top schools in India | Delhi NCR | Noida |
Top schools in India | Delhi NCR | Noida | Top schools in India | Delhi NCR | Noida |
Top schools in India | Delhi NCR | Noida |
 
Chi square[1]
Chi square[1]Chi square[1]
Chi square[1]
 
Chi square test
Chi square testChi square test
Chi square test
 
Random Error Theory
Random Error TheoryRandom Error Theory
Random Error Theory
 
Geneticschapter2part2 140126121602-phpapp02
Geneticschapter2part2 140126121602-phpapp02Geneticschapter2part2 140126121602-phpapp02
Geneticschapter2part2 140126121602-phpapp02
 
Probability and Statistics - Week 1
Probability and Statistics - Week 1Probability and Statistics - Week 1
Probability and Statistics - Week 1
 
Probablity
ProbablityProbablity
Probablity
 
Top schools in ghaziabad
Top schools in ghaziabadTop schools in ghaziabad
Top schools in ghaziabad
 
Probability
ProbabilityProbability
Probability
 
Lect w7 t_test_amp_chi_test
Lect w7 t_test_amp_chi_testLect w7 t_test_amp_chi_test
Lect w7 t_test_amp_chi_test
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

The chi square_test

  • 1. The Chi Square Test • A statistical method used to determine goodness of fit – Goodness of fit refers to how close the observed data are to those predicted from a hypothesis • Note: – The chi square test does not prove that a hypothesis is correct • It evaluates to what extent the data and the hypothesis have a good fit
  • 2. The Chi Square Test (we will cover this in lab; the following slides will be useful to review after that lab) • The general formula is χ2 = Σ (O – E)2 E • where – O = observed data in each category – E = observed data in each category based on the experimenter’s hypothesis Σ = Sum of the calculations for each category
  • 3. • Consider the following example in Drosophila melanogaster • Gene affecting wing shape – c+ = Normal wing – c = Curved wing • Gene affecting body color – e+ = Normal (gray) – e = ebony • Note: – The wild-type allele is designated with a + sign – Recessive mutant alleles are designated with lowercase letters • The Cross: – A cross is made between two true-breeding flies (c+c+e+e+ and ccee). The flies of the F1 generation are then allowed to mate with each other to produce an F2 generation.
  • 4. • The outcome – F1 generation • All offspring have straight wings and gray bodies – F2 generation • • • • • 193 straight wings, gray bodies 69 straight wings, ebony bodies 64 curved wings, gray bodies 26 curved wings, ebony bodies 352 total flies • Applying the chi square test – Step 1: Propose a null hypothesis (Ho) that allows us to calculate the expected values based on Mendel’s laws • The two traits are independently assorting
  • 5. – Step 2: Calculate the expected values of the four phenotypes, based on the hypothesis • According to our hypothesis, there should be a 9:3:3:1 ratio on the F2 generation Phenotype Expected probability Expected number Observed number straight wings, gray bodies 9/16 9/16 X 352 = 198 193 straight wings, ebony bodies 3/16 3/16 X 352 = 66 64 curved wings, gray bodies 3/16 3/16 X 352 = 66 62 curved wings, ebony bodies 1/16 1/16 X 352 = 22 24
  • 6. – Step 3: Apply the chi square formula χ2 = χ2 = (O1 – E1)2 E1 (193 – 198)2 198 + + (O2 – E2)2 E2 (69 – 66)2 66 χ2 = 0.13 + 0.14 + 0.06 + 0.73 χ = 1.06 2 + + (O3 – E3)2 E3 (64 – 66)2 66 + + (O4 – E4)2 E4 (26 – 22)2 22 Expected number Observed number 198 193 66 64 66 62 22 24
  • 7. • Step 4: Interpret the chi square value – The calculated chi square value can be used to obtain probabilities, or P values, from a chi square table • These probabilities allow us to determine the likelihood that the observed deviations are due to random chance alone – Low chi square values indicate a high probability that the observed deviations could be due to random chance alone – High chi square values indicate a low probability that the observed deviations are due to random chance alone – If the chi square value results in a probability that is less than 0.05 (ie: less than 5%) it is considered statistically significant • The hypothesis is rejected
  • 8. • Step 4: Interpret the chi square value – Before we can use the chi square table, we have to determine the degrees of freedom (df) • The df is a measure of the number of categories that are independent of each other • If you know the 3 of the 4 categories you can deduce the 4th (total number of progeny – categories 1-3) • df = n – 1 – where n = total number of categories • In our experiment, there are four phenotypes/categories – Therefore, df = 4 – 1 = 3 – Refer to Table 2.1
  • 10. • Step 4: Interpret the chi square value – With df = 3, the chi square value of 1.06 is slightly greater than 1.005 (which corresponds to P-value = 0.80) – P-value = 0.80 means that Chi-square values equal to or greater than 1.005 are expected to occur 80% of the time due to random chance alone; that is, when the null hypothesis is true. – Therefore, it is quite probable that the deviations between the observed and expected values in this experiment can be explained by random sampling error and the null hypothesis is not rejected. What was the null hypothesis?