SlideShare uma empresa Scribd logo
1 de 41
11.1
Nonstatistical Hypothesis Testing…
A criminal trial is an example of hypothesis testing without
the statistics.
In a trial a jury must decide between two hypotheses. The
null hypothesis is
H0: The defendant is innocent
The alternative hypothesis or research hypothesis is
H1: The defendant is guilty
The jury does not know which hypothesis is true. They must
make a decision on the basis of evidence presented.
11.2
Nonstatistical Hypothesis Testing…
In the language of statistics convicting the defendant is called rejecting
the null hypothesis in favor of the alternative hypothesis. That is, the
jury is saying that there is enough evidence to conclude that the
defendant is guilty (i.e., there is enough evidence to support the
alternative hypothesis).
If the jury acquits it is stating that there is not enough evidence to
support the alternative hypothesis. Notice that the jury is not saying
that the defendant is innocent, only that there is not enough evidence to
support the alternative hypothesis. That is why we never say that we
accept the null hypothesis, although most people in industry will say
“We accept the null hypothesis”
11.3
Nonstatistical Hypothesis Testing…
There are two possible errors.
A Type I error occurs when we reject a true null hypothesis.
That is, a Type I error occurs when the jury convicts an
innocent person. We would want the probability of this type
of error [maybe 0.001 – beyond a reasonable doubt] to be
very small for a criminal trial where a conviction results in
the death penalty, whereas for a civil trial, where conviction
might result in someone having to “pay for damages to a
wrecked auto”,we would be willing for the probability to be
larger [0.49 – preponderance of the evidence ]
P(Type I error) = α [usually 0.05 or 0.01]
11.4
Nonstatistical Hypothesis Testing…
A Type II error occurs when we don’t reject a false null
hypothesis [accept the null hypothesis]. That occurs when a
guilty defendant is acquitted.
In practice, this type of error is by far the most serious
mistake we normally make. For example, if we test the
hypothesis that the amount of medication in a heart pill is
equal to a value which will cure your heart problem and
“accept the hull hypothesis that the amount is ok”. Later on
we find out that the average amount is WAY too large and
people die from “too much medication” [I wish we had
rejected the hypothesis and threw the pills in the trash can],
it’s too late because we shipped the pills to the public.
11.5
Nonstatistical Hypothesis Testing…
The probability of a Type I error is denoted as α (Greek
letter alpha). The probability of a type II error is β (Greek
letter beta).
The two probabilities are inversely related. Decreasing one
increases the other, for a fixed sample size.
In other words, you can’t have α and β both real small for
any old sample size. You may have to take a much larger
sample size, or in the court example, you need much more
evidence.
11.6
Types of Errors…
A Type I error occurs when we reject a true null hypothesis
(i.e. Reject H0 when it is TRUE)
A Type II error occurs when we don’t reject a false null
hypothesis (i.e. Do NOT reject H0 when it is FALSE)
H0 T F
Reject I
Reject II
11.7
Nonstatistical Hypothesis Testing…
The critical concepts are theses:
1. There are two hypotheses, the null and the alternative hypotheses.
2. The procedure begins with the assumption that the null hypothesis is
true.
3. The goal is to determine whether there is enough evidence to infer
that the alternative hypothesis is true, or the null is not likely to be
true.
4. There are two possible decisions:
Conclude that there is enough evidence to support the alternative
hypothesis. Reject the null.
Conclude that there is not enough evidence to support the
alternative hypothesis. Fail to reject the null.
11.8
Concepts of Hypothesis Testing (1)…
The two hypotheses are called the null hypothesis and the
other the alternative or research hypothesis. The usual
notation is:
H0: — the ‘null’ hypothesis
H1: — the ‘alternative’ or ‘research’ hypothesis
The null hypothesis (H0) will always state that the parameter
equals the value specified in the alternative hypothesis (H1)
pronounced
H “nought”
11.9
Concepts of Hypothesis Testing…
Consider mean demand for computers during assembly lead
time. Rather than estimate the mean demand, our operations
manager wants to know whether the mean is different from
350 units. In other words, someone is claiming that the mean
time is 350 units and we want to check this claim out to see
if it appears reasonable. We can rephrase this request into a
test of the hypothesis:
H0: = 350
Thus, our research hypothesis becomes:
H1: 350≠
Recall that the standard deviation [σ]was assumed to be 75,
the sample size [n] was 25, and the sample mean [ ] was
calculated to be 370.16
11.10
Concepts of Hypothesis Testing…
For example, if we’re trying to decide whether the mean is
not equal to 350, a large value of (say, 600) would provide
enough evidence.
If is close to 350 (say, 355) we could not say that this
provides a great deal of evidence to infer that the population
mean is different than 350.
11.11
Concepts of Hypothesis Testing (4)…
The two possible decisions that can be made:
Conclude that there is enough evidence to support the alternative
hypothesis
(also stated as: reject the null hypothesis in favor of the alternative)
Conclude that there is not enough evidence to support the
alternative hypothesis
(also stated as: failing to reject the null hypothesis in favor of the
alternative)
NOTE: we do not say that we accept the null hypothesis if a
statistician is around…
11.12
Concepts of Hypothesis Testing (2)…
The testing procedure begins with the assumption that the
null hypothesis is true.
Thus, until we have further statistical evidence, we will
assume:
H0: = 350 (assumed to be TRUE)
The next step will be to determine the sampling distribution
of the sample mean assuming the true mean is 350.
is normal with 350
75/SQRT(25) = 15
11.13
Is the Sample Mean in the Guts of the Sampling Distribution??
11.14
Three ways to determine this: First way
1. Unstandardized test statistic: Is in the guts of the
sampling distribution? Depends on what you define as
the “guts” of the sampling distribution.
If we define the guts as the center 95% of the distribution
[this means α = 0.05], then the critical values that define
the guts will be 1.96 standard deviations of X-Bar on
either side of the mean of the sampling distribution
[350], or
UCV = 350 + 1.96*15 = 350 + 29.4 = 379.4
LCV = 350 – 1.96*15 = 350 – 29.4 = 320.6
11.15
1. Unstandardized Test Statistic Approach
11.16
Three ways to determine this: Second way
2. Standardized test statistic: Since we defined the “guts” of
the sampling distribution to be the center 95% [α = 0.05],
If the Z-Score for the sample mean is greater than
1.96, we know that will be in the reject region on the right
side or
If the Z-Score for the sample mean is less than -1.97,
we know that will be in the reject region on the left side.
Z = ( - )/ = (370.16 – 350)/15 = 1.344
Is this Z-Score in the guts of the sampling distribution???
11.17
2. Standardized Test Statistic Approach
11.18
Three ways to determine this: Third way
3. The p-value approach (which is generally used with a computer and
statistical software): Increase the “Rejection Region” until it
“captures” the sample mean.
For this example, since is to the right of the mean, calculate
P( > 370.16) = P(Z > 1.344) = 0.0901
Since this is a two tailed test, you must double this area for the p-value.
p-value = 2*(0.0901) = 0.1802
Since we defined the guts as the center 95% [α = 0.05], the reject
region is the other 5%. Since our sample mean, , is in the 18.02%
region, it cannot be in our 5% rejection region [α = 0.05].
11.19
3. p-value approach
11.20
Statistical Conclusions:
Unstandardized Test Statistic:
Since LCV (320.6) < (370.16) < UCV (379.4), we
reject the null hypothesis at a 5% level of significance.
Standardized Test Statistic:
Since -Zα/2(-1.96) < Z(1.344) < Zα/2 (1.96), we fail to reject
the null hypothesis at a 5% level of significance.
P-value:
Since p-value (0.1802) > 0.05 [α], we fail to reject the
hull hypothesis at a 5% level of significance.
11.21
Example…
A department store manager determines that a new billing
system will be cost-effective only if the mean monthly
account is more than $170.
A random sample of 400 monthly accounts is drawn, for
which the sample mean is $178. The accounts are
approximately normally distributed with a standard deviation
of $65.
Can we conclude that the new system will be cost-effective?
11.22
Example…
The system will be cost effective if the mean account
balance for all customers is greater than $170.
We express this belief as a our research hypothesis, that is:
H1: > 170 (this is what we want to determine)
Thus, our null hypothesis becomes:
H0: = 170 (this specifies a single value for the
parameter of interest) – Actually H0: μ < 170
11.23
Example…
What we want to show:
H1: > 170
H0: < 170 (we’ll assume this is true)
Normally we put Ho first.
We know:
n = 400,
= 178, and
= 65
= 65/SQRT(400) = 3.25
α = 0.05
11.24
Example… Rejection Region…
The rejection region is a range of values such that if the test
statistic falls into that range, we decide to reject the null
hypothesis in favor of the alternative hypothesis.
is the critical value of to reject H0.
11.25
Example…
At a 5% significance level (i.e. =0.05), we get [all α in one tail]
Zα = Z0.05 = 1.645
Therefore, UCV = 170 + 1.645*3.25 = 175.35
Since our sample mean (178) is greater than the critical value we
calculated (175.35), we reject the null hypothesis in favor of H1
OR
(>1.645) Reject null
OR
p-value = P( > 178) = P(Z > 2.46) = 0.0069 < 0.05 Reject null
11.26
Example… The Big Picture…
=175.34
=178
H1: > 170
H0: = 170
Reject H0 in favor of
11.27
Interpreting the p-value…
The smaller the p-value, the more statistical evidence exists
to support the alternative hypothesis.
•If the p-value is less than 1%, there is overwhelming
evidence that supports the alternative hypothesis.
•If the p-value is between 1% and 5%, there is a strong
evidence that supports the alternative hypothesis.
•If the p-value is between 5% and 10% there is a weak
evidence that supports the alternative hypothesis.
•If the p-value exceeds 10%, there is no evidence that
supports the alternative hypothesis.
We observe a p-value of .0069, hence there is
overwhelming evidence to support H1: > 170.
11.28
Interpreting the p-value…
Overwhelming Evidence
(Highly Significant)
Strong Evidence
(Significant)
Weak Evidence
(Not Significant)
No Evidence
(Not Significant)
0 .01 .05 .10
p=.0069
11.29
Conclusions of a Test of Hypothesis…
If we reject the null hypothesis, we conclude that there is
enough evidence to infer that the alternative hypothesis is
true.
If we fail to reject the null hypothesis, we conclude that there
is not enough statistical evidence to infer that the alternative
hypothesis is true. This does not mean that we have proven
that the null hypothesis is true!
Keep in mind that committing a Type I error OR a Type II
error can be VERY bad depending on the problem.
11.30
One tail test with rejection region on right
The last example was a one tail test, because the rejection
region is located in only one tail of the sampling distribution:
More correctly, this was an example of a right tail test.
H1: μ > 170
H0: μ < 170
11.31
One tail test with rejection region on left
The rejection region will be in the left tail.
11.32
Two tail test with rejection region in both tails
The rejection region is split equally between the two tails.
11.33
Example… Students work
AT&T’s argues that its rates are such that customers won’t
see a difference in their phone bills between them and their
competitors. They calculate the mean and standard deviation
for all their customers at $17.09 and $3.87 (respectively).
Note: Don’t know the true value for σ, so we estimate σ from
the data [σ ~ s = 3.87] – large sample so don’t worry.
They then sample 100 customers at random and recalculate a
monthly phone bill based on competitor’s rates.
Our null and alternative hypotheses are
H1: 17.09. We do this by assuming that:≠
H0: = 17.09
11.34
Example…
The rejection region is set up so we can reject the null
hypothesis when the test statistic is large or when it is small.
That is, we set up a two-tail rejection region. The total area
in the rejection region must sum to , so we divide α by 2.
stat is “small” stat is “large”
11.35
Example…
At a 5% significance level (i.e. = .05), we have
/2 = .025. Thus, z.025 = 1.96 and our rejection region is:
z < –1.96 -or- z > 1.96
z-z.025 +z.0250
11.36
Example…
From the data, we calculate = 17.55
Using our standardized test statistic:
We find that:
Since z = 1.19 is not greater than 1.96, nor less than –1.96
we cannot reject the null hypothesis in favor of H1. That is
“there is insufficient evidence to infer that there is a
difference between the bills of AT&T and the competitor.”
11.37
Summary of One- and Two-Tail Tests…
One-Tail Test
(left tail)
Two-Tail Test One-Tail Test
(right tail)
11.38
Probability of a Type II Error –
A Type II error occurs when a false null hypothesis is not
rejected or “you accept the null when it is not true” but don’t
say it this way if a statistician is around.
In practice, this is by far the most serious error you can make
in most cases, especially in the “quality field”.
11.39
Judging the Test…
A statistical test of hypothesis is effectively defined by the
significance level ( ) and the sample size (n), both of
which are selected by the statistics practitioner.
Therefore, if the probability of a Type II error ( ) is too
large [we have insufficient power], we can reduce it by
increasing , and/or
increasing the sample size, n.
11.40
Judging the Test…
The power of a test is defined as 1– .
It represents the probability of rejecting the null hypothesis when it is
false and the true mean is something other than the null value for the
mean.
If we are testing the hypothesis that the average amount of medication
in blood pressure pills is equal to 6 mg (which is good), and we “fail to
reject” the null hypothesis, ship the pills to patients worldwide, only to
find out later that the “true” average amount of medication is really 8
mg and people die, we get in trouble. This occurred because the
P(reject the null / true mean = 7 mg) = 0.32 which would mean that we
have a 68% chance on not rejecting the null for these BAD pills and
shipping to patients worldwide.
11.41
Probability you ship pills whose mean amount of medication is 7 mg approximately 67%

Mais conteúdo relacionado

Mais procurados

Algorithm and pseudo codes
Algorithm and pseudo codesAlgorithm and pseudo codes
Algorithm and pseudo codes
hermiraguilar
 

Mais procurados (20)

Algorithm and pseudo codes
Algorithm and pseudo codesAlgorithm and pseudo codes
Algorithm and pseudo codes
 
Inheritance OOP Concept in C++.
Inheritance OOP Concept in C++.Inheritance OOP Concept in C++.
Inheritance OOP Concept in C++.
 
Structure
StructureStructure
Structure
 
Storage Class in C Progrmming
Storage Class in C Progrmming Storage Class in C Progrmming
Storage Class in C Progrmming
 
Functions in c
Functions in cFunctions in c
Functions in c
 
Les5e ppt 07
Les5e ppt 07Les5e ppt 07
Les5e ppt 07
 
Unit II chapter 4 Loops in C
Unit II chapter 4 Loops in CUnit II chapter 4 Loops in C
Unit II chapter 4 Loops in C
 
Introduction to java programming part 1
Introduction to java programming part 1Introduction to java programming part 1
Introduction to java programming part 1
 
Lecture 5 - Structured Programming Language
Lecture 5 - Structured Programming Language Lecture 5 - Structured Programming Language
Lecture 5 - Structured Programming Language
 
Python Programming Essentials - M25 - os and sys modules
Python Programming Essentials - M25 - os and sys modulesPython Programming Essentials - M25 - os and sys modules
Python Programming Essentials - M25 - os and sys modules
 
MMW (Data Management)-Part 1 for ULO 2 (1).pptx
MMW (Data Management)-Part 1 for ULO 2 (1).pptxMMW (Data Management)-Part 1 for ULO 2 (1).pptx
MMW (Data Management)-Part 1 for ULO 2 (1).pptx
 
Looping Statements and Control Statements in Python
Looping Statements and Control Statements in PythonLooping Statements and Control Statements in Python
Looping Statements and Control Statements in Python
 
Loops c++
Loops c++Loops c++
Loops c++
 
Rules for Variable Declaration
Rules for Variable DeclarationRules for Variable Declaration
Rules for Variable Declaration
 
Data types in C
Data types in CData types in C
Data types in C
 
The compilation process
The compilation processThe compilation process
The compilation process
 
Decision Making and Looping
Decision Making and LoopingDecision Making and Looping
Decision Making and Looping
 
Unit 2. Elements of C
Unit 2. Elements of CUnit 2. Elements of C
Unit 2. Elements of C
 
Unit ii chapter 1 operator and expressions in c
Unit ii chapter 1 operator and expressions in cUnit ii chapter 1 operator and expressions in c
Unit ii chapter 1 operator and expressions in c
 
Conditional Statement in C Language
Conditional Statement in C LanguageConditional Statement in C Language
Conditional Statement in C Language
 

Destaque (20)

Week6 narrativewriting
Week6 narrativewritingWeek6 narrativewriting
Week6 narrativewriting
 
Week 2
Week 2Week 2
Week 2
 
Week11 eng1060.ppt
Week11 eng1060.pptWeek11 eng1060.ppt
Week11 eng1060.ppt
 
Math3010 week 5
Math3010 week 5Math3010 week 5
Math3010 week 5
 
Math3010 week 6
Math3010 week 6Math3010 week 6
Math3010 week 6
 
Math3010 week 3
Math3010 week 3Math3010 week 3
Math3010 week 3
 
Eng 1060 week 4
Eng 1060 week 4Eng 1060 week 4
Eng 1060 week 4
 
Santrock essentials4e ppt_ch16
Santrock essentials4e ppt_ch16Santrock essentials4e ppt_ch16
Santrock essentials4e ppt_ch16
 
Ch016
Ch016Ch016
Ch016
 
Chapter6
Chapter6Chapter6
Chapter6
 
Week 6
Week 6Week 6
Week 6
 
Chapter 8
Chapter 8Chapter 8
Chapter 8
 
Efr ch13 ethicsinternational_sr2.4
Efr ch13 ethicsinternational_sr2.4Efr ch13 ethicsinternational_sr2.4
Efr ch13 ethicsinternational_sr2.4
 
Week5
Week5Week5
Week5
 
Ch3
Ch3Ch3
Ch3
 
Chapter 020 1
Chapter 020 1Chapter 020 1
Chapter 020 1
 
Living evidence 3
Living evidence 3Living evidence 3
Living evidence 3
 
Chapter 4 ppt eval &amp; testing 4e formatted 01.10 mo edits
Chapter 4 ppt eval &amp; testing 4e formatted 01.10 mo editsChapter 4 ppt eval &amp; testing 4e formatted 01.10 mo edits
Chapter 4 ppt eval &amp; testing 4e formatted 01.10 mo edits
 
Santrock essentials 3e_ppt_ch14
Santrock essentials 3e_ppt_ch14Santrock essentials 3e_ppt_ch14
Santrock essentials 3e_ppt_ch14
 
Nur 3062 chapter 8
Nur 3062 chapter 8Nur 3062 chapter 8
Nur 3062 chapter 8
 

Semelhante a Math3010 week 4

Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01
mandrewmartin
 
Testing of hypothesis
Testing of hypothesisTesting of hypothesis
Testing of hypothesis
Jags Jagdish
 

Semelhante a Math3010 week 4 (20)

6 estimation hypothesis testing t test
6 estimation hypothesis testing t test6 estimation hypothesis testing t test
6 estimation hypothesis testing t test
 
Top schools in delhi ncr
Top schools in delhi ncrTop schools in delhi ncr
Top schools in delhi ncr
 
chapter11.ppt
chapter11.pptchapter11.ppt
chapter11.ppt
 
Chapter8 Introduction to Estimation Hypothesis Testing.pdf
Chapter8 Introduction to Estimation Hypothesis Testing.pdfChapter8 Introduction to Estimation Hypothesis Testing.pdf
Chapter8 Introduction to Estimation Hypothesis Testing.pdf
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
eMba i qt unit-5.1_hypothesis testing
eMba i qt unit-5.1_hypothesis testingeMba i qt unit-5.1_hypothesis testing
eMba i qt unit-5.1_hypothesis testing
 
B.tech admission in india
B.tech admission in indiaB.tech admission in india
B.tech admission in india
 
Chapter 9 Fundamental of Hypothesis Testing.ppt
Chapter 9 Fundamental of Hypothesis Testing.pptChapter 9 Fundamental of Hypothesis Testing.ppt
Chapter 9 Fundamental of Hypothesis Testing.ppt
 
Chapter11 (1)
Chapter11 (1)Chapter11 (1)
Chapter11 (1)
 
hypothesis test
 hypothesis test hypothesis test
hypothesis test
 
Testing of hypotheses
Testing of hypothesesTesting of hypotheses
Testing of hypotheses
 
Test of hypothesis
Test of hypothesisTest of hypothesis
Test of hypothesis
 
hypothesis testing overview
hypothesis testing overviewhypothesis testing overview
hypothesis testing overview
 
More Statistics
More StatisticsMore Statistics
More Statistics
 
Unit 4 Tests of Significance
Unit 4 Tests of SignificanceUnit 4 Tests of Significance
Unit 4 Tests of Significance
 
Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01
 
20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd
 
hypothesis-tesing.pdf
hypothesis-tesing.pdfhypothesis-tesing.pdf
hypothesis-tesing.pdf
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Testing of hypothesis
Testing of hypothesisTesting of hypothesis
Testing of hypothesis
 

Mais de stanbridge

Mais de stanbridge (20)

Micro Lab 3 Lecture
Micro Lab 3 LectureMicro Lab 3 Lecture
Micro Lab 3 Lecture
 
Creating a poster v2
Creating a poster v2Creating a poster v2
Creating a poster v2
 
Creating a poster
Creating a posterCreating a poster
Creating a poster
 
Sample poster
Sample posterSample poster
Sample poster
 
OT 5018 Thesis Dissemination
OT 5018 Thesis DisseminationOT 5018 Thesis Dissemination
OT 5018 Thesis Dissemination
 
Ot5101 005 week 5
Ot5101 005 week 5Ot5101 005 week 5
Ot5101 005 week 5
 
Ot5101 005 week4
Ot5101 005 week4Ot5101 005 week4
Ot5101 005 week4
 
Compliance, motivation, and health behaviors
Compliance, motivation, and health behaviors Compliance, motivation, and health behaviors
Compliance, motivation, and health behaviors
 
Ch 5 developmental stages of the learner
Ch 5   developmental stages of the learnerCh 5   developmental stages of the learner
Ch 5 developmental stages of the learner
 
OT 5101 week2 theory policy
OT 5101 week2 theory policyOT 5101 week2 theory policy
OT 5101 week2 theory policy
 
OT 5101 week3 planning needs assessment
OT 5101 week3 planning needs assessmentOT 5101 week3 planning needs assessment
OT 5101 week3 planning needs assessment
 
Ot5101 week1
Ot5101 week1Ot5101 week1
Ot5101 week1
 
NUR 304 Chapter005
NUR 304 Chapter005NUR 304 Chapter005
NUR 304 Chapter005
 
NUR 3043 Chapter007
NUR 3043 Chapter007NUR 3043 Chapter007
NUR 3043 Chapter007
 
NUR 3043 Chapter006
NUR 3043 Chapter006NUR 3043 Chapter006
NUR 3043 Chapter006
 
NUR 3043 Chapter004
NUR 3043 Chapter004NUR 3043 Chapter004
NUR 3043 Chapter004
 
3043 Chapter009
3043 Chapter0093043 Chapter009
3043 Chapter009
 
3043 Chapter008
 3043 Chapter008 3043 Chapter008
3043 Chapter008
 
Melnyk ppt chapter_21
Melnyk ppt chapter_21Melnyk ppt chapter_21
Melnyk ppt chapter_21
 
Melnyk ppt chapter_22
Melnyk ppt chapter_22Melnyk ppt chapter_22
Melnyk ppt chapter_22
 

Último

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Último (20)

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 

Math3010 week 4

  • 1. 11.1 Nonstatistical Hypothesis Testing… A criminal trial is an example of hypothesis testing without the statistics. In a trial a jury must decide between two hypotheses. The null hypothesis is H0: The defendant is innocent The alternative hypothesis or research hypothesis is H1: The defendant is guilty The jury does not know which hypothesis is true. They must make a decision on the basis of evidence presented.
  • 2. 11.2 Nonstatistical Hypothesis Testing… In the language of statistics convicting the defendant is called rejecting the null hypothesis in favor of the alternative hypothesis. That is, the jury is saying that there is enough evidence to conclude that the defendant is guilty (i.e., there is enough evidence to support the alternative hypothesis). If the jury acquits it is stating that there is not enough evidence to support the alternative hypothesis. Notice that the jury is not saying that the defendant is innocent, only that there is not enough evidence to support the alternative hypothesis. That is why we never say that we accept the null hypothesis, although most people in industry will say “We accept the null hypothesis”
  • 3. 11.3 Nonstatistical Hypothesis Testing… There are two possible errors. A Type I error occurs when we reject a true null hypothesis. That is, a Type I error occurs when the jury convicts an innocent person. We would want the probability of this type of error [maybe 0.001 – beyond a reasonable doubt] to be very small for a criminal trial where a conviction results in the death penalty, whereas for a civil trial, where conviction might result in someone having to “pay for damages to a wrecked auto”,we would be willing for the probability to be larger [0.49 – preponderance of the evidence ] P(Type I error) = α [usually 0.05 or 0.01]
  • 4. 11.4 Nonstatistical Hypothesis Testing… A Type II error occurs when we don’t reject a false null hypothesis [accept the null hypothesis]. That occurs when a guilty defendant is acquitted. In practice, this type of error is by far the most serious mistake we normally make. For example, if we test the hypothesis that the amount of medication in a heart pill is equal to a value which will cure your heart problem and “accept the hull hypothesis that the amount is ok”. Later on we find out that the average amount is WAY too large and people die from “too much medication” [I wish we had rejected the hypothesis and threw the pills in the trash can], it’s too late because we shipped the pills to the public.
  • 5. 11.5 Nonstatistical Hypothesis Testing… The probability of a Type I error is denoted as α (Greek letter alpha). The probability of a type II error is β (Greek letter beta). The two probabilities are inversely related. Decreasing one increases the other, for a fixed sample size. In other words, you can’t have α and β both real small for any old sample size. You may have to take a much larger sample size, or in the court example, you need much more evidence.
  • 6. 11.6 Types of Errors… A Type I error occurs when we reject a true null hypothesis (i.e. Reject H0 when it is TRUE) A Type II error occurs when we don’t reject a false null hypothesis (i.e. Do NOT reject H0 when it is FALSE) H0 T F Reject I Reject II
  • 7. 11.7 Nonstatistical Hypothesis Testing… The critical concepts are theses: 1. There are two hypotheses, the null and the alternative hypotheses. 2. The procedure begins with the assumption that the null hypothesis is true. 3. The goal is to determine whether there is enough evidence to infer that the alternative hypothesis is true, or the null is not likely to be true. 4. There are two possible decisions: Conclude that there is enough evidence to support the alternative hypothesis. Reject the null. Conclude that there is not enough evidence to support the alternative hypothesis. Fail to reject the null.
  • 8. 11.8 Concepts of Hypothesis Testing (1)… The two hypotheses are called the null hypothesis and the other the alternative or research hypothesis. The usual notation is: H0: — the ‘null’ hypothesis H1: — the ‘alternative’ or ‘research’ hypothesis The null hypothesis (H0) will always state that the parameter equals the value specified in the alternative hypothesis (H1) pronounced H “nought”
  • 9. 11.9 Concepts of Hypothesis Testing… Consider mean demand for computers during assembly lead time. Rather than estimate the mean demand, our operations manager wants to know whether the mean is different from 350 units. In other words, someone is claiming that the mean time is 350 units and we want to check this claim out to see if it appears reasonable. We can rephrase this request into a test of the hypothesis: H0: = 350 Thus, our research hypothesis becomes: H1: 350≠ Recall that the standard deviation [σ]was assumed to be 75, the sample size [n] was 25, and the sample mean [ ] was calculated to be 370.16
  • 10. 11.10 Concepts of Hypothesis Testing… For example, if we’re trying to decide whether the mean is not equal to 350, a large value of (say, 600) would provide enough evidence. If is close to 350 (say, 355) we could not say that this provides a great deal of evidence to infer that the population mean is different than 350.
  • 11. 11.11 Concepts of Hypothesis Testing (4)… The two possible decisions that can be made: Conclude that there is enough evidence to support the alternative hypothesis (also stated as: reject the null hypothesis in favor of the alternative) Conclude that there is not enough evidence to support the alternative hypothesis (also stated as: failing to reject the null hypothesis in favor of the alternative) NOTE: we do not say that we accept the null hypothesis if a statistician is around…
  • 12. 11.12 Concepts of Hypothesis Testing (2)… The testing procedure begins with the assumption that the null hypothesis is true. Thus, until we have further statistical evidence, we will assume: H0: = 350 (assumed to be TRUE) The next step will be to determine the sampling distribution of the sample mean assuming the true mean is 350. is normal with 350 75/SQRT(25) = 15
  • 13. 11.13 Is the Sample Mean in the Guts of the Sampling Distribution??
  • 14. 11.14 Three ways to determine this: First way 1. Unstandardized test statistic: Is in the guts of the sampling distribution? Depends on what you define as the “guts” of the sampling distribution. If we define the guts as the center 95% of the distribution [this means α = 0.05], then the critical values that define the guts will be 1.96 standard deviations of X-Bar on either side of the mean of the sampling distribution [350], or UCV = 350 + 1.96*15 = 350 + 29.4 = 379.4 LCV = 350 – 1.96*15 = 350 – 29.4 = 320.6
  • 15. 11.15 1. Unstandardized Test Statistic Approach
  • 16. 11.16 Three ways to determine this: Second way 2. Standardized test statistic: Since we defined the “guts” of the sampling distribution to be the center 95% [α = 0.05], If the Z-Score for the sample mean is greater than 1.96, we know that will be in the reject region on the right side or If the Z-Score for the sample mean is less than -1.97, we know that will be in the reject region on the left side. Z = ( - )/ = (370.16 – 350)/15 = 1.344 Is this Z-Score in the guts of the sampling distribution???
  • 17. 11.17 2. Standardized Test Statistic Approach
  • 18. 11.18 Three ways to determine this: Third way 3. The p-value approach (which is generally used with a computer and statistical software): Increase the “Rejection Region” until it “captures” the sample mean. For this example, since is to the right of the mean, calculate P( > 370.16) = P(Z > 1.344) = 0.0901 Since this is a two tailed test, you must double this area for the p-value. p-value = 2*(0.0901) = 0.1802 Since we defined the guts as the center 95% [α = 0.05], the reject region is the other 5%. Since our sample mean, , is in the 18.02% region, it cannot be in our 5% rejection region [α = 0.05].
  • 20. 11.20 Statistical Conclusions: Unstandardized Test Statistic: Since LCV (320.6) < (370.16) < UCV (379.4), we reject the null hypothesis at a 5% level of significance. Standardized Test Statistic: Since -Zα/2(-1.96) < Z(1.344) < Zα/2 (1.96), we fail to reject the null hypothesis at a 5% level of significance. P-value: Since p-value (0.1802) > 0.05 [α], we fail to reject the hull hypothesis at a 5% level of significance.
  • 21. 11.21 Example… A department store manager determines that a new billing system will be cost-effective only if the mean monthly account is more than $170. A random sample of 400 monthly accounts is drawn, for which the sample mean is $178. The accounts are approximately normally distributed with a standard deviation of $65. Can we conclude that the new system will be cost-effective?
  • 22. 11.22 Example… The system will be cost effective if the mean account balance for all customers is greater than $170. We express this belief as a our research hypothesis, that is: H1: > 170 (this is what we want to determine) Thus, our null hypothesis becomes: H0: = 170 (this specifies a single value for the parameter of interest) – Actually H0: μ < 170
  • 23. 11.23 Example… What we want to show: H1: > 170 H0: < 170 (we’ll assume this is true) Normally we put Ho first. We know: n = 400, = 178, and = 65 = 65/SQRT(400) = 3.25 α = 0.05
  • 24. 11.24 Example… Rejection Region… The rejection region is a range of values such that if the test statistic falls into that range, we decide to reject the null hypothesis in favor of the alternative hypothesis. is the critical value of to reject H0.
  • 25. 11.25 Example… At a 5% significance level (i.e. =0.05), we get [all α in one tail] Zα = Z0.05 = 1.645 Therefore, UCV = 170 + 1.645*3.25 = 175.35 Since our sample mean (178) is greater than the critical value we calculated (175.35), we reject the null hypothesis in favor of H1 OR (>1.645) Reject null OR p-value = P( > 178) = P(Z > 2.46) = 0.0069 < 0.05 Reject null
  • 26. 11.26 Example… The Big Picture… =175.34 =178 H1: > 170 H0: = 170 Reject H0 in favor of
  • 27. 11.27 Interpreting the p-value… The smaller the p-value, the more statistical evidence exists to support the alternative hypothesis. •If the p-value is less than 1%, there is overwhelming evidence that supports the alternative hypothesis. •If the p-value is between 1% and 5%, there is a strong evidence that supports the alternative hypothesis. •If the p-value is between 5% and 10% there is a weak evidence that supports the alternative hypothesis. •If the p-value exceeds 10%, there is no evidence that supports the alternative hypothesis. We observe a p-value of .0069, hence there is overwhelming evidence to support H1: > 170.
  • 28. 11.28 Interpreting the p-value… Overwhelming Evidence (Highly Significant) Strong Evidence (Significant) Weak Evidence (Not Significant) No Evidence (Not Significant) 0 .01 .05 .10 p=.0069
  • 29. 11.29 Conclusions of a Test of Hypothesis… If we reject the null hypothesis, we conclude that there is enough evidence to infer that the alternative hypothesis is true. If we fail to reject the null hypothesis, we conclude that there is not enough statistical evidence to infer that the alternative hypothesis is true. This does not mean that we have proven that the null hypothesis is true! Keep in mind that committing a Type I error OR a Type II error can be VERY bad depending on the problem.
  • 30. 11.30 One tail test with rejection region on right The last example was a one tail test, because the rejection region is located in only one tail of the sampling distribution: More correctly, this was an example of a right tail test. H1: μ > 170 H0: μ < 170
  • 31. 11.31 One tail test with rejection region on left The rejection region will be in the left tail.
  • 32. 11.32 Two tail test with rejection region in both tails The rejection region is split equally between the two tails.
  • 33. 11.33 Example… Students work AT&T’s argues that its rates are such that customers won’t see a difference in their phone bills between them and their competitors. They calculate the mean and standard deviation for all their customers at $17.09 and $3.87 (respectively). Note: Don’t know the true value for σ, so we estimate σ from the data [σ ~ s = 3.87] – large sample so don’t worry. They then sample 100 customers at random and recalculate a monthly phone bill based on competitor’s rates. Our null and alternative hypotheses are H1: 17.09. We do this by assuming that:≠ H0: = 17.09
  • 34. 11.34 Example… The rejection region is set up so we can reject the null hypothesis when the test statistic is large or when it is small. That is, we set up a two-tail rejection region. The total area in the rejection region must sum to , so we divide α by 2. stat is “small” stat is “large”
  • 35. 11.35 Example… At a 5% significance level (i.e. = .05), we have /2 = .025. Thus, z.025 = 1.96 and our rejection region is: z < –1.96 -or- z > 1.96 z-z.025 +z.0250
  • 36. 11.36 Example… From the data, we calculate = 17.55 Using our standardized test statistic: We find that: Since z = 1.19 is not greater than 1.96, nor less than –1.96 we cannot reject the null hypothesis in favor of H1. That is “there is insufficient evidence to infer that there is a difference between the bills of AT&T and the competitor.”
  • 37. 11.37 Summary of One- and Two-Tail Tests… One-Tail Test (left tail) Two-Tail Test One-Tail Test (right tail)
  • 38. 11.38 Probability of a Type II Error – A Type II error occurs when a false null hypothesis is not rejected or “you accept the null when it is not true” but don’t say it this way if a statistician is around. In practice, this is by far the most serious error you can make in most cases, especially in the “quality field”.
  • 39. 11.39 Judging the Test… A statistical test of hypothesis is effectively defined by the significance level ( ) and the sample size (n), both of which are selected by the statistics practitioner. Therefore, if the probability of a Type II error ( ) is too large [we have insufficient power], we can reduce it by increasing , and/or increasing the sample size, n.
  • 40. 11.40 Judging the Test… The power of a test is defined as 1– . It represents the probability of rejecting the null hypothesis when it is false and the true mean is something other than the null value for the mean. If we are testing the hypothesis that the average amount of medication in blood pressure pills is equal to 6 mg (which is good), and we “fail to reject” the null hypothesis, ship the pills to patients worldwide, only to find out later that the “true” average amount of medication is really 8 mg and people die, we get in trouble. This occurred because the P(reject the null / true mean = 7 mg) = 0.32 which would mean that we have a 68% chance on not rejecting the null for these BAD pills and shipping to patients worldwide.
  • 41. 11.41 Probability you ship pills whose mean amount of medication is 7 mg approximately 67%