SlideShare uma empresa Scribd logo
1 de 29
Item Analysis,
Test Validity and Reliability
Prepared by:
Rovel A. Aparicio
Mathematics Teacher
THERE IS always A
better WAY
Stages in Test Construction
I. Planning the Test
A. Determining the Objectives
B. Preparing the Table of Specifications
C. Selecting the Appropriate Item Format
D. Writing the Test items
E. Editing the Test items
Stages in Test Construction
II. Trying Out the Test

A. Administering the test
Item analysis
C. Preparing the Final Form of the Test
Stages in Test Construction
III. Establishing Test Validity

IV. Establishing Test Reliability

V. Interpreting the Test Scores
Item Analysis

GOAL: Improve the test.
IMPORTANCE: Measure the effectiveness of individual test item.

DIFFICULTY INDEX

DISCRIMINATION INDEX

 the percentage

 refers to the degree

of the pupils who
got the items rigth.
interpreted as
how easy or how
difficult an item is.

to which success or
failure of an item
indicates possession of
the acheivement being
measured.
ACTIVITY NO.1

COMPUTE THE DIFFICULTY
INDEX AND
DISCRIMINATION INDEX OF
PERIODICAL TEST.
U-L INDEX METHOD
(STEPS)

1. Score the papers and rank them from
highest
to lowest according to the total score.
2. Separate the top 27% and the bottom 27%
of the papers.
3. Tally the responses made to each test item
by each individual in the upper 27% group.
4.Tally the responses made to each test item
by each individual in the lower 27% group.
U-L INDEX METHOD
(STEPS)

5. Compute the difficulty index.
[d= (U+L)/(nu+nl)]
6. Compute the discrimination index.
[D=(U-L)/nu] or [D=(U-L)/nl]
Item
no.

Upper
27%

1

Lower
27%

Difficulty
Index

14

12

0.81

0.13

Revised

2

10

6

0.50

0.25

3

11

7

0.56

0.25

Retained
Retained

4

9

2

0.34

0.44

Retained

5

12

0.56

0.38

Retained

6

6

6
14

0.63

-0.50

Rejected

7

13

4

0.53

0.56

Retained

8

3

10

0.41

-0.44

Rejected

9

13

12

0.78

0.06

Rejected

10

8

6

0.44

0.13

Revised

No. of pupils tested- 60

Discrimination
Index

Remarks
Item Analysis

DIFFICULTY INDEX
.00-.20

Very Difficult

.21-.80

Moderately
Difficult

.81-1.00

DISCRIMINATION INDEX
< .09

Poor items
(Reject)

.10-.39

Reasonably
Good (Revise)

.40-1.00

Very Good
items (Retain)

Very Easy
Establishing Test Validity

Criterionrelated
Validity

Content
Validity
Types of Test
Validity

Construct
Validity
Establishing Test Validity
Types of Validity
Types of Validity

1.Content
Validity

Meaning
Meaning

Procedure
Procedure

Compare test tasks
How well the
with test
sample test bar specifications
tasks represent describing the task
the domain of
domain under
tasks to be
consideration
measured.
(non-statistical)
Establishing Test Validity
Types of Validity
Types of Validity

2. Construct
Validity

Meaning
Meaning

Procedure
Procedure

Experimentally
determine what factors
How test
influence scores on test.
performance can
The procedure may be
be described
psychologically. logical and statistical
using correlations and
other statistical
methods.
Establishing Test Validity
Types of Validity
Types of Validity

3.

Meaning
Meaning

Procedure
Procedure

Compare test scores
How well test
with measure of
performance
performance(grade)
obtain on later date(for
Criterion- predicts
future performance prediction).or another
related or estimates current
measure of performance
Validity
performance on obtain concurrently(for
some valued
estimating present
measures other
status.( Primarily
than the test
Statistical). Correlate
itself.
test results with
outside criterion.
Establishing Test Reliability
Measure of
Stability and
Equivalence

Measure of
Stability

Types of
Reliability
Measure

Measure of
Internal
Consistency

Measure of
Equivalence
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

1. Measure of
Stability

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Test- retest
method

Procedure
Procedure

Give a test twice to
the same group with
any time interval
between tests from
several minutes to
several years.
(Pearson r)
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

2. Measure of
Equivalence

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Procedure
Procedure

Give two forms of a
Equivalent formstest to the same
group in close
method
succession
(Pearson r)
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

3. Measure of
Stability

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Procedure
Procedure

Give two forms of
a test to the same
Test- retest
with equivalentgroup with increased
time intervals
forms
between forms.
(Pearson r)
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

4. Measure of
internal
consistency

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Procedure
Procedure

Give a test once.
Kuder-Richarson
Score the total test
method
and apply the Kuder
Richardson formula.
Establishing Test Reliability
Types of Reliability
Types of Reliability
Measures
Measures

4. Measure of
internal
consistency

Methods of
Methods of
Estimating Reliability
Estimating Reliability

Split half
method

Procedure
Procedure

Give a test once.
Score equivalent
halves of the test.
(e.g. odd and even
numbered items.
(Pearson r and
Spearman- Brown
formula)
ACTIVITY NO.2

TEST THE RELIABILITY OF
PERIODICAL TEST.
Pearson r Standard Scores
(Directions)

1. Begin by writing the pairs of scores to be
studied in two columns. Be sure that the pair of
scores for each pupils is in the same row.
Label one set of scores X , the other Y.
2.Get the sum (∑) of the scores for each
column. Divide the sum by the number of
scores (N) in each column to get the mean.
3.Subtract each score in column X from the
mean x. Write the difference in column x. Be
sure to put an algebraic sign.
Pearson r Standard Scores
(Directions)

4. Subtract each score in column Y from the
mean y. Write the difference in column y. Don't
forget the sign.
5. Square each score in column X. Enter each
result under X2 .
6. Square each score in column Y. Enter each
result under Y2 .
7. Compute the standard deviation of X and Y
and enter the result under the column of SDx
and SDy respectively .
Pearson r Standard Scores
(Directions)

8. Divide each entry in column x and y by the
standard deviation SDx and SDy respectively
and enter the result under Zx and Zy
respectively.
9. Multiply Zx and Zy and enter the result
under ZxZy.
10. Get the sum (∑) ZxZy.
11. Apply the formula r=∑ZxZy
N
Interpretation of
Coefficient of Correlation

Correlation is a measure of relationship
between two variables.
Magnitude or size of
Relationship
0.8 and above means
high correlation
0.5 means moderate
correlation
0.3 and below means
low correlation

Direction of Relationship
Negative coefficient
means, as one variable
increases, the other
decreases.
Positive Coefficient
means, as one variable
increases, the other also
increases
Interpretation of
Coefficient of Variation

Coeffecient of Variation is defined as the
ratio of the standard deviation and the mean
and usually expressed in percent.

Criteria:
c.v. = (mean/s.d.)x100

less than 10%Homogenous
greater than 10%- Heterogenous
REMEMBER:

1. Use item analysis procedures to check the quality
of the test. The item analysis should be interpreted
with care and caution
2. A test is valid when it measures what it is supposed
to measure
3. A test is reliable when it is consistent .
Test validity

Mais conteúdo relacionado

Mais procurados

Distracter Analysis - Index of Effectiveness
Distracter Analysis - Index of EffectivenessDistracter Analysis - Index of Effectiveness
Distracter Analysis - Index of EffectivenessMr. Ronald Quileste, PhD
 
What are the different ways to establish test reliability
What are the different ways to establish test reliabilityWhat are the different ways to establish test reliability
What are the different ways to establish test reliabilityJhai Diocos Maravilla
 
Interpretation of Assessment Results
Interpretation of Assessment ResultsInterpretation of Assessment Results
Interpretation of Assessment ResultsRica Joy Pontilar
 
Item analysis and validation
Item analysis and validationItem analysis and validation
Item analysis and validationKEnkenken Tan
 
Atlantico evaluation presentation
Atlantico evaluation presentationAtlantico evaluation presentation
Atlantico evaluation presentationMcastillobarrios
 
Item Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty IndexItem Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty IndexMr. Ronald Quileste, PhD
 
Assessment as learning
Assessment as learningAssessment as learning
Assessment as learningAnup Singh
 
Principles of high quality assessment
Principles of high quality assessmentPrinciples of high quality assessment
Principles of high quality assessmentShaneLagnirac
 
Principles of high quality assessment criteria and techniques
Principles of high quality assessment criteria and techniquesPrinciples of high quality assessment criteria and techniques
Principles of high quality assessment criteria and techniquesshimmy ct
 
Administering, Analyzing, and Improving the Test or Assessment
Administering, Analyzing, and Improving the Test or AssessmentAdministering, Analyzing, and Improving the Test or Assessment
Administering, Analyzing, and Improving the Test or AssessmentNema Grace Medillo
 
General principles of testing to Different qualities of high quality assessment
General principles of testing to Different qualities of high quality assessmentGeneral principles of testing to Different qualities of high quality assessment
General principles of testing to Different qualities of high quality assessmentHannah Borja
 
Criterion and Norm - Referenced Interpretations and the Four Frames of Reference
Criterion and Norm - Referenced Interpretations and the Four Frames of ReferenceCriterion and Norm - Referenced Interpretations and the Four Frames of Reference
Criterion and Norm - Referenced Interpretations and the Four Frames of ReferenceMr. Ronald Quileste, PhD
 
Measurement,evaluation,assessment(upload)
Measurement,evaluation,assessment(upload)Measurement,evaluation,assessment(upload)
Measurement,evaluation,assessment(upload)Dr.Shazia Zamir
 
Manage Learning Performance With Ict
Manage Learning Performance With IctManage Learning Performance With Ict
Manage Learning Performance With IctOPENDESK STUDIUM
 

Mais procurados (20)

Distracter Analysis - Index of Effectiveness
Distracter Analysis - Index of EffectivenessDistracter Analysis - Index of Effectiveness
Distracter Analysis - Index of Effectiveness
 
What are the different ways to establish test reliability
What are the different ways to establish test reliabilityWhat are the different ways to establish test reliability
What are the different ways to establish test reliability
 
Item Analysis and Validation
Item Analysis and ValidationItem Analysis and Validation
Item Analysis and Validation
 
Interpretation of Assessment Results
Interpretation of Assessment ResultsInterpretation of Assessment Results
Interpretation of Assessment Results
 
Improving the test items
Improving the test itemsImproving the test items
Improving the test items
 
Item analysis and validation
Item analysis and validationItem analysis and validation
Item analysis and validation
 
Test and Assessment Types
Test and Assessment TypesTest and Assessment Types
Test and Assessment Types
 
Atlantico evaluation presentation
Atlantico evaluation presentationAtlantico evaluation presentation
Atlantico evaluation presentation
 
Item Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty IndexItem Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty Index
 
Item analysis
Item analysisItem analysis
Item analysis
 
Assessment as learning
Assessment as learningAssessment as learning
Assessment as learning
 
Planning the test
Planning the testPlanning the test
Planning the test
 
Principles of high quality assessment
Principles of high quality assessmentPrinciples of high quality assessment
Principles of high quality assessment
 
Principles of high quality assessment criteria and techniques
Principles of high quality assessment criteria and techniquesPrinciples of high quality assessment criteria and techniques
Principles of high quality assessment criteria and techniques
 
Test construction
Test constructionTest construction
Test construction
 
Administering, Analyzing, and Improving the Test or Assessment
Administering, Analyzing, and Improving the Test or AssessmentAdministering, Analyzing, and Improving the Test or Assessment
Administering, Analyzing, and Improving the Test or Assessment
 
General principles of testing to Different qualities of high quality assessment
General principles of testing to Different qualities of high quality assessmentGeneral principles of testing to Different qualities of high quality assessment
General principles of testing to Different qualities of high quality assessment
 
Criterion and Norm - Referenced Interpretations and the Four Frames of Reference
Criterion and Norm - Referenced Interpretations and the Four Frames of ReferenceCriterion and Norm - Referenced Interpretations and the Four Frames of Reference
Criterion and Norm - Referenced Interpretations and the Four Frames of Reference
 
Measurement,evaluation,assessment(upload)
Measurement,evaluation,assessment(upload)Measurement,evaluation,assessment(upload)
Measurement,evaluation,assessment(upload)
 
Manage Learning Performance With Ict
Manage Learning Performance With IctManage Learning Performance With Ict
Manage Learning Performance With Ict
 

Semelhante a Test validity

Criteria to Consider when Constructing Good tests
Criteria to Consider when Constructing Good testsCriteria to Consider when Constructing Good tests
Criteria to Consider when Constructing Good testsShimmy Tolentino
 
Criteria to consider when constructing good tests
Criteria to consider when constructing good testsCriteria to consider when constructing good tests
Criteria to consider when constructing good testsshimmy ct
 
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxLESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxMarjoriAnneDelosReye
 
Chp8 170419081335-converted
Chp8 170419081335-convertedChp8 170419081335-converted
Chp8 170419081335-convertedMishal Tahir
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scalingBalaji P
 
Practical Language Testing by Fulcher (2010)
Practical Language Testing by Fulcher (2010)Practical Language Testing by Fulcher (2010)
Practical Language Testing by Fulcher (2010)Mahsa Farahanynia
 
STEP IN DEVELOPMENT ASSESSMENT TOOLS
STEP IN DEVELOPMENT ASSESSMENT TOOLSSTEP IN DEVELOPMENT ASSESSMENT TOOLS
STEP IN DEVELOPMENT ASSESSMENT TOOLSLoradelLegaspi
 
Chp8 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp8  - Research Methods for Business By Authors Uma Sekaran and Roger BougieChp8  - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp8 - Research Methods for Business By Authors Uma Sekaran and Roger BougieHassan Usman
 
educatiinar.pptx
educatiinar.pptxeducatiinar.pptx
educatiinar.pptxNithuNithu7
 
Cot 2 demonstration teaching.pptx
Cot 2 demonstration teaching.pptxCot 2 demonstration teaching.pptx
Cot 2 demonstration teaching.pptxHarbsOrHar
 
Validity and reliability of questionnaires
Validity and reliability of questionnairesValidity and reliability of questionnaires
Validity and reliability of questionnairesVenkitachalam R
 
Principles of design of experiments (doe)20 5-2014
Principles of  design of experiments (doe)20 5-2014Principles of  design of experiments (doe)20 5-2014
Principles of design of experiments (doe)20 5-2014Awad Albalwi
 
5. Validity Test & Reliability Testand Sampling Design.docx
5. Validity Test & Reliability Testand Sampling Design.docx5. Validity Test & Reliability Testand Sampling Design.docx
5. Validity Test & Reliability Testand Sampling Design.docxalinainglis
 
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE  Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE Pradip Limbani
 
AS Week 7 Observation and Levels of Measurement.
AS Week 7 Observation and Levels of Measurement. AS Week 7 Observation and Levels of Measurement.
AS Week 7 Observation and Levels of Measurement. Jamie Davies
 
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)Aminullah Assagaf
 
Education Assessment in Learnings 1.pptx
Education Assessment in Learnings 1.pptxEducation Assessment in Learnings 1.pptx
Education Assessment in Learnings 1.pptxRayLorenzOrtega
 
Establishing Validity-and-Reliability-Test ppt.pptx
Establishing Validity-and-Reliability-Test ppt.pptxEstablishing Validity-and-Reliability-Test ppt.pptx
Establishing Validity-and-Reliability-Test ppt.pptxRayLorenzOrtega
 
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)Aminullah Assagaf
 

Semelhante a Test validity (20)

Criteria to Consider when Constructing Good tests
Criteria to Consider when Constructing Good testsCriteria to Consider when Constructing Good tests
Criteria to Consider when Constructing Good tests
 
Criteria to consider when constructing good tests
Criteria to consider when constructing good testsCriteria to consider when constructing good tests
Criteria to consider when constructing good tests
 
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptxLESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
 
Chp8 170419081335-converted
Chp8 170419081335-convertedChp8 170419081335-converted
Chp8 170419081335-converted
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scaling
 
Practical Language Testing by Fulcher (2010)
Practical Language Testing by Fulcher (2010)Practical Language Testing by Fulcher (2010)
Practical Language Testing by Fulcher (2010)
 
STEP IN DEVELOPMENT ASSESSMENT TOOLS
STEP IN DEVELOPMENT ASSESSMENT TOOLSSTEP IN DEVELOPMENT ASSESSMENT TOOLS
STEP IN DEVELOPMENT ASSESSMENT TOOLS
 
Chp8 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp8  - Research Methods for Business By Authors Uma Sekaran and Roger BougieChp8  - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
Chp8 - Research Methods for Business By Authors Uma Sekaran and Roger Bougie
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
educatiinar.pptx
educatiinar.pptxeducatiinar.pptx
educatiinar.pptx
 
Cot 2 demonstration teaching.pptx
Cot 2 demonstration teaching.pptxCot 2 demonstration teaching.pptx
Cot 2 demonstration teaching.pptx
 
Validity and reliability of questionnaires
Validity and reliability of questionnairesValidity and reliability of questionnaires
Validity and reliability of questionnaires
 
Principles of design of experiments (doe)20 5-2014
Principles of  design of experiments (doe)20 5-2014Principles of  design of experiments (doe)20 5-2014
Principles of design of experiments (doe)20 5-2014
 
5. Validity Test & Reliability Testand Sampling Design.docx
5. Validity Test & Reliability Testand Sampling Design.docx5. Validity Test & Reliability Testand Sampling Design.docx
5. Validity Test & Reliability Testand Sampling Design.docx
 
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE  Ag Extn.504 :-  RESEARCH METHODS IN BEHAVIOURAL SCIENCE
Ag Extn.504 :- RESEARCH METHODS IN BEHAVIOURAL SCIENCE
 
AS Week 7 Observation and Levels of Measurement.
AS Week 7 Observation and Levels of Measurement. AS Week 7 Observation and Levels of Measurement.
AS Week 7 Observation and Levels of Measurement.
 
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
 
Education Assessment in Learnings 1.pptx
Education Assessment in Learnings 1.pptxEducation Assessment in Learnings 1.pptx
Education Assessment in Learnings 1.pptx
 
Establishing Validity-and-Reliability-Test ppt.pptx
Establishing Validity-and-Reliability-Test ppt.pptxEstablishing Validity-and-Reliability-Test ppt.pptx
Establishing Validity-and-Reliability-Test ppt.pptx
 
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)Aminullah assagaf model regresi lengkap  10 agustus 2021_(sobel, path, outlier)
Aminullah assagaf model regresi lengkap 10 agustus 2021_(sobel, path, outlier)
 

Último

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Test validity

  • 1. Item Analysis, Test Validity and Reliability Prepared by: Rovel A. Aparicio Mathematics Teacher
  • 2. THERE IS always A better WAY
  • 3. Stages in Test Construction I. Planning the Test A. Determining the Objectives B. Preparing the Table of Specifications C. Selecting the Appropriate Item Format D. Writing the Test items E. Editing the Test items
  • 4. Stages in Test Construction II. Trying Out the Test A. Administering the test Item analysis C. Preparing the Final Form of the Test
  • 5. Stages in Test Construction III. Establishing Test Validity IV. Establishing Test Reliability V. Interpreting the Test Scores
  • 6. Item Analysis GOAL: Improve the test. IMPORTANCE: Measure the effectiveness of individual test item. DIFFICULTY INDEX DISCRIMINATION INDEX  the percentage  refers to the degree of the pupils who got the items rigth. interpreted as how easy or how difficult an item is. to which success or failure of an item indicates possession of the acheivement being measured.
  • 7. ACTIVITY NO.1 COMPUTE THE DIFFICULTY INDEX AND DISCRIMINATION INDEX OF PERIODICAL TEST.
  • 8. U-L INDEX METHOD (STEPS) 1. Score the papers and rank them from highest to lowest according to the total score. 2. Separate the top 27% and the bottom 27% of the papers. 3. Tally the responses made to each test item by each individual in the upper 27% group. 4.Tally the responses made to each test item by each individual in the lower 27% group.
  • 9. U-L INDEX METHOD (STEPS) 5. Compute the difficulty index. [d= (U+L)/(nu+nl)] 6. Compute the discrimination index. [D=(U-L)/nu] or [D=(U-L)/nl]
  • 11. Item Analysis DIFFICULTY INDEX .00-.20 Very Difficult .21-.80 Moderately Difficult .81-1.00 DISCRIMINATION INDEX < .09 Poor items (Reject) .10-.39 Reasonably Good (Revise) .40-1.00 Very Good items (Retain) Very Easy
  • 13. Establishing Test Validity Types of Validity Types of Validity 1.Content Validity Meaning Meaning Procedure Procedure Compare test tasks How well the with test sample test bar specifications tasks represent describing the task the domain of domain under tasks to be consideration measured. (non-statistical)
  • 14. Establishing Test Validity Types of Validity Types of Validity 2. Construct Validity Meaning Meaning Procedure Procedure Experimentally determine what factors How test influence scores on test. performance can The procedure may be be described psychologically. logical and statistical using correlations and other statistical methods.
  • 15. Establishing Test Validity Types of Validity Types of Validity 3. Meaning Meaning Procedure Procedure Compare test scores How well test with measure of performance performance(grade) obtain on later date(for Criterion- predicts future performance prediction).or another related or estimates current measure of performance Validity performance on obtain concurrently(for some valued estimating present measures other status.( Primarily than the test Statistical). Correlate itself. test results with outside criterion.
  • 16. Establishing Test Reliability Measure of Stability and Equivalence Measure of Stability Types of Reliability Measure Measure of Internal Consistency Measure of Equivalence
  • 17. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 1. Measure of Stability Methods of Methods of Estimating Reliability Estimating Reliability Test- retest method Procedure Procedure Give a test twice to the same group with any time interval between tests from several minutes to several years. (Pearson r)
  • 18. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 2. Measure of Equivalence Methods of Methods of Estimating Reliability Estimating Reliability Procedure Procedure Give two forms of a Equivalent formstest to the same group in close method succession (Pearson r)
  • 19. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 3. Measure of Stability Methods of Methods of Estimating Reliability Estimating Reliability Procedure Procedure Give two forms of a test to the same Test- retest with equivalentgroup with increased time intervals forms between forms. (Pearson r)
  • 20. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 4. Measure of internal consistency Methods of Methods of Estimating Reliability Estimating Reliability Procedure Procedure Give a test once. Kuder-Richarson Score the total test method and apply the Kuder Richardson formula.
  • 21. Establishing Test Reliability Types of Reliability Types of Reliability Measures Measures 4. Measure of internal consistency Methods of Methods of Estimating Reliability Estimating Reliability Split half method Procedure Procedure Give a test once. Score equivalent halves of the test. (e.g. odd and even numbered items. (Pearson r and Spearman- Brown formula)
  • 22. ACTIVITY NO.2 TEST THE RELIABILITY OF PERIODICAL TEST.
  • 23. Pearson r Standard Scores (Directions) 1. Begin by writing the pairs of scores to be studied in two columns. Be sure that the pair of scores for each pupils is in the same row. Label one set of scores X , the other Y. 2.Get the sum (∑) of the scores for each column. Divide the sum by the number of scores (N) in each column to get the mean. 3.Subtract each score in column X from the mean x. Write the difference in column x. Be sure to put an algebraic sign.
  • 24. Pearson r Standard Scores (Directions) 4. Subtract each score in column Y from the mean y. Write the difference in column y. Don't forget the sign. 5. Square each score in column X. Enter each result under X2 . 6. Square each score in column Y. Enter each result under Y2 . 7. Compute the standard deviation of X and Y and enter the result under the column of SDx and SDy respectively .
  • 25. Pearson r Standard Scores (Directions) 8. Divide each entry in column x and y by the standard deviation SDx and SDy respectively and enter the result under Zx and Zy respectively. 9. Multiply Zx and Zy and enter the result under ZxZy. 10. Get the sum (∑) ZxZy. 11. Apply the formula r=∑ZxZy N
  • 26. Interpretation of Coefficient of Correlation Correlation is a measure of relationship between two variables. Magnitude or size of Relationship 0.8 and above means high correlation 0.5 means moderate correlation 0.3 and below means low correlation Direction of Relationship Negative coefficient means, as one variable increases, the other decreases. Positive Coefficient means, as one variable increases, the other also increases
  • 27. Interpretation of Coefficient of Variation Coeffecient of Variation is defined as the ratio of the standard deviation and the mean and usually expressed in percent. Criteria: c.v. = (mean/s.d.)x100 less than 10%Homogenous greater than 10%- Heterogenous
  • 28. REMEMBER: 1. Use item analysis procedures to check the quality of the test. The item analysis should be interpreted with care and caution 2. A test is valid when it measures what it is supposed to measure 3. A test is reliable when it is consistent .