SlideShare a Scribd company logo
1 of 17
RELIABILITY
DEFINITION OF TERMS
• Reliability: consistency in measurement
• Reliability Coefficient: the ratio between true score
variance on a test and the total variance
• Measurement Error: factors associated with the process
of measuring some variable other than what is being
measured
• Random Error: error due to measuring a targeted variable
caused by unpredictable fluctuations and inconsistencies
of other variables in the measurement process
• Systematic Error: error due to measuring a variable
typically constant or proportionate to what is presumed to
be the true value of variable being measured
SOURCES OF ERROR
• Test Construction
• Variation among items within a test and between tests
• Test Interpretation
• Error variances occurring test administration can cause
alterations in attention or motivation
• Test Scoring
• Objective tests have well-documented reliability
• Subjective tests can be subject to source of error
variabce
CLASSICAL TEST THEORY
• It assumes that each person would have a true score
obtained if there were no errors in measurement
• Errors of measurement are random
• True scores will not change with repeated
applications of the same test
• X (observed true score) = T (true score) + E (error)
DOMAIN SAMPLING
THEORY
• Uses a limited number of items to represent a larger
and more complicated construct
• Reliability is the ratio of variance on shorter test and
variance of the long-run true score
• The greater the number of items, the greater the
reliability
ITEM RESPONSE THEORY
(LATENT TRAIT THEORY)
• Models the probability that a person with X ability
will be able to perform at a level of Y
• Discrimination: how an item differentiates among
people with higher or lower levels of whatever is
being measured
GENERALIZABILITY
THEORY
• Universe score
• Facets: number of items, amount of training raters
had, and purpose of the test; must be similar
• Generalizability Study: how much impact the
different facets have on the test score; how can
scores be generalized in different situations
• Decision Study: usefulness of test in helping the user
to make decisions
TEST-RETEST
RELIABILITY
• It is an estimate of reliability obtained by correlating
pairs of scores from the same people on different
administrations
• It is appropriate when the test measures something
that is relatively stable over time
• Coefficient of Stability: when the interval between
the obtained measures was greater than six months.
ALTERNATE-FORMS &
PARALLEL-FORMS RELIABILITY
• Parallel-Forms Reliability: the extent to which item
sampling and other errors have affected the test
scores on versions of the same test; means and the
variances of observed test scores are equal.
• Alternate-Forms Reliability: the estimate by which
the different forms of the test have been affected by
item sampling error or others
• Coefficient of Equivalence: the degree of the
relationship between various forms of the test
SPLIT-HALF RELIABILITY
• Obtain through the correlation of two pairs of scores from
equivalent halves of a test that was administered once
• Also known as the odd-even reliability
• Spearman-Brown Formula: estimates the correlation
between two-halves if each had been the length of the
whole test. This can be used to estimate the effect of
shortening the items onto the reliability of the test and
determine the number of items needed to attained a
desired level of reliability
𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑒𝑑 𝑟 =
2𝑟
1 + 𝑟- r is the Pearson’s r between two halves
SPLIT-HALF RELIABILITY
• Cronbach’s Alpha: when two-halves have unequal
variances.
𝛼 =
2 𝜎𝑥
2
− 𝜎 𝑦1
2
𝜎 𝑦2
2
𝜎𝑥
2
- 𝜎 𝑥
2
variance of scores on the whole test
- 𝜎 𝑦1
2 𝜎 𝑦2
2 variance for two separate halves of the test
• Coefficient Omega: used to estimate the extent by
which all items measure the same underlying trait.
INTER-ITEM CONSISTENCY
• The degree of correlation among all items on a scale
• Can be used to measure the degree of homogeneity
• Kuder-Richardson Formula: measures the inter-item
consistency for dichotomous items
𝐾𝑅20 =
𝑘
𝑘 − 1
1 −
𝑝𝑞
𝜎2
INTER-ITEM CONSISTENCY
• Coefficient Alpha: measures the inter-item
consistency of nondichotomous items; mean of all
possible split-half correlations
𝛼 =
𝑁
𝑁 − 1
𝑆2
− Σ𝑆ⅈ2
𝑆2
• Average Proportional Distance: focuses on the
difference between item scores. A value of <0.20 is an
indicator of excellent internal consistency
INTER-SCORER RELIABILITY
• The degree of agreement between scorers with
regard to particular measures
• Fleiss Kappa: the actual agreement proportion of the
potential agreement following correction for chance
agreement
STANDARD ERROR OF
MEASUREMENT
• Provides a measure of precision of an observed test
score.
• The higher the test reliability is, the lower the
standard error of measurement
𝑆𝐸𝑀 = 𝜎 1 − 𝑟
STANDARD ERROR OF
MEASUREMENT
• Standard error of measurements are used for
constructing confidence intervals around specific
observed scores in the attempt to inform the
probability that the true score lies within the range of
scores
• Z-scores are commonly used
• Reporting:
“Given the assessee obtained a score of ____, there are
two out of three chances that the assessee’s true score
would fall between ____ and ___”
IMPROVING RELIABILITY
• Quality of test items
• Adequate sampling of content domains
• Longer assessment
• Develop a scoring plan
• Ensure validity

More Related Content

What's hot

Test Reliability and Validity
Test Reliability and ValidityTest Reliability and Validity
Test Reliability and ValidityBrian Ebie
 
Presentation validity
Presentation validityPresentation validity
Presentation validityAshMusavi
 
Validity and Reliability
Validity and ReliabilityValidity and Reliability
Validity and ReliabilityMaury Martinez
 
Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Maheen Iftikhar
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validityKaimrc_Rss_Jd
 
What is Reliability and its Types?
What is Reliability and its Types? What is Reliability and its Types?
What is Reliability and its Types? Dr. Amjad Ali Arain
 
Reliability
ReliabilityReliability
ReliabilityRoi Xcel
 
Validity and reliability in assessment.
Validity and reliability in assessment. Validity and reliability in assessment.
Validity and reliability in assessment. Tarek Tawfik Amin
 
Meaning and Methods of Estimating Reliability of Test.pptx
Meaning and Methods of Estimating Reliability of Test.pptxMeaning and Methods of Estimating Reliability of Test.pptx
Meaning and Methods of Estimating Reliability of Test.pptxsarat68
 
VALIDITY
VALIDITYVALIDITY
VALIDITYANCYBS
 
Psychological test adaptation
Psychological test adaptationPsychological test adaptation
Psychological test adaptationCarlo Magno
 
reliablity and validity in social sciences research
reliablity and validity  in social sciences researchreliablity and validity  in social sciences research
reliablity and validity in social sciences researchSourabh Sharma
 
Presentation Validity & Reliability
Presentation Validity & ReliabilityPresentation Validity & Reliability
Presentation Validity & Reliabilitysongoten77
 

What's hot (20)

Validity &amp; reliability
Validity &amp; reliabilityValidity &amp; reliability
Validity &amp; reliability
 
Test Reliability and Validity
Test Reliability and ValidityTest Reliability and Validity
Test Reliability and Validity
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
Presentation validity
Presentation validityPresentation validity
Presentation validity
 
Reliability
Reliability Reliability
Reliability
 
Validity and Reliability
Validity and ReliabilityValidity and Reliability
Validity and Reliability
 
Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
What is Reliability and its Types?
What is Reliability and its Types? What is Reliability and its Types?
What is Reliability and its Types?
 
Norms[1]
Norms[1]Norms[1]
Norms[1]
 
Reliability
ReliabilityReliability
Reliability
 
Validity and reliability
Validity and reliabilityValidity and reliability
Validity and reliability
 
Validity and reliability in assessment.
Validity and reliability in assessment. Validity and reliability in assessment.
Validity and reliability in assessment.
 
Meaning and Methods of Estimating Reliability of Test.pptx
Meaning and Methods of Estimating Reliability of Test.pptxMeaning and Methods of Estimating Reliability of Test.pptx
Meaning and Methods of Estimating Reliability of Test.pptx
 
VALIDITY
VALIDITYVALIDITY
VALIDITY
 
Reliability types
Reliability typesReliability types
Reliability types
 
Psychological test adaptation
Psychological test adaptationPsychological test adaptation
Psychological test adaptation
 
Validity in Assessment
Validity in AssessmentValidity in Assessment
Validity in Assessment
 
reliablity and validity in social sciences research
reliablity and validity  in social sciences researchreliablity and validity  in social sciences research
reliablity and validity in social sciences research
 
Presentation Validity & Reliability
Presentation Validity & ReliabilityPresentation Validity & Reliability
Presentation Validity & Reliability
 

Similar to Reliability

unit 9 measurements presentation- short.ppt
unit 9 measurements presentation- short.pptunit 9 measurements presentation- short.ppt
unit 9 measurements presentation- short.pptMitikuTeka1
 
Reliability of test
Reliability of testReliability of test
Reliability of testSarat Rout
 
Reliability and validity1
Reliability and validity1Reliability and validity1
Reliability and validity1MMIHS
 
Evaluation of Measurement Instruments.ppt
Evaluation of Measurement Instruments.pptEvaluation of Measurement Instruments.ppt
Evaluation of Measurement Instruments.pptCityComputers3
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYJoydeep Singh
 
Reliability and Validity.pptx
Reliability and Validity.pptxReliability and Validity.pptx
Reliability and Validity.pptxVandanaGaur8
 
Validity and reliability of the instrument
Validity and reliability of the instrumentValidity and reliability of the instrument
Validity and reliability of the instrumentBhumi Patel
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good testcyrilcoscos
 
Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Linejan
 
JC-16-23June2021-rel-val.pptx
JC-16-23June2021-rel-val.pptxJC-16-23June2021-rel-val.pptx
JC-16-23June2021-rel-val.pptxsaurami
 
Establishing the English Language Test Reliability
 Establishing the  English Language Test Reliability  Establishing the  English Language Test Reliability
Establishing the English Language Test Reliability Djihad .B
 
Reliability by Vartika Verma .pdf
Reliability by Vartika Verma .pdfReliability by Vartika Verma .pdf
Reliability by Vartika Verma .pdfVartika Verma
 
Qualities of Good Test.pdf
Qualities of Good Test.pdfQualities of Good Test.pdf
Qualities of Good Test.pdfFaheemGul17
 
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptxChapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptxHazelLansula1
 
Lesson 11 Understanding Data and Ways to Systematically Collect Data.ppt
Lesson 11 Understanding Data and Ways to Systematically Collect Data.pptLesson 11 Understanding Data and Ways to Systematically Collect Data.ppt
Lesson 11 Understanding Data and Ways to Systematically Collect Data.pptGerfelChan1
 

Similar to Reliability (20)

unit 9 measurements presentation- short.ppt
unit 9 measurements presentation- short.pptunit 9 measurements presentation- short.ppt
unit 9 measurements presentation- short.ppt
 
Reliability of test
Reliability of testReliability of test
Reliability of test
 
Monika seminar
Monika seminarMonika seminar
Monika seminar
 
Monika seminar
Monika seminarMonika seminar
Monika seminar
 
Reliability and validity1
Reliability and validity1Reliability and validity1
Reliability and validity1
 
Evaluation of Measurement Instruments.ppt
Evaluation of Measurement Instruments.pptEvaluation of Measurement Instruments.ppt
Evaluation of Measurement Instruments.ppt
 
RELIABILITY AND VALIDITY
RELIABILITY AND VALIDITYRELIABILITY AND VALIDITY
RELIABILITY AND VALIDITY
 
variables cont
variables contvariables cont
variables cont
 
Reliability and Validity.pptx
Reliability and Validity.pptxReliability and Validity.pptx
Reliability and Validity.pptx
 
Validity and reliability of the instrument
Validity and reliability of the instrumentValidity and reliability of the instrument
Validity and reliability of the instrument
 
Characteristics of a good test
Characteristics of a good testCharacteristics of a good test
Characteristics of a good test
 
Louzel Report - Reliability & validity
Louzel Report - Reliability & validity Louzel Report - Reliability & validity
Louzel Report - Reliability & validity
 
JC-16-23June2021-rel-val.pptx
JC-16-23June2021-rel-val.pptxJC-16-23June2021-rel-val.pptx
JC-16-23June2021-rel-val.pptx
 
Edm 202
Edm 202Edm 202
Edm 202
 
Establishing the English Language Test Reliability
 Establishing the  English Language Test Reliability  Establishing the  English Language Test Reliability
Establishing the English Language Test Reliability
 
Reliability by Vartika Verma .pdf
Reliability by Vartika Verma .pdfReliability by Vartika Verma .pdf
Reliability by Vartika Verma .pdf
 
Qualities of Good Test.pdf
Qualities of Good Test.pdfQualities of Good Test.pdf
Qualities of Good Test.pdf
 
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptxChapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
 
Lesson 11 Understanding Data and Ways to Systematically Collect Data.ppt
Lesson 11 Understanding Data and Ways to Systematically Collect Data.pptLesson 11 Understanding Data and Ways to Systematically Collect Data.ppt
Lesson 11 Understanding Data and Ways to Systematically Collect Data.ppt
 
Assessment ppt
Assessment pptAssessment ppt
Assessment ppt
 

More from Martin Vince Cruz, RPm (20)

Multivariatetechniques01
Multivariatetechniques01Multivariatetechniques01
Multivariatetechniques01
 
Late adulthood
Late adulthoodLate adulthood
Late adulthood
 
Emerging and Early Adulthood
Emerging and Early  AdulthoodEmerging and Early  Adulthood
Emerging and Early Adulthood
 
Middle and Late Childhood
Middle and Late ChildhoodMiddle and Late Childhood
Middle and Late Childhood
 
infancy
infancyinfancy
infancy
 
Introto lifespandevt
Introto lifespandevtIntroto lifespandevt
Introto lifespandevt
 
Feminist therapy
Feminist therapyFeminist therapy
Feminist therapy
 
Paraphilias
ParaphiliasParaphilias
Paraphilias
 
Somatic sexdysphoria
Somatic sexdysphoriaSomatic sexdysphoria
Somatic sexdysphoria
 
Anxiety disorders
Anxiety disordersAnxiety disorders
Anxiety disorders
 
Person centered therapy
Person centered therapyPerson centered therapy
Person centered therapy
 
Organizational culture
Organizational cultureOrganizational culture
Organizational culture
 
Anxiety disorders
Anxiety disordersAnxiety disorders
Anxiety disorders
 
Counselor: Person and Professional
Counselor: Person and ProfessionalCounselor: Person and Professional
Counselor: Person and Professional
 
Abnormal Behavior in the Historical Context
Abnormal Behavior in the Historical ContextAbnormal Behavior in the Historical Context
Abnormal Behavior in the Historical Context
 
George kelly
George kellyGeorge kelly
George kelly
 
Raymond cattell
Raymond cattellRaymond cattell
Raymond cattell
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
Using SPSS: A Tutorial
Using SPSS: A TutorialUsing SPSS: A Tutorial
Using SPSS: A Tutorial
 
Review of Statistics
Review of StatisticsReview of Statistics
Review of Statistics
 

Recently uploaded

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 

Recently uploaded (20)

YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 

Reliability

  • 2. DEFINITION OF TERMS • Reliability: consistency in measurement • Reliability Coefficient: the ratio between true score variance on a test and the total variance • Measurement Error: factors associated with the process of measuring some variable other than what is being measured • Random Error: error due to measuring a targeted variable caused by unpredictable fluctuations and inconsistencies of other variables in the measurement process • Systematic Error: error due to measuring a variable typically constant or proportionate to what is presumed to be the true value of variable being measured
  • 3. SOURCES OF ERROR • Test Construction • Variation among items within a test and between tests • Test Interpretation • Error variances occurring test administration can cause alterations in attention or motivation • Test Scoring • Objective tests have well-documented reliability • Subjective tests can be subject to source of error variabce
  • 4. CLASSICAL TEST THEORY • It assumes that each person would have a true score obtained if there were no errors in measurement • Errors of measurement are random • True scores will not change with repeated applications of the same test • X (observed true score) = T (true score) + E (error)
  • 5. DOMAIN SAMPLING THEORY • Uses a limited number of items to represent a larger and more complicated construct • Reliability is the ratio of variance on shorter test and variance of the long-run true score • The greater the number of items, the greater the reliability
  • 6. ITEM RESPONSE THEORY (LATENT TRAIT THEORY) • Models the probability that a person with X ability will be able to perform at a level of Y • Discrimination: how an item differentiates among people with higher or lower levels of whatever is being measured
  • 7. GENERALIZABILITY THEORY • Universe score • Facets: number of items, amount of training raters had, and purpose of the test; must be similar • Generalizability Study: how much impact the different facets have on the test score; how can scores be generalized in different situations • Decision Study: usefulness of test in helping the user to make decisions
  • 8. TEST-RETEST RELIABILITY • It is an estimate of reliability obtained by correlating pairs of scores from the same people on different administrations • It is appropriate when the test measures something that is relatively stable over time • Coefficient of Stability: when the interval between the obtained measures was greater than six months.
  • 9. ALTERNATE-FORMS & PARALLEL-FORMS RELIABILITY • Parallel-Forms Reliability: the extent to which item sampling and other errors have affected the test scores on versions of the same test; means and the variances of observed test scores are equal. • Alternate-Forms Reliability: the estimate by which the different forms of the test have been affected by item sampling error or others • Coefficient of Equivalence: the degree of the relationship between various forms of the test
  • 10. SPLIT-HALF RELIABILITY • Obtain through the correlation of two pairs of scores from equivalent halves of a test that was administered once • Also known as the odd-even reliability • Spearman-Brown Formula: estimates the correlation between two-halves if each had been the length of the whole test. This can be used to estimate the effect of shortening the items onto the reliability of the test and determine the number of items needed to attained a desired level of reliability 𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑒𝑑 𝑟 = 2𝑟 1 + 𝑟- r is the Pearson’s r between two halves
  • 11. SPLIT-HALF RELIABILITY • Cronbach’s Alpha: when two-halves have unequal variances. 𝛼 = 2 𝜎𝑥 2 − 𝜎 𝑦1 2 𝜎 𝑦2 2 𝜎𝑥 2 - 𝜎 𝑥 2 variance of scores on the whole test - 𝜎 𝑦1 2 𝜎 𝑦2 2 variance for two separate halves of the test • Coefficient Omega: used to estimate the extent by which all items measure the same underlying trait.
  • 12. INTER-ITEM CONSISTENCY • The degree of correlation among all items on a scale • Can be used to measure the degree of homogeneity • Kuder-Richardson Formula: measures the inter-item consistency for dichotomous items 𝐾𝑅20 = 𝑘 𝑘 − 1 1 − 𝑝𝑞 𝜎2
  • 13. INTER-ITEM CONSISTENCY • Coefficient Alpha: measures the inter-item consistency of nondichotomous items; mean of all possible split-half correlations 𝛼 = 𝑁 𝑁 − 1 𝑆2 − Σ𝑆ⅈ2 𝑆2 • Average Proportional Distance: focuses on the difference between item scores. A value of <0.20 is an indicator of excellent internal consistency
  • 14. INTER-SCORER RELIABILITY • The degree of agreement between scorers with regard to particular measures • Fleiss Kappa: the actual agreement proportion of the potential agreement following correction for chance agreement
  • 15. STANDARD ERROR OF MEASUREMENT • Provides a measure of precision of an observed test score. • The higher the test reliability is, the lower the standard error of measurement 𝑆𝐸𝑀 = 𝜎 1 − 𝑟
  • 16. STANDARD ERROR OF MEASUREMENT • Standard error of measurements are used for constructing confidence intervals around specific observed scores in the attempt to inform the probability that the true score lies within the range of scores • Z-scores are commonly used • Reporting: “Given the assessee obtained a score of ____, there are two out of three chances that the assessee’s true score would fall between ____ and ___”
  • 17. IMPROVING RELIABILITY • Quality of test items • Adequate sampling of content domains • Longer assessment • Develop a scoring plan • Ensure validity