SlideShare uma empresa Scribd logo
1 de 30
The Art and Science of Test Development—Part F

Psychometric/technical statistical analysis: Internal


                             Kevin S. McGrew, PhD.

                              Educational Psychologist

                             Research Director
                         Woodcock-Muñoz Foundation




   The basic structure and content of this presentation is grounded extensively on the test
               development procedures developed by Dr. Richard Woodcock
“In god we trust….all others must show data”
               (unknown source)




                                 Test authors and
                                  publishers have
                                 standards-based
                              responsibility to provide
                              supporting psychometric
                              technical information re:
                                 tests and battery

                             Typically in the form of a series of
                             technical chapters in manual or a
                                separate technical manual
Calculate psychometric/measurement
statistics for technical manual/chapters




  Use Joint Test Standards as a guide
Calculate summary statistics (n, means, SDs, SEM) and
reliabilities for all tests and clusters by technical age groups



                                                                   etc…




etc…
Special reliability analyses required for speeded tests



        Traditional test-retest reliability analysis
Special reliability analyses for all tests
    More complex repeated measures reliability analysis
(McArdle and Woodcock, 1989—see WJ-R Technical Manual)
Provide evidence based on internal
    structure (internal validity)
Structural (Internal) Stage of Test Development

Purpose                Examine the internal relations among the measures used to
                       operationalize the theoretical construct domain (i.e., intelligence
                       or cognitive abilities)
Questions asked        Do the observed measures “behave” in a manner consistent
                       with the theoretical domain definition of intelligence?
Method and concepts    • Internal domain studies
                       • Item/subscale intercorrelations
                       • Exploratory/confirmatory factor analysis
Characteristics of     • Measures co-vary in a manner consistent with the intended
strong test validity     theoretical structure
program                • Factors reflect trait rather than method variance
                       • Items/measures are representative of the empirical domain
Structural/internal validity evidence: Test and cluster
        inter-correlation matrices by technical age groups




                                                                 etc…




etc…
Structural/internal
     validity

Confirmatory factor
 analysis by major
   age groups

(exploratory factor
   analysis if not
 theory-driven test
     blueprint)
Structural/internal validity Confirmatory factor
             analysis by major age groups

(exploratory factor analysis if not theory-driven test blueprint)




                                                  .53

                                                   .67
                                                   .40
                                                 .42

                                                  .43
Structural (Internal) Stage of Test Development

Purpose                Examine the internal relations among the measures used to
                       operationalize the theoretical construct domain (i.e., intelligence
                       or cognitive abilities)
Questions asked        Do the observed measures “behave” in a manner consistent
                       with the theoretical domain definition of intelligence?
Method and concepts    • Exploratory/confirmatory factor analysis

Characteristics of     • The theoretical/empirical model is deemed plausible
strong test validity     (especially when compared against other competing models)
program                  based on substantive and statistical criteria
Structural/internal validity: Confirmatory factor
                    analysis model comparisons by major age groups



              The WJ III factor structure model provided the best fit to the
                   data when compared to six alternative models




                                                        Fit Statistics
                Models              Chi-square      df       AIC                RMSEA
WJ III CHC 7-factor                 13189.16       536    13377.16         0.056 (0.055-0.057)
Gc/Gsm/Gs/Gv+Gf (WAIS 4-factor)     15113.99       537    15301.00         0.060 (0.059-0.061)
Gc/Gsm/Gq/Gv+Gf (SB IV 4-factor)    20379.58       537    20565.58         0.070 (0.069-0.071)
Gf-Gc Dichotomous (KAIT)            23145.12       549    23307.12         0.074 (0.073-0.075)
PASS 4-factor *                     25198.46       542    25374.46         0.077 (0.078-0.079)
g single factor                     65314.78       1170   65524.78         0.086 (0.085-0.086)
Null model                          215827.54      1219  215939.54         0.153 (0.153-0.154)


              The conclusion was the same across 5 age-differentiated samples
Internal validity evidence example: g-loadings for
differentially weighted General Intellectual Ability cluster
Provide evidence based on internal
structure: Developmental evidence?
Developmental evidence in the form of
differential growth curves of measures
Provide Test Fairness Evidence
Structural/internal validity

Evaluating structural invariance with Multiple Group CFA




                          =




 White                                        Non-White
Structural/internal validity

Evaluating structural invariance with Multiple Group CFA




                          =




 Male                                         Female
Structural/internal validity

Evaluating structural invariance with Multiple Group CFA




                          =




 Hispanic                                     Non-Hispanic
Test fairness evidence: Item Level Analyses:
      Differential Item Functioning (DIF)




                                           •Male/Female

                                         •White/Non-White

                                          •Hispanic/Non-
                                              Hispanic
Test fairness evidence: Item Level Analyses:
      Differential Item Functioning (DIF)




                                               •Male/Female

                                               •White/Non-
                                                 White

                                           •Hispanic/Non-
                                               Hispanic



                                       Results combined
                                      with results from Bias
                                       Sensitivity Review
                                              Panels
Lack of rigor and quality control in all prior/earlier stages will “rattle through the
data” and rear its ugly head when performing the final statistical analysis

Shorts cuts in prior stages will “bite you in in the ____” as you attempt to
perform final statistical analysis

Data screening, data screening, data screening!!!!……. prior to do performing
final statistical analysis
     • Compute extensive descriptive statistical analysis for all variables (e.g.,
     histograms, scatterplots, box-whisker plots, etc.)

     • More than means and SD’s. Also calculate median, skew, kurtosis, n-tiles,
     etc.

Deliberately planned and sophisticated “front end” data collection short-cuts
(e.g., matrix sampling) introduce an extreme level of “back end” complexity to
routine statistical/psychometric analysis

Know your limits, level of expertise, and skills. Even those with extensive test
development experience often need access to trusted measurement/statistical
consultants                                                              (cont. next slide)
Don’t be seduced and completely reliant on factor analysis as the primary internal/structural
validity tool

     • An example: Inability of CFA to differentiate closely related latent constructs (e.g., Gc and
     Reading/Writing—Grw) doesn’t prove they are the same. Need to examine other evidence
     (e.g., very different developmental growth curves for Gc and Grw)

Published statistics/psychometric information needs to be based on final publication length
tests
     • Often need to use test-length correction formula’s (e.g., KR-21) for test reliabilities

     • Correlations between short /and or long norming versions of a test, that differ in test length
     (number of items) from publication length test, may need special adjustments/corrections.

Back up, back up, back up!!!!!!!!!! Don’t let a dead hard drive or computer destroy your work
and progress. Do it constantly. Build redundancy into your files and people skill sets

Sad fact: Majority of test users do NOT pay attention to the fancy and special
psychometric/statistical analysis you report in technical chapters or manuals. Be prepared
for post-publication education via other methods.

Post-manual publication technical reports of special/sophisticated analyses are good when
publication time-line pressures dictate making difficult decisions.
Exploratory-driven confirmatory factor analysis is often used by test
developers to explore unexpected characteristics of tests (often called
“model generation modeling” in SEM/CFA literature)

Different approaches to DIF (differential item functioning)

Multiple group CFA to test invariance (by age, by gender, by……..)
     • Different degrees of measurement invariance can be tested

Traditional definition of psychometric bias and appropriate/inappropriate
statistical methods

Equating (e.g., Form A/B) methods and evidence

Methods for calculating prediction models that account for regression to the
mean and that are sensitive to developmental (age) X content interactions

Complex repeated measures reliability analyses to tease out test stability,
internal consistency, and trait stability sources of score variance (see WJ-R
Technical Manual)
End of Part F
  Additional steps in test development process will be
presented in subsequent modules as they are developed
Applied Psych Test Design: Part F--Psychometric/technical statistical analysis:  Internal
Applied Psych Test Design: Part F--Psychometric/technical statistical analysis:  Internal
Applied Psych Test Design: Part F--Psychometric/technical statistical analysis:  Internal

Mais conteúdo relacionado

Semelhante a Applied Psych Test Design: Part F--Psychometric/technical statistical analysis: Internal

Selecting an Ideal Survey Instrument for a Quantitative Study
Selecting an Ideal Survey Instrument for a Quantitative StudySelecting an Ideal Survey Instrument for a Quantitative Study
Selecting an Ideal Survey Instrument for a Quantitative StudyStatistics Solutions
 
General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyStatistics Solutions
 
Carma internet research module scale development
Carma internet research module   scale developmentCarma internet research module   scale development
Carma internet research module scale developmentSyracuse University
 
QualitativeAnalysis_W2015.ppt
QualitativeAnalysis_W2015.pptQualitativeAnalysis_W2015.ppt
QualitativeAnalysis_W2015.pptRabinThapa27
 
reliability and validity psychology 1234
reliability and validity psychology 1234reliability and validity psychology 1234
reliability and validity psychology 1234MajaAiraBumatay
 
Assessment Centers in recruitment & selection
Assessment Centers in recruitment & selectionAssessment Centers in recruitment & selection
Assessment Centers in recruitment & selectionRahila Narejo
 
Applied Psych Test Design: Part A--Planning, development frameworks & domain/...
Applied Psych Test Design: Part A--Planning, development frameworks & domain/...Applied Psych Test Design: Part A--Planning, development frameworks & domain/...
Applied Psych Test Design: Part A--Planning, development frameworks & domain/...Kevin McGrew
 
FOCUSING YOUR RESEARCH EFFORTS Planning Your Research
FOCUSING YOUR RESEARCH EFFORTS Planning Your Research FOCUSING YOUR RESEARCH EFFORTS Planning Your Research
FOCUSING YOUR RESEARCH EFFORTS Planning Your Research ShainaBoling829
 
Week 9 validity and reliability
Week 9 validity and reliabilityWeek 9 validity and reliability
Week 9 validity and reliabilitywawaaa789
 
Psychometrics for Clinical Skills Assessment
Psychometrics for Clinical Skills AssessmentPsychometrics for Clinical Skills Assessment
Psychometrics for Clinical Skills AssessmentINSPIRE_Network
 
Quantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsQuantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsMartin Kretzer
 
NG BB 33 Hypothesis Testing Basics
NG BB 33 Hypothesis Testing BasicsNG BB 33 Hypothesis Testing Basics
NG BB 33 Hypothesis Testing BasicsLeanleaders.org
 
NG BB 33 Hypothesis Testing Basics
NG BB 33 Hypothesis Testing BasicsNG BB 33 Hypothesis Testing Basics
NG BB 33 Hypothesis Testing BasicsLeanleaders.org
 
Psychological assessments and tests.pptx
Psychological assessments and tests.pptxPsychological assessments and tests.pptx
Psychological assessments and tests.pptxCafeWandererNoida
 

Semelhante a Applied Psych Test Design: Part F--Psychometric/technical statistical analysis: Internal (20)

Selecting an Ideal Survey Instrument for a Quantitative Study
Selecting an Ideal Survey Instrument for a Quantitative StudySelecting an Ideal Survey Instrument for a Quantitative Study
Selecting an Ideal Survey Instrument for a Quantitative Study
 
General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative Methodology
 
Carma internet research module scale development
Carma internet research module   scale developmentCarma internet research module   scale development
Carma internet research module scale development
 
QualitativeAnalysis_W2015.ppt
QualitativeAnalysis_W2015.pptQualitativeAnalysis_W2015.ppt
QualitativeAnalysis_W2015.ppt
 
reliability and validity psychology 1234
reliability and validity psychology 1234reliability and validity psychology 1234
reliability and validity psychology 1234
 
Ch05 instrumentation
Ch05 instrumentationCh05 instrumentation
Ch05 instrumentation
 
Assessment Centers in recruitment & selection
Assessment Centers in recruitment & selectionAssessment Centers in recruitment & selection
Assessment Centers in recruitment & selection
 
Applied Psych Test Design: Part A--Planning, development frameworks & domain/...
Applied Psych Test Design: Part A--Planning, development frameworks & domain/...Applied Psych Test Design: Part A--Planning, development frameworks & domain/...
Applied Psych Test Design: Part A--Planning, development frameworks & domain/...
 
FOCUSING YOUR RESEARCH EFFORTS Planning Your Research
FOCUSING YOUR RESEARCH EFFORTS Planning Your Research FOCUSING YOUR RESEARCH EFFORTS Planning Your Research
FOCUSING YOUR RESEARCH EFFORTS Planning Your Research
 
data analysis.ppt
data analysis.pptdata analysis.ppt
data analysis.ppt
 
data analysis.pptx
data analysis.pptxdata analysis.pptx
data analysis.pptx
 
Week 9 validity and reliability
Week 9 validity and reliabilityWeek 9 validity and reliability
Week 9 validity and reliability
 
Psychometrics for Clinical Skills Assessment
Psychometrics for Clinical Skills AssessmentPsychometrics for Clinical Skills Assessment
Psychometrics for Clinical Skills Assessment
 
ES_140_METHODS_OF_RESEARCH.pdf
ES_140_METHODS_OF_RESEARCH.pdfES_140_METHODS_OF_RESEARCH.pdf
ES_140_METHODS_OF_RESEARCH.pdf
 
Quantitative Research: Surveys and Experiments
Quantitative Research: Surveys and ExperimentsQuantitative Research: Surveys and Experiments
Quantitative Research: Surveys and Experiments
 
NG BB 33 Hypothesis Testing Basics
NG BB 33 Hypothesis Testing BasicsNG BB 33 Hypothesis Testing Basics
NG BB 33 Hypothesis Testing Basics
 
NG BB 33 Hypothesis Testing Basics
NG BB 33 Hypothesis Testing BasicsNG BB 33 Hypothesis Testing Basics
NG BB 33 Hypothesis Testing Basics
 
Psychological assessments and tests.pptx
Psychological assessments and tests.pptxPsychological assessments and tests.pptx
Psychological assessments and tests.pptx
 
Psychological assessments and tests.pptx
Psychological assessments and tests.pptxPsychological assessments and tests.pptx
Psychological assessments and tests.pptx
 
Vilnius pres dianne lalancette
Vilnius pres dianne lalancetteVilnius pres dianne lalancette
Vilnius pres dianne lalancette
 

Mais de Kevin McGrew

The Model of Achievement Competence Motivation (MACM) Part E: Crossing the R...
The Model of Achievement Competence Motivation (MACM) Part E:  Crossing the R...The Model of Achievement Competence Motivation (MACM) Part E:  Crossing the R...
The Model of Achievement Competence Motivation (MACM) Part E: Crossing the R...Kevin McGrew
 
The Model of Achievement Competence Motivation (MACM): Part D: The volition ...
The Model of Achievement Competence Motivation (MACM): Part D:  The volition ...The Model of Achievement Competence Motivation (MACM): Part D:  The volition ...
The Model of Achievement Competence Motivation (MACM): Part D: The volition ...Kevin McGrew
 
The Model of Achievement Competence Motivation (MACM) Part C: The motivation...
The Model of Achievement Competence Motivation (MACM) Part C:  The motivation...The Model of Achievement Competence Motivation (MACM) Part C:  The motivation...
The Model of Achievement Competence Motivation (MACM) Part C: The motivation...Kevin McGrew
 
The Model of Achievement Competence Motivation (MACM): Part B - An overview ...
The Model of Achievement Competence Motivation (MACM):  Part B - An overview ...The Model of Achievement Competence Motivation (MACM):  Part B - An overview ...
The Model of Achievement Competence Motivation (MACM): Part B - An overview ...Kevin McGrew
 
The Model of Achievement Competence Motivation (MACM): Part A Introduction o...
The Model of Achievement Competence Motivation (MACM):  Part A Introduction o...The Model of Achievement Competence Motivation (MACM):  Part A Introduction o...
The Model of Achievement Competence Motivation (MACM): Part A Introduction o...Kevin McGrew
 
The WJ IV Cognitive GIA in iintellectual disability (ID) assessment
The WJ IV Cognitive GIA in iintellectual disability (ID) assessmentThe WJ IV Cognitive GIA in iintellectual disability (ID) assessment
The WJ IV Cognitive GIA in iintellectual disability (ID) assessmentKevin McGrew
 
The Evolution of the Cattell-Horn-Carrol (CHC) Theory of Intelligence: Schne...
The Evolution of the Cattell-Horn-Carrol (CHC) Theory of Intelligence:  Schne...The Evolution of the Cattell-Horn-Carrol (CHC) Theory of Intelligence:  Schne...
The Evolution of the Cattell-Horn-Carrol (CHC) Theory of Intelligence: Schne...Kevin McGrew
 
Beyond cognitive abilities: An integrative model of learning-related persona...
Beyond cognitive abilities:  An integrative model of learning-related persona...Beyond cognitive abilities:  An integrative model of learning-related persona...
Beyond cognitive abilities: An integrative model of learning-related persona...Kevin McGrew
 
What about executive functions and CHC theory: New research for discussion
What about executive functions and CHC theory:  New research for discussionWhat about executive functions and CHC theory:  New research for discussion
What about executive functions and CHC theory: New research for discussionKevin McGrew
 
"Intelligent" intelligence testing with the WJ IV COG: Why do some individua...
"Intelligent" intelligence testing with the WJ IV COG:  Why do some individua..."Intelligent" intelligence testing with the WJ IV COG:  Why do some individua...
"Intelligent" intelligence testing with the WJ IV COG: Why do some individua...Kevin McGrew
 
CHC model of inteligence revised (v2.4). Has Glr been incorrectly conceptual...
CHC model of inteligence revised (v2.4).  Has Glr been incorrectly conceptual...CHC model of inteligence revised (v2.4).  Has Glr been incorrectly conceptual...
CHC model of inteligence revised (v2.4). Has Glr been incorrectly conceptual...Kevin McGrew
 
What is "intelligent" intelligence testing
What is "intelligent" intelligence testingWhat is "intelligent" intelligence testing
What is "intelligent" intelligence testingKevin McGrew
 
"intelligent" intelligence testing: Why do some individuals obtain markedly ...
"intelligent" intelligence testing:  Why do some individuals obtain markedly ..."intelligent" intelligence testing:  Why do some individuals obtain markedly ...
"intelligent" intelligence testing: Why do some individuals obtain markedly ...Kevin McGrew
 
"intelligent" intelligence testing: Evaluating wihtin CHC domain test score ...
"intelligent" intelligence testing:  Evaluating wihtin CHC domain test score ..."intelligent" intelligence testing:  Evaluating wihtin CHC domain test score ...
"intelligent" intelligence testing: Evaluating wihtin CHC domain test score ...Kevin McGrew
 
How to evaulate the unusualness (base rate) of WJ IV cluster or test score di...
How to evaulate the unusualness (base rate) of WJ IV cluster or test score di...How to evaulate the unusualness (base rate) of WJ IV cluster or test score di...
How to evaulate the unusualness (base rate) of WJ IV cluster or test score di...Kevin McGrew
 
The WJ IV and Beyond CHC Theory: Kevin McGrew's NASP mini-skills workshop
The WJ IV and Beyond CHC Theory:  Kevin McGrew's NASP mini-skills workshopThe WJ IV and Beyond CHC Theory:  Kevin McGrew's NASP mini-skills workshop
The WJ IV and Beyond CHC Theory: Kevin McGrew's NASP mini-skills workshopKevin McGrew
 
The WJ IV Measurement of Auditory Processing (Ga)
The WJ IV Measurement of Auditory Processing (Ga)The WJ IV Measurement of Auditory Processing (Ga)
The WJ IV Measurement of Auditory Processing (Ga)Kevin McGrew
 
Overview of the WJ IV Cognitive Battery: GIA and CHC Clusters
Overview of the WJ IV Cognitive Battery: GIA and CHC ClustersOverview of the WJ IV Cognitive Battery: GIA and CHC Clusters
Overview of the WJ IV Cognitive Battery: GIA and CHC ClustersKevin McGrew
 
CHC Theory Codebook 2: Cognitive definitions
CHC Theory Codebook 2:  Cognitive definitionsCHC Theory Codebook 2:  Cognitive definitions
CHC Theory Codebook 2: Cognitive definitionsKevin McGrew
 
CHC Theory Codebook 1: Cognitive definitions
CHC Theory Codebook 1:  Cognitive definitionsCHC Theory Codebook 1:  Cognitive definitions
CHC Theory Codebook 1: Cognitive definitionsKevin McGrew
 

Mais de Kevin McGrew (20)

The Model of Achievement Competence Motivation (MACM) Part E: Crossing the R...
The Model of Achievement Competence Motivation (MACM) Part E:  Crossing the R...The Model of Achievement Competence Motivation (MACM) Part E:  Crossing the R...
The Model of Achievement Competence Motivation (MACM) Part E: Crossing the R...
 
The Model of Achievement Competence Motivation (MACM): Part D: The volition ...
The Model of Achievement Competence Motivation (MACM): Part D:  The volition ...The Model of Achievement Competence Motivation (MACM): Part D:  The volition ...
The Model of Achievement Competence Motivation (MACM): Part D: The volition ...
 
The Model of Achievement Competence Motivation (MACM) Part C: The motivation...
The Model of Achievement Competence Motivation (MACM) Part C:  The motivation...The Model of Achievement Competence Motivation (MACM) Part C:  The motivation...
The Model of Achievement Competence Motivation (MACM) Part C: The motivation...
 
The Model of Achievement Competence Motivation (MACM): Part B - An overview ...
The Model of Achievement Competence Motivation (MACM):  Part B - An overview ...The Model of Achievement Competence Motivation (MACM):  Part B - An overview ...
The Model of Achievement Competence Motivation (MACM): Part B - An overview ...
 
The Model of Achievement Competence Motivation (MACM): Part A Introduction o...
The Model of Achievement Competence Motivation (MACM):  Part A Introduction o...The Model of Achievement Competence Motivation (MACM):  Part A Introduction o...
The Model of Achievement Competence Motivation (MACM): Part A Introduction o...
 
The WJ IV Cognitive GIA in iintellectual disability (ID) assessment
The WJ IV Cognitive GIA in iintellectual disability (ID) assessmentThe WJ IV Cognitive GIA in iintellectual disability (ID) assessment
The WJ IV Cognitive GIA in iintellectual disability (ID) assessment
 
The Evolution of the Cattell-Horn-Carrol (CHC) Theory of Intelligence: Schne...
The Evolution of the Cattell-Horn-Carrol (CHC) Theory of Intelligence:  Schne...The Evolution of the Cattell-Horn-Carrol (CHC) Theory of Intelligence:  Schne...
The Evolution of the Cattell-Horn-Carrol (CHC) Theory of Intelligence: Schne...
 
Beyond cognitive abilities: An integrative model of learning-related persona...
Beyond cognitive abilities:  An integrative model of learning-related persona...Beyond cognitive abilities:  An integrative model of learning-related persona...
Beyond cognitive abilities: An integrative model of learning-related persona...
 
What about executive functions and CHC theory: New research for discussion
What about executive functions and CHC theory:  New research for discussionWhat about executive functions and CHC theory:  New research for discussion
What about executive functions and CHC theory: New research for discussion
 
"Intelligent" intelligence testing with the WJ IV COG: Why do some individua...
"Intelligent" intelligence testing with the WJ IV COG:  Why do some individua..."Intelligent" intelligence testing with the WJ IV COG:  Why do some individua...
"Intelligent" intelligence testing with the WJ IV COG: Why do some individua...
 
CHC model of inteligence revised (v2.4). Has Glr been incorrectly conceptual...
CHC model of inteligence revised (v2.4).  Has Glr been incorrectly conceptual...CHC model of inteligence revised (v2.4).  Has Glr been incorrectly conceptual...
CHC model of inteligence revised (v2.4). Has Glr been incorrectly conceptual...
 
What is "intelligent" intelligence testing
What is "intelligent" intelligence testingWhat is "intelligent" intelligence testing
What is "intelligent" intelligence testing
 
"intelligent" intelligence testing: Why do some individuals obtain markedly ...
"intelligent" intelligence testing:  Why do some individuals obtain markedly ..."intelligent" intelligence testing:  Why do some individuals obtain markedly ...
"intelligent" intelligence testing: Why do some individuals obtain markedly ...
 
"intelligent" intelligence testing: Evaluating wihtin CHC domain test score ...
"intelligent" intelligence testing:  Evaluating wihtin CHC domain test score ..."intelligent" intelligence testing:  Evaluating wihtin CHC domain test score ...
"intelligent" intelligence testing: Evaluating wihtin CHC domain test score ...
 
How to evaulate the unusualness (base rate) of WJ IV cluster or test score di...
How to evaulate the unusualness (base rate) of WJ IV cluster or test score di...How to evaulate the unusualness (base rate) of WJ IV cluster or test score di...
How to evaulate the unusualness (base rate) of WJ IV cluster or test score di...
 
The WJ IV and Beyond CHC Theory: Kevin McGrew's NASP mini-skills workshop
The WJ IV and Beyond CHC Theory:  Kevin McGrew's NASP mini-skills workshopThe WJ IV and Beyond CHC Theory:  Kevin McGrew's NASP mini-skills workshop
The WJ IV and Beyond CHC Theory: Kevin McGrew's NASP mini-skills workshop
 
The WJ IV Measurement of Auditory Processing (Ga)
The WJ IV Measurement of Auditory Processing (Ga)The WJ IV Measurement of Auditory Processing (Ga)
The WJ IV Measurement of Auditory Processing (Ga)
 
Overview of the WJ IV Cognitive Battery: GIA and CHC Clusters
Overview of the WJ IV Cognitive Battery: GIA and CHC ClustersOverview of the WJ IV Cognitive Battery: GIA and CHC Clusters
Overview of the WJ IV Cognitive Battery: GIA and CHC Clusters
 
CHC Theory Codebook 2: Cognitive definitions
CHC Theory Codebook 2:  Cognitive definitionsCHC Theory Codebook 2:  Cognitive definitions
CHC Theory Codebook 2: Cognitive definitions
 
CHC Theory Codebook 1: Cognitive definitions
CHC Theory Codebook 1:  Cognitive definitionsCHC Theory Codebook 1:  Cognitive definitions
CHC Theory Codebook 1: Cognitive definitions
 

Último

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 

Último (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 

Applied Psych Test Design: Part F--Psychometric/technical statistical analysis: Internal

  • 1. The Art and Science of Test Development—Part F Psychometric/technical statistical analysis: Internal Kevin S. McGrew, PhD. Educational Psychologist Research Director Woodcock-Muñoz Foundation The basic structure and content of this presentation is grounded extensively on the test development procedures developed by Dr. Richard Woodcock
  • 2. “In god we trust….all others must show data” (unknown source) Test authors and publishers have standards-based responsibility to provide supporting psychometric technical information re: tests and battery Typically in the form of a series of technical chapters in manual or a separate technical manual
  • 3. Calculate psychometric/measurement statistics for technical manual/chapters Use Joint Test Standards as a guide
  • 4. Calculate summary statistics (n, means, SDs, SEM) and reliabilities for all tests and clusters by technical age groups etc… etc…
  • 5. Special reliability analyses required for speeded tests Traditional test-retest reliability analysis
  • 6. Special reliability analyses for all tests More complex repeated measures reliability analysis (McArdle and Woodcock, 1989—see WJ-R Technical Manual)
  • 7. Provide evidence based on internal structure (internal validity)
  • 8. Structural (Internal) Stage of Test Development Purpose Examine the internal relations among the measures used to operationalize the theoretical construct domain (i.e., intelligence or cognitive abilities) Questions asked Do the observed measures “behave” in a manner consistent with the theoretical domain definition of intelligence? Method and concepts • Internal domain studies • Item/subscale intercorrelations • Exploratory/confirmatory factor analysis Characteristics of • Measures co-vary in a manner consistent with the intended strong test validity theoretical structure program • Factors reflect trait rather than method variance • Items/measures are representative of the empirical domain
  • 9. Structural/internal validity evidence: Test and cluster inter-correlation matrices by technical age groups etc… etc…
  • 10. Structural/internal validity Confirmatory factor analysis by major age groups (exploratory factor analysis if not theory-driven test blueprint)
  • 11. Structural/internal validity Confirmatory factor analysis by major age groups (exploratory factor analysis if not theory-driven test blueprint) .53 .67 .40 .42 .43
  • 12.
  • 13. Structural (Internal) Stage of Test Development Purpose Examine the internal relations among the measures used to operationalize the theoretical construct domain (i.e., intelligence or cognitive abilities) Questions asked Do the observed measures “behave” in a manner consistent with the theoretical domain definition of intelligence? Method and concepts • Exploratory/confirmatory factor analysis Characteristics of • The theoretical/empirical model is deemed plausible strong test validity (especially when compared against other competing models) program based on substantive and statistical criteria
  • 14. Structural/internal validity: Confirmatory factor analysis model comparisons by major age groups The WJ III factor structure model provided the best fit to the data when compared to six alternative models Fit Statistics Models Chi-square df AIC RMSEA WJ III CHC 7-factor 13189.16 536 13377.16 0.056 (0.055-0.057) Gc/Gsm/Gs/Gv+Gf (WAIS 4-factor) 15113.99 537 15301.00 0.060 (0.059-0.061) Gc/Gsm/Gq/Gv+Gf (SB IV 4-factor) 20379.58 537 20565.58 0.070 (0.069-0.071) Gf-Gc Dichotomous (KAIT) 23145.12 549 23307.12 0.074 (0.073-0.075) PASS 4-factor * 25198.46 542 25374.46 0.077 (0.078-0.079) g single factor 65314.78 1170 65524.78 0.086 (0.085-0.086) Null model 215827.54 1219 215939.54 0.153 (0.153-0.154) The conclusion was the same across 5 age-differentiated samples
  • 15. Internal validity evidence example: g-loadings for differentially weighted General Intellectual Ability cluster
  • 16. Provide evidence based on internal structure: Developmental evidence?
  • 17. Developmental evidence in the form of differential growth curves of measures
  • 19. Structural/internal validity Evaluating structural invariance with Multiple Group CFA = White Non-White
  • 20. Structural/internal validity Evaluating structural invariance with Multiple Group CFA = Male Female
  • 21. Structural/internal validity Evaluating structural invariance with Multiple Group CFA = Hispanic Non-Hispanic
  • 22. Test fairness evidence: Item Level Analyses: Differential Item Functioning (DIF) •Male/Female •White/Non-White •Hispanic/Non- Hispanic
  • 23. Test fairness evidence: Item Level Analyses: Differential Item Functioning (DIF) •Male/Female •White/Non- White •Hispanic/Non- Hispanic Results combined with results from Bias Sensitivity Review Panels
  • 24. Lack of rigor and quality control in all prior/earlier stages will “rattle through the data” and rear its ugly head when performing the final statistical analysis Shorts cuts in prior stages will “bite you in in the ____” as you attempt to perform final statistical analysis Data screening, data screening, data screening!!!!……. prior to do performing final statistical analysis • Compute extensive descriptive statistical analysis for all variables (e.g., histograms, scatterplots, box-whisker plots, etc.) • More than means and SD’s. Also calculate median, skew, kurtosis, n-tiles, etc. Deliberately planned and sophisticated “front end” data collection short-cuts (e.g., matrix sampling) introduce an extreme level of “back end” complexity to routine statistical/psychometric analysis Know your limits, level of expertise, and skills. Even those with extensive test development experience often need access to trusted measurement/statistical consultants (cont. next slide)
  • 25. Don’t be seduced and completely reliant on factor analysis as the primary internal/structural validity tool • An example: Inability of CFA to differentiate closely related latent constructs (e.g., Gc and Reading/Writing—Grw) doesn’t prove they are the same. Need to examine other evidence (e.g., very different developmental growth curves for Gc and Grw) Published statistics/psychometric information needs to be based on final publication length tests • Often need to use test-length correction formula’s (e.g., KR-21) for test reliabilities • Correlations between short /and or long norming versions of a test, that differ in test length (number of items) from publication length test, may need special adjustments/corrections. Back up, back up, back up!!!!!!!!!! Don’t let a dead hard drive or computer destroy your work and progress. Do it constantly. Build redundancy into your files and people skill sets Sad fact: Majority of test users do NOT pay attention to the fancy and special psychometric/statistical analysis you report in technical chapters or manuals. Be prepared for post-publication education via other methods. Post-manual publication technical reports of special/sophisticated analyses are good when publication time-line pressures dictate making difficult decisions.
  • 26. Exploratory-driven confirmatory factor analysis is often used by test developers to explore unexpected characteristics of tests (often called “model generation modeling” in SEM/CFA literature) Different approaches to DIF (differential item functioning) Multiple group CFA to test invariance (by age, by gender, by……..) • Different degrees of measurement invariance can be tested Traditional definition of psychometric bias and appropriate/inappropriate statistical methods Equating (e.g., Form A/B) methods and evidence Methods for calculating prediction models that account for regression to the mean and that are sensitive to developmental (age) X content interactions Complex repeated measures reliability analyses to tease out test stability, internal consistency, and trait stability sources of score variance (see WJ-R Technical Manual)
  • 27. End of Part F Additional steps in test development process will be presented in subsequent modules as they are developed