SlideShare uma empresa Scribd logo
1 de 15
Baixar para ler offline
Principles of test design
Objectives, measurement qualities, challenges
KAYSERI, 29-31 JANUARY 2014
Aims

! 

Definining our assessment objectives

! 

Measurement qualities: ‚test usefulness

! 

Testing challenges

! 

Considering impact
Testing: a definition

! 

“the systematic gathering of information for
the purpose of making decisions” (Weiss
1972)

! 

“the collection of reliable and relevant
information” (Bachman 1990)

! 

What information?

! 

What decisions?
Defining our test objectives

1) 

What kind of test is it going to be?

2) 

What decisions do we need information for?

3) 

What abilities are tested?

4) 

How much detail do we need?

5) 

How accurate should our results be?

6) 

Is there any washback effect?

7) 

What are the practical constraints?
Assessment in the classroom
!  examinations
!  formal
!  oral

tests

tests

!  home
!  term

assignments

reports

!  continuous

assessment

!  projectwork,
!  student

research work

self-evaluation
Purposes of tests
! 

for placement or level testing: to decide on the level of
learners

! 

for diagnostic testing: to identify individual strengths or
weaknesses

! 

to evaluate progress

! 

to evaluate skills for specific needs

! 

to give marks for performance

! 

to evaluate and update syllabus and objectives

! 

also to prepare/train for exams
Types of tests (Madsen, 1983)
Contrasting categories of ESL tests
Knowledge

Performance (or skills)

Subjective

Objective

Productive

Receptive

Language sub-skills
Norm-referenced
Discrete-point
Proficiency

Communication skills
Criterion-referenced
Integrative
Achievement (or progress)
Key principles of testing

" 

A correspondence between language test
performance and language use
◦ 

" 

“In order for a particular language test to be useful for its
intended purposes, test performance must correspond in
demonstrable ways to language use in non-test situations.”

A clear and explicit definition of qualities of test
usefulness
◦ 

“Usefulness = Reliability + Construct validity + Authenticity +
Interactiveness + Impact + Practicality”
(Bachman & Palmer, 1996)
Qualities of test usefulness:
reliability
" 

consistent across populations

" 

consistent within the same test
irrespective of:
! 

setting

! 

marker

! 

item set
When conditions of the test remain unchanged, it always produces
essentially the same results.
Qualities of test usefulness:
validity
!  Types

of validity (Alderson et al, 1995)

!  internal
!  face

validity

validity

!  content

validity

!  response

!  external

validity

validity

!  concurrent
!  predictive

!  construct
! 

validity

validity

validity

meaningful? appropriate? representative?
Construct validity
CHECKLIST FOR VALID TESTS
Tests what it claims to test?
Best format for what you want to test?
Content relevant to student’s real-life needs?
Items typical/representative of learner use?
Items don’t accidentally test other things?
Test skill, NOT knowledge, creativity or logic?
Clear and appropriate marking criteria?
Testing competence
from Language Testing in Practice by Bachman & Palmer (1996,
OUP)

Misconceptions
! 

one best test for a given
situation

! 

misunderstanding the nature
of testing and test
development

! 

unreasonable expectations

! 

blind faith in measurement

Resulting problems
! 

inappropriate tests

! 

failure to meet specific needs

! 

uninformed use of popular testing
methods

! 

frustration about not finding the
perfect test

! 

loss of faith in self ! testing can
only be done by experts

! 

defending the indefensible
(stakeholders s expectations)
Challenges in testing
" 
" 
" 
" 
" 

" 

testing = “real life”? “authentic”?
“communicative language testing”?
defining abilities in a test
selecting tasks and items ! competence?
measurement principles:
can “unidimensional scores for locally independent
test items reflect authentic language use”?
(Bachman)
outside factors ! performance?
Thank you!

zoltan.rezmuves@digimedia.hu
consonantvoiced.blogspot.com

Mais conteúdo relacionado

Mais procurados

types-of-test-and-testing
 types-of-test-and-testing types-of-test-and-testing
types-of-test-and-testing
Amal Al Abri
 
Principles of language assessment
Principles of language assessmentPrinciples of language assessment
Principles of language assessment
Astrid Caballero
 
Testing for language teachers 101 (1)
Testing for language teachers 101 (1)Testing for language teachers 101 (1)
Testing for language teachers 101 (1)
Paul Doyon
 

Mais procurados (20)

Testing : An important part of ELT
Testing : An important part of ELTTesting : An important part of ELT
Testing : An important part of ELT
 
Language Assessment - Standardized Testing by EFL Learners
Language Assessment - Standardized Testing by EFL LearnersLanguage Assessment - Standardized Testing by EFL Learners
Language Assessment - Standardized Testing by EFL Learners
 
Testing writing
Testing writingTesting writing
Testing writing
 
types-of-test-and-testing
 types-of-test-and-testing types-of-test-and-testing
types-of-test-and-testing
 
Validity, reliablility, washback
Validity, reliablility, washbackValidity, reliablility, washback
Validity, reliablility, washback
 
3 basic-principles_of_assessment
3  basic-principles_of_assessment3  basic-principles_of_assessment
3 basic-principles_of_assessment
 
Test Techniques
Test TechniquesTest Techniques
Test Techniques
 
Stages of test development
Stages of test developmentStages of test development
Stages of test development
 
Common test techniques
Common test techniquesCommon test techniques
Common test techniques
 
Kinds of Language Tests
Kinds of Language TestsKinds of Language Tests
Kinds of Language Tests
 
Testing grammar and vocabulary
Testing grammar and vocabularyTesting grammar and vocabulary
Testing grammar and vocabulary
 
Principles of language assessment
Principles of language assessmentPrinciples of language assessment
Principles of language assessment
 
Testing for Language Teachers
Testing for Language TeachersTesting for Language Teachers
Testing for Language Teachers
 
Validity
ValidityValidity
Validity
 
Kinds of tests
Kinds of testsKinds of tests
Kinds of tests
 
Testing writing
Testing writingTesting writing
Testing writing
 
Language Testing Techniques
Language Testing TechniquesLanguage Testing Techniques
Language Testing Techniques
 
Test methods in Language Testing
Test methods in Language TestingTest methods in Language Testing
Test methods in Language Testing
 
Testing for language teachers 101 (1)
Testing for language teachers 101 (1)Testing for language teachers 101 (1)
Testing for language teachers 101 (1)
 
Testing vocabulary (final)
Testing vocabulary (final)Testing vocabulary (final)
Testing vocabulary (final)
 

Destaque

A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...
A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...
A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...
Roddy McDougall
 
Testing backto basics
Testing backto basicsTesting backto basics
Testing backto basics
ngec
 
Communicative Testing
Communicative  TestingCommunicative  Testing
Communicative Testing
Ningsih SM
 
communicative languag
communicative languagcommunicative languag
communicative languag
khalafi
 

Destaque (20)

2.2.test design test writing: plenary CTS-Academic
2.2.test design test writing: plenary CTS-Academic2.2.test design test writing: plenary CTS-Academic
2.2.test design test writing: plenary CTS-Academic
 
40 Hot Tips To Better Position Your Education In A Down Economy
40 Hot Tips To Better Position Your Education In A Down Economy40 Hot Tips To Better Position Your Education In A Down Economy
40 Hot Tips To Better Position Your Education In A Down Economy
 
A3 slideshow
A3   slideshowA3   slideshow
A3 slideshow
 
Comparative Chart (Brown - Bachman)
Comparative Chart (Brown - Bachman)Comparative Chart (Brown - Bachman)
Comparative Chart (Brown - Bachman)
 
SWAN/SIOC: Aligning Scientific Discourse Representation and Social Semantics
SWAN/SIOC: Aligning Scientific Discourse Representation and Social SemanticsSWAN/SIOC: Aligning Scientific Discourse Representation and Social Semantics
SWAN/SIOC: Aligning Scientific Discourse Representation and Social Semantics
 
Media & Generations: use of media in generational discourse
Media & Generations: use of media in generational discourseMedia & Generations: use of media in generational discourse
Media & Generations: use of media in generational discourse
 
A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...
A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...
A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...
 
Culture of Adversarialism
Culture of AdversarialismCulture of Adversarialism
Culture of Adversarialism
 
What is-discourse (2)
What is-discourse (2)What is-discourse (2)
What is-discourse (2)
 
Online version 20151003 main issues in language testing
Online version 20151003 main issues in language   testingOnline version 20151003 main issues in language   testing
Online version 20151003 main issues in language testing
 
Bachman style
Bachman styleBachman style
Bachman style
 
Testing backto basics
Testing backto basicsTesting backto basics
Testing backto basics
 
Xác trị slide 1 - validation basics
Xác trị   slide  1 - validation basicsXác trị   slide  1 - validation basics
Xác trị slide 1 - validation basics
 
Discourse Representation Theory
Discourse Representation TheoryDiscourse Representation Theory
Discourse Representation Theory
 
Glossary of L2 evaluation terms
Glossary of L2 evaluation termsGlossary of L2 evaluation terms
Glossary of L2 evaluation terms
 
Ideology and textbooks
Ideology and textbooksIdeology and textbooks
Ideology and textbooks
 
Qualitative research
Qualitative researchQualitative research
Qualitative research
 
Some Reflections on Task-based Language Performance Assessment
Some Reflections on Task-based Language Performance AssessmentSome Reflections on Task-based Language Performance Assessment
Some Reflections on Task-based Language Performance Assessment
 
Communicative Testing
Communicative  TestingCommunicative  Testing
Communicative Testing
 
communicative languag
communicative languagcommunicative languag
communicative languag
 

Semelhante a 1.2 principles of test design: plenary CTS-Academic

01 introducción a la evaluación del aprendizaje de idiomas
01 introducción a la evaluación del aprendizaje de idiomas01 introducción a la evaluación del aprendizaje de idiomas
01 introducción a la evaluación del aprendizaje de idiomas
Y Casart
 
ELTLAE Group 2.pptx
ELTLAE Group 2.pptxELTLAE Group 2.pptx
ELTLAE Group 2.pptx
AhzaPutro
 
Principles of high quality assessment
Principles of high quality assessmentPrinciples of high quality assessment
Principles of high quality assessment
A CM
 

Semelhante a 1.2 principles of test design: plenary CTS-Academic (20)

3.2 test development cycle: plenary CTS-Academic
3.2 test development cycle: plenary CTS-Academic3.2 test development cycle: plenary CTS-Academic
3.2 test development cycle: plenary CTS-Academic
 
Applied linguistics: Assessment for language teachers
Applied linguistics: Assessment for language teachersApplied linguistics: Assessment for language teachers
Applied linguistics: Assessment for language teachers
 
7.1 assessment and the cefr (1)
7.1 assessment and the cefr (1)7.1 assessment and the cefr (1)
7.1 assessment and the cefr (1)
 
01 introducción a la evaluación del aprendizaje de idiomas
01 introducción a la evaluación del aprendizaje de idiomas01 introducción a la evaluación del aprendizaje de idiomas
01 introducción a la evaluación del aprendizaje de idiomas
 
Evaluating a Good Language Test
Evaluating a Good Language TestEvaluating a Good Language Test
Evaluating a Good Language Test
 
principle and types of assessment 12.pptx
principle and types of assessment 12.pptxprinciple and types of assessment 12.pptx
principle and types of assessment 12.pptx
 
CPUT Formal Assessment Online Workshop March 2018
CPUT Formal Assessment Online Workshop March 2018CPUT Formal Assessment Online Workshop March 2018
CPUT Formal Assessment Online Workshop March 2018
 
CRITERIA AND VALIDITY PRESENTATIONS.pptx
CRITERIA AND VALIDITY PRESENTATIONS.pptxCRITERIA AND VALIDITY PRESENTATIONS.pptx
CRITERIA AND VALIDITY PRESENTATIONS.pptx
 
lecture2_20111.ppt
lecture2_20111.pptlecture2_20111.ppt
lecture2_20111.ppt
 
7.1 assessment and the cefr (1)
7.1 assessment and the cefr (1)7.1 assessment and the cefr (1)
7.1 assessment and the cefr (1)
 
Test appraisal
Test appraisalTest appraisal
Test appraisal
 
ELTLAE Group 2.pptx
ELTLAE Group 2.pptxELTLAE Group 2.pptx
ELTLAE Group 2.pptx
 
Principles of Language Assessment (Aranda & Lingcallo).pptx
Principles of Language Assessment (Aranda & Lingcallo).pptxPrinciples of Language Assessment (Aranda & Lingcallo).pptx
Principles of Language Assessment (Aranda & Lingcallo).pptx
 
Validation
ValidationValidation
Validation
 
3-_basic_principles_of_assessment-1.ppt
3-_basic_principles_of_assessment-1.ppt3-_basic_principles_of_assessment-1.ppt
3-_basic_principles_of_assessment-1.ppt
 
Principles_of_language_testing.ppt
Principles_of_language_testing.pptPrinciples_of_language_testing.ppt
Principles_of_language_testing.ppt
 
English Proficiency Test
English Proficiency TestEnglish Proficiency Test
English Proficiency Test
 
7 assessment and the cefr
7 assessment and the cefr 7 assessment and the cefr
7 assessment and the cefr
 
Principles of high quality assessment
Principles of high quality assessmentPrinciples of high quality assessment
Principles of high quality assessment
 
Chapter24
Chapter24Chapter24
Chapter24
 

Mais de SeltAcademy

Mais de SeltAcademy (18)

CTS-Academic: Module 2 session 9 cognitive processes
CTS-Academic: Module 2 session 9 cognitive processesCTS-Academic: Module 2 session 9 cognitive processes
CTS-Academic: Module 2 session 9 cognitive processes
 
CTS-Academic: Module 2 session 8 psycholinguistics
CTS-Academic: Module 2 session 8 psycholinguisticsCTS-Academic: Module 2 session 8 psycholinguistics
CTS-Academic: Module 2 session 8 psycholinguistics
 
CTS-Academic: Module 2 session 6 classroom sla
CTS-Academic: Module 2 session 6 classroom slaCTS-Academic: Module 2 session 6 classroom sla
CTS-Academic: Module 2 session 6 classroom sla
 
CTS-Academic: Module 2 session 5 sla research
CTS-Academic: Module 2 session 5 sla researchCTS-Academic: Module 2 session 5 sla research
CTS-Academic: Module 2 session 5 sla research
 
CTS-Academic: Module 2 session 3 theories of language learning
CTS-Academic: Module 2 session 3 theories of language learningCTS-Academic: Module 2 session 3 theories of language learning
CTS-Academic: Module 2 session 3 theories of language learning
 
CTS-Academic: Module 2 session 2 theories of language learning
CTS-Academic: Module 2 session 2 theories of language learningCTS-Academic: Module 2 session 2 theories of language learning
CTS-Academic: Module 2 session 2 theories of language learning
 
CTS-Academic: Module 2 session 1 theories of language
CTS-Academic: Module 2 session 1 theories of languageCTS-Academic: Module 2 session 1 theories of language
CTS-Academic: Module 2 session 1 theories of language
 
CTS-Academic: Module 2 session 10 lesson shapes
CTS-Academic: Module 2 session 10 lesson shapesCTS-Academic: Module 2 session 10 lesson shapes
CTS-Academic: Module 2 session 10 lesson shapes
 
1.3 tests of grammar and vocabulary: workshop CTS-Academic
1.3 tests of grammar and vocabulary: workshop CTS-Academic1.3 tests of grammar and vocabulary: workshop CTS-Academic
1.3 tests of grammar and vocabulary: workshop CTS-Academic
 
3.1 curriculum and assessment: plenary CTS-Academic
3.1 curriculum and assessment: plenary CTS-Academic3.1 curriculum and assessment: plenary CTS-Academic
3.1 curriculum and assessment: plenary CTS-Academic
 
2.3 tests of receptive skills: workshop CTS-Academic
2.3 tests of receptive skills: workshop CTS-Academic2.3 tests of receptive skills: workshop CTS-Academic
2.3 tests of receptive skills: workshop CTS-Academic
 
3.3 tests of productive skills: workshop CTS-Academic
3.3 tests of productive skills: workshop CTS-Academic3.3 tests of productive skills: workshop CTS-Academic
3.3 tests of productive skills: workshop CTS-Academic
 
2.1 applying standards to testing: plenary CTS-Academic
2.1 applying standards to testing: plenary CTS-Academic2.1 applying standards to testing: plenary CTS-Academic
2.1 applying standards to testing: plenary CTS-Academic
 
1.1 language assessment in Turkey: plenary CTS-Academic
1.1 language assessment in Turkey: plenary CTS-Academic1.1 language assessment in Turkey: plenary CTS-Academic
1.1 language assessment in Turkey: plenary CTS-Academic
 
Part 3 great opportunities great expectations
Part 3 great opportunities great expectationsPart 3 great opportunities great expectations
Part 3 great opportunities great expectations
 
Part 2 great opportunities great expectations
Part 2 great opportunities great expectationsPart 2 great opportunities great expectations
Part 2 great opportunities great expectations
 
Part 1 great opportunities great expectations
Part 1 great opportunities great expectationsPart 1 great opportunities great expectations
Part 1 great opportunities great expectations
 
Types of stories for children
Types of stories for childrenTypes of stories for children
Types of stories for children
 

Último

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Último (20)

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 

1.2 principles of test design: plenary CTS-Academic

  • 1. Principles of test design Objectives, measurement qualities, challenges KAYSERI, 29-31 JANUARY 2014
  • 2.
  • 3. Aims !  Definining our assessment objectives !  Measurement qualities: ‚test usefulness !  Testing challenges !  Considering impact
  • 4. Testing: a definition !  “the systematic gathering of information for the purpose of making decisions” (Weiss 1972) !  “the collection of reliable and relevant information” (Bachman 1990) !  What information? !  What decisions?
  • 5. Defining our test objectives 1)  What kind of test is it going to be? 2)  What decisions do we need information for? 3)  What abilities are tested? 4)  How much detail do we need? 5)  How accurate should our results be? 6)  Is there any washback effect? 7)  What are the practical constraints?
  • 6. Assessment in the classroom !  examinations !  formal !  oral tests tests !  home !  term assignments reports !  continuous assessment !  projectwork, !  student research work self-evaluation
  • 7. Purposes of tests !  for placement or level testing: to decide on the level of learners !  for diagnostic testing: to identify individual strengths or weaknesses !  to evaluate progress !  to evaluate skills for specific needs !  to give marks for performance !  to evaluate and update syllabus and objectives !  also to prepare/train for exams
  • 8. Types of tests (Madsen, 1983) Contrasting categories of ESL tests Knowledge Performance (or skills) Subjective Objective Productive Receptive Language sub-skills Norm-referenced Discrete-point Proficiency Communication skills Criterion-referenced Integrative Achievement (or progress)
  • 9. Key principles of testing "  A correspondence between language test performance and language use ◦  "  “In order for a particular language test to be useful for its intended purposes, test performance must correspond in demonstrable ways to language use in non-test situations.” A clear and explicit definition of qualities of test usefulness ◦  “Usefulness = Reliability + Construct validity + Authenticity + Interactiveness + Impact + Practicality” (Bachman & Palmer, 1996)
  • 10. Qualities of test usefulness: reliability "  consistent across populations "  consistent within the same test irrespective of: !  setting !  marker !  item set When conditions of the test remain unchanged, it always produces essentially the same results.
  • 11. Qualities of test usefulness: validity !  Types of validity (Alderson et al, 1995) !  internal !  face validity validity !  content validity !  response !  external validity validity !  concurrent !  predictive !  construct !  validity validity validity meaningful? appropriate? representative?
  • 12. Construct validity CHECKLIST FOR VALID TESTS Tests what it claims to test? Best format for what you want to test? Content relevant to student’s real-life needs? Items typical/representative of learner use? Items don’t accidentally test other things? Test skill, NOT knowledge, creativity or logic? Clear and appropriate marking criteria?
  • 13. Testing competence from Language Testing in Practice by Bachman & Palmer (1996, OUP) Misconceptions !  one best test for a given situation !  misunderstanding the nature of testing and test development !  unreasonable expectations !  blind faith in measurement Resulting problems !  inappropriate tests !  failure to meet specific needs !  uninformed use of popular testing methods !  frustration about not finding the perfect test !  loss of faith in self ! testing can only be done by experts !  defending the indefensible (stakeholders s expectations)
  • 14. Challenges in testing "  "  "  "  "  "  testing = “real life”? “authentic”? “communicative language testing”? defining abilities in a test selecting tasks and items ! competence? measurement principles: can “unidimensional scores for locally independent test items reflect authentic language use”? (Bachman) outside factors ! performance?