SlideShare uma empresa Scribd logo
1 de 46
Learning Analytics for the Evaluation of
Competencies and Behaviors in Serious Games
José A. Ruipérez Valiente — @JoseARuiperez — jruiperez@um.es
Introductions
Where are you coming from?
playful.mit.edu
@playfulMIT
What we believe in
Who we are
Main contributors to this research
José A. Ruipérez-Valiente
BEng Telecomunications Systems (UCAM),
MEng Telecomunications, MSc y PhD
Telematics (UC3M), Postdoc (MIT)
6 years working in learning analytics across
many objectives and contexts
Currently focused in large scale trends in
MOOCs and game-based assessment
Juan de la Cierva Researcher at UMU and
affiliate at MIT Playful Journey Lab
YJ (Yoon Jeon) Kim
Executive Director Playful
Journey Lab located at
MIT Open Learning
Assessment scientist
Focus on games and
playful approaches for
assessment
Topics related to this talk
- Games for Learning
- Game-based Assessment
- Learning Analytics
- … and Design (which is transverse to numerous areas and applications)
Motivations
Why and how are we doing this?
A game is a voluntary interactive
activity, in which one or more players
follow rules that constrain their
behavior, enacting an artificial conflict
that ends in a quantifiable outcome.
~Eric Zimmerman (2004)
Why Games?
● Games are “flexible enough for players to
inhabit and explore through meaningful
play” (Salen & Zimmerman) (deep learning)
● Majority of children grow up playing games
● Learners have more freedom related to
how much effort they choose to expend,
how often they fail and try again (Osterweil,
2014) (real life)
Assessment is a process of reasoning
from evidence. Therefore, an
assessment is a tool designed to
observe students’ behavior and
produce data that can be used to draw
reasonable inferences about what
students know.
~ Bob Mislevy
Why Games for Assessment?
● Games incorporate multiple pathways to solution(s) where learners can make
meaningful choices and demonstrate multiple ways of solving problems
● Use complex and authentic problems → hard-to-measure constructs
o We need to assess 21st century skills
● Games are motivating and engaging → accurate assessment (Sundre &
Wise, 2003)
● It doesn’t feel like assessment (i.e. stealth assessment)
o Less stresful situations for students
Metaphor
The Broad view of Learning Analytics
…collection, analysis and reporting of data about learners and
their contexts, for purposes of understanding and optimising
learning and the environments in which it occurs…
Source: First Learning Analytics
and Knowledge Conference
The Learning Analytics data-driven Process
Raw data
generation
Feature
engineering
Visualizations
Recommendation
Report generator
Meaningful features
Which raw data is
necessary?
What to do with the processed
data?
What to obtain and How
to do it?
Technology as an engine to enhance learning
Exploration,
Correlation,
clustering,
prediction,
causes…
Learning
environments
Conclusions generate feedback and close the LA loop
Game-based Assessment
Design, model implementation and evaluation process
Design, Development and Evaluation Process of Game-based
Assessment
Design
● Design and implementation of game system
○ Game mechanics that can generate evidence
from the constructs and a data infrastructure that
effectively stores that evidence
○ The most iterative step of the process with very
frequent playtesting
1. Start with paper prototypes
2. Move to drafty digital prototypes
3. End with advanced digital prototypes
● Data collection
○ Diverse audiences and contexts
○ Very important for game mechanics and tech side
○ Face-to-face playtesting
○ Amazon MTurk
Face-to-face playtesting
Amazon Mechnical Turk as part of the design process
Amazon Mechnical Turk as part of the design process
Balance between Game Design and Assessment Design
Meet Shadowspect!
More at https://shadowspect.org/
Model development
● Implementation of the assessment machinery:
○ Process of turning evidence into constructs
○ Content knowledge assessment: Following a
traditional Evidence-centered Design
○ Cognitive and behavioral assessment: Combining
knowledge engineering process and ML with expert
labelling
● Data collection:
○ Same high school context, age, and settings
○ Two sessions of one hour each
○ Around 10 US high school classes and more than 200
hundred students
Model development:
Content knowledge assessment
Implementation via Evidence-centered Design
Common Core Geometry Standards
● Competency model: We focus on the common core geometry standards
o MG.A.1: Use geometric shapes, their measures, and their properties to describe
objects (e.g., modeling a tree trunk or a human torso as a cylinder)
o GMD.B.4: Identify the shapes of two-dimensional cross-sections of three-
dimensional objects, and identify three-dimensional objects generated by rotations
of two-dimensional objects
o CO.A.5: Given a geometric figure and a rotation, reflection, or translation, draw the
transformed figure
o CO.B.6: Use geometric descriptions of rigid motions to transform figures and to
predict the effect of a given rigid motion on a given figure
ECD Summary for Geometry Common Standards Assessmement
● Collaboration with geometry specialist, game designer and assessment designer
○ Evidence model: We generate puzzles that generate evidence from the Geometry Common Standards
○ Task model: We map the relationship (none, weak or strong) of each puzzle with the common standard
○ Assembly model: We put all the evidence from a student together to assess their content knowledge
○ Presentation & Delivery model: Reports and dashboards by student/standard. Difficulty by exercise
Puzzle MG.A.1 GMD.B.4 …
Puzzle 1 Weak Weak …
Puzzle 2 None None …
… … … …
Student Puzzle 1 Puzzle 2 …
Student 1
OK, # 1
attempt
OK, # 3
attempts
…
Student 1 NA
Fail, # 5
attempt
…
… … … …
Our simplified case scenario right now
Evidence Standards
map
Model development:
Cognitive and Behavioral Assessment
Implementation via a Learning Analytics Knowledge Engineering Process
Knowledge Engineering Process
● We acquire knowledge about the construct that we want to measure
1. Reading about the construct
2. Conducting interview with experts
3. Reviewing related scientific literature
● We algorithmically implement features that use the data/evidence that can inform the
construct that we want to measure
Our simplified case scenario now updates to:
Evidence Constructs
map
Data Features
data schema inform
algorithms
Efficiency construct
- Efficiency is the ability to do things well, successfully, and without waste. It
often specifically comprises the capability of a specific application of effort
to produce a specific outcome with a minimum amount or quantity of
waste, expense, or unnecessary effort (Wikipedia)
Evidence in Shadowspect related to efficiency
● Ability to do things well:
○ Solving puzzles correctly
● Expense or effort:
○ Time invested
○ Number of attempts to solve a problem
Mapping evidence into necessary data in Shadowspect
● We need: puzzles solved correctly, time invested and attempts
○ Necessary types of events for that:
■ puzzle_start (timestamp, student, puzzle_id)
■ leave_to_menu (timestamp, student, puzzle_id)
■ puzzle_attempt (timestamp, student, puzzle_id, correct)
How does data in Shadowspect actually looks like?
Algorithm to compute features from data (pseudo-code)
# note this is a VERY simplified version that do not aim to be the most effective implementation of this algorithm
computeEfficiencyFeatures(student):
student_events = getStudentEvents(student)
correct_exercises_list = list(); number_attempts = 0; total_time = 0; puzzle_started_event = None
for event in student_events:
if(event[‘type’] == ‘puzzle_started’) then
puzzle_started_event = event
elif(event[‘type’] == ‘leave_to_menu’) then
total_time += (event[‘timestamp’] - puzzle_started_event[‘timestamp’])
puzzle_started_event = None
elif(event[‘type’] == ‘puzzle_attempt’):
number_attempts += 1
if(event[‘correct’] == True) then
correct_exercises_list.add(event[‘puzzle_id’])
attempts_per_correct_problem = length(unique(correct_exercises_list))/number_attempts
time_per_correct_problem = length(unique(correct_exercises_list))/total_time
return(attempts_per_correct_problem, time_per_correct_problem)
The previous general scenario
Evidence Constructs
map
Data Features
data schema inform
algorithms
Model for efficiency in Shadowspect
Evidence
● Correct puzzles
● Time
● Number attempts
Data
● puzzle_start
● leave_to_menu
● puzzle_attempt
data schema inform
computeEfficiency
Features(student)
Construct
Efficiency
Features
attempts_per_correct_problem
time_per_correct_problem
map
Model development:
Cognitive and Behavioral Assessment
Implementation via Learning Analytics with Experts and Machine Learning
Expert Labelling and Machine Learning Process
● Two or more experts label text or video replays that can be visually assessed
○ We divide all level interactions in replays that can be labelled
○ Experts review replays and label them for each construct that we want to measure
■ They might use rubrics and we are looking for expert inter-agreement (Cohen’s kappa)
○ We implement a supervised machine learning assessment model based on these labels
● Challenges here include achieving good inter-agreement, technical logistics, replay
resolution and final implementation of the ML model
Example of simplified text replay: 1. Start puzzle – 2. Create shape square – 3. Move square – 4. Create cone
5. Rotate cone – 6. Change perspective – 7. Snapshot – 8. Move cone – 9. Submit – 10 Puzzle correct
Expert Labelling and Machine Learning Process
Evidence
Constructsmap
Data Features
data schema
inform
algorithms
expert
assessment
ML/AI
Evaluation
● We are not here yet! Future plans:
● Data collection:
○ Implementation as part of the curriculum in high
school classes
○ Demographic and school data with external measures
● Game analytics: How is the game being used by
students? Improvements, enjoyment…
● Model performance evaluation: How are the
models working? What do teachers think about
models?
● Psychometric evaluation: Are our models
correlated to other external tests, e.g. geometry
traditional tests or spatial reasoning validated
instruments
It’s time to say goodbye
But let’s conclude before that
Conclusions
● Alternative assessment method with great potential
○ Focus on complex constructs, can focus on the process (on only outcomes), is less stressful
and more enjoyable for students
● Highly challenging and multidisciplinary field, main problems:
○ Cost, scalability and generalization across GBA tools, model validity, trustworthiness, and
teacher literacy
● Some companies are already using GBA as part pre-hiring
● Difference between Assessment and assessment
● Opportunities for collaboration!
Thank you!
José A. Ruipérez Valiente — @JoseARuiperez — jruiperez@um.es

Mais conteúdo relacionado

Semelhante a Learning Analytics for the Evaluation of Competencies and Behaviors in Serious Games

2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...eMadrid network
 
Introduction to Machine learning
Introduction to Machine learningIntroduction to Machine learning
Introduction to Machine learningKnoldus Inc.
 
Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...Bart Rienties
 
How AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksHow AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksKatie Fang
 
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...KnowledgeGraph
 
林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning台灣資料科學年會
 
Dr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based LearningDr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based LearningCITE
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.pptbutest
 
Learning Analytics and Serious Games: Trends and Considerations
Learning Analytics and Serious Games: Trends and ConsiderationsLearning Analytics and Serious Games: Trends and Considerations
Learning Analytics and Serious Games: Trends and ConsiderationsLaila Shoukry
 
Umap17 learner modelingforintegrationskills_yunhuang
Umap17 learner modelingforintegrationskills_yunhuangUmap17 learner modelingforintegrationskills_yunhuang
Umap17 learner modelingforintegrationskills_yunhuangYun Huang
 
ToTCOOP+i Lesson plan unit_3_final_version_en
ToTCOOP+i Lesson plan unit_3_final_version_enToTCOOP+i Lesson plan unit_3_final_version_en
ToTCOOP+i Lesson plan unit_3_final_version_enToTCOOPiTech
 
On Evaluating The 80 Days Geography Game
On Evaluating The 80 Days Geography GameOn Evaluating The 80 Days Geography Game
On Evaluating The 80 Days Geography Gamearammann
 
On Evaluating Serious Games
On Evaluating Serious GamesOn Evaluating Serious Games
On Evaluating Serious Gamesstemprm
 
On Evaluating Serious Games
On Evaluating Serious GamesOn Evaluating Serious Games
On Evaluating Serious Gamesstemrpm
 
2webquest Srkthomas Statistics
2webquest Srkthomas Statistics2webquest Srkthomas Statistics
2webquest Srkthomas Statisticssrthomas
 
Descriptive Statistics in the NCAA
Descriptive Statistics in the NCAADescriptive Statistics in the NCAA
Descriptive Statistics in the NCAAsrkthomas
 
Webquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAWebquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAsrthomas
 

Semelhante a Learning Analytics for the Evaluation of Competencies and Behaviors in Serious Games (20)

2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
 
Introduction to Machine learning
Introduction to Machine learningIntroduction to Machine learning
Introduction to Machine learning
 
Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...
 
How AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksHow AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinks
 
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
 
林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning
 
Dr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based LearningDr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based Learning
 
Applying learning analytics in serious games
Applying learning analytics in serious games Applying learning analytics in serious games
Applying learning analytics in serious games
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.ppt
 
Learning Analytics and Serious Games: Trends and Considerations
Learning Analytics and Serious Games: Trends and ConsiderationsLearning Analytics and Serious Games: Trends and Considerations
Learning Analytics and Serious Games: Trends and Considerations
 
Week 1.pdf
Week 1.pdfWeek 1.pdf
Week 1.pdf
 
EDPC605_7&8
EDPC605_7&8EDPC605_7&8
EDPC605_7&8
 
Umap17 learner modelingforintegrationskills_yunhuang
Umap17 learner modelingforintegrationskills_yunhuangUmap17 learner modelingforintegrationskills_yunhuang
Umap17 learner modelingforintegrationskills_yunhuang
 
ToTCOOP+i Lesson plan unit_3_final_version_en
ToTCOOP+i Lesson plan unit_3_final_version_enToTCOOP+i Lesson plan unit_3_final_version_en
ToTCOOP+i Lesson plan unit_3_final_version_en
 
On Evaluating The 80 Days Geography Game
On Evaluating The 80 Days Geography GameOn Evaluating The 80 Days Geography Game
On Evaluating The 80 Days Geography Game
 
On Evaluating Serious Games
On Evaluating Serious GamesOn Evaluating Serious Games
On Evaluating Serious Games
 
On Evaluating Serious Games
On Evaluating Serious GamesOn Evaluating Serious Games
On Evaluating Serious Games
 
2webquest Srkthomas Statistics
2webquest Srkthomas Statistics2webquest Srkthomas Statistics
2webquest Srkthomas Statistics
 
Descriptive Statistics in the NCAA
Descriptive Statistics in the NCAADescriptive Statistics in the NCAA
Descriptive Statistics in the NCAA
 
Webquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAWebquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAA
 

Mais de MIT

Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...MIT
 
Investigación en Learning Analytics vs. Learning Analytics en la Universidad
Investigación en Learning Analyticsvs.Learning Analytics en la UniversidadInvestigación en Learning Analyticsvs.Learning Analytics en la Universidad
Investigación en Learning Analytics vs. Learning Analytics en la UniversidadMIT
 
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente MIT
 
A Data-driven Method for the Detection of Close Submitters in Online Learning...
A Data-driven Method for the Detection of Close Submitters in Online Learning...A Data-driven Method for the Detection of Close Submitters in Online Learning...
A Data-driven Method for the Detection of Close Submitters in Online Learning...MIT
 
Using Multiple Accounts for Harvesting Solutions in MOOCs
Using Multiple Accounts for Harvesting Solutions in MOOCs Using Multiple Accounts for Harvesting Solutions in MOOCs
Using Multiple Accounts for Harvesting Solutions in MOOCs MIT
 
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...MIT
 

Mais de MIT (6)

Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
 
Investigación en Learning Analytics vs. Learning Analytics en la Universidad
Investigación en Learning Analyticsvs.Learning Analytics en la UniversidadInvestigación en Learning Analyticsvs.Learning Analytics en la Universidad
Investigación en Learning Analytics vs. Learning Analytics en la Universidad
 
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
 
A Data-driven Method for the Detection of Close Submitters in Online Learning...
A Data-driven Method for the Detection of Close Submitters in Online Learning...A Data-driven Method for the Detection of Close Submitters in Online Learning...
A Data-driven Method for the Detection of Close Submitters in Online Learning...
 
Using Multiple Accounts for Harvesting Solutions in MOOCs
Using Multiple Accounts for Harvesting Solutions in MOOCs Using Multiple Accounts for Harvesting Solutions in MOOCs
Using Multiple Accounts for Harvesting Solutions in MOOCs
 
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
 

Último

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 

Último (20)

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 

Learning Analytics for the Evaluation of Competencies and Behaviors in Serious Games

  • 1. Learning Analytics for the Evaluation of Competencies and Behaviors in Serious Games José A. Ruipérez Valiente — @JoseARuiperez — jruiperez@um.es
  • 6. Main contributors to this research José A. Ruipérez-Valiente BEng Telecomunications Systems (UCAM), MEng Telecomunications, MSc y PhD Telematics (UC3M), Postdoc (MIT) 6 years working in learning analytics across many objectives and contexts Currently focused in large scale trends in MOOCs and game-based assessment Juan de la Cierva Researcher at UMU and affiliate at MIT Playful Journey Lab YJ (Yoon Jeon) Kim Executive Director Playful Journey Lab located at MIT Open Learning Assessment scientist Focus on games and playful approaches for assessment
  • 7. Topics related to this talk - Games for Learning - Game-based Assessment - Learning Analytics - … and Design (which is transverse to numerous areas and applications)
  • 8. Motivations Why and how are we doing this?
  • 9. A game is a voluntary interactive activity, in which one or more players follow rules that constrain their behavior, enacting an artificial conflict that ends in a quantifiable outcome. ~Eric Zimmerman (2004)
  • 10. Why Games? ● Games are “flexible enough for players to inhabit and explore through meaningful play” (Salen & Zimmerman) (deep learning) ● Majority of children grow up playing games ● Learners have more freedom related to how much effort they choose to expend, how often they fail and try again (Osterweil, 2014) (real life)
  • 11. Assessment is a process of reasoning from evidence. Therefore, an assessment is a tool designed to observe students’ behavior and produce data that can be used to draw reasonable inferences about what students know. ~ Bob Mislevy
  • 12. Why Games for Assessment? ● Games incorporate multiple pathways to solution(s) where learners can make meaningful choices and demonstrate multiple ways of solving problems ● Use complex and authentic problems → hard-to-measure constructs o We need to assess 21st century skills ● Games are motivating and engaging → accurate assessment (Sundre & Wise, 2003) ● It doesn’t feel like assessment (i.e. stealth assessment) o Less stresful situations for students
  • 14. The Broad view of Learning Analytics …collection, analysis and reporting of data about learners and their contexts, for purposes of understanding and optimising learning and the environments in which it occurs… Source: First Learning Analytics and Knowledge Conference
  • 15. The Learning Analytics data-driven Process Raw data generation Feature engineering Visualizations Recommendation Report generator Meaningful features Which raw data is necessary? What to do with the processed data? What to obtain and How to do it? Technology as an engine to enhance learning Exploration, Correlation, clustering, prediction, causes… Learning environments Conclusions generate feedback and close the LA loop
  • 16. Game-based Assessment Design, model implementation and evaluation process
  • 17. Design, Development and Evaluation Process of Game-based Assessment
  • 18. Design ● Design and implementation of game system ○ Game mechanics that can generate evidence from the constructs and a data infrastructure that effectively stores that evidence ○ The most iterative step of the process with very frequent playtesting 1. Start with paper prototypes 2. Move to drafty digital prototypes 3. End with advanced digital prototypes ● Data collection ○ Diverse audiences and contexts ○ Very important for game mechanics and tech side ○ Face-to-face playtesting ○ Amazon MTurk
  • 20. Amazon Mechnical Turk as part of the design process
  • 21. Amazon Mechnical Turk as part of the design process
  • 22. Balance between Game Design and Assessment Design
  • 23. Meet Shadowspect! More at https://shadowspect.org/
  • 24. Model development ● Implementation of the assessment machinery: ○ Process of turning evidence into constructs ○ Content knowledge assessment: Following a traditional Evidence-centered Design ○ Cognitive and behavioral assessment: Combining knowledge engineering process and ML with expert labelling ● Data collection: ○ Same high school context, age, and settings ○ Two sessions of one hour each ○ Around 10 US high school classes and more than 200 hundred students
  • 25. Model development: Content knowledge assessment Implementation via Evidence-centered Design
  • 26.
  • 27. Common Core Geometry Standards ● Competency model: We focus on the common core geometry standards o MG.A.1: Use geometric shapes, their measures, and their properties to describe objects (e.g., modeling a tree trunk or a human torso as a cylinder) o GMD.B.4: Identify the shapes of two-dimensional cross-sections of three- dimensional objects, and identify three-dimensional objects generated by rotations of two-dimensional objects o CO.A.5: Given a geometric figure and a rotation, reflection, or translation, draw the transformed figure o CO.B.6: Use geometric descriptions of rigid motions to transform figures and to predict the effect of a given rigid motion on a given figure
  • 28. ECD Summary for Geometry Common Standards Assessmement ● Collaboration with geometry specialist, game designer and assessment designer ○ Evidence model: We generate puzzles that generate evidence from the Geometry Common Standards ○ Task model: We map the relationship (none, weak or strong) of each puzzle with the common standard ○ Assembly model: We put all the evidence from a student together to assess their content knowledge ○ Presentation & Delivery model: Reports and dashboards by student/standard. Difficulty by exercise Puzzle MG.A.1 GMD.B.4 … Puzzle 1 Weak Weak … Puzzle 2 None None … … … … … Student Puzzle 1 Puzzle 2 … Student 1 OK, # 1 attempt OK, # 3 attempts … Student 1 NA Fail, # 5 attempt … … … … …
  • 29. Our simplified case scenario right now Evidence Standards map
  • 30. Model development: Cognitive and Behavioral Assessment Implementation via a Learning Analytics Knowledge Engineering Process
  • 31. Knowledge Engineering Process ● We acquire knowledge about the construct that we want to measure 1. Reading about the construct 2. Conducting interview with experts 3. Reviewing related scientific literature ● We algorithmically implement features that use the data/evidence that can inform the construct that we want to measure
  • 32. Our simplified case scenario now updates to: Evidence Constructs map Data Features data schema inform algorithms
  • 33. Efficiency construct - Efficiency is the ability to do things well, successfully, and without waste. It often specifically comprises the capability of a specific application of effort to produce a specific outcome with a minimum amount or quantity of waste, expense, or unnecessary effort (Wikipedia)
  • 34. Evidence in Shadowspect related to efficiency ● Ability to do things well: ○ Solving puzzles correctly ● Expense or effort: ○ Time invested ○ Number of attempts to solve a problem
  • 35. Mapping evidence into necessary data in Shadowspect ● We need: puzzles solved correctly, time invested and attempts ○ Necessary types of events for that: ■ puzzle_start (timestamp, student, puzzle_id) ■ leave_to_menu (timestamp, student, puzzle_id) ■ puzzle_attempt (timestamp, student, puzzle_id, correct)
  • 36. How does data in Shadowspect actually looks like?
  • 37. Algorithm to compute features from data (pseudo-code) # note this is a VERY simplified version that do not aim to be the most effective implementation of this algorithm computeEfficiencyFeatures(student): student_events = getStudentEvents(student) correct_exercises_list = list(); number_attempts = 0; total_time = 0; puzzle_started_event = None for event in student_events: if(event[‘type’] == ‘puzzle_started’) then puzzle_started_event = event elif(event[‘type’] == ‘leave_to_menu’) then total_time += (event[‘timestamp’] - puzzle_started_event[‘timestamp’]) puzzle_started_event = None elif(event[‘type’] == ‘puzzle_attempt’): number_attempts += 1 if(event[‘correct’] == True) then correct_exercises_list.add(event[‘puzzle_id’]) attempts_per_correct_problem = length(unique(correct_exercises_list))/number_attempts time_per_correct_problem = length(unique(correct_exercises_list))/total_time return(attempts_per_correct_problem, time_per_correct_problem)
  • 38. The previous general scenario Evidence Constructs map Data Features data schema inform algorithms
  • 39. Model for efficiency in Shadowspect Evidence ● Correct puzzles ● Time ● Number attempts Data ● puzzle_start ● leave_to_menu ● puzzle_attempt data schema inform computeEfficiency Features(student) Construct Efficiency Features attempts_per_correct_problem time_per_correct_problem map
  • 40. Model development: Cognitive and Behavioral Assessment Implementation via Learning Analytics with Experts and Machine Learning
  • 41. Expert Labelling and Machine Learning Process ● Two or more experts label text or video replays that can be visually assessed ○ We divide all level interactions in replays that can be labelled ○ Experts review replays and label them for each construct that we want to measure ■ They might use rubrics and we are looking for expert inter-agreement (Cohen’s kappa) ○ We implement a supervised machine learning assessment model based on these labels ● Challenges here include achieving good inter-agreement, technical logistics, replay resolution and final implementation of the ML model Example of simplified text replay: 1. Start puzzle – 2. Create shape square – 3. Move square – 4. Create cone 5. Rotate cone – 6. Change perspective – 7. Snapshot – 8. Move cone – 9. Submit – 10 Puzzle correct
  • 42. Expert Labelling and Machine Learning Process Evidence Constructsmap Data Features data schema inform algorithms expert assessment ML/AI
  • 43. Evaluation ● We are not here yet! Future plans: ● Data collection: ○ Implementation as part of the curriculum in high school classes ○ Demographic and school data with external measures ● Game analytics: How is the game being used by students? Improvements, enjoyment… ● Model performance evaluation: How are the models working? What do teachers think about models? ● Psychometric evaluation: Are our models correlated to other external tests, e.g. geometry traditional tests or spatial reasoning validated instruments
  • 44. It’s time to say goodbye But let’s conclude before that
  • 45. Conclusions ● Alternative assessment method with great potential ○ Focus on complex constructs, can focus on the process (on only outcomes), is less stressful and more enjoyable for students ● Highly challenging and multidisciplinary field, main problems: ○ Cost, scalability and generalization across GBA tools, model validity, trustworthiness, and teacher literacy ● Some companies are already using GBA as part pre-hiring ● Difference between Assessment and assessment ● Opportunities for collaboration!
  • 46. Thank you! José A. Ruipérez Valiente — @JoseARuiperez — jruiperez@um.es

Notas do Editor

  1. begins by identifying what should be assessed in terms of knowledge, skills, or other learner attributes. These variables cannot be observed directly, so behaviors and performances that demonstrate these variables need to be identified instead. The next step is determining the types of tasks or situations that would draw out such behaviors or performances. Example around simple math knowledge in a game: