SlideShare a Scribd company logo
1 of 21
Download to read offline
A Usability Analysis System for
            e-Learning Authoring Tools




                                                        Manuel Freire Morán
eMadrid seminar on Adaptive Systems - 7 October 2011   manuel.freire@fdi.ucm.es
Presentation

Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks




                                 1/20
Introduction: authoring & tool evaluation

Contents
                              System adoption requires content
   Introduction

   Story-line authoring           authoring
   Ratings and outcomes           reuse
   Time-lapse visualization

   Future work                Better authoring requires creativity support
   Concluding remarks
                                  imagine, create, play, share, reflect
                                    creative thinking spiral - Resnick, 2008
                                  low threshold, high ceiling, wide walls
                                  make it as simple as possible - and maybe simpler
                                    Shneiderman et al, 2006
                                  evaluate your tools


                                                     2/20
Story-line authoring: adventure game authoring

Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks




                              3/20
Story-line authoring: Weev

Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks




                                        4/20
Story-line authoring: initial question

Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks




                                          5/20
Ratings and outcomes: a-b testing with ratings

Contents
                              A-B testing: very popular in web usability
   Introduction

   Story-line authoring           Split participants randomly into two groups
   Ratings and outcomes           Each group uses an interface variant (A and B)
   Time-lapse visualization
                                  Outcomes of each group are compared
   Future work

   Concluding remarks
                              Outcome of a creative task?
                                  Objective measures: correctness, completeness
                                  Subjective measures: questionnaires (author
                                  satisfaction), ratings (comparative quality)



                                              6/20
Ratings and outcomes: evaluating ratings

Contents
                              N users are requested to rate M users each
   Introduction

   Story-line authoring          All users rated same number of times
   Ratings and outcomes          No user rated twice by same rater
   Time-lapse visualization
                                 No user rates self
   Future work

   Concluding remarks
                              Unknown individual rating distribution. How
                               to determine if results are significant?
                                 H0: count(A rated better than B) is a fair coin toss
                                 one-sided binomial distribution


                                              7/20
Ratings and outcomes: statistical treatment

Contents
                                                      1x b>a
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization   5/10         4/10        7/10       8/10       6/10
                                     a            b           a          a          b
   Future work

   Concluding remarks



                                                      4 x a>b




                                         8/20
Ratings and outcomes: experiment

Contents
                              Experiment with
   Introduction

   Story-line authoring          20 users (first-time Weev users)
   Ratings and outcomes          5-minute tutorial on tool use (applicable to A&B)
   Time-lapse visualization
                                 "Little Red Riding-Hood" script and resources
   Future work

   Concluding remarks
                                 1 hour's time; after editing, rating screen

                              Outcome
                                 70 total A vs B comparisons; in 40, B > A
                                 p-value: 0.141: does not reject null hypothesis


                                              9/20
Ratings and outcomes: simulations

Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks




                                       10/20
Ratings and outcomes: more simulations
                                                         Increase ratings-per-user
Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks


                              Increase number of users




                                           11/20
Ratings and outcomes: increasing ratings

Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks




                                   12/20
Ratings and outcomes: increasing users

Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks




                                   13/20
Time-lapse visualization: idea

Contents
                              UI instrumentation sends data to server
   Introduction

   Story-line authoring          server timestamps records for each user
   Ratings and outcomes             UI screen captures
   Time-lapse visualization
                                    Action logs
   Future work

   Concluding remarks
                                 last screen capture sample used for rating
                                 ratings-file is just one more record type

                              How do you begin to analyze this data?
                                 Build videos from screen-captures
                                 Visualization tool to scan time-lapse data
                                             14/20
Time-lapse visualization: interface

                              user / record
Contents                      selection         time-lapse view of
   Introduction                                 records for first
                                                selected user
   Story-line authoring
                                                                     time-lapse view for
   Ratings and outcomes                                              next selected user

   Time-lapse visualization

   Future work

   Concluding remarks
                                                                                detail /
                                                                                difference view




                                              15/20
Time-lapse visualization: insights
                                User is stuck
Contents
   Introduction

   Story-line authoring

   Ratings and outcomes         Creative flow
   Time-lapse visualization

   Future work

   Concluding remarks

                                                Rectangular placement




                                                   16/20
Time-lapse visualization: tasks and directions

Contents
                               Tasks for UI evaluation
   Introduction

   Story-line authoring          What did this user do?
   Ratings and outcomes
                                  - glance at timeline
   Time-lapse visualization       What happened between here and there?
   Future work                    - use difference function
   Concluding remarks

                               Still missing
                                  Variable-length time lapses ("time zoom")
                                  Graphical display of textual log data
                                  Filter & Sort

                                              17/20
Future work: next experiment

Contents
                               Server
   Introduction

   Story-line authoring           Finer-grained data collection
   Ratings and outcomes           Better rating distributions, more ratings
   Time-lapse visualization
                                  Support for experimenter note-taking
   Future work

   Concluding remarks
                               Analysis tool improvements

                               Test new automatic layout assistant
                                  Do people use it?
                                  Do people rate "automatically-improved"
                                  storylines as better than-before?
                                              18/20
Concluding remarks

Contents
                              A/B usability testing methodology for
   Introduction

   Story-line authoring
                               creative/aesthetic tasks
   Ratings and outcomes
                              Reusable server
   Time-lapse visualization

   Future work                    Collects images, logs, rating results
   Concluding remarks             Configurable rating, allows real-time monitoring

                              Reusable client
                                  Specific handling for each record-type (ie.: logs)



                                              19/20
Questions or comments?

Contents
   Introduction

   Story-line authoring

   Ratings and outcomes

   Time-lapse visualization

   Future work

   Concluding remarks




                                      20/20

More Related Content

Similar to 2011 10 07 (uam) emadrid mfreire ucm sistema analisis usabilidad herramientas autoria contenidos e learning

Building Serious Games for Medical Intervention and Training
Building Serious Games for Medical Intervention and TrainingBuilding Serious Games for Medical Intervention and Training
Building Serious Games for Medical Intervention and Training
Brock Dubbels
 
TAO DAYS - Integration of 3rd party components into TAO
TAO DAYS - Integration of 3rd party components into TAOTAO DAYS - Integration of 3rd party components into TAO
TAO DAYS - Integration of 3rd party components into TAO
Open Assessment Technologies
 
Using rapid prototying_for_design_iteration
Using rapid prototying_for_design_iterationUsing rapid prototying_for_design_iteration
Using rapid prototying_for_design_iteration
drewz lin
 
Sarbajit Resume - Delivery Manager QA - Test Automation Consluting
Sarbajit Resume - Delivery Manager QA - Test Automation ConslutingSarbajit Resume - Delivery Manager QA - Test Automation Consluting
Sarbajit Resume - Delivery Manager QA - Test Automation Consluting
sarbajit Chakrabarty
 
EPFL PxS week 12 - UX design techniques
EPFL PxS week 12 - UX design techniquesEPFL PxS week 12 - UX design techniques
EPFL PxS week 12 - UX design techniques
hendrikknoche
 

Similar to 2011 10 07 (uam) emadrid mfreire ucm sistema analisis usabilidad herramientas autoria contenidos e learning (20)

Assessment outcomes from the TENCompetence project
Assessment outcomes from the TENCompetence projectAssessment outcomes from the TENCompetence project
Assessment outcomes from the TENCompetence project
 
Building an mvp that works for users
Building an mvp that works for users Building an mvp that works for users
Building an mvp that works for users
 
Workshop: Managing top tasks #BPCW11
Workshop: Managing top tasks #BPCW11Workshop: Managing top tasks #BPCW11
Workshop: Managing top tasks #BPCW11
 
Usability Assessment of a Context-Aware and Personality-Based Mobile Recommen...
Usability Assessment of a Context-Aware and Personality-Based Mobile Recommen...Usability Assessment of a Context-Aware and Personality-Based Mobile Recommen...
Usability Assessment of a Context-Aware and Personality-Based Mobile Recommen...
 
Sketching Web APIs
Sketching Web APIsSketching Web APIs
Sketching Web APIs
 
Building Serious Games for Medical Intervention and Training
Building Serious Games for Medical Intervention and TrainingBuilding Serious Games for Medical Intervention and Training
Building Serious Games for Medical Intervention and Training
 
TAO DAYS - Integration of 3rd party components into TAO
TAO DAYS - Integration of 3rd party components into TAOTAO DAYS - Integration of 3rd party components into TAO
TAO DAYS - Integration of 3rd party components into TAO
 
Designing an MVP that works for users (2 and 1/2 hours) @Lean UX NYC 2013
Designing an MVP that works for users (2 and 1/2 hours) @Lean UX NYC 2013Designing an MVP that works for users (2 and 1/2 hours) @Lean UX NYC 2013
Designing an MVP that works for users (2 and 1/2 hours) @Lean UX NYC 2013
 
09
0909
09
 
Using rapid prototying_for_design_iteration
Using rapid prototying_for_design_iterationUsing rapid prototying_for_design_iteration
Using rapid prototying_for_design_iteration
 
Sdec 2011 ux_agile_svt
Sdec 2011 ux_agile_svtSdec 2011 ux_agile_svt
Sdec 2011 ux_agile_svt
 
By the Book: Examining the Art of Building Great User Experiences in Software
By the Book: Examining the Art of Building Great User Experiences in SoftwareBy the Book: Examining the Art of Building Great User Experiences in Software
By the Book: Examining the Art of Building Great User Experiences in Software
 
By the Book: Examining the Art of Building Great User Experiences in Software
By the Book: Examining the Art of Building Great User Experiences in SoftwareBy the Book: Examining the Art of Building Great User Experiences in Software
By the Book: Examining the Art of Building Great User Experiences in Software
 
Sarbajit Resume - Delivery Manager QA - Test Automation Consluting
Sarbajit Resume - Delivery Manager QA - Test Automation ConslutingSarbajit Resume - Delivery Manager QA - Test Automation Consluting
Sarbajit Resume - Delivery Manager QA - Test Automation Consluting
 
EPFL PxS week 12 - UX design techniques
EPFL PxS week 12 - UX design techniquesEPFL PxS week 12 - UX design techniques
EPFL PxS week 12 - UX design techniques
 
A Decade of Comment Quality Assessment: A Systematic Literature Review
A Decade of Comment Quality Assessment: A Systematic Literature ReviewA Decade of Comment Quality Assessment: A Systematic Literature Review
A Decade of Comment Quality Assessment: A Systematic Literature Review
 
You Don't Know C.R.A.P. about UX/UI
You Don't Know C.R.A.P. about UX/UIYou Don't Know C.R.A.P. about UX/UI
You Don't Know C.R.A.P. about UX/UI
 
Telford SUGUK - March 2012 - Part 2
Telford SUGUK - March 2012 - Part 2Telford SUGUK - March 2012 - Part 2
Telford SUGUK - March 2012 - Part 2
 
From Text Tо Visual BPMN Process Models
From Text Tо Visual BPMN Process ModelsFrom Text Tо Visual BPMN Process Models
From Text Tо Visual BPMN Process Models
 
Behaviour Driven Development V 0.1
Behaviour Driven Development V 0.1Behaviour Driven Development V 0.1
Behaviour Driven Development V 0.1
 

More from eMadrid network

More from eMadrid network (20)

Recognizing Lifelong Learning Competences: A Report of Two Cases - Edmundo Tovar
Recognizing Lifelong Learning Competences: A Report of Two Cases - Edmundo TovarRecognizing Lifelong Learning Competences: A Report of Two Cases - Edmundo Tovar
Recognizing Lifelong Learning Competences: A Report of Two Cases - Edmundo Tovar
 
A study about the impact of rewards on student's engagement with the flipped ...
A study about the impact of rewards on student's engagement with the flipped ...A study about the impact of rewards on student's engagement with the flipped ...
A study about the impact of rewards on student's engagement with the flipped ...
 
Assessment and recognition in technical massive open on-line courses with and...
Assessment and recognition in technical massive open on-line courses with and...Assessment and recognition in technical massive open on-line courses with and...
Assessment and recognition in technical massive open on-line courses with and...
 
Recognition of learning: Status, experiences and challenges - Carlos Delgado ...
Recognition of learning: Status, experiences and challenges - Carlos Delgado ...Recognition of learning: Status, experiences and challenges - Carlos Delgado ...
Recognition of learning: Status, experiences and challenges - Carlos Delgado ...
 
Bootstrapping serious games to assess learning through analytics - Baltasar F...
Bootstrapping serious games to assess learning through analytics - Baltasar F...Bootstrapping serious games to assess learning through analytics - Baltasar F...
Bootstrapping serious games to assess learning through analytics - Baltasar F...
 
Meta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth CobosMeta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
Meta-review of recognition of learning in LMS and MOOCs - Ruth Cobos
 
Best paper Award - Miguel Castro
Best paper Award - Miguel CastroBest paper Award - Miguel Castro
Best paper Award - Miguel Castro
 
eMadrid Gaming4Coding - Possibilities of game learning analytics for coding l...
eMadrid Gaming4Coding - Possibilities of game learning analytics for coding l...eMadrid Gaming4Coding - Possibilities of game learning analytics for coding l...
eMadrid Gaming4Coding - Possibilities of game learning analytics for coding l...
 
Seminario eMadrid_Curso MOOC_Antonio de Nebrija_Apología del saber.pptx.pdf
Seminario eMadrid_Curso MOOC_Antonio de Nebrija_Apología del saber.pptx.pdfSeminario eMadrid_Curso MOOC_Antonio de Nebrija_Apología del saber.pptx.pdf
Seminario eMadrid_Curso MOOC_Antonio de Nebrija_Apología del saber.pptx.pdf
 
eMadrid-Opportunities and Design Challenges in the Gaming4Coding Project_Pete...
eMadrid-Opportunities and Design Challenges in the Gaming4Coding Project_Pete...eMadrid-Opportunities and Design Challenges in the Gaming4Coding Project_Pete...
eMadrid-Opportunities and Design Challenges in the Gaming4Coding Project_Pete...
 
Open_principles_and_co-creation_for_digital_competences_for_students.pdf
Open_principles_and_co-creation_for_digital_competences_for_students.pdfOpen_principles_and_co-creation_for_digital_competences_for_students.pdf
Open_principles_and_co-creation_for_digital_competences_for_students.pdf
 
Competencias_digitales_del_profesorado_universitario_para_la_educación_abiert...
Competencias_digitales_del_profesorado_universitario_para_la_educación_abiert...Competencias_digitales_del_profesorado_universitario_para_la_educación_abiert...
Competencias_digitales_del_profesorado_universitario_para_la_educación_abiert...
 
eMadrid_KatjaAssaf_DigiCred.pdf
eMadrid_KatjaAssaf_DigiCred.pdfeMadrid_KatjaAssaf_DigiCred.pdf
eMadrid_KatjaAssaf_DigiCred.pdf
 
Presentazione E-Madrid - 12-01-2023 Ruth Kerr.pdf
Presentazione E-Madrid - 12-01-2023 Ruth Kerr.pdfPresentazione E-Madrid - 12-01-2023 Ruth Kerr.pdf
Presentazione E-Madrid - 12-01-2023 Ruth Kerr.pdf
 
EDC-eMadrid_20230113 Ildikó Mázár.pdf
EDC-eMadrid_20230113 Ildikó Mázár.pdfEDC-eMadrid_20230113 Ildikó Mázár.pdf
EDC-eMadrid_20230113 Ildikó Mázár.pdf
 
2022_12_16 «“La informática en la educación escolar en Europa”, informe Euryd...
2022_12_16 «“La informática en la educación escolar en Europa”, informe Euryd...2022_12_16 «“La informática en la educación escolar en Europa”, informe Euryd...
2022_12_16 «“La informática en la educación escolar en Europa”, informe Euryd...
 
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»
2022_12_16 «Informatics – A Fundamental Discipline for the 21st Century»
 
2022_12_16 «Efecto del uso de lenguajes basados en bloques en el aprendizaje ...
2022_12_16 «Efecto del uso de lenguajes basados en bloques en el aprendizaje ...2022_12_16 «Efecto del uso de lenguajes basados en bloques en el aprendizaje ...
2022_12_16 «Efecto del uso de lenguajes basados en bloques en el aprendizaje ...
 
2022_11_11 «AI and ML methods for Multimodal Learning Analytics»
2022_11_11 «AI and ML methods for Multimodal Learning Analytics»2022_11_11 «AI and ML methods for Multimodal Learning Analytics»
2022_11_11 «AI and ML methods for Multimodal Learning Analytics»
 
2022_11_11 «The promise and challenges of Multimodal Learning Analytics»
2022_11_11 «The promise and challenges of Multimodal Learning Analytics»2022_11_11 «The promise and challenges of Multimodal Learning Analytics»
2022_11_11 «The promise and challenges of Multimodal Learning Analytics»
 

Recently uploaded

Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 

Recently uploaded (20)

Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 

2011 10 07 (uam) emadrid mfreire ucm sistema analisis usabilidad herramientas autoria contenidos e learning

  • 1. A Usability Analysis System for e-Learning Authoring Tools Manuel Freire Morán eMadrid seminar on Adaptive Systems - 7 October 2011 manuel.freire@fdi.ucm.es
  • 2. Presentation Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks 1/20
  • 3. Introduction: authoring & tool evaluation Contents System adoption requires content Introduction Story-line authoring  authoring Ratings and outcomes  reuse Time-lapse visualization Future work Better authoring requires creativity support Concluding remarks  imagine, create, play, share, reflect creative thinking spiral - Resnick, 2008  low threshold, high ceiling, wide walls  make it as simple as possible - and maybe simpler Shneiderman et al, 2006  evaluate your tools 2/20
  • 4. Story-line authoring: adventure game authoring Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks 3/20
  • 5. Story-line authoring: Weev Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks 4/20
  • 6. Story-line authoring: initial question Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks 5/20
  • 7. Ratings and outcomes: a-b testing with ratings Contents A-B testing: very popular in web usability Introduction Story-line authoring  Split participants randomly into two groups Ratings and outcomes  Each group uses an interface variant (A and B) Time-lapse visualization  Outcomes of each group are compared Future work Concluding remarks Outcome of a creative task?  Objective measures: correctness, completeness  Subjective measures: questionnaires (author satisfaction), ratings (comparative quality) 6/20
  • 8. Ratings and outcomes: evaluating ratings Contents N users are requested to rate M users each Introduction Story-line authoring  All users rated same number of times Ratings and outcomes  No user rated twice by same rater Time-lapse visualization  No user rates self Future work Concluding remarks Unknown individual rating distribution. How to determine if results are significant?  H0: count(A rated better than B) is a fair coin toss  one-sided binomial distribution 7/20
  • 9. Ratings and outcomes: statistical treatment Contents 1x b>a Introduction Story-line authoring Ratings and outcomes Time-lapse visualization 5/10 4/10 7/10 8/10 6/10 a b a a b Future work Concluding remarks 4 x a>b 8/20
  • 10. Ratings and outcomes: experiment Contents Experiment with Introduction Story-line authoring  20 users (first-time Weev users) Ratings and outcomes  5-minute tutorial on tool use (applicable to A&B) Time-lapse visualization  "Little Red Riding-Hood" script and resources Future work Concluding remarks  1 hour's time; after editing, rating screen Outcome  70 total A vs B comparisons; in 40, B > A  p-value: 0.141: does not reject null hypothesis 9/20
  • 11. Ratings and outcomes: simulations Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks 10/20
  • 12. Ratings and outcomes: more simulations Increase ratings-per-user Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks Increase number of users 11/20
  • 13. Ratings and outcomes: increasing ratings Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks 12/20
  • 14. Ratings and outcomes: increasing users Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks 13/20
  • 15. Time-lapse visualization: idea Contents UI instrumentation sends data to server Introduction Story-line authoring  server timestamps records for each user Ratings and outcomes  UI screen captures Time-lapse visualization  Action logs Future work Concluding remarks  last screen capture sample used for rating  ratings-file is just one more record type How do you begin to analyze this data?  Build videos from screen-captures  Visualization tool to scan time-lapse data 14/20
  • 16. Time-lapse visualization: interface user / record Contents selection time-lapse view of Introduction records for first selected user Story-line authoring time-lapse view for Ratings and outcomes next selected user Time-lapse visualization Future work Concluding remarks detail / difference view 15/20
  • 17. Time-lapse visualization: insights User is stuck Contents Introduction Story-line authoring Ratings and outcomes Creative flow Time-lapse visualization Future work Concluding remarks Rectangular placement 16/20
  • 18. Time-lapse visualization: tasks and directions Contents  Tasks for UI evaluation Introduction Story-line authoring What did this user do? Ratings and outcomes - glance at timeline Time-lapse visualization  What happened between here and there? Future work - use difference function Concluding remarks  Still missing  Variable-length time lapses ("time zoom")  Graphical display of textual log data  Filter & Sort 17/20
  • 19. Future work: next experiment Contents  Server Introduction Story-line authoring  Finer-grained data collection Ratings and outcomes  Better rating distributions, more ratings Time-lapse visualization  Support for experimenter note-taking Future work Concluding remarks  Analysis tool improvements  Test new automatic layout assistant  Do people use it?  Do people rate "automatically-improved" storylines as better than-before? 18/20
  • 20. Concluding remarks Contents A/B usability testing methodology for Introduction Story-line authoring creative/aesthetic tasks Ratings and outcomes Reusable server Time-lapse visualization Future work  Collects images, logs, rating results Concluding remarks  Configurable rating, allows real-time monitoring Reusable client  Specific handling for each record-type (ie.: logs) 19/20
  • 21. Questions or comments? Contents Introduction Story-line authoring Ratings and outcomes Time-lapse visualization Future work Concluding remarks 20/20