SlideShare uma empresa Scribd logo
1 de 13
Outliers in usability testing:
How to treat usability problems found for only
one test participant
      Asbjørn Følstad, SINTEF
      Effie Lai-Chong Law, University of Leicester
      Kasper Hornbæk, University of Copenhagen

      NordiCHI 2012
Content

1   Single-user problems
2   Yes, they are abundant
3   … but how to deal with them?
4   Current practices – straight from the horse's mouth
5   Recommendations




                                                     2
Single-user problems


                       The problems backed up
                       with data from only a
                       single participant in
                       usability test




                                                3
Single-user problems

                       Are they relevant?
                       May be infrequent usability
                       problems
                       - Point estimate: .25 (LaPlace)
                       - 95% conf. int.: .01-.58 (Adj. Wald)


                       Are they valid?
                       May be an artefact of the test
                       situation
                       "there is always a risk of being misled
                       by the spurious behavior of a single
                       person" (Nielsen, 2000 – useit.com)

                                                         4
Single-user problems are abundant

                                             Office system eval.
                Content
                                                15 participants                         Nielsen and
              management                                                                  Landauer
              system eval.                        77 of 145                                  (1993)
                                               problems single-
             17 participants
                                                user problems
            41 of 88 problems
Law and        single-user
Hvannberg       problems
(2004)

                                Law, E.L.-C. Hvannberg, E.T. Analysis of Combinatorial User Effect in
                                International Usability Tests. In Proc. CHI '04, ACM Press (2004), 9-16.

                                Nielsen, J., Landauer, T.K. A mathematical model of the finding of usability
                                problems. In Proc. CHI '93, ACM Press (1993), 206-213.                 5
Advice on how to deal with them is scarce
                               View "unique
  Short discussion of          problems as noise                          Kjeldskov, Skov
  single-user problems.        rather than real                           and Stage
  Recommend to report          usability problems"                        (2004)
  these as outliers            in study of instant
                               data analysis

                               Report single user
                               problems as real
                               problems in a stress-
                               test of problem                            Woolrych
                               predictions                                and Cockton
                                                                          (2001)

                          Kjeldskov J., Skov M. B., Stage J. Instant Data Analysis: Evaluating
                          Usability in a Day. In Proc. NordiCHI '04, ACM Press (2004), 233-240.

                          Woolrych, A., Cockton, G. Why and when five test users aren’t enough.
                          In Proc. IHM-HCI 2001, Cépadèus Éditions (2001), 105-108.      6
Asking the practitioners for current practices
Opportunity: Larger survey on
analysis practices in usability
evaluation

Included question on single-user
problems

89 usability practitioners answered
this particular question

Median 6 yrs. work experience

17 different countries

Usability tests with median of 8
user participants

                                                 7
Potential outcomes for single-user problems


                                 8 accept

Participants as divided as the
                                 4 classify as low priority
little advice provided in the
literature                       4 record as outlier

                                 6 reject

                                 (22 items total on this theme)




                                                                  8
Relevant conditions when making the call
                             18 Problem severity

                             9 Test participants' profile

A range of conditions        6 Sample size
reported as relevant. But
some maybe deserving to be
reported more often?         6 Artifact of the test situation?

                             5 Task importance

                             5 Other

                             (49 items total on this theme)

                                                              9
Resources and strategies when making the call
                             9 Discuss with experts or team
                             members
20 reported to rely on own   9 New/extended evaluations
professional knowledge and   8 Check against heuristics /
experience.
                             guidelines / principles
However, several potential
useful resources and         […]
strategies were reported
                             3 Specific process or policy
                             2 Confirmed hypotheses /
                             previous experiences
                             2 Debrief with users
                             (63 items total on this theme)
                                                              10
When considering the study findings …


  How did you handle
  single-user problems
  in your latest
  usability test?




                                        11
Recommendations

1   Procedure for handling single-user problems
2   Pay particular attention to sample size

3
    Check against knowledge resources –
    guidelines, heuristics, or previous evaluations
4
    Seek advice - from experts or team members
5   Be alert: Artefact of the test situation?



                                                      12
Thank you




            13

Mais conteúdo relacionado

Semelhante a Single-user problems

Failure analysis integrated multi stakeholder mental model and project life c...
Failure analysis integrated multi stakeholder mental model and project life c...Failure analysis integrated multi stakeholder mental model and project life c...
Failure analysis integrated multi stakeholder mental model and project life c...Piriya Uraiwong
 
Human Computer Interaction - Heuristic Evaluation
Human Computer Interaction - Heuristic EvaluationHuman Computer Interaction - Heuristic Evaluation
Human Computer Interaction - Heuristic Evaluationemmadmd
 
Mobile Apps Usability - Intro
Mobile Apps Usability - IntroMobile Apps Usability - Intro
Mobile Apps Usability - IntroSoleh Al Ayubi
 
2 Studies UX types should know about (Straub UXPA unconference13)
2 Studies UX types should know about (Straub UXPA unconference13)2 Studies UX types should know about (Straub UXPA unconference13)
2 Studies UX types should know about (Straub UXPA unconference13)Kath Straub
 
Do you really need to test with only 5 users
Do you really need to test with only 5 usersDo you really need to test with only 5 users
Do you really need to test with only 5 usersVictoria Bondarchuk
 
The role of systems analysis in co-learning. Walter Rossing
The role of systems analysis in co-learning. Walter RossingThe role of systems analysis in co-learning. Walter Rossing
The role of systems analysis in co-learning. Walter RossingJoanna Hicks
 
WQD2011 – INNOVATION – BRONZE WINNER – Tawam Hospital - Innovative Approaches...
WQD2011 – INNOVATION – BRONZE WINNER – Tawam Hospital - Innovative Approaches...WQD2011 – INNOVATION – BRONZE WINNER – Tawam Hospital - Innovative Approaches...
WQD2011 – INNOVATION – BRONZE WINNER – Tawam Hospital - Innovative Approaches...Dubai Quality Group
 
Research methods for socio-technical systems analysis (LSCITS EngD 2012)
Research methods for socio-technical systems analysis (LSCITS EngD 2012)Research methods for socio-technical systems analysis (LSCITS EngD 2012)
Research methods for socio-technical systems analysis (LSCITS EngD 2012)Ian Sommerville
 
Bug debug keynote - Present problems and future solutions
Bug debug keynote - Present problems and future solutionsBug debug keynote - Present problems and future solutions
Bug debug keynote - Present problems and future solutionsRIA RUI Society
 
IITSEC Presentation on Learning in Virtual Worlds
IITSEC Presentation on Learning in Virtual WorldsIITSEC Presentation on Learning in Virtual Worlds
IITSEC Presentation on Learning in Virtual Worldstaoirene
 
Approaches to Preservation Storage Technologies
Approaches to Preservation Storage Technologies Approaches to Preservation Storage Technologies
Approaches to Preservation Storage Technologies Micah Altman
 
June brownbagpressurvey
June brownbagpressurveyJune brownbagpressurvey
June brownbagpressurveyMicah Altman
 
Conducting Expert Reviews Using the VIMM Model
Conducting Expert Reviews Using the VIMM ModelConducting Expert Reviews Using the VIMM Model
Conducting Expert Reviews Using the VIMM ModelMichael Rawlins
 
Experiments on Pattern-based Ontology Design
Experiments on Pattern-based Ontology DesignExperiments on Pattern-based Ontology Design
Experiments on Pattern-based Ontology Designevabl444
 
Ontologies for Crisis Management: A Review of State of the Art in Ontology De...
Ontologies for Crisis Management: A Review of State of the Art in Ontology De...Ontologies for Crisis Management: A Review of State of the Art in Ontology De...
Ontologies for Crisis Management: A Review of State of the Art in Ontology De...streamspotter
 
Experimental, Quasi experimental, Single-Case, and Internet-based Researches ...
Experimental, Quasi experimental, Single-Case, and Internet-based Researches ...Experimental, Quasi experimental, Single-Case, and Internet-based Researches ...
Experimental, Quasi experimental, Single-Case, and Internet-based Researches ...Hatice Çilsalar
 
Modeling Difficulty in Recommender Systems
Modeling Difficulty in Recommender SystemsModeling Difficulty in Recommender Systems
Modeling Difficulty in Recommender Systemskib_83
 

Semelhante a Single-user problems (20)

Failure analysis integrated multi stakeholder mental model and project life c...
Failure analysis integrated multi stakeholder mental model and project life c...Failure analysis integrated multi stakeholder mental model and project life c...
Failure analysis integrated multi stakeholder mental model and project life c...
 
Human Computer Interaction - Heuristic Evaluation
Human Computer Interaction - Heuristic EvaluationHuman Computer Interaction - Heuristic Evaluation
Human Computer Interaction - Heuristic Evaluation
 
Mobile Apps Usability - Intro
Mobile Apps Usability - IntroMobile Apps Usability - Intro
Mobile Apps Usability - Intro
 
2 Studies UX types should know about (Straub UXPA unconference13)
2 Studies UX types should know about (Straub UXPA unconference13)2 Studies UX types should know about (Straub UXPA unconference13)
2 Studies UX types should know about (Straub UXPA unconference13)
 
Do you really need to test with only 5 users
Do you really need to test with only 5 usersDo you really need to test with only 5 users
Do you really need to test with only 5 users
 
The role of systems analysis in co-learning. Walter Rossing
The role of systems analysis in co-learning. Walter RossingThe role of systems analysis in co-learning. Walter Rossing
The role of systems analysis in co-learning. Walter Rossing
 
WQD2011 – INNOVATION – BRONZE WINNER – Tawam Hospital - Innovative Approaches...
WQD2011 – INNOVATION – BRONZE WINNER – Tawam Hospital - Innovative Approaches...WQD2011 – INNOVATION – BRONZE WINNER – Tawam Hospital - Innovative Approaches...
WQD2011 – INNOVATION – BRONZE WINNER – Tawam Hospital - Innovative Approaches...
 
Research methods for socio-technical systems analysis (LSCITS EngD 2012)
Research methods for socio-technical systems analysis (LSCITS EngD 2012)Research methods for socio-technical systems analysis (LSCITS EngD 2012)
Research methods for socio-technical systems analysis (LSCITS EngD 2012)
 
Erfaringer med Remote Usability Testing af Jan Stage, AAU
Erfaringer med Remote Usability Testing af Jan Stage, AAUErfaringer med Remote Usability Testing af Jan Stage, AAU
Erfaringer med Remote Usability Testing af Jan Stage, AAU
 
Bug debug keynote - Present problems and future solutions
Bug debug keynote - Present problems and future solutionsBug debug keynote - Present problems and future solutions
Bug debug keynote - Present problems and future solutions
 
IITSEC Presentation on Learning in Virtual Worlds
IITSEC Presentation on Learning in Virtual WorldsIITSEC Presentation on Learning in Virtual Worlds
IITSEC Presentation on Learning in Virtual Worlds
 
Approaches to Preservation Storage Technologies
Approaches to Preservation Storage Technologies Approaches to Preservation Storage Technologies
Approaches to Preservation Storage Technologies
 
June brownbagpressurvey
June brownbagpressurveyJune brownbagpressurvey
June brownbagpressurvey
 
CSMR06b.ppt
CSMR06b.pptCSMR06b.ppt
CSMR06b.ppt
 
Conducting Expert Reviews Using the VIMM Model
Conducting Expert Reviews Using the VIMM ModelConducting Expert Reviews Using the VIMM Model
Conducting Expert Reviews Using the VIMM Model
 
Calibration of weights in surveys with nonresponse and frame imperfections
Calibration of weights in surveys with nonresponse and frame imperfectionsCalibration of weights in surveys with nonresponse and frame imperfections
Calibration of weights in surveys with nonresponse and frame imperfections
 
Experiments on Pattern-based Ontology Design
Experiments on Pattern-based Ontology DesignExperiments on Pattern-based Ontology Design
Experiments on Pattern-based Ontology Design
 
Ontologies for Crisis Management: A Review of State of the Art in Ontology De...
Ontologies for Crisis Management: A Review of State of the Art in Ontology De...Ontologies for Crisis Management: A Review of State of the Art in Ontology De...
Ontologies for Crisis Management: A Review of State of the Art in Ontology De...
 
Experimental, Quasi experimental, Single-Case, and Internet-based Researches ...
Experimental, Quasi experimental, Single-Case, and Internet-based Researches ...Experimental, Quasi experimental, Single-Case, and Internet-based Researches ...
Experimental, Quasi experimental, Single-Case, and Internet-based Researches ...
 
Modeling Difficulty in Recommender Systems
Modeling Difficulty in Recommender SystemsModeling Difficulty in Recommender Systems
Modeling Difficulty in Recommender Systems
 

Mais de Asbjørn Følstad

Sosiale medier og innovasjon i offentlig sektor web
Sosiale medier og innovasjon i offentlig sektor webSosiale medier og innovasjon i offentlig sektor web
Sosiale medier og innovasjon i offentlig sektor webAsbjørn Følstad
 
Chi2012 analysis in practical usability evaluation web
Chi2012 analysis in practical usability evaluation webChi2012 analysis in practical usability evaluation web
Chi2012 analysis in practical usability evaluation webAsbjørn Følstad
 
Nettet som kanal for brukerinvolvering - erfaringer fra living lab 20120620-web
Nettet som kanal for brukerinvolvering - erfaringer fra living lab 20120620-webNettet som kanal for brukerinvolvering - erfaringer fra living lab 20120620-web
Nettet som kanal for brukerinvolvering - erfaringer fra living lab 20120620-webAsbjørn Følstad
 
Sosiale medier i offentlig sektor
Sosiale medier i offentlig sektorSosiale medier i offentlig sektor
Sosiale medier i offentlig sektorAsbjørn Følstad
 
Usability evaluation in exclusive domains_presentation
Usability evaluation in exclusive domains_presentationUsability evaluation in exclusive domains_presentation
Usability evaluation in exclusive domains_presentationAsbjørn Følstad
 
Usability evaluation in exclusive domains
Usability evaluation in exclusive domainsUsability evaluation in exclusive domains
Usability evaluation in exclusive domainsAsbjørn Følstad
 
The relevance of UX models and measures
The relevance of UX models and measuresThe relevance of UX models and measures
The relevance of UX models and measuresAsbjørn Følstad
 
The relevance of UX models and measures
The relevance of UX models and measuresThe relevance of UX models and measures
The relevance of UX models and measuresAsbjørn Følstad
 
Participatory design 2.0 - brukerinvolvering gjennom sosiale medier
Participatory design 2.0 - brukerinvolvering gjennom sosiale medierParticipatory design 2.0 - brukerinvolvering gjennom sosiale medier
Participatory design 2.0 - brukerinvolvering gjennom sosiale medierAsbjørn Følstad
 
Bedre offentlige tjenester ved å engasjere brukerne i sosiale medier
Bedre offentlige tjenester ved å engasjere brukerne i sosiale medierBedre offentlige tjenester ved å engasjere brukerne i sosiale medier
Bedre offentlige tjenester ved å engasjere brukerne i sosiale medierAsbjørn Følstad
 
Sosial tilbakemelding på design
Sosial tilbakemelding på designSosial tilbakemelding på design
Sosial tilbakemelding på designAsbjørn Følstad
 

Mais de Asbjørn Følstad (13)

Sosiale medier og innovasjon i offentlig sektor web
Sosiale medier og innovasjon i offentlig sektor webSosiale medier og innovasjon i offentlig sektor web
Sosiale medier og innovasjon i offentlig sektor web
 
Chi2012 analysis in practical usability evaluation web
Chi2012 analysis in practical usability evaluation webChi2012 analysis in practical usability evaluation web
Chi2012 analysis in practical usability evaluation web
 
Nettet som kanal for brukerinvolvering - erfaringer fra living lab 20120620-web
Nettet som kanal for brukerinvolvering - erfaringer fra living lab 20120620-webNettet som kanal for brukerinvolvering - erfaringer fra living lab 20120620-web
Nettet som kanal for brukerinvolvering - erfaringer fra living lab 20120620-web
 
Sosiale medier i offentlig sektor
Sosiale medier i offentlig sektorSosiale medier i offentlig sektor
Sosiale medier i offentlig sektor
 
Byråkratrollen i endring
Byråkratrollen i endringByråkratrollen i endring
Byråkratrollen i endring
 
Usability evaluation in exclusive domains_presentation
Usability evaluation in exclusive domains_presentationUsability evaluation in exclusive domains_presentation
Usability evaluation in exclusive domains_presentation
 
Usability evaluation in exclusive domains
Usability evaluation in exclusive domainsUsability evaluation in exclusive domains
Usability evaluation in exclusive domains
 
The relevance of UX models and measures
The relevance of UX models and measuresThe relevance of UX models and measures
The relevance of UX models and measures
 
The relevance of UX models and measures
The relevance of UX models and measuresThe relevance of UX models and measures
The relevance of UX models and measures
 
Participatory design 2.0 - brukerinvolvering gjennom sosiale medier
Participatory design 2.0 - brukerinvolvering gjennom sosiale medierParticipatory design 2.0 - brukerinvolvering gjennom sosiale medier
Participatory design 2.0 - brukerinvolvering gjennom sosiale medier
 
Bedre offentlige tjenester ved å engasjere brukerne i sosiale medier
Bedre offentlige tjenester ved å engasjere brukerne i sosiale medierBedre offentlige tjenester ved å engasjere brukerne i sosiale medier
Bedre offentlige tjenester ved å engasjere brukerne i sosiale medier
 
Sosial tilbakemelding på design
Sosial tilbakemelding på designSosial tilbakemelding på design
Sosial tilbakemelding på design
 
Wud2009 Folstad Til Web
Wud2009 Folstad Til WebWud2009 Folstad Til Web
Wud2009 Folstad Til Web
 

Single-user problems

  • 1. Outliers in usability testing: How to treat usability problems found for only one test participant Asbjørn Følstad, SINTEF Effie Lai-Chong Law, University of Leicester Kasper Hornbæk, University of Copenhagen NordiCHI 2012
  • 2. Content 1 Single-user problems 2 Yes, they are abundant 3 … but how to deal with them? 4 Current practices – straight from the horse's mouth 5 Recommendations 2
  • 3. Single-user problems The problems backed up with data from only a single participant in usability test 3
  • 4. Single-user problems Are they relevant? May be infrequent usability problems - Point estimate: .25 (LaPlace) - 95% conf. int.: .01-.58 (Adj. Wald) Are they valid? May be an artefact of the test situation "there is always a risk of being misled by the spurious behavior of a single person" (Nielsen, 2000 – useit.com) 4
  • 5. Single-user problems are abundant Office system eval. Content 15 participants Nielsen and management Landauer system eval. 77 of 145 (1993) problems single- 17 participants user problems 41 of 88 problems Law and single-user Hvannberg problems (2004) Law, E.L.-C. Hvannberg, E.T. Analysis of Combinatorial User Effect in International Usability Tests. In Proc. CHI '04, ACM Press (2004), 9-16. Nielsen, J., Landauer, T.K. A mathematical model of the finding of usability problems. In Proc. CHI '93, ACM Press (1993), 206-213. 5
  • 6. Advice on how to deal with them is scarce View "unique Short discussion of problems as noise Kjeldskov, Skov single-user problems. rather than real and Stage Recommend to report usability problems" (2004) these as outliers in study of instant data analysis Report single user problems as real problems in a stress- test of problem Woolrych predictions and Cockton (2001) Kjeldskov J., Skov M. B., Stage J. Instant Data Analysis: Evaluating Usability in a Day. In Proc. NordiCHI '04, ACM Press (2004), 233-240. Woolrych, A., Cockton, G. Why and when five test users aren’t enough. In Proc. IHM-HCI 2001, Cépadèus Éditions (2001), 105-108. 6
  • 7. Asking the practitioners for current practices Opportunity: Larger survey on analysis practices in usability evaluation Included question on single-user problems 89 usability practitioners answered this particular question Median 6 yrs. work experience 17 different countries Usability tests with median of 8 user participants 7
  • 8. Potential outcomes for single-user problems 8 accept Participants as divided as the 4 classify as low priority little advice provided in the literature 4 record as outlier 6 reject (22 items total on this theme) 8
  • 9. Relevant conditions when making the call 18 Problem severity 9 Test participants' profile A range of conditions 6 Sample size reported as relevant. But some maybe deserving to be reported more often? 6 Artifact of the test situation? 5 Task importance 5 Other (49 items total on this theme) 9
  • 10. Resources and strategies when making the call 9 Discuss with experts or team members 20 reported to rely on own 9 New/extended evaluations professional knowledge and 8 Check against heuristics / experience. guidelines / principles However, several potential useful resources and […] strategies were reported 3 Specific process or policy 2 Confirmed hypotheses / previous experiences 2 Debrief with users (63 items total on this theme) 10
  • 11. When considering the study findings … How did you handle single-user problems in your latest usability test? 11
  • 12. Recommendations 1 Procedure for handling single-user problems 2 Pay particular attention to sample size 3 Check against knowledge resources – guidelines, heuristics, or previous evaluations 4 Seek advice - from experts or team members 5 Be alert: Artefact of the test situation? 12
  • 13. Thank you 13