SlideShare uma empresa Scribd logo
1 de 26
Baixar para ler offline
iSearch: brugerbehov og
testresultater i integreret søgning


Marianne Lykke, professor, PhD.
Aalborg University
Institut for Kommunikation, eLearning Lab
Dagens emner
•   Projektets formål og problemstilling
•   Testsamlingen
•   Resultater vedr. testsamling
•   Resultater vedr. informationsbehov
•   Resultater vedr. søgeperformance
•   Opsamling og det videre arbejde
Formål og problemstilling


      ‘Why is Google so easy
      and the library so hard?’
       (Claire Duddy - student)



                                  United Kingdom Serials Group
                                  2009 Annual conference
Formål og problemstilling

Haves:
Mange digitale samlinger med forskellige medier, genrer,
  dokumenttyper, metadata og vokabularer

Ønskes:
En søgemaskine, der kan finde de mest relevante resultater
  på tværs af det hele
Formål og problemstilling

Integreret søgning er en mulig løsning:

•   Relevante informationsobjekter høstes
•   Posterne normaliseres til et enkelt metadata format
•   De indekseres i et stort fælles indeks
•   Der tilføjes en Google-like søgeboks og rangering

• (…en slags federated search 2.0)
Formål og problemstilling

• Overordnet handler projektet Værktøjer til integreret
  søgning om at teste og evaluere, hvilke søgealgoritmer
  og indekseringskonfigurationer, der giver de bedste
  søgeresultater ved integreret søgning

• I praksis har det største arbejde været at udvikle og
  etablere et realistisk testmiljø, hvor søge- og
  rangordningsalgoritmer kan evalueres systematisk og
  metodisk korrekt
iSearch testsamling
En testsamling til at evaluere værktøjer til integreret søgning
 kræver som minimum:

• En samling af forskellige dokumenttyper, med forskellige
  dokumentrepræsentationer
• Et sæt søgeopgaver
• Relevansvurderinger i forhold til søgeopgaver

• Formålet er at foretage både realistiske og kontrollerede
  testsøgninger
iSearch testsamling
• Testsamling indenfor fagområdet fysik
• Testsamling bestående af:
   – 18.841 bogposter
   – 291,244 artikelposter, inkl. abstrakts
   – 143,569 fuldtekst artikler
• Varieret sæt af metadata formater og vokabularer
• 65 realistiske informationsbehov
• 4-skala relevansbedømmelser
• Demografisk information om fysikere
• Holdningsinformation om metadata, søgning,
  relevansbedømmelse, tilfredshed med søgeresultat
Informationsbehov
 • 65 søgeopgaver
 • Indsamlet gennem 3 workshops afholdt på DTU, KU og
   AAU
 • 23 fysikere, seniorforskere, phd’ere,
   specialestuderende)
 • Spørgeskema
     • Beskrivelse af informationsbehov ud fra 5
       perspektiver
     • Demografiske data
     • Holdningsdata
 • Op til 200 relevansbedømmelser per opgave
 • 4 skalaer: highly, faily, marginally and not relevant
Informationsbehov
5 perspektiver
Perspective                   Question
a) Current information need   What are you looking for?
b) Work task situation        Why are you looking for this?
                              What is you background
c) Current knowledge state
                              knowledge of this topic?
                              What should am ideal answer
d) Ideal answer
                              contain to solve problem or task?
                              Which central search terms would
e) Adequate search terms      you use to express situation and
                              information need?
iSearch information need No. 49
a) Information Need: Information on characterization by photo luminescence of highly
   doped ZnO films
b) Work Task: For my master thesis I work with characterization of ZnO films by photo
   luminescence. The films are manufactured by RF magnetron sputtering and have
   thicknesses of approximately 100 nm. The films are either intrinsic or doped with Al.
   Green luminescence are of particular interest, but other defect modes are also of
   interest. The aim is to document a simple way of characterizing films in a non intrusive
   manor, and maybe to implement the technique in the production to monitor film growth.
   In particular information on sub band gab excitation is interesting as only a 405 nm
   laser is readily available at the institute
c) Background: I have worked with the topic for a year and a half. We have made
   experiments with photo luminescence and have observed green luminescence. I have
   read quite a lot of review articles on the subject and have been seeking articles with
   comparable parameters
d) Ideal Answer: An article containing examples of luminescence from samples made by
   rf magnetron sputtering. Graphs with photoluminescence data from ZnO films are
   essential. Ideally Al doped ZnO films would be featured in the article
e) Search terms: ZnO, rf magnetron sputtering, photo luminescence, al doped, green
   luminescence
Informationsbehov
Formål

Task purpose             %

Theoretical background   54

Previous results         26

Research methodology     20
Informationsbehov
Søgefacetter
Search facets         Task description questions
                a)    b)      c)     d)    e)       All
Common topic    316   545    310    234    242     1647
Method          47    73     66     37     48      271
Info type       38    26     29     145     5      243
Application      1     7      1      1      1       11
Other           n/a   15      11     5     n/a     31
Informationsbehov
Andre facetter
   • Research groups
   • Specific reference
   • Source
   • Year
   • Location
   • Disciplinary field
   • Audience
Informationsbehov
Antal søgetermer
                    Task description questions (average)
Task purpose       a)     b)     c)     d)     e)     All
Theoretical
                   6.7   10.5   6.1    7.1    4.8    35.2
background
Previous results   5.5   10.1   6.1    5.4    4.7    31.8
Research
                   5.8   9.6    6.9    6.3    3.8    32.4
methodology
All                6.2   10.2   6.4    6.5    4.6    33.8
Relevansbedømmelser


                            No. of tasks
  Range of relevant docs.     N = 65
           > 100                 9
         75 - 100                3
          50 - 74                8
          25 - 49                13
          15 - 24                12
          10 - 14                8
            < 10                 12
Søgeperformance - dokumenttyper
               0,50

               0,45

               0,40
                                                        Alle rel.
               0,35
                                                        rel. Bogposter
               0,30                                     rel. Fuldtekst
                                                        rel. Metadata
   Præcision




               0,25

               0,20

               0,15

               0,10

               0,05

               0,00
                   0,00   0,20   0,40            0,60               0,80   1,00

                                        Recall
Søgeperformance - fusion
               0,50

               0,45

               0,40                                                   Alle rel.
                                                                      rel. Bogposter
               0,35
                                                                      rel. Fuldtekst
               0,30                                                   rel. Metadata
               0,25
  Precisiopn




               0,20

               0,15

               0,10

               0,05

               0,00
                   0,00   0,10   0,20   0,30   0,40    0,50    0,60    0,70   0,80     0,90   1,00

                                                      Recall
Søgeperformance - metadata
              0,50

              0,45

              0,40
                                                               description    fulltext
              0,35

              0,30
                                                               subject        title
              0,25
 Precisiopn




              0,20

              0,15

              0,10

              0,05

              0,00
                  0,00   0,10   0,20   0,30   0,40    0,50    0,60   0,70    0,80     0,90   1,00

                                                     Recall
Relevansbedømmelse - metadata
 80,0
                               Extremely useful   Somewhat useful   Not at all useful

 70,0



 60,0



 50,0



 40,0



 30,0



 20,0



 10,0



  0,0
        Author   Description            Date           Comment           Subjects
Søgeperformance – description
                 0,50

                 0,45

                 0,40                                     Alle rel.
                 0,35                                     rel. Bogposter
                 0,30                                     rel. Fuldtekst

                 0,25
                                                          rel. Metadata
    Precisiopn




                 0,20

                 0,15

                 0,10

                 0,05

                 0,00
                     0,00   0,20   0,40            0,60        0,80        1,00

                                          Recall
Søgeperformance - dokumenttyper
              NDCG for different task purposes and document types
        0,6

        0,5

        0,4
 NDCG




        0,3

        0,2

        0,1

         0
               Previous results   Design methodology Theory & background
                       All document types Book records   knowledge
                       Full text articles   Metadata records
Søgeperformance - søgetermer
                     NDCG for different task lengths
                         and document types
       1,0

       0,8

       0,6
NDCG




       0,4

       0,2

       0,0
             Short length (2-4 terms)        Long length (5-12 terms)
                   All document types   Book records
                   Full text articles   Metadata records
Opsamling og videre arbejde
 • Oprettelse af testsamling til evaluering af integreret
   søgning
 • Realistisk og kontrolleret test setting
 • Informationssøgning indenfor fagområdet Fysik:
    – 3 overordnede typer informationsbehov
    – Muligt at identificere 5 perspektiver på informationsbehov med
      forskelligt antal facetter
    – BT/NT, forkortelser, få andre synonyme variationer
 • Søgeeffektivitet varierer på tværs af
   informationsbehovstyper og dokumentrepræsentationer
 • Fremtidigt arbejde:
    – Analyser af sammenhæng mellem informationsbehovstyper,
      facetter, dokumenttyper, vokabularer og
      dokumentrepræsentationer
    – Kvalitative, forklarende analyser
    – Analyser i ”virkelighedens verden”
Erfaringer
• Hensyntagen til dokumenttyper med hensyn til
  vægtning og fusion

• Analysere informationsbehovstyper, længde og
  struktur
  – Opsætning
  – Interface og brugter-systemdialog
  – Undervisning
Litteratur
Ingwersen, P., Lykke, M., Bogers, T., Larsen, B. & Lund, H. (2010): Assessors'
    Search Result Satisfaction Associated with Relevance in a Scientific Domain.
    In: Belkin, N. J. & Kelly, D. (reds.) IIiX'10 Proceeding of the Third Symposium
    on Information Interaction in Context, New Brunswick, NJ, USA, August 18-21,
    2010.New York: ACM, p. 283-287.
Ingwersen, P., Toine, B. & Lykke, M. (2010). Does degree of work task completion
   influence retrieval performance. In: Proceedings of the 73rd ASIS&T Annual
   Meeting. Grove, A. (red.). 47. American Society for Information Science and
   Technology
Lykke, M., Ingwersen, P., Bogers, T., Lund, H. & Larsen, B. (2010): Physicists'
    Information Tasks: Structure, Length and Retrieval Performance. In: Belkin, N.
    J. & Kelly, D. (reds.) IIiX'10 Proceeding of the Third Symposium on Information
    Interaction in Context, New Brunswick, NJ, USA, August 18-21, 2010.New
    York: ACM, p. 347-351.
Lykke, M., Larsen, B., Lund, H. & Ingwersen, P. (2010): Developing a Test
    Collection for the Evaluation of Integrated Search. In: Gurrin, C. & al. eds.
    Advances in Information Retrieval, 32nd European Conference on IR
    Research, ECIR 2010, Milton Keynes, UK, March 28-31, 2010, Proceedings.
    Berlin: Springer, p. 627-630. (Lecture Notes in Computer Science ; 5993)

Mais conteúdo relacionado

Destaque

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Destaque (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

iSearch: brugerbehov og testresultater i integreret søgning

  • 1. iSearch: brugerbehov og testresultater i integreret søgning Marianne Lykke, professor, PhD. Aalborg University Institut for Kommunikation, eLearning Lab
  • 2. Dagens emner • Projektets formål og problemstilling • Testsamlingen • Resultater vedr. testsamling • Resultater vedr. informationsbehov • Resultater vedr. søgeperformance • Opsamling og det videre arbejde
  • 3. Formål og problemstilling ‘Why is Google so easy and the library so hard?’ (Claire Duddy - student) United Kingdom Serials Group 2009 Annual conference
  • 4. Formål og problemstilling Haves: Mange digitale samlinger med forskellige medier, genrer, dokumenttyper, metadata og vokabularer Ønskes: En søgemaskine, der kan finde de mest relevante resultater på tværs af det hele
  • 5. Formål og problemstilling Integreret søgning er en mulig løsning: • Relevante informationsobjekter høstes • Posterne normaliseres til et enkelt metadata format • De indekseres i et stort fælles indeks • Der tilføjes en Google-like søgeboks og rangering • (…en slags federated search 2.0)
  • 6. Formål og problemstilling • Overordnet handler projektet Værktøjer til integreret søgning om at teste og evaluere, hvilke søgealgoritmer og indekseringskonfigurationer, der giver de bedste søgeresultater ved integreret søgning • I praksis har det største arbejde været at udvikle og etablere et realistisk testmiljø, hvor søge- og rangordningsalgoritmer kan evalueres systematisk og metodisk korrekt
  • 7. iSearch testsamling En testsamling til at evaluere værktøjer til integreret søgning kræver som minimum: • En samling af forskellige dokumenttyper, med forskellige dokumentrepræsentationer • Et sæt søgeopgaver • Relevansvurderinger i forhold til søgeopgaver • Formålet er at foretage både realistiske og kontrollerede testsøgninger
  • 8. iSearch testsamling • Testsamling indenfor fagområdet fysik • Testsamling bestående af: – 18.841 bogposter – 291,244 artikelposter, inkl. abstrakts – 143,569 fuldtekst artikler • Varieret sæt af metadata formater og vokabularer • 65 realistiske informationsbehov • 4-skala relevansbedømmelser • Demografisk information om fysikere • Holdningsinformation om metadata, søgning, relevansbedømmelse, tilfredshed med søgeresultat
  • 9. Informationsbehov • 65 søgeopgaver • Indsamlet gennem 3 workshops afholdt på DTU, KU og AAU • 23 fysikere, seniorforskere, phd’ere, specialestuderende) • Spørgeskema • Beskrivelse af informationsbehov ud fra 5 perspektiver • Demografiske data • Holdningsdata • Op til 200 relevansbedømmelser per opgave • 4 skalaer: highly, faily, marginally and not relevant
  • 10. Informationsbehov 5 perspektiver Perspective Question a) Current information need What are you looking for? b) Work task situation Why are you looking for this? What is you background c) Current knowledge state knowledge of this topic? What should am ideal answer d) Ideal answer contain to solve problem or task? Which central search terms would e) Adequate search terms you use to express situation and information need?
  • 11. iSearch information need No. 49 a) Information Need: Information on characterization by photo luminescence of highly doped ZnO films b) Work Task: For my master thesis I work with characterization of ZnO films by photo luminescence. The films are manufactured by RF magnetron sputtering and have thicknesses of approximately 100 nm. The films are either intrinsic or doped with Al. Green luminescence are of particular interest, but other defect modes are also of interest. The aim is to document a simple way of characterizing films in a non intrusive manor, and maybe to implement the technique in the production to monitor film growth. In particular information on sub band gab excitation is interesting as only a 405 nm laser is readily available at the institute c) Background: I have worked with the topic for a year and a half. We have made experiments with photo luminescence and have observed green luminescence. I have read quite a lot of review articles on the subject and have been seeking articles with comparable parameters d) Ideal Answer: An article containing examples of luminescence from samples made by rf magnetron sputtering. Graphs with photoluminescence data from ZnO films are essential. Ideally Al doped ZnO films would be featured in the article e) Search terms: ZnO, rf magnetron sputtering, photo luminescence, al doped, green luminescence
  • 12. Informationsbehov Formål Task purpose % Theoretical background 54 Previous results 26 Research methodology 20
  • 13. Informationsbehov Søgefacetter Search facets Task description questions a) b) c) d) e) All Common topic 316 545 310 234 242 1647 Method 47 73 66 37 48 271 Info type 38 26 29 145 5 243 Application 1 7 1 1 1 11 Other n/a 15 11 5 n/a 31
  • 14. Informationsbehov Andre facetter • Research groups • Specific reference • Source • Year • Location • Disciplinary field • Audience
  • 15. Informationsbehov Antal søgetermer Task description questions (average) Task purpose a) b) c) d) e) All Theoretical 6.7 10.5 6.1 7.1 4.8 35.2 background Previous results 5.5 10.1 6.1 5.4 4.7 31.8 Research 5.8 9.6 6.9 6.3 3.8 32.4 methodology All 6.2 10.2 6.4 6.5 4.6 33.8
  • 16. Relevansbedømmelser No. of tasks Range of relevant docs. N = 65 > 100 9 75 - 100 3 50 - 74 8 25 - 49 13 15 - 24 12 10 - 14 8 < 10 12
  • 17. Søgeperformance - dokumenttyper 0,50 0,45 0,40 Alle rel. 0,35 rel. Bogposter 0,30 rel. Fuldtekst rel. Metadata Præcision 0,25 0,20 0,15 0,10 0,05 0,00 0,00 0,20 0,40 0,60 0,80 1,00 Recall
  • 18. Søgeperformance - fusion 0,50 0,45 0,40 Alle rel. rel. Bogposter 0,35 rel. Fuldtekst 0,30 rel. Metadata 0,25 Precisiopn 0,20 0,15 0,10 0,05 0,00 0,00 0,10 0,20 0,30 0,40 0,50 0,60 0,70 0,80 0,90 1,00 Recall
  • 19. Søgeperformance - metadata 0,50 0,45 0,40 description fulltext 0,35 0,30 subject title 0,25 Precisiopn 0,20 0,15 0,10 0,05 0,00 0,00 0,10 0,20 0,30 0,40 0,50 0,60 0,70 0,80 0,90 1,00 Recall
  • 20. Relevansbedømmelse - metadata 80,0 Extremely useful Somewhat useful Not at all useful 70,0 60,0 50,0 40,0 30,0 20,0 10,0 0,0 Author Description Date Comment Subjects
  • 21. Søgeperformance – description 0,50 0,45 0,40 Alle rel. 0,35 rel. Bogposter 0,30 rel. Fuldtekst 0,25 rel. Metadata Precisiopn 0,20 0,15 0,10 0,05 0,00 0,00 0,20 0,40 0,60 0,80 1,00 Recall
  • 22. Søgeperformance - dokumenttyper NDCG for different task purposes and document types 0,6 0,5 0,4 NDCG 0,3 0,2 0,1 0 Previous results Design methodology Theory & background All document types Book records knowledge Full text articles Metadata records
  • 23. Søgeperformance - søgetermer NDCG for different task lengths and document types 1,0 0,8 0,6 NDCG 0,4 0,2 0,0 Short length (2-4 terms) Long length (5-12 terms) All document types Book records Full text articles Metadata records
  • 24. Opsamling og videre arbejde • Oprettelse af testsamling til evaluering af integreret søgning • Realistisk og kontrolleret test setting • Informationssøgning indenfor fagområdet Fysik: – 3 overordnede typer informationsbehov – Muligt at identificere 5 perspektiver på informationsbehov med forskelligt antal facetter – BT/NT, forkortelser, få andre synonyme variationer • Søgeeffektivitet varierer på tværs af informationsbehovstyper og dokumentrepræsentationer • Fremtidigt arbejde: – Analyser af sammenhæng mellem informationsbehovstyper, facetter, dokumenttyper, vokabularer og dokumentrepræsentationer – Kvalitative, forklarende analyser – Analyser i ”virkelighedens verden”
  • 25. Erfaringer • Hensyntagen til dokumenttyper med hensyn til vægtning og fusion • Analysere informationsbehovstyper, længde og struktur – Opsætning – Interface og brugter-systemdialog – Undervisning
  • 26. Litteratur Ingwersen, P., Lykke, M., Bogers, T., Larsen, B. & Lund, H. (2010): Assessors' Search Result Satisfaction Associated with Relevance in a Scientific Domain. In: Belkin, N. J. & Kelly, D. (reds.) IIiX'10 Proceeding of the Third Symposium on Information Interaction in Context, New Brunswick, NJ, USA, August 18-21, 2010.New York: ACM, p. 283-287. Ingwersen, P., Toine, B. & Lykke, M. (2010). Does degree of work task completion influence retrieval performance. In: Proceedings of the 73rd ASIS&T Annual Meeting. Grove, A. (red.). 47. American Society for Information Science and Technology Lykke, M., Ingwersen, P., Bogers, T., Lund, H. & Larsen, B. (2010): Physicists' Information Tasks: Structure, Length and Retrieval Performance. In: Belkin, N. J. & Kelly, D. (reds.) IIiX'10 Proceeding of the Third Symposium on Information Interaction in Context, New Brunswick, NJ, USA, August 18-21, 2010.New York: ACM, p. 347-351. Lykke, M., Larsen, B., Lund, H. & Ingwersen, P. (2010): Developing a Test Collection for the Evaluation of Integrated Search. In: Gurrin, C. & al. eds. Advances in Information Retrieval, 32nd European Conference on IR Research, ECIR 2010, Milton Keynes, UK, March 28-31, 2010, Proceedings. Berlin: Springer, p. 627-630. (Lecture Notes in Computer Science ; 5993)