SlideShare uma empresa Scribd logo
1 de 45
www.polimedia.nl
Building the PoliMedia system;
data- and user-driven
Who are we?
Laura Hollink
• Assistant professor at VU
• Modeling, linking and enrichment
of data
• Data-driven research
• @laurahollink
Max Kemman
• Junior researcher at EUR
• Human-Computer Interaction
• User-driven research
• @MaxJ_K
eHumanities group - PoliMedia 2
PoliMedia team
Henri Beunders (EUR)
Jaap Blom (NISV)
Laura Hollink (VU)
Geert-Jan Houben (TU Delft)
Funded by CLARIN-NL
Damir Juric (TU Delft)
Max Kemman (EUR)
Martijn Kleppe (EUR)
Johan Oomen (NISV)
Linking Politics to Media
eHumanities group - PoliMedia 3
The research questions
• How is a person, subject or process covered & visualised by the media?
• How do debates and arguments develop over a longer period of time?
• Analysing the changing ideas, arguments and presentation in different
media
eHumanities group - PoliMedia 4
eHumanities group - PoliMedia 5
Issues with current approach
eHumanities group - PoliMedia 6
Issues with current approach
Goal: explicit links to different media
types in one system
eHumanities group - PoliMedia 7
PoliMedia system
eHumanities group - PoliMedia 8
PoliMedia
Portal
- Browse:
debate and
date
- Search:
debate and
person
Newspapers
KB
Television
Sound and Vision
Radio
KB
Staten
Generaal
Digitaal
KB
Data-driven (Laura) & user-driven (Max)
Data
eHumanities group - PoliMedia 9
Debate data
Handelingen der Staten-General or Dutch Hansard
from 1945-1995
Some provenance:
1. Transcripts are made of the complete debates of the Dutch
parliament.
2. Published online by the government on
http://www.statengeneraaldigitaal.nl/ (1818 1995) and
http://officielebekendmakingen.nl/ (from 1995)
3. PoliticalMashup project has translated government pdf and
txt files into XML, incl URI’s as identifiers, see
http://politicalmashup.nl/
4. We build on that.
eHumanities group - PoliMedia 11
Debate
Metadata
Topic 1
Topic 2
Speaker 1 / Content
Speaker 2 / Content
Speaker 3 / Content
Speaker 1 / Content
Structureof the
debate data
Including:
• who, when, what
• identifiers for subparts
of the debate
• chronological order of
speakers
Media data
• Newspaper articles
– at the National Library of the
Netherlands
– Many newspapers 1950- 1995
– Text + images of newspaper
layout
• Radio bulletins
– Transcripts of ANP news
• Newscasts
– in the Academia collection of the
Netherlands institute for Sound
and Vision
Semantic model
nl.proc.sgd.d.
194519460000002
nl.proc.sgd.d.
194519460000002.1
PartOfDebateDebate
http://resolver.politicalmashup.nl/nl.proc.sgd.d.194519460000002
http://statengeneraaldigitaal.nl/
http://resolver.kb.nl/resolve?urn=sgd:mpeg21:19451946:0000002:pdf
nl.proc.sgd.d.19720000002
Handelingen Verenigde
Vergadering...
Dutch
1945-11-20
rdf:type
dc:id
dc:source
dc:source
dc:publisher
dc:language
dc:date
hasPart
rdf:type
nl.proc.sgd.d.
194519460000002.1.1
hasPart
DebateContext
rdf:type
nl.proc.sgd.d.
194519460000002.1.2
Speech
rdf:type
hasPart
nl.proc.sgd.d.
194519460000002.1.3
hasSubsequentSpeech
"Mijnheer de
Voorzitter, de
Commissie
van …"
hasSpokenText
sem:hasActor
"De voorzitter
opent de
vergadering…"
hasText
http://resolver.kb.nl/resolve?urn=ddd:011198136:mpeg21:a0525:ocr
coveredIn
nl.proc.sgd.d.
194519460000002.2
hasSubsequentPartOfDebate
Semantic model
sem:hasActor
Speaker_0006
4
Party_kvp
hasParty
hasSpeaker
member_of
_parliament
Party
KVP
Katholieke Volkspartij
rdf:type
hasAcronym
hasFullName
Joannes Antonius James
Bargefoaf:firstName
foaf:lastName
Barge
rdfs:label
http://resolver.politicalmashup.nl/nl.m.00064
dc:source
Politician
rdf:type
hasRole
Reuse of vocabularies:
Simple Event Model (SEM),
Dublin Core, FOAF, links to
ISOCAT data categories.
Linked Data
eHumanities group - PoliMedia 15
• Data openly accessible in a semantic Web standard
• Easy to combine with other semantic Web data
• E.g. DBpedia data on politicians and parties.
Linking Debates to Newspaper
articles that cover them
• Challenges:
– How to link documents that are so different in
nature?
– Can we use the structure of the debates: people,
chronologic order of speeches, introductions to
each new topic, etc.
– How can we do this efficiently, using the access
mechanisms of the archives?
eHumanities group - PoliMedia 16
Linking approach
eHumanities group - PoliMedia 17
Detect
topics in
speeches
Create
queries
Search
newspaper
archive
Topics
Named
Entities
Name of
speaker
Detect
Named
Entities in
speeches
Candidate
articles
Queries
Rank
candidate
articles
Links
between
speeches
and articles
Debates
Date of
debate
Detect topics
The MALLET topic model package
• Unsupervised analysis of text
• “a Topic consists of a cluster of words that frequently occur together”
• [see http://mallet.cs.umass.edu/topics.php]
• Input:
– Text
– Number of iterations
– Number of topics/clusters
• Output:
– Words that cluster around one topic.
• Example:
– Text: a speech in a debate from 1975
– number of iterations: 2000
– number of topics: 1
Create Queries
eHumanities group - PoliMedia 19
Named
Entities from
the speech
Named
Entities from
the debate
intro
Topics from
the speech
Topics from
the debate
intro
Name of
speaker Date of debate
Named
Entities from
the speech
Named
Entities from
the debate
intro
Topics from
the speech
Topics from
the debate
intro
Evaluation
• Experiment 1: NEs in speech
• Experiment 2: NEs + topics in speech
• Experiment 3: NEs + topics in speech and debate
eHumanities group - PoliMedia 20
Results
• A linked open data set of Dutch parliamentary
debates.
• With links to URL’s of news paper articles and
radio bulletins at the Royal Library.
• A system that supports researchers in finding
the data to answer their questions.
eHumanities group - PoliMedia 21
User-driven
What do scholars want?
• Why user research?
• Understanding the user [1, 2]
– Acceptance
– Performance
– Capabilities
– Weaknesses
• Goal
– Creating a system that is intuitive and helpful to the users
[1] Y. Liu, A. Osvalder, and M. Karlsson, “Considering the importance of user profiles in
interface design,” no. May, 2010
[2] J. Preece, Y. Rogers, and H. Sharp, “Interaction Design: Beyond Human-Computer
Interaction,” Design, vol. 18, no. 1, pp. 68-68, 2002
eHumanities group - PoliMedia 22
User research in the development
process
• Examine search behaviour of users
– Survey regarding search strategies
– Interviews
• User wishes → user requirements
• Wireframes → Prototype
• Evaluation →New prioritization of remaining
user requirements
• Final version
eHumanities group - PoliMedia 23
Survey
General search strategies
• N=294
• Popular search engines
Very often
Often
Regularly
Sometimes
Never
Don’t
know it
Google
GoogleImages
GoogleScholar
YouTube
JSTOR
KB
Flickr
EBSCO
NationaalArchief
WebofKnowledge
UitzendingGemist
Yahoo!
Bing
Academia.nl
Europeana
Scopus
MicrosoftAcademicSearch
EUscreen
Arkyves
24
Survey
General search strategies
1. Keywords 4,75
2. Advanced search 3,36
3. Related terms 2,52
4. Boolean 2,42
5. Browsing subject
categories 2,29
6. Filters 2,19
7. Thesaurus 1,87
8. Visualization 1,22
eHumanities group - PoliMedia 25
Survey
Conclusions
• Google is the dominant search engine
• This has two consequences
1. People compare other search systems to their
experience with Google
2. The search task is mainly performed by using
keywords
eHumanities group - PoliMedia 26
Interviews
• N=5
• Quantitative (n=2) as well as qualitative (n=4)
• Main themes
– How do people search currently?
– What could be improved about current search systems?
– What should PoliMedia offer, given its goals?
• Results
– 39 user wishes
– Prioritized internally
• 19 user wishes deemed out of scope
• 20 user requirements
eHumanities group - PoliMedia 27
Interviews
Findings
• Key issue is to provide a good overview of data
– Why are search results retrieved
– How are search results ranked
• Assumptions of relevance
– Higher frequency of keywords indicated higher relevancy to
query?
– Longer segments (speeches and articles) indicate higher
importance?
• Many more or less out-of-scope wishes to make current
research easier
– Sentiment-metadata
– Context metadata
– Ability to export to own software
eHumanities group - PoliMedia 28
• Clear and
immediate
keyword-search
• Support for
Booleans and
(some) Google-
search operators
• Separate
advanced-search
eHumanities group - PoliMedia 29
Wireframes
Search interface
Wireframes
Search results
• Keyword search
remains
prominent
• User chosen
ranking of results
• Keyword
highlighting
• Overview of
related media
• Support for
filtering
eHumanities group - PoliMedia 30
Wireframes
Debate page
• Keyword search
remains
prominent
• Overview of
people in debate
• Easy access to
related material
31eHumanities group - PoliMedia
Prototype v1.0
eHumanities group - PoliMedia 32
Evaluation
• Eye tracking evaluation of the search system
– Search system was still in development
• N=24
– History
– Political communication
• Goals
– Gain understanding of distribution of attention
– Collect general feedback on interface
eHumanities group - PoliMedia 33
Evaluation
Eye tracking
• Viewing Duration
• Search bar received little attention after
search results were displayed
• Facets received a lot of attention
• Page-search (CTRL+F) mainly received
attention on debate page view
eHumanities group - PoliMedia 34
Tasks Search bar Facets Search results Page-search
Known Item 17% 22% 60% 2%
Exploratory 6% 12% 80% 2%
Evaluation
Usability feedback
• The ranking of search results was an issue for
users
• The year-filter should be a slider
• The debate page should be greatly improved
– Better identification for speaker, party, topic,
relevance to query
– Provide filters on debate-page as well
eHumanities group - PoliMedia 35
Prototype v2.0
eHumanities group - PoliMedia 36
Prototype v2.0 - query
eHumanities group - PoliMedia 37
Prototype v2.0 – filter speaker
eHumanities group - PoliMedia 38
Prototype v2.0 - filter role
eHumanities group - PoliMedia 39
Prototype v2.0 - debate
eHumanities group - PoliMedia 40
Prototype v2.0 - highlight speech
eHumanities group - PoliMedia 41
Prototype v2.0 - link newspaper
eHumanities group - PoliMedia 42
Prototype v2.0 - newspaper
eHumanities group - PoliMedia 43
Prototype v2.0 - link radio
eHumanities group - PoliMedia 44
Conclusion
• PoliMedia; data- or user-driven?
• Continuous interplay
– Users gave input for usefulness of links
– Data limits what features we can offer to users
• Collection quality and usability are both critical to
users [3]
[3] Xie, I. (2006). Evaluation of digital libraries: Criteria and problems from users’
perspectives. Library & Information Science Research, 28(3), 433–452.
doi:10.1016/j.lisr.2006.06.002
eHumanities group - PoliMedia 45

Mais conteúdo relacionado

Destaque

PoliMedia presentation NOTaS meeting
PoliMedia presentation NOTaS meetingPoliMedia presentation NOTaS meeting
PoliMedia presentation NOTaS meetingMaxKemman
 
User research in the development of PoliMedia
User research in the development of PoliMediaUser research in the development of PoliMedia
User research in the development of PoliMediaMaxKemman
 
Tracking online user behaviour with a multimethod research design
Tracking online user behaviour with a multimethod research designTracking online user behaviour with a multimethod research design
Tracking online user behaviour with a multimethod research designMartijn Kleppe
 
User Required? On the Value of User Research in the Digital Humanities
User Required? On the Value of User Research in the Digital HumanitiesUser Required? On the Value of User Research in the Digital Humanities
User Required? On the Value of User Research in the Digital HumanitiesMaxKemman
 
Talking With Scholars - Developing a Research Environment for Oral History Co...
Talking With Scholars - Developing a Research Environment for Oral History Co...Talking With Scholars - Developing a Research Environment for Oral History Co...
Talking With Scholars - Developing a Research Environment for Oral History Co...MaxKemman
 
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...Martijn Kleppe
 
West Coast Forum 2010 Kick Off Presentation
West Coast Forum 2010 Kick Off PresentationWest Coast Forum 2010 Kick Off Presentation
West Coast Forum 2010 Kick Off PresentationRaghu Thricovil
 
Big Science isn't just for physics - PoliMedia - Automatically Linking Politi...
Big Science isn't just for physics - PoliMedia - Automatically Linking Politi...Big Science isn't just for physics - PoliMedia - Automatically Linking Politi...
Big Science isn't just for physics - PoliMedia - Automatically Linking Politi...Martijn Kleppe
 
Analyzing Published and Consumed Digital & Digitized News
Analyzing Published and Consumed Digital & Digitized NewsAnalyzing Published and Consumed Digital & Digitized News
Analyzing Published and Consumed Digital & Digitized NewsMartijn Kleppe
 
Polimedia kick-off presentation
Polimedia kick-off presentationPolimedia kick-off presentation
Polimedia kick-off presentationMaxKemman
 
Raleigh Budget Kick-Off Presentation
Raleigh Budget Kick-Off PresentationRaleigh Budget Kick-Off Presentation
Raleigh Budget Kick-Off PresentationPublicFinanceTV
 
TYPO3 4.5 Kick-Off Presentation #t3dd10
TYPO3 4.5 Kick-Off Presentation #t3dd10TYPO3 4.5 Kick-Off Presentation #t3dd10
TYPO3 4.5 Kick-Off Presentation #t3dd10Ernesto Baschny
 

Destaque (12)

PoliMedia presentation NOTaS meeting
PoliMedia presentation NOTaS meetingPoliMedia presentation NOTaS meeting
PoliMedia presentation NOTaS meeting
 
User research in the development of PoliMedia
User research in the development of PoliMediaUser research in the development of PoliMedia
User research in the development of PoliMedia
 
Tracking online user behaviour with a multimethod research design
Tracking online user behaviour with a multimethod research designTracking online user behaviour with a multimethod research design
Tracking online user behaviour with a multimethod research design
 
User Required? On the Value of User Research in the Digital Humanities
User Required? On the Value of User Research in the Digital HumanitiesUser Required? On the Value of User Research in the Digital Humanities
User Required? On the Value of User Research in the Digital Humanities
 
Talking With Scholars - Developing a Research Environment for Oral History Co...
Talking With Scholars - Developing a Research Environment for Oral History Co...Talking With Scholars - Developing a Research Environment for Oral History Co...
Talking With Scholars - Developing a Research Environment for Oral History Co...
 
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
PoliMedia - Analysing Mediacoverage of political debates in newspapers, radio...
 
West Coast Forum 2010 Kick Off Presentation
West Coast Forum 2010 Kick Off PresentationWest Coast Forum 2010 Kick Off Presentation
West Coast Forum 2010 Kick Off Presentation
 
Big Science isn't just for physics - PoliMedia - Automatically Linking Politi...
Big Science isn't just for physics - PoliMedia - Automatically Linking Politi...Big Science isn't just for physics - PoliMedia - Automatically Linking Politi...
Big Science isn't just for physics - PoliMedia - Automatically Linking Politi...
 
Analyzing Published and Consumed Digital & Digitized News
Analyzing Published and Consumed Digital & Digitized NewsAnalyzing Published and Consumed Digital & Digitized News
Analyzing Published and Consumed Digital & Digitized News
 
Polimedia kick-off presentation
Polimedia kick-off presentationPolimedia kick-off presentation
Polimedia kick-off presentation
 
Raleigh Budget Kick-Off Presentation
Raleigh Budget Kick-Off PresentationRaleigh Budget Kick-Off Presentation
Raleigh Budget Kick-Off Presentation
 
TYPO3 4.5 Kick-Off Presentation #t3dd10
TYPO3 4.5 Kick-Off Presentation #t3dd10TYPO3 4.5 Kick-Off Presentation #t3dd10
TYPO3 4.5 Kick-Off Presentation #t3dd10
 

Semelhante a Building the PoliMedia search system; data- and user-driven

Sense4us PACITA event presentation
Sense4us PACITA event presentationSense4us PACITA event presentation
Sense4us PACITA event presentationSENSE4US project
 
Global Futures Intelligence System talk at WFSF 2013
Global Futures Intelligence System talk at WFSF 2013Global Futures Intelligence System talk at WFSF 2013
Global Futures Intelligence System talk at WFSF 2013Jerome Glenn
 
E-challenges11 WeGov Workshop
E-challenges11 WeGov WorkshopE-challenges11 WeGov Workshop
E-challenges11 WeGov WorkshopWeGov project
 
Methods and techniques-comm (3/6)
Methods and techniques-comm  (3/6)Methods and techniques-comm  (3/6)
Methods and techniques-comm (3/6)Roberta Cuel
 
Methods and techniques (2/6)
Methods and techniques (2/6)Methods and techniques (2/6)
Methods and techniques (2/6)Roberta Cuel
 
WeGov Analysis Tools to connect Policy Makers with Citizens Online
WeGov Analysis Tools to connect Policy Makers with Citizens OnlineWeGov Analysis Tools to connect Policy Makers with Citizens Online
WeGov Analysis Tools to connect Policy Makers with Citizens OnlineTimo Wandhoefer
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMichael Mathioudakis
 
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the Map
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the MapNew Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the Map
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the MapAxel Bruns
 
Passive expert - sourcing, for policy making in the EU
Passive expert - sourcing,  for policy making in the EUPassive expert - sourcing,  for policy making in the EU
Passive expert - sourcing, for policy making in the EUYannis Charalabidis
 
Innovative approaches to analyses of online social networks
Innovative approaches to analyses of online social networksInnovative approaches to analyses of online social networks
Innovative approaches to analyses of online social networksJakob Jensen
 
Lessons Learned from a Digital Tool Criticism Workshop
Lessons Learned from a Digital Tool Criticism WorkshopLessons Learned from a Digital Tool Criticism Workshop
Lessons Learned from a Digital Tool Criticism WorkshopMarijn Koolen
 
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Timo Wandhoefer
 
Classifying intangible social innovation concepts using machine learning and ...
Classifying intangible social innovation concepts using machine learning and ...Classifying intangible social innovation concepts using machine learning and ...
Classifying intangible social innovation concepts using machine learning and ...Nikola Milosevic
 
Research uptake and digital communications
Research uptake and digital communicationsResearch uptake and digital communications
Research uptake and digital communicationsresyst
 
Supporting user driven innovation activities in a crowdsourcing community
Supporting user driven innovation activities in a crowdsourcing communitySupporting user driven innovation activities in a crowdsourcing community
Supporting user driven innovation activities in a crowdsourcing communityMiia Kosonen
 
Using open datasets for research purposes
Using open datasets for research purposesUsing open datasets for research purposes
Using open datasets for research purposesMartijn Kleppe
 
Mediaspaces: Life After Convergence / Presentation at EBU Multimedia Forum 5....
Mediaspaces: Life After Convergence / Presentation at EBU Multimedia Forum 5....Mediaspaces: Life After Convergence / Presentation at EBU Multimedia Forum 5....
Mediaspaces: Life After Convergence / Presentation at EBU Multimedia Forum 5....Kari-Hans Kommonen
 
A hands-on approach to digital tool criticism: Tools for (self-)reflection
A hands-on approach to digital tool criticism: Tools for (self-)reflectionA hands-on approach to digital tool criticism: Tools for (self-)reflection
A hands-on approach to digital tool criticism: Tools for (self-)reflectionMarijn Koolen
 

Semelhante a Building the PoliMedia search system; data- and user-driven (20)

Sense4us PACITA event presentation
Sense4us PACITA event presentationSense4us PACITA event presentation
Sense4us PACITA event presentation
 
Global Futures Intelligence System talk at WFSF 2013
Global Futures Intelligence System talk at WFSF 2013Global Futures Intelligence System talk at WFSF 2013
Global Futures Intelligence System talk at WFSF 2013
 
E-challenges11 WeGov Workshop
E-challenges11 WeGov WorkshopE-challenges11 WeGov Workshop
E-challenges11 WeGov Workshop
 
Methods and techniques-comm (3/6)
Methods and techniques-comm  (3/6)Methods and techniques-comm  (3/6)
Methods and techniques-comm (3/6)
 
Methods and techniques (2/6)
Methods and techniques (2/6)Methods and techniques (2/6)
Methods and techniques (2/6)
 
WeGov Analysis Tools to connect Policy Makers with Citizens Online
WeGov Analysis Tools to connect Policy Makers with Citizens OnlineWeGov Analysis Tools to connect Policy Makers with Citizens Online
WeGov Analysis Tools to connect Policy Makers with Citizens Online
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
 
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the Map
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the MapNew Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the Map
New Perspectives on Social Media: Putting Our ‘Known Unknowns’ on the Map
 
Passive expert - sourcing, for policy making in the EU
Passive expert - sourcing,  for policy making in the EUPassive expert - sourcing,  for policy making in the EU
Passive expert - sourcing, for policy making in the EU
 
Innovative approaches to analyses of online social networks
Innovative approaches to analyses of online social networksInnovative approaches to analyses of online social networks
Innovative approaches to analyses of online social networks
 
'Using digital technology to engage with the community', by Deb Rawlings and ...
'Using digital technology to engage with the community', by Deb Rawlings and ...'Using digital technology to engage with the community', by Deb Rawlings and ...
'Using digital technology to engage with the community', by Deb Rawlings and ...
 
Social Media for Research CAURA 2013
Social Media for Research CAURA 2013Social Media for Research CAURA 2013
Social Media for Research CAURA 2013
 
Lessons Learned from a Digital Tool Criticism Workshop
Lessons Learned from a Digital Tool Criticism WorkshopLessons Learned from a Digital Tool Criticism Workshop
Lessons Learned from a Digital Tool Criticism Workshop
 
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
Online Forums vs. Social Networks: Two Case Studies to support eGovernment wi...
 
Classifying intangible social innovation concepts using machine learning and ...
Classifying intangible social innovation concepts using machine learning and ...Classifying intangible social innovation concepts using machine learning and ...
Classifying intangible social innovation concepts using machine learning and ...
 
Research uptake and digital communications
Research uptake and digital communicationsResearch uptake and digital communications
Research uptake and digital communications
 
Supporting user driven innovation activities in a crowdsourcing community
Supporting user driven innovation activities in a crowdsourcing communitySupporting user driven innovation activities in a crowdsourcing community
Supporting user driven innovation activities in a crowdsourcing community
 
Using open datasets for research purposes
Using open datasets for research purposesUsing open datasets for research purposes
Using open datasets for research purposes
 
Mediaspaces: Life After Convergence / Presentation at EBU Multimedia Forum 5....
Mediaspaces: Life After Convergence / Presentation at EBU Multimedia Forum 5....Mediaspaces: Life After Convergence / Presentation at EBU Multimedia Forum 5....
Mediaspaces: Life After Convergence / Presentation at EBU Multimedia Forum 5....
 
A hands-on approach to digital tool criticism: Tools for (self-)reflection
A hands-on approach to digital tool criticism: Tools for (self-)reflectionA hands-on approach to digital tool criticism: Tools for (self-)reflection
A hands-on approach to digital tool criticism: Tools for (self-)reflection
 

Mais de MaxKemman

Boundary practices in digital humanities
Boundary practices in digital humanitiesBoundary practices in digital humanities
Boundary practices in digital humanitiesMaxKemman
 
Infrastructure As Afterthought
Infrastructure As AfterthoughtInfrastructure As Afterthought
Infrastructure As AfterthoughtMaxKemman
 
Interdisciplinary Ignorance
Interdisciplinary IgnoranceInterdisciplinary Ignorance
Interdisciplinary IgnoranceMaxKemman
 
Digital History Projects as Boundary Objects
Digital History Projects as Boundary ObjectsDigital History Projects as Boundary Objects
Digital History Projects as Boundary ObjectsMaxKemman
 
Digital History Projects as Boundary Objects
Digital History Projects as Boundary ObjectsDigital History Projects as Boundary Objects
Digital History Projects as Boundary ObjectsMaxKemman
 
Too Many Varied User Requirements for Digital Humanities Projects
Too Many Varied User Requirements for Digital Humanities ProjectsToo Many Varied User Requirements for Digital Humanities Projects
Too Many Varied User Requirements for Digital Humanities ProjectsMaxKemman
 
Oral History Today
Oral History TodayOral History Today
Oral History TodayMaxKemman
 
Dutch Journalism in the Digital Age
Dutch Journalism in the Digital AgeDutch Journalism in the Digital Age
Dutch Journalism in the Digital AgeMaxKemman
 
User research for the development of search systems
User research for the development of search systemsUser research for the development of search systems
User research for the development of search systemsMaxKemman
 
Mapping the use of digital sources amongst Humanities scholars in the Netherl...
Mapping the use of digital sources amongst Humanities scholars in the Netherl...Mapping the use of digital sources amongst Humanities scholars in the Netherl...
Mapping the use of digital sources amongst Humanities scholars in the Netherl...MaxKemman
 

Mais de MaxKemman (10)

Boundary practices in digital humanities
Boundary practices in digital humanitiesBoundary practices in digital humanities
Boundary practices in digital humanities
 
Infrastructure As Afterthought
Infrastructure As AfterthoughtInfrastructure As Afterthought
Infrastructure As Afterthought
 
Interdisciplinary Ignorance
Interdisciplinary IgnoranceInterdisciplinary Ignorance
Interdisciplinary Ignorance
 
Digital History Projects as Boundary Objects
Digital History Projects as Boundary ObjectsDigital History Projects as Boundary Objects
Digital History Projects as Boundary Objects
 
Digital History Projects as Boundary Objects
Digital History Projects as Boundary ObjectsDigital History Projects as Boundary Objects
Digital History Projects as Boundary Objects
 
Too Many Varied User Requirements for Digital Humanities Projects
Too Many Varied User Requirements for Digital Humanities ProjectsToo Many Varied User Requirements for Digital Humanities Projects
Too Many Varied User Requirements for Digital Humanities Projects
 
Oral History Today
Oral History TodayOral History Today
Oral History Today
 
Dutch Journalism in the Digital Age
Dutch Journalism in the Digital AgeDutch Journalism in the Digital Age
Dutch Journalism in the Digital Age
 
User research for the development of search systems
User research for the development of search systemsUser research for the development of search systems
User research for the development of search systems
 
Mapping the use of digital sources amongst Humanities scholars in the Netherl...
Mapping the use of digital sources amongst Humanities scholars in the Netherl...Mapping the use of digital sources amongst Humanities scholars in the Netherl...
Mapping the use of digital sources amongst Humanities scholars in the Netherl...
 

Último

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 

Último (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Building the PoliMedia search system; data- and user-driven

  • 1. www.polimedia.nl Building the PoliMedia system; data- and user-driven
  • 2. Who are we? Laura Hollink • Assistant professor at VU • Modeling, linking and enrichment of data • Data-driven research • @laurahollink Max Kemman • Junior researcher at EUR • Human-Computer Interaction • User-driven research • @MaxJ_K eHumanities group - PoliMedia 2 PoliMedia team Henri Beunders (EUR) Jaap Blom (NISV) Laura Hollink (VU) Geert-Jan Houben (TU Delft) Funded by CLARIN-NL Damir Juric (TU Delft) Max Kemman (EUR) Martijn Kleppe (EUR) Johan Oomen (NISV)
  • 3. Linking Politics to Media eHumanities group - PoliMedia 3
  • 4. The research questions • How is a person, subject or process covered & visualised by the media? • How do debates and arguments develop over a longer period of time? • Analysing the changing ideas, arguments and presentation in different media eHumanities group - PoliMedia 4
  • 5. eHumanities group - PoliMedia 5 Issues with current approach
  • 6. eHumanities group - PoliMedia 6 Issues with current approach
  • 7. Goal: explicit links to different media types in one system eHumanities group - PoliMedia 7
  • 8. PoliMedia system eHumanities group - PoliMedia 8 PoliMedia Portal - Browse: debate and date - Search: debate and person Newspapers KB Television Sound and Vision Radio KB Staten Generaal Digitaal KB Data-driven (Laura) & user-driven (Max)
  • 10. Debate data Handelingen der Staten-General or Dutch Hansard from 1945-1995 Some provenance: 1. Transcripts are made of the complete debates of the Dutch parliament. 2. Published online by the government on http://www.statengeneraaldigitaal.nl/ (1818 1995) and http://officielebekendmakingen.nl/ (from 1995) 3. PoliticalMashup project has translated government pdf and txt files into XML, incl URI’s as identifiers, see http://politicalmashup.nl/ 4. We build on that.
  • 11. eHumanities group - PoliMedia 11 Debate Metadata Topic 1 Topic 2 Speaker 1 / Content Speaker 2 / Content Speaker 3 / Content Speaker 1 / Content Structureof the debate data Including: • who, when, what • identifiers for subparts of the debate • chronological order of speakers
  • 12. Media data • Newspaper articles – at the National Library of the Netherlands – Many newspapers 1950- 1995 – Text + images of newspaper layout • Radio bulletins – Transcripts of ANP news • Newscasts – in the Academia collection of the Netherlands institute for Sound and Vision
  • 13. Semantic model nl.proc.sgd.d. 194519460000002 nl.proc.sgd.d. 194519460000002.1 PartOfDebateDebate http://resolver.politicalmashup.nl/nl.proc.sgd.d.194519460000002 http://statengeneraaldigitaal.nl/ http://resolver.kb.nl/resolve?urn=sgd:mpeg21:19451946:0000002:pdf nl.proc.sgd.d.19720000002 Handelingen Verenigde Vergadering... Dutch 1945-11-20 rdf:type dc:id dc:source dc:source dc:publisher dc:language dc:date hasPart rdf:type nl.proc.sgd.d. 194519460000002.1.1 hasPart DebateContext rdf:type nl.proc.sgd.d. 194519460000002.1.2 Speech rdf:type hasPart nl.proc.sgd.d. 194519460000002.1.3 hasSubsequentSpeech "Mijnheer de Voorzitter, de Commissie van …" hasSpokenText sem:hasActor "De voorzitter opent de vergadering…" hasText http://resolver.kb.nl/resolve?urn=ddd:011198136:mpeg21:a0525:ocr coveredIn nl.proc.sgd.d. 194519460000002.2 hasSubsequentPartOfDebate
  • 14. Semantic model sem:hasActor Speaker_0006 4 Party_kvp hasParty hasSpeaker member_of _parliament Party KVP Katholieke Volkspartij rdf:type hasAcronym hasFullName Joannes Antonius James Bargefoaf:firstName foaf:lastName Barge rdfs:label http://resolver.politicalmashup.nl/nl.m.00064 dc:source Politician rdf:type hasRole Reuse of vocabularies: Simple Event Model (SEM), Dublin Core, FOAF, links to ISOCAT data categories.
  • 15. Linked Data eHumanities group - PoliMedia 15 • Data openly accessible in a semantic Web standard • Easy to combine with other semantic Web data • E.g. DBpedia data on politicians and parties.
  • 16. Linking Debates to Newspaper articles that cover them • Challenges: – How to link documents that are so different in nature? – Can we use the structure of the debates: people, chronologic order of speeches, introductions to each new topic, etc. – How can we do this efficiently, using the access mechanisms of the archives? eHumanities group - PoliMedia 16
  • 17. Linking approach eHumanities group - PoliMedia 17 Detect topics in speeches Create queries Search newspaper archive Topics Named Entities Name of speaker Detect Named Entities in speeches Candidate articles Queries Rank candidate articles Links between speeches and articles Debates Date of debate
  • 18. Detect topics The MALLET topic model package • Unsupervised analysis of text • “a Topic consists of a cluster of words that frequently occur together” • [see http://mallet.cs.umass.edu/topics.php] • Input: – Text – Number of iterations – Number of topics/clusters • Output: – Words that cluster around one topic. • Example: – Text: a speech in a debate from 1975 – number of iterations: 2000 – number of topics: 1
  • 19. Create Queries eHumanities group - PoliMedia 19 Named Entities from the speech Named Entities from the debate intro Topics from the speech Topics from the debate intro Name of speaker Date of debate Named Entities from the speech Named Entities from the debate intro Topics from the speech Topics from the debate intro
  • 20. Evaluation • Experiment 1: NEs in speech • Experiment 2: NEs + topics in speech • Experiment 3: NEs + topics in speech and debate eHumanities group - PoliMedia 20
  • 21. Results • A linked open data set of Dutch parliamentary debates. • With links to URL’s of news paper articles and radio bulletins at the Royal Library. • A system that supports researchers in finding the data to answer their questions. eHumanities group - PoliMedia 21
  • 22. User-driven What do scholars want? • Why user research? • Understanding the user [1, 2] – Acceptance – Performance – Capabilities – Weaknesses • Goal – Creating a system that is intuitive and helpful to the users [1] Y. Liu, A. Osvalder, and M. Karlsson, “Considering the importance of user profiles in interface design,” no. May, 2010 [2] J. Preece, Y. Rogers, and H. Sharp, “Interaction Design: Beyond Human-Computer Interaction,” Design, vol. 18, no. 1, pp. 68-68, 2002 eHumanities group - PoliMedia 22
  • 23. User research in the development process • Examine search behaviour of users – Survey regarding search strategies – Interviews • User wishes → user requirements • Wireframes → Prototype • Evaluation →New prioritization of remaining user requirements • Final version eHumanities group - PoliMedia 23
  • 24. Survey General search strategies • N=294 • Popular search engines Very often Often Regularly Sometimes Never Don’t know it Google GoogleImages GoogleScholar YouTube JSTOR KB Flickr EBSCO NationaalArchief WebofKnowledge UitzendingGemist Yahoo! Bing Academia.nl Europeana Scopus MicrosoftAcademicSearch EUscreen Arkyves 24
  • 25. Survey General search strategies 1. Keywords 4,75 2. Advanced search 3,36 3. Related terms 2,52 4. Boolean 2,42 5. Browsing subject categories 2,29 6. Filters 2,19 7. Thesaurus 1,87 8. Visualization 1,22 eHumanities group - PoliMedia 25
  • 26. Survey Conclusions • Google is the dominant search engine • This has two consequences 1. People compare other search systems to their experience with Google 2. The search task is mainly performed by using keywords eHumanities group - PoliMedia 26
  • 27. Interviews • N=5 • Quantitative (n=2) as well as qualitative (n=4) • Main themes – How do people search currently? – What could be improved about current search systems? – What should PoliMedia offer, given its goals? • Results – 39 user wishes – Prioritized internally • 19 user wishes deemed out of scope • 20 user requirements eHumanities group - PoliMedia 27
  • 28. Interviews Findings • Key issue is to provide a good overview of data – Why are search results retrieved – How are search results ranked • Assumptions of relevance – Higher frequency of keywords indicated higher relevancy to query? – Longer segments (speeches and articles) indicate higher importance? • Many more or less out-of-scope wishes to make current research easier – Sentiment-metadata – Context metadata – Ability to export to own software eHumanities group - PoliMedia 28
  • 29. • Clear and immediate keyword-search • Support for Booleans and (some) Google- search operators • Separate advanced-search eHumanities group - PoliMedia 29 Wireframes Search interface
  • 30. Wireframes Search results • Keyword search remains prominent • User chosen ranking of results • Keyword highlighting • Overview of related media • Support for filtering eHumanities group - PoliMedia 30
  • 31. Wireframes Debate page • Keyword search remains prominent • Overview of people in debate • Easy access to related material 31eHumanities group - PoliMedia
  • 33. Evaluation • Eye tracking evaluation of the search system – Search system was still in development • N=24 – History – Political communication • Goals – Gain understanding of distribution of attention – Collect general feedback on interface eHumanities group - PoliMedia 33
  • 34. Evaluation Eye tracking • Viewing Duration • Search bar received little attention after search results were displayed • Facets received a lot of attention • Page-search (CTRL+F) mainly received attention on debate page view eHumanities group - PoliMedia 34 Tasks Search bar Facets Search results Page-search Known Item 17% 22% 60% 2% Exploratory 6% 12% 80% 2%
  • 35. Evaluation Usability feedback • The ranking of search results was an issue for users • The year-filter should be a slider • The debate page should be greatly improved – Better identification for speaker, party, topic, relevance to query – Provide filters on debate-page as well eHumanities group - PoliMedia 35
  • 37. Prototype v2.0 - query eHumanities group - PoliMedia 37
  • 38. Prototype v2.0 – filter speaker eHumanities group - PoliMedia 38
  • 39. Prototype v2.0 - filter role eHumanities group - PoliMedia 39
  • 40. Prototype v2.0 - debate eHumanities group - PoliMedia 40
  • 41. Prototype v2.0 - highlight speech eHumanities group - PoliMedia 41
  • 42. Prototype v2.0 - link newspaper eHumanities group - PoliMedia 42
  • 43. Prototype v2.0 - newspaper eHumanities group - PoliMedia 43
  • 44. Prototype v2.0 - link radio eHumanities group - PoliMedia 44
  • 45. Conclusion • PoliMedia; data- or user-driven? • Continuous interplay – Users gave input for usefulness of links – Data limits what features we can offer to users • Collection quality and usability are both critical to users [3] [3] Xie, I. (2006). Evaluation of digital libraries: Criteria and problems from users’ perspectives. Library & Information Science Research, 28(3), 433–452. doi:10.1016/j.lisr.2006.06.002 eHumanities group - PoliMedia 45

Notas do Editor

  1. Create explicit links.
  2. Go to archives, look up original data, decide whether there is a link to a debate.
  3. Many systems, cross media analysis is difficult.
  4. Debates.
  5. used to check models, summarize the corpus, and guide exploration of its contents
  6. Manual evaluation of relevance media items to political speech? = unsure about relevance0 = not relevant1 = partially relevant2 = relevant
  7. Context metadata:Roles of peopleLinks toexternal databasesTypes of documentsTypes of presentation (dramatic, humoristic, etc.)