Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage: WebSci2014 Poster

•

2 likes•998 views

Lora Aroyo

Technology

45%$
12%$
13%$
3%$
27%$
Type%of%crowd%workers%annota1ons%
Genus$common$
Genus$botanical$
Species$common$
Species$botanical$
Non9ﬂower$names$
Crowdsourcing Knowledge-Intensive Tasks
In Cultural Heritage
Jasper Oosterman, Alessandro Bozzon, Geert-Jan Houben
Archana Nottamkandath, Chris Dijkshoorn, Lora Aroyo
Delft University of Technology
VU University Amsterdam
Cultural Heritage
Collections
Aspects of Knowledge
Intensive Tasks
Experiment Conclusions
Enrich data collections by tapping into the interest
and expertise of crowds to create knowledge;
Crowd Generated Knowledge.
ü  Data-intensive
• Rijksmuseum has 1M
art pieces requiring
annotation
ü  Knowledge-intensive
• Diverse and specific knowledge needed
ü  Goals
• Coverage: Enrich complete (sub)collection
• Quality: High quality annotations
A platform to support
crowd-enabled,
collaborative
annotation processes.
What is the relation
between entity
identification
difficulty and crowd
annotation behavior?
1
2
Challenge
Identification
•  Identify relevant entities
•  Prominence and amount
of entities
•  Artistic interpretation,
lack of detail, fantasy
Species
Rosa Californica
Genus
Rosa
Family
Rosaceae
Annotation
•  Tag identified entities
•  Specificity of tags
•  Domain and culture
specific knowledge
Setup
•  82 prints from the Rijksmuseum containing flowers
•  Tasks: annotate prints with specific flower names
•  Executed by experts and crowd workers via
crowdsourcing platforms
Experimental platform: Accurator
3
Insights
•  Domain specific tasks on CS platforms not popular, but
knowledge is present in some workers.
•  Flower prominence does not affect identification
•  Print difficulty only affects flower types identification
•  Low crowd annotator agreement à worker selection
and task orchestration are required
Links Demo Video
Experiment performed within the SEALINCMedia project.
Scan for a demo of Accurator or a video explaining our
research together with the Rijksmuseum.
%WrongAnswer
0
0.2
0.4
0.6
0.8
1.0
# of Flowers
NP P
Flower Types
NP P
# Workers opened task 732
# Workers passed test questions 84
# Selected workers 44
# Annotation tasks performed 488
# “Fantasy” task 58
# “Unable” task 70
# “Flowers” task 360
# Flower labels 465
Median
Median
ErrorRate
0
0.2
0.4
0.6
0.8
1.0
Flower # Id.
Easy Average Hard
Flower Type Id.
Easy Average Hard

Viewers also liked

Lecture 6: Human-Computer Interaction Course (2015) @VU University AmsterdamLora Aroyo

Lecture 5: Human-Computer Interaction Course (2015) @VU University AmsterdamLora Aroyo

User Centered Design 101Experience Dynamics

Introduction to HCI Deskala

Lecture 1: Human-Computer Interaction Introduction (2014)Lora Aroyo

SXSW2017 @NewDutchMedia Talk: Exploration is the New SearchLora Aroyo

How Google WorksEric Schmidt

Viewers also liked (7)

Lecture 6: Human-Computer Interaction Course (2015) @VU University Amsterdam

Lecture 5: Human-Computer Interaction Course (2015) @VU University Amsterdam

User Centered Design 101

Introduction to HCI

Lecture 1: Human-Computer Interaction Introduction (2014)

SXSW2017 @NewDutchMedia Talk: Exploration is the New Search

How Google Works

Similar to Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage: WebSci2014 Poster

Raymond E Zvavanyange_African Traditional Leadership Conference, November 14-...Young Professionals for Agricultural Development

Everyone's SmithsonianNina Simon

Researching Multilingually at the Borders of Language, the Body, Law and the ...researchingmultilingually

Genericity versus expressivity – reflections about the semantics of interoper...Andrea Scharnhorst

Ethno primer and observation skills workshop slidesLaura Yarrow

Indigenous Perspectives on Museum Diversity (Part 3/3) - Reclaiming our Place...West Muse

Ways of "researching multilingually" at the borders of language, the body, la...RMBorders

Malina lafayette 2019 colloquiumroger malina

Ethnography researchMAHARSHI DAYANAND UNIVERSITY

Antropologia/anthropologyWilmer Carrion

Researching multilingually exploring emerging linguistic practices in migrant...RMBorders

Finding the annotation needs of the botanical community in a digital libraryWilliam Ulate

Participatory [Citizen] ScienceMuki Haklay

Use Your Senses: Overcoming The Accessibility ParadoxMuseumNext

Non-traditional visitors' skills and expectations in museumsAniko Korenchy-Misz

Chapter 10 science for allKristin Eaquinto

RMTC Hub Presentation (P. Holmes)RMBorders

Research Data Management for the Humanities and Social SciencesMartin Donnelly

Towards Semantic Enrichment of Newspapers: a historical ecology use case Marieke van Erp

Similar to Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage: WebSci2014 Poster (20)

Raymond E Zvavanyange_African Traditional Leadership Conference, November 14-...

Everyone's Smithsonian

Researching Multilingually at the Borders of Language, the Body, Law and the ...

Genericity versus expressivity – reflections about the semantics of interoper...

Ethno primer and observation skills workshop slides

Indigenous Perspectives on Museum Diversity (Part 3/3) - Reclaiming our Place...

Ways of "researching multilingually" at the borders of language, the body, la...

Malina lafayette 2019 colloquium

Ethnography research

Antropologia/anthropology

Researching multilingually exploring emerging linguistic practices in migrant...

Finding the annotation needs of the botanical community in a digital library

Participatory [Citizen] Science

Use Your Senses: Overcoming The Accessibility Paradox

Non-traditional visitors' skills and expectations in museums

Chapter 10 science for all

RMTC Hub Presentation (P. Holmes)

Research Data Management for the Humanities and Social Sciences

Towards Semantic Enrichment of Newspapers: a historical ecology use case

Recently uploaded

[BuildWithAI] Introduction to Gemini.pdfSandro Moreira

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Why Teams call analytics are critical to your entire businesspanagenda

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Corporate and higher education May webinar.pptxRustici Software

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

MS Copilot expands with MS Graph connectorsNanddeep Nachan

CNIC Information System with Pakdata Cf In Pakistandanishmna97

Exploring Multimodal Embeddings with MilvusZilliz

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

DBX First Quarter 2024 Investor PresentationDropbox

Architecting Cloud Native ApplicationsWSO2

Recently uploaded (20)

[BuildWithAI] Introduction to Gemini.pdf

AWS Community Day CPH - Three problems of Terraform

Why Teams call analytics are critical to your entire business

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Corporate and higher education May webinar.pptx

presentation ICT roal in 21st century education

AXA XL - Insurer Innovation Award Americas 2024

Artificial Intelligence Chap.5 : Uncertainty

MS Copilot expands with MS Graph connectors

CNIC Information System with Pakdata Cf In Pakistan

Exploring Multimodal Embeddings with Milvus

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Strategies for Landing an Oracle DBA Job as a Fresher

DBX First Quarter 2024 Investor Presentation

Architecting Cloud Native Applications

Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage: WebSci2014 Poster

1. 45%$ 12%$ 13%$ 3%$ 27%$ Type%of%crowd%workers%annota1ons% Genus$common$ Genus$botanical$ Species$common$ Species$botanical$ Non9ﬂower$names$ Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage Jasper Oosterman, Alessandro Bozzon, Geert-Jan Houben Archana Nottamkandath, Chris Dijkshoorn, Lora Aroyo Delft University of Technology VU University Amsterdam Cultural Heritage Collections Aspects of Knowledge Intensive Tasks Experiment Conclusions Enrich data collections by tapping into the interest and expertise of crowds to create knowledge; Crowd Generated Knowledge. ü  Data-intensive • Rijksmuseum has 1M art pieces requiring annotation ü  Knowledge-intensive • Diverse and specific knowledge needed ü  Goals • Coverage: Enrich complete (sub)collection • Quality: High quality annotations A platform to support crowd-enabled, collaborative annotation processes. What is the relation between entity identification difficulty and crowd annotation behavior? 1 2 Challenge Identification •  Identify relevant entities •  Prominence and amount of entities •  Artistic interpretation, lack of detail, fantasy Species Rosa Californica Genus Rosa Family Rosaceae Annotation •  Tag identified entities •  Specificity of tags •  Domain and culture specific knowledge Setup •  82 prints from the Rijksmuseum containing flowers •  Tasks: annotate prints with specific flower names •  Executed by experts and crowd workers via crowdsourcing platforms Experimental platform: Accurator 3 Insights •  Domain specific tasks on CS platforms not popular, but knowledge is present in some workers. •  Flower prominence does not affect identification •  Print difficulty only affects flower types identification •  Low crowd annotator agreement à worker selection and task orchestration are required Links Demo Video Experiment performed within the SEALINCMedia project. Scan for a demo of Accurator or a video explaining our research together with the Rijksmuseum. %WrongAnswer 0 0.2 0.4 0.6 0.8 1.0 # of Flowers NP P Flower Types NP P # Workers opened task 732 # Workers passed test questions 84 # Selected workers 44 # Annotation tasks performed 488 # “Fantasy” task 58 # “Unable” task 70 # “Flowers” task 360 # Flower labels 465 Median Median ErrorRate 0 0.2 0.4 0.6 0.8 1.0 Flower # Id. Easy Average Hard Flower Type Id. Easy Average Hard

Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage: WebSci2014 Poster

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

Similar to Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage: WebSci2014 Poster

Similar to Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage: WebSci2014 Poster (20)

More from Lora Aroyo

More from Lora Aroyo (20)

Recently uploaded

Recently uploaded (20)

Crowdsourcing Knowledge-Intensive Tasks In Cultural Heritage: WebSci2014 Poster