Estermann Wikidata and Heritage Data 20170914

Beat Estermann
Beat EstermannResearcher em Bern University of Applied Sciences
Wikidata & Heritage Data
Where do we stand? What’s next?
Lausanne, 14 September 2017
Sijie Dai, Captain Alving – Prix de Lausanne 2010. Photo by Inisheer, CC BY-SA (Wikimedia Commons)
Unless otherwise noted,, the content of this presentation is made available under the CC BY 4.0 license.
▶ The aim of this project is to coordinate, facilitate and promote
the ingestion of cultural heritage related data into
Wikidata, to facilitate the cleansing and enhancement of this
data and to promote its use across Wikipedia, its sister
projects and beyond.
▶ It is our vision to establish Wikidata as a central hub for data
integration, data enhancement, and data management in
the heritage domain.
Aim and Vision (WikiProject Cultural Heritage)
▶ Establish Wikidata as a database that covers the entire world’s
cultural heritage.
▶ Establish Wikidata as a central hub that interlinks GLAM collections
around the world; and provides links to bibliographic, genealogic,
scientifc and other collections of information; create the ultimate
authority file.
▶ Foster truly multilingual and global collaboration among people
from various backgrounds.
▶ Leverage synergies between institutions, reduce duplicate work.
▶ Encourage debate in the community by highlighting and
interrogating differences in perspective.
▶ Provide a single source of data for some of the most popular web
sites and apps, including Wikipedia infoboxes and lists.
Vision (Blog posts: Stinson et al. 2016; Thornton / Cochrane 2016; Poulter 2017)
Thematic Projects
https://www.wikidata.org/wiki/Wikidata:WikiProject_Cultural_heritage [Example]
Current Challenges &
Insights
Data
Ingestion
Data
Ingestion
Data
Provision
Data
Provision
Ontology
Developmen
t
Ontology
Developmen
t
Data
Maintenance
Data
Maintenance
Data UseData Use
Core Aspects of the Project
Community &
Collaboration
Community &
Collaboration
Platforms
& Tools
Platforms
& Tools
Wikidata Within
the Wider Data
Landscape
Core Processes
▶ Wikidata needs to be explained to institutions in view of data
donations.
• Lack of awareness of the importance of open licenses in
databases
• Fears of loss of control related to publishing data under CC-0
• What can institutions gain from their involvement in Wikidata?
▶ Community members need assistance with scraping data from
websites.
▶ Present coverage is biased; it is highest for Western Europe and
North America; how to get access to data from other world regions?
How To Get Access to Freely Licensed Data?
▶ http://make.opendata.ch/wiki/data:glam_ch
• Personnalités Vaudoises (BCUL)
• Swiss Photography Metadata (Büro für Fotografiegeschichte)
• Artist data from the SIKART Lexicon on art in Switzerland (SIK-ISEA)
• Metadata of the Historical Dictionary of Switzerland (HLS)
• PCP Inventory (Federal Office for Civil Protection)
• Inventory of Historical Monuments (Canton of Zurich)
• Inventory of Historical Monuments (City of Zurich)
• Inventory of classified Gardens and Parks (City of Zurich)
• Art in the Urban Space (City of Zurich)
• Swiss GLAM Inventory (OpenGLAM)
• Inventory of Research Libraries in Switzerland (Swissbib)
• ISplus Swiss (G)LAM Inventory (Swiss National Library)
• Schauspielhaus Zürich Repertoire of Theatre and other Productions, 1938–1968
• Swiss Theatre Metadata (Swiss Theatre Collection)
• Plazi TreatmentBank (repository of the world's species) (Plazi.org)
• Historical Statistics of Switzerland (University of Zurich)
Data Provision – Which Datasets are Useful?
Challenges Related to Ontology Development (1/2)
All rights reserved.
Estermann Wikidata and Heritage Data 20170914
Estermann Wikidata and Heritage Data 20170914
Estermann Wikidata and Heritage Data 20170914
▶ Coping with the Bazaar:
• Sometimes changes to property definitions are too easily made by
volunteers
• There is a rigorous process for creating new properties, but not for
changing definitions of properties or creating new classes
• No master language; how to keep translations of definitions in synch?
• Sometimes different approaches are used to model the same thing.
▶ What are good design principles?
• Re-usability of properties across various domains
• Select high priority areas first, do not try to solve everything overnight for
the entire cultural heritage domain
• …
▶ Finding a balance between:
• The expressive power of an ontology
• Its practicability when it comes to large scale use by many people
• Its queryability (usability from the perspective of data users)
Challenges Related to Ontology Development (2/2)
▶ Mapping Between Data Models
• Getting an overview of appropriate properties and classes can be a
time-consuming exercise.
• Creating new properties requires community agreement and may involve
lengthy discussions and compromises.
• There is still a lot of work to be done in the area of typologies and
thesauri [Example]
▶ Matching Items / Disambiguation
• There are tools like Mix’n’Match and OpenRefine to support this, but it
remains a major challenge, esp. with datasets which haven’t resolved this
issue internally.
▶ Incorrect / Incoherent Data on Wikidata
• Many data ingestion projects require cleansing up of existing data.
▶ Repeated Ingestion / Updates
• How to approach the historicization of data?
• How to set up processes to regularly update data?
Challenges Related to Data Ingestion
N.B.: We are not filling a void or starting from scratch, but contributing to an
existing ecosystem of data, data models, and community members!
Example: Data Cleansing
Estermann Wikidata and Heritage Data 20170914
▶ Establishing and Documenting Data Quality
• Getting rid of duplicates
• Dealing with incorrect and inconsistent data
• How to monitor data quality and data completeness?
▶ Building a Network of Trust
• Linking all statements to a reliable source
• In the future: “Signed Statements” 
▶ Data Exchange Between Wikidata and Primary Databases
▶ Data synchronization: How to keep data mutually up to date?
▶ How to make it easier for GLAM employees to follow
changes/improvements to their data on Wikidata?
Challenges Related to Data Maintenance
▶ Chicken and Egg Problem:
• Data usage drives data quality & completeness
• Data quality & completeness are prerequisites of data use
Challenges Related to Data Use
[Example]
▶ Linking Wikidata with other databases
• Map existing standards from the GLAM sector to Wikidata
• Merge data imported from Wikipedia with data from reliable databases
▶ In what areas is Wikidata supposed to…
• serve as the master database (referencing sources other than databases)?
• hold data imported from reliable databases?
• link to authoritative databases (without holding the actual data)?
▶ How should GLAMs organize their relationship with Wikidata?
• Provide mutual links?
• Ingest part or all of their data into Wikidata?
• Synchronize part or all of their data with Wikidata?
• Use Wikidata as their main database?
Wikidata and the Wider Data Landscape
▶ How to improve guidelines, community structures, reporting etc. in
order to be able to involve more GLAM personnel in Wikidata?
▶ How best to foster a shared data modelling practice in various
areas? (Need for more modelling show cases, coordination, etc.)
▶ Need for training and tools (to facilitate the accomplishment of
certain tasks).
▶ The evolving tools landscape constitutes a challenge when
establishing processes and working with guidelines.
▶ https://www.wikidata.org/wiki/Wikidata:WikiProject_Cultural_heritag
e
▶ Wikidata + GLAM Facebook Group
Community & Collaboration
Useful Tools
▶ Example: Tools I used for the ingest of the Swiss GLAM
Inventory:
• Microsoft Excel / Open Office Calc
• Wikidata Query Service
• Open Refine
• Reconcile-csv
• Listeria
• Quick Statements
• Microsoft Word / Excel (mail merge)
• Hatnote: «Listen to Wikipedia»
▶ Diff tools to help tracking changes in datasets on Wikidata and to
synchronize with external databases
▶ Statistics tools (data completeness; data use)
▶ Data visualization tools (beyond what the Query service can already
do)
▶ Data tracking tools (data completeness; see how data evolves)
▶ Improved version of the Quick Statements Tool (see feature
requests)
▶ Customizable forms for manual data entry
Tools – Wishlist
Thank You for Your Attention!
Contact
Beat Estermann
Bern University of Applied Sciences
beat.estermann@bfh.ch
+41 31 848 34 38
1 de 24

Recomendados

LinkedUp - European Data Forum por
LinkedUp - European Data ForumLinkedUp - European Data Forum
LinkedUp - European Data ForumMarieke Guy
12K visualizações26 slides
Artivity phase 3 pitch por
Artivity phase 3 pitchArtivity phase 3 pitch
Artivity phase 3 pitchAthanasios Velios
6.5K visualizações57 slides
Wikidata Introductory Workshop por
Wikidata Introductory WorkshopWikidata Introductory Workshop
Wikidata Introductory WorkshopBeat Estermann
221 visualizações36 slides
Wikidata Introduction, Linked Digital Future Initiative, August 2019 por
Wikidata Introduction, Linked Digital Future Initiative, August 2019Wikidata Introduction, Linked Digital Future Initiative, August 2019
Wikidata Introduction, Linked Digital Future Initiative, August 2019Beat Estermann
193 visualizações30 slides
The Open Education Working Group: Bringing people and projects together por
The Open Education Working Group: Bringing people and projects togetherThe Open Education Working Group: Bringing people and projects together
The Open Education Working Group: Bringing people and projects togetherMarieke Guy
2.3K visualizações19 slides
Zeng marcia ifla-subjectaccesssmartdatadh por
Zeng marcia ifla-subjectaccesssmartdatadhZeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadhMarcia Zeng
3.1K visualizações51 slides

Mais conteúdo relacionado

Mais procurados

Europeana Research Panel DH Benelux 2017 por
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017Europeana
203 visualizações46 slides
Working with other sectors por
Working with other sectorsWorking with other sectors
Working with other sectorsJisc
1.1K visualizações9 slides
LinkedUp at Mozilla Festival Science Fair por
LinkedUp at Mozilla Festival Science FairLinkedUp at Mozilla Festival Science Fair
LinkedUp at Mozilla Festival Science FairMarieke Guy
2.1K visualizações16 slides
Adoption and Integration of Persistent Identifiers in European Research Infor... por
Adoption and Integration of Persistent Identifiers in European Research Infor...Adoption and Integration of Persistent Identifiers in European Research Infor...
Adoption and Integration of Persistent Identifiers in European Research Infor...LIBER Europe
1.7K visualizações29 slides
Estermann Panel on Authority Files, 3 June 2020 por
Estermann Panel on Authority Files, 3 June 2020Estermann Panel on Authority Files, 3 June 2020
Estermann Panel on Authority Files, 3 June 2020Beat Estermann
347 visualizações21 slides
Wikidata and performing_arts_20170811 por
Wikidata and performing_arts_20170811Wikidata and performing_arts_20170811
Wikidata and performing_arts_20170811Beat Estermann
5.1K visualizações32 slides

Mais procurados(16)

Europeana Research Panel DH Benelux 2017 por Europeana
Europeana Research Panel DH Benelux 2017Europeana Research Panel DH Benelux 2017
Europeana Research Panel DH Benelux 2017
Europeana203 visualizações
Working with other sectors por Jisc
Working with other sectorsWorking with other sectors
Working with other sectors
Jisc1.1K visualizações
LinkedUp at Mozilla Festival Science Fair por Marieke Guy
LinkedUp at Mozilla Festival Science FairLinkedUp at Mozilla Festival Science Fair
LinkedUp at Mozilla Festival Science Fair
Marieke Guy2.1K visualizações
Adoption and Integration of Persistent Identifiers in European Research Infor... por LIBER Europe
Adoption and Integration of Persistent Identifiers in European Research Infor...Adoption and Integration of Persistent Identifiers in European Research Infor...
Adoption and Integration of Persistent Identifiers in European Research Infor...
LIBER Europe1.7K visualizações
Estermann Panel on Authority Files, 3 June 2020 por Beat Estermann
Estermann Panel on Authority Files, 3 June 2020Estermann Panel on Authority Files, 3 June 2020
Estermann Panel on Authority Files, 3 June 2020
Beat Estermann347 visualizações
Wikidata and performing_arts_20170811 por Beat Estermann
Wikidata and performing_arts_20170811Wikidata and performing_arts_20170811
Wikidata and performing_arts_20170811
Beat Estermann5.1K visualizações
Are we failing users? Can open approaches meet their needs? - Maura Marx por Jisc
Are we failing users? Can open approaches meet their needs? - Maura MarxAre we failing users? Can open approaches meet their needs? - Maura Marx
Are we failing users? Can open approaches meet their needs? - Maura Marx
Jisc1.3K visualizações
Wikidata and performing_arts_20180116 por Beat Estermann
Wikidata and performing_arts_20180116Wikidata and performing_arts_20180116
Wikidata and performing_arts_20180116
Beat Estermann255 visualizações
The GND initiative 2017-2021: Developing a Backbone for the Web of Cultural a... por LIBER Europe
The GND initiative 2017-2021: Developing a Backbone for the Web of Cultural a...The GND initiative 2017-2021: Developing a Backbone for the Web of Cultural a...
The GND initiative 2017-2021: Developing a Backbone for the Web of Cultural a...
LIBER Europe1.9K visualizações
Bridging the gap between researchers and research data management por Marieke Guy
Bridging the gap between researchers and research data management   Bridging the gap between researchers and research data management
Bridging the gap between researchers and research data management
Marieke Guy1.5K visualizações
Research data spring: clipper por Jisc RDM
Research data spring: clipperResearch data spring: clipper
Research data spring: clipper
Jisc RDM5.1K visualizações
Clare Lanigan - Presentation to IES Students por dri_ireland
Clare Lanigan - Presentation to IES StudentsClare Lanigan - Presentation to IES Students
Clare Lanigan - Presentation to IES Students
dri_ireland241 visualizações
Multimedia-2016_Brochure por Gracy Jones
Multimedia-2016_BrochureMultimedia-2016_Brochure
Multimedia-2016_Brochure
Gracy Jones259 visualizações

Similar a Estermann Wikidata and Heritage Data 20170914

Estermann wikidata introduction-sapa-20180630 por
Estermann wikidata introduction-sapa-20180630Estermann wikidata introduction-sapa-20180630
Estermann wikidata introduction-sapa-20180630Beat Estermann
165 visualizações47 slides
Seeing Connecticut Now and Then: Repository Services that Support Your Best M... por
Seeing Connecticut Now and Then: Repository Services that Support Your Best M...Seeing Connecticut Now and Then: Repository Services that Support Your Best M...
Seeing Connecticut Now and Then: Repository Services that Support Your Best M...University of Connecticut Libraries
248 visualizações29 slides
Estermann Wikidata GLAM Example Projects 20170914 por
Estermann Wikidata GLAM Example Projects 20170914Estermann Wikidata GLAM Example Projects 20170914
Estermann Wikidata GLAM Example Projects 20170914Beat Estermann
605 visualizações30 slides
A Manifesto for the Digital Shift in Research Libraries por
A Manifesto for the Digital Shift in Research LibrariesA Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research LibrariesTorsten Reimer
1.4K visualizações18 slides
What is eScience, and where does it go from here? por
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?Daniel S. Katz
307 visualizações18 slides
Sgci nsf-si2-2-21-17 por
Sgci nsf-si2-2-21-17Sgci nsf-si2-2-21-17
Sgci nsf-si2-2-21-17Nancy Wilkins-Diehr
168 visualizações22 slides

Similar a Estermann Wikidata and Heritage Data 20170914(20)

Estermann wikidata introduction-sapa-20180630 por Beat Estermann
Estermann wikidata introduction-sapa-20180630Estermann wikidata introduction-sapa-20180630
Estermann wikidata introduction-sapa-20180630
Beat Estermann165 visualizações
Estermann Wikidata GLAM Example Projects 20170914 por Beat Estermann
Estermann Wikidata GLAM Example Projects 20170914Estermann Wikidata GLAM Example Projects 20170914
Estermann Wikidata GLAM Example Projects 20170914
Beat Estermann605 visualizações
A Manifesto for the Digital Shift in Research Libraries por Torsten Reimer
A Manifesto for the Digital Shift in Research LibrariesA Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research Libraries
Torsten Reimer1.4K visualizações
What is eScience, and where does it go from here? por Daniel S. Katz
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?
Daniel S. Katz307 visualizações
Sgci nsf-si2-2-21-17 por Nancy Wilkins-Diehr
Sgci nsf-si2-2-21-17Sgci nsf-si2-2-21-17
Sgci nsf-si2-2-21-17
Nancy Wilkins-Diehr168 visualizações
The Biodiversity Information Standards (TDWG): Opportunities for Collaboratio... por Martin Kalfatovic
The Biodiversity Information Standards (TDWG): Opportunities for Collaboratio...The Biodiversity Information Standards (TDWG): Opportunities for Collaboratio...
The Biodiversity Information Standards (TDWG): Opportunities for Collaboratio...
Martin Kalfatovic439 visualizações
Aggregation of Linked Data A case study in the cultural heritage domain por Nuno Freire
Aggregation of Linked Data A case study in the cultural heritage domainAggregation of Linked Data A case study in the cultural heritage domain
Aggregation of Linked Data A case study in the cultural heritage domain
Nuno Freire144 visualizações
Research into Practice case study 2: Library linked data implementations an... por Hazel Hall
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...
Hazel Hall308 visualizações
How you and your gateway can benefit from the services of the Science Gateway... por Katherine Lawrence
How you and your gateway can benefit from the services of the Science Gateway...How you and your gateway can benefit from the services of the Science Gateway...
How you and your gateway can benefit from the services of the Science Gateway...
Katherine Lawrence127 visualizações
KEDL DBpedia 2019 por Sebastian Hellmann
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
Sebastian Hellmann392 visualizações
Operationalising AI at a national library por Mia
Operationalising AI at a national libraryOperationalising AI at a national library
Operationalising AI at a national library
Mia 889 visualizações
RDM LIASA webinar por Sarah Jones
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
Sarah Jones3.6K visualizações
Webinar: Decarboni.se – building a climate change solution web platform por Global CCS Institute
Webinar: Decarboni.se – building a climate change solution web platform Webinar: Decarboni.se – building a climate change solution web platform
Webinar: Decarboni.se – building a climate change solution web platform
Global CCS Institute569 visualizações
Using Open Data and Citizen Science to Promote Citizen Engagement with Green ... por Azavea
Using Open Data and Citizen Science to Promote Citizen Engagement with Green ...Using Open Data and Citizen Science to Promote Citizen Engagement with Green ...
Using Open Data and Citizen Science to Promote Citizen Engagement with Green ...
Azavea749 visualizações
Engaging with students and researchers: the case of the social sciences por Louise Corti
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
Louise Corti215 visualizações
2-21-12 Preservation Planning Success Stories Slides por DuraSpace
2-21-12 Preservation Planning Success Stories Slides2-21-12 Preservation Planning Success Stories Slides
2-21-12 Preservation Planning Success Stories Slides
DuraSpace440 visualizações

Mais de Beat Estermann

Linked Open Data for the Performing Arts: Latest Developments in Switzerland,... por
Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...
Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...Beat Estermann
16 visualizações12 slides
Presentation Opendata.ch Association / Open Event Data por
Presentation Opendata.ch Association / Open Event DataPresentation Opendata.ch Association / Open Event Data
Presentation Opendata.ch Association / Open Event DataBeat Estermann
4 visualizações18 slides
Digital Public Goods in the Service of Digital Self-Determination, Digital S... por
Digital Public Goods in the Service of Digital Self-Determination, Digital S...Digital Public Goods in the Service of Digital Self-Determination, Digital S...
Digital Public Goods in the Service of Digital Self-Determination, Digital S...Beat Estermann
12 visualizações26 slides
Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022 por
Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022
Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022Beat Estermann
50 visualizações12 slides
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020 por
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Beat Estermann
184 visualizações23 slides
Open Cultural Data in Switzerland por
Open Cultural Data in SwitzerlandOpen Cultural Data in Switzerland
Open Cultural Data in SwitzerlandBeat Estermann
113 visualizações15 slides

Mais de Beat Estermann(20)

Linked Open Data for the Performing Arts: Latest Developments in Switzerland,... por Beat Estermann
Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...
Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...
Beat Estermann16 visualizações
Presentation Opendata.ch Association / Open Event Data por Beat Estermann
Presentation Opendata.ch Association / Open Event DataPresentation Opendata.ch Association / Open Event Data
Presentation Opendata.ch Association / Open Event Data
Beat Estermann4 visualizações
Digital Public Goods in the Service of Digital Self-Determination, Digital S... por Beat Estermann
Digital Public Goods in the Service of Digital Self-Determination, Digital S...Digital Public Goods in the Service of Digital Self-Determination, Digital S...
Digital Public Goods in the Service of Digital Self-Determination, Digital S...
Beat Estermann12 visualizações
Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022 por Beat Estermann
Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022
Datenraum für Kultur- und Kulturerbedaten, 15. Nov. 2022
Beat Estermann50 visualizações
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020 por Beat Estermann
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020
Beat Estermann184 visualizações
Open Cultural Data in Switzerland por Beat Estermann
Open Cultural Data in SwitzerlandOpen Cultural Data in Switzerland
Open Cultural Data in Switzerland
Beat Estermann113 visualizações
BFH-Studie Digitalisierung und Umwelt - BAFU-Kaderklausur - 20191127 por Beat Estermann
BFH-Studie Digitalisierung und Umwelt - BAFU-Kaderklausur - 20191127BFH-Studie Digitalisierung und Umwelt - BAFU-Kaderklausur - 20191127
BFH-Studie Digitalisierung und Umwelt - BAFU-Kaderklausur - 20191127
Beat Estermann129 visualizações
Wikidata Conference 2019 GLAM Panel - 20191025 por Beat Estermann
Wikidata Conference 2019 GLAM Panel - 20191025Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025
Beat Estermann226 visualizações
Estermann ENICPA Wiki Loves Performing Arts 20191022 por Beat Estermann
Estermann ENICPA Wiki Loves Performing Arts 20191022Estermann ENICPA Wiki Loves Performing Arts 20191022
Estermann ENICPA Wiki Loves Performing Arts 20191022
Beat Estermann193 visualizações
Bootstrapping the International Knowledge Base for the Performing Arts por Beat Estermann
Bootstrapping the International Knowledge Base for the Performing ArtsBootstrapping the International Knowledge Base for the Performing Arts
Bootstrapping the International Knowledge Base for the Performing Arts
Beat Estermann143 visualizações
Estermann wd glam-intro_20181204 por Beat Estermann
Estermann wd glam-intro_20181204Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204
Beat Estermann143 visualizações
Workshop "Performing Arts Database based on Wikidata" por Beat Estermann
Workshop "Performing Arts Database based on Wikidata"Workshop "Performing Arts Database based on Wikidata"
Workshop "Performing Arts Database based on Wikidata"
Beat Estermann218 visualizações
Estermann wikidata performing-arts-20181109 por Beat Estermann
Estermann wikidata performing-arts-20181109Estermann wikidata performing-arts-20181109
Estermann wikidata performing-arts-20181109
Beat Estermann130 visualizações
Estermann performing arts_database_20180721 por Beat Estermann
Estermann performing arts_database_20180721Estermann performing arts_database_20180721
Estermann performing arts_database_20180721
Beat Estermann364 visualizações
Estermann spa platform-ontology_development_20180116 por Beat Estermann
Estermann spa platform-ontology_development_20180116Estermann spa platform-ontology_development_20180116
Estermann spa platform-ontology_development_20180116
Beat Estermann404 visualizações
OpenGLAM CH Hackathons por Beat Estermann
OpenGLAM CH HackathonsOpenGLAM CH Hackathons
OpenGLAM CH Hackathons
Beat Estermann209 visualizações
The Role of Heritage Institutions in the Context of a National Data Infrastru... por Beat Estermann
The Role of Heritage Institutions in the Context of a National Data Infrastru...The Role of Heritage Institutions in the Context of a National Data Infrastru...
The Role of Heritage Institutions in the Context of a National Data Infrastru...
Beat Estermann373 visualizações
Towards a National Data Infrastructure. First Insights Regarding Its Design a... por Beat Estermann
Towards a National Data Infrastructure. First Insights Regarding Its Design a...Towards a National Data Infrastructure. First Insights Regarding Its Design a...
Towards a National Data Infrastructure. First Insights Regarding Its Design a...
Beat Estermann1.6K visualizações
Estermann montreal symposium_2016_open_glam_benchmark_survey_20160509 por Beat Estermann
Estermann montreal symposium_2016_open_glam_benchmark_survey_20160509Estermann montreal symposium_2016_open_glam_benchmark_survey_20160509
Estermann montreal symposium_2016_open_glam_benchmark_survey_20160509
Beat Estermann773 visualizações
Estermann irspm2016 open_glam_in_practice_20160414 por Beat Estermann
Estermann irspm2016 open_glam_in_practice_20160414Estermann irspm2016 open_glam_in_practice_20160414
Estermann irspm2016 open_glam_in_practice_20160414
Beat Estermann580 visualizações

Último

Mapping location and co-location of industries at the neighborhood level - A... por
Mapping location and co-location of industries at the neighborhood level  - A...Mapping location and co-location of industries at the neighborhood level  - A...
Mapping location and co-location of industries at the neighborhood level - A...OECD CFE
6 visualizações19 slides
Financial sustainability of schemes managed by PHED in Punjab_Krishnakumar Th... por
Financial sustainability of schemes managed by PHED in Punjab_Krishnakumar Th...Financial sustainability of schemes managed by PHED in Punjab_Krishnakumar Th...
Financial sustainability of schemes managed by PHED in Punjab_Krishnakumar Th...India Water Portal
6 visualizações19 slides
Research - Asrayan Project of BD por
Research  - Asrayan Project of BDResearch  - Asrayan Project of BD
Research - Asrayan Project of BDMd. Masudur Rahman, PMP
14 visualizações25 slides
Contributi L. 3/2019 por
Contributi L. 3/2019Contributi L. 3/2019
Contributi L. 3/2019Partito democratico
37 visualizações256 slides
University of Cambridge: COP28 briefing por
University of Cambridge: COP28 briefingUniversity of Cambridge: COP28 briefing
University of Cambridge: COP28 briefingEnergy for One World
12 visualizações33 slides
Dr. Fleur Wouterse - 2023 ReSAKSS Conference.pptx por
Dr. Fleur Wouterse - 2023 ReSAKSS Conference.pptxDr. Fleur Wouterse - 2023 ReSAKSS Conference.pptx
Dr. Fleur Wouterse - 2023 ReSAKSS Conference.pptxAKADEMIYA2063
7 visualizações11 slides

Último(20)

Mapping location and co-location of industries at the neighborhood level - A... por OECD CFE
Mapping location and co-location of industries at the neighborhood level  - A...Mapping location and co-location of industries at the neighborhood level  - A...
Mapping location and co-location of industries at the neighborhood level - A...
OECD CFE6 visualizações
Financial sustainability of schemes managed by PHED in Punjab_Krishnakumar Th... por India Water Portal
Financial sustainability of schemes managed by PHED in Punjab_Krishnakumar Th...Financial sustainability of schemes managed by PHED in Punjab_Krishnakumar Th...
Financial sustainability of schemes managed by PHED in Punjab_Krishnakumar Th...
India Water Portal6 visualizações
University of Cambridge: COP28 briefing por Energy for One World
University of Cambridge: COP28 briefingUniversity of Cambridge: COP28 briefing
University of Cambridge: COP28 briefing
Energy for One World12 visualizações
Dr. Fleur Wouterse - 2023 ReSAKSS Conference.pptx por AKADEMIYA2063
Dr. Fleur Wouterse - 2023 ReSAKSS Conference.pptxDr. Fleur Wouterse - 2023 ReSAKSS Conference.pptx
Dr. Fleur Wouterse - 2023 ReSAKSS Conference.pptx
AKADEMIYA20637 visualizações
Dr. Ousmane Badiane - 2023 ReSAKSS Conference.pptx por AKADEMIYA2063
Dr. Ousmane Badiane - 2023 ReSAKSS Conference.pptxDr. Ousmane Badiane - 2023 ReSAKSS Conference.pptx
Dr. Ousmane Badiane - 2023 ReSAKSS Conference.pptx
AKADEMIYA206328 visualizações
Food for Elderly homeless por SERUDS INDIA
Food for Elderly homelessFood for Elderly homeless
Food for Elderly homeless
SERUDS INDIA9 visualizações
COP 28 GHANA DELEGATES.docx por Kweku Zurek
COP 28 GHANA DELEGATES.docxCOP 28 GHANA DELEGATES.docx
COP 28 GHANA DELEGATES.docx
Kweku Zurek6.3K visualizações
Mrs. Tsitsi Makombe - 2023 ReSAKSS Conference por AKADEMIYA2063
Mrs. Tsitsi Makombe - 2023 ReSAKSS Conference Mrs. Tsitsi Makombe - 2023 ReSAKSS Conference
Mrs. Tsitsi Makombe - 2023 ReSAKSS Conference
AKADEMIYA20638 visualizações
Ms. Julie Collins - 2023 ReSAKSS Conference.pptx por AKADEMIYA2063
Ms. Julie Collins - 2023 ReSAKSS Conference.pptxMs. Julie Collins - 2023 ReSAKSS Conference.pptx
Ms. Julie Collins - 2023 ReSAKSS Conference.pptx
AKADEMIYA206317 visualizações
Answer to UNESCO – Youth Employment Through Heritage and Culture in Yemen por Kevin Lognoné
Answer to UNESCO – Youth Employment Through Heritage and Culture in YemenAnswer to UNESCO – Youth Employment Through Heritage and Culture in Yemen
Answer to UNESCO – Youth Employment Through Heritage and Culture in Yemen
Kevin Lognoné6 visualizações
Managing drinking water infrastructure in West Bengal Gram Panchayats_Sujata ... por India Water Portal
Managing drinking water infrastructure in West Bengal Gram Panchayats_Sujata ...Managing drinking water infrastructure in West Bengal Gram Panchayats_Sujata ...
Managing drinking water infrastructure in West Bengal Gram Panchayats_Sujata ...
India Water Portal9 visualizações
Support Girl students with Education por SERUDS INDIA
Support Girl students with EducationSupport Girl students with Education
Support Girl students with Education
SERUDS INDIA6 visualizações
ΕΚΘΕΣΗ ΚΟΜΙΣΙΟΝ ΓΙΑ ΤΟΥΡΚΙΑ por ssuser9e6212
ΕΚΘΕΣΗ ΚΟΜΙΣΙΟΝ ΓΙΑ ΤΟΥΡΚΙΑΕΚΘΕΣΗ ΚΟΜΙΣΙΟΝ ΓΙΑ ΤΟΥΡΚΙΑ
ΕΚΘΕΣΗ ΚΟΜΙΣΙΟΝ ΓΙΑ ΤΟΥΡΚΙΑ
ssuser9e6212172 visualizações
Arrow Adoption Training for Kinship Families por ArrowMarketing
Arrow Adoption Training for Kinship FamiliesArrow Adoption Training for Kinship Families
Arrow Adoption Training for Kinship Families
ArrowMarketing42 visualizações
COP28 President Launches Global Decarbonization Accelerator por Energy for One World
COP28 President Launches Global Decarbonization AcceleratorCOP28 President Launches Global Decarbonization Accelerator
COP28 President Launches Global Decarbonization Accelerator
Energy for One World32 visualizações
Mr. Kenao Lao - 2023 ReSAKSS Conference.pptx por AKADEMIYA2063
 Mr. Kenao Lao - 2023 ReSAKSS Conference.pptx Mr. Kenao Lao - 2023 ReSAKSS Conference.pptx
Mr. Kenao Lao - 2023 ReSAKSS Conference.pptx
AKADEMIYA20637 visualizações

Estermann Wikidata and Heritage Data 20170914

  • 1. Wikidata & Heritage Data Where do we stand? What’s next? Lausanne, 14 September 2017 Sijie Dai, Captain Alving – Prix de Lausanne 2010. Photo by Inisheer, CC BY-SA (Wikimedia Commons) Unless otherwise noted,, the content of this presentation is made available under the CC BY 4.0 license.
  • 2. ▶ The aim of this project is to coordinate, facilitate and promote the ingestion of cultural heritage related data into Wikidata, to facilitate the cleansing and enhancement of this data and to promote its use across Wikipedia, its sister projects and beyond. ▶ It is our vision to establish Wikidata as a central hub for data integration, data enhancement, and data management in the heritage domain. Aim and Vision (WikiProject Cultural Heritage)
  • 3. ▶ Establish Wikidata as a database that covers the entire world’s cultural heritage. ▶ Establish Wikidata as a central hub that interlinks GLAM collections around the world; and provides links to bibliographic, genealogic, scientifc and other collections of information; create the ultimate authority file. ▶ Foster truly multilingual and global collaboration among people from various backgrounds. ▶ Leverage synergies between institutions, reduce duplicate work. ▶ Encourage debate in the community by highlighting and interrogating differences in perspective. ▶ Provide a single source of data for some of the most popular web sites and apps, including Wikipedia infoboxes and lists. Vision (Blog posts: Stinson et al. 2016; Thornton / Cochrane 2016; Poulter 2017)
  • 6. Data Ingestion Data Ingestion Data Provision Data Provision Ontology Developmen t Ontology Developmen t Data Maintenance Data Maintenance Data UseData Use Core Aspects of the Project Community & Collaboration Community & Collaboration Platforms & Tools Platforms & Tools Wikidata Within the Wider Data Landscape Core Processes
  • 7. ▶ Wikidata needs to be explained to institutions in view of data donations. • Lack of awareness of the importance of open licenses in databases • Fears of loss of control related to publishing data under CC-0 • What can institutions gain from their involvement in Wikidata? ▶ Community members need assistance with scraping data from websites. ▶ Present coverage is biased; it is highest for Western Europe and North America; how to get access to data from other world regions? How To Get Access to Freely Licensed Data?
  • 8. ▶ http://make.opendata.ch/wiki/data:glam_ch • Personnalités Vaudoises (BCUL) • Swiss Photography Metadata (Büro für Fotografiegeschichte) • Artist data from the SIKART Lexicon on art in Switzerland (SIK-ISEA) • Metadata of the Historical Dictionary of Switzerland (HLS) • PCP Inventory (Federal Office for Civil Protection) • Inventory of Historical Monuments (Canton of Zurich) • Inventory of Historical Monuments (City of Zurich) • Inventory of classified Gardens and Parks (City of Zurich) • Art in the Urban Space (City of Zurich) • Swiss GLAM Inventory (OpenGLAM) • Inventory of Research Libraries in Switzerland (Swissbib) • ISplus Swiss (G)LAM Inventory (Swiss National Library) • Schauspielhaus Zürich Repertoire of Theatre and other Productions, 1938–1968 • Swiss Theatre Metadata (Swiss Theatre Collection) • Plazi TreatmentBank (repository of the world's species) (Plazi.org) • Historical Statistics of Switzerland (University of Zurich) Data Provision – Which Datasets are Useful?
  • 9. Challenges Related to Ontology Development (1/2) All rights reserved.
  • 13. ▶ Coping with the Bazaar: • Sometimes changes to property definitions are too easily made by volunteers • There is a rigorous process for creating new properties, but not for changing definitions of properties or creating new classes • No master language; how to keep translations of definitions in synch? • Sometimes different approaches are used to model the same thing. ▶ What are good design principles? • Re-usability of properties across various domains • Select high priority areas first, do not try to solve everything overnight for the entire cultural heritage domain • … ▶ Finding a balance between: • The expressive power of an ontology • Its practicability when it comes to large scale use by many people • Its queryability (usability from the perspective of data users) Challenges Related to Ontology Development (2/2)
  • 14. ▶ Mapping Between Data Models • Getting an overview of appropriate properties and classes can be a time-consuming exercise. • Creating new properties requires community agreement and may involve lengthy discussions and compromises. • There is still a lot of work to be done in the area of typologies and thesauri [Example] ▶ Matching Items / Disambiguation • There are tools like Mix’n’Match and OpenRefine to support this, but it remains a major challenge, esp. with datasets which haven’t resolved this issue internally. ▶ Incorrect / Incoherent Data on Wikidata • Many data ingestion projects require cleansing up of existing data. ▶ Repeated Ingestion / Updates • How to approach the historicization of data? • How to set up processes to regularly update data? Challenges Related to Data Ingestion N.B.: We are not filling a void or starting from scratch, but contributing to an existing ecosystem of data, data models, and community members!
  • 17. ▶ Establishing and Documenting Data Quality • Getting rid of duplicates • Dealing with incorrect and inconsistent data • How to monitor data quality and data completeness? ▶ Building a Network of Trust • Linking all statements to a reliable source • In the future: “Signed Statements”  ▶ Data Exchange Between Wikidata and Primary Databases ▶ Data synchronization: How to keep data mutually up to date? ▶ How to make it easier for GLAM employees to follow changes/improvements to their data on Wikidata? Challenges Related to Data Maintenance
  • 18. ▶ Chicken and Egg Problem: • Data usage drives data quality & completeness • Data quality & completeness are prerequisites of data use Challenges Related to Data Use
  • 20. ▶ Linking Wikidata with other databases • Map existing standards from the GLAM sector to Wikidata • Merge data imported from Wikipedia with data from reliable databases ▶ In what areas is Wikidata supposed to… • serve as the master database (referencing sources other than databases)? • hold data imported from reliable databases? • link to authoritative databases (without holding the actual data)? ▶ How should GLAMs organize their relationship with Wikidata? • Provide mutual links? • Ingest part or all of their data into Wikidata? • Synchronize part or all of their data with Wikidata? • Use Wikidata as their main database? Wikidata and the Wider Data Landscape
  • 21. ▶ How to improve guidelines, community structures, reporting etc. in order to be able to involve more GLAM personnel in Wikidata? ▶ How best to foster a shared data modelling practice in various areas? (Need for more modelling show cases, coordination, etc.) ▶ Need for training and tools (to facilitate the accomplishment of certain tasks). ▶ The evolving tools landscape constitutes a challenge when establishing processes and working with guidelines. ▶ https://www.wikidata.org/wiki/Wikidata:WikiProject_Cultural_heritag e ▶ Wikidata + GLAM Facebook Group Community & Collaboration
  • 22. Useful Tools ▶ Example: Tools I used for the ingest of the Swiss GLAM Inventory: • Microsoft Excel / Open Office Calc • Wikidata Query Service • Open Refine • Reconcile-csv • Listeria • Quick Statements • Microsoft Word / Excel (mail merge) • Hatnote: «Listen to Wikipedia»
  • 23. ▶ Diff tools to help tracking changes in datasets on Wikidata and to synchronize with external databases ▶ Statistics tools (data completeness; data use) ▶ Data visualization tools (beyond what the Query service can already do) ▶ Data tracking tools (data completeness; see how data evolves) ▶ Improved version of the Quick Statements Tool (see feature requests) ▶ Customizable forms for manual data entry Tools – Wishlist
  • 24. Thank You for Your Attention! Contact Beat Estermann Bern University of Applied Sciences beat.estermann@bfh.ch +41 31 848 34 38