SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
Wikidata as a hub to library
linked data re-use
Jim Hahn
Head of Metadata Research, University of Pennsylvania
Objectives
Attendees will ….
● gain an appreciation of data re-use as applied to linked data in order to
advance discovery enhancement projects at their institution;
● understand how Wikidata is utilized for enriching and distributing linked data
on the web in order to make use of Wikidata as a persistent structured data
source;
● become aware of a Penn pilot project for Penn Faculty visibility on Wikidata
and associated graph networks.
Defining Data Re-use
From Carlson & Anderson (2007, pp. 635) in the Journal of Computer Mediated
Communication:
Investments in e-science technologies are motivated by a multiplicity of factors.
First is the urgency to manage the increasingly large quantities of complex data
produced by digital technologies and digitally enabled science. ‘‘Deluge,’’ ‘‘waves,’’
and ‘‘knowledge overload’’ are some of the terms used to describe the situation
(Hey & Trefethen,2003). Another related factor is the concern of funding
bodies to ‘‘repurpose’’ their investments in data to avoid what is, in turn,
termed ‘‘data tombs in mono-disciplinary silos’’ and to see a maximum
return on their investments.
Data Re-use
In What are data? The many kinds of data and their implications for data re-use.
Samuelle Carlson and Ben Anderson (2007), compile several case studies to
illustrate the “life-stages” of data, including:
● Data Collection;
● Data Formatting;
● Data Release;
● Data Re-Use.
Significant findings for data re-use suggest challenges ahead -- namely that there
are a variety of data practices and assumptions across the disciplines studied..
Importance of Data Re-use
Wikidata practices and Library data practices are not necessarily incompatible, but
must be negotiated for understanding and importantly, context.
Wikidata itself is a more contemporary project with less legacy data problems than
libraries.
Library data are wonderfully expressive, unique, and complex. Library data were
created as strings, and only very recently has entity based classification and
cataloging processes started.
Wikidata
Why Wikidata
OCLC made use of the infrastructure (Wikibase) for Project Passage is continuing
to expand upon use Wikibase for the Mellon Funded Entity Management Project.
There is an opportunity to re-use the Wikidata properties for ongoing Linked Data
Research at the Libraries Wibase in particular offers several advantages, including
the following:
● local control over linked data from several disparate projects in linked data
● Creating links among UPenn Scholarship and the broader web of linked data.
Recent studies have pointed to the importance of Wikimedia content to
search engines/discovery on the web writ large (Vincent & Hecht, 2020).
Wikimedia content in recommenders...
● Recommender systems development to promote library content (Tsuji, 2019).
○ Linked data offers some advantages over big data for providing personalization/recommender
services (Campbell & Cowan, 2016); e.g. recommendations are based on structured
knowledge not based on personal data mining.
How Libraries make use of Wikidata
Libraries have made a series of contributions to Wikidata…
● LD4 (Linked Data 4 Production) in Particular has engaged a Wikidata Affinity
Group
● OCLC Entity Management Project
● Share-VDE
● Library of Congress NACO (Name Authority Cooperative Program)
Wikidata Examples in LD4
https://www.wikidata.org/wiki/Wikidata:WikiProject_Linked_Data_for_Production/Practical_Wikidata_for_Librarians
Wikidata Example in OCLC
https://www.oclc.org/en/worldcat/oclc-and-linked-data.html
Wikidata Example in Share-VDE
https://www.wikidata.org/wiki/Property:P6329
Share-VDE Wikidata Utilization (Possemato, 2018)
https://www.loc.gov/bibframe/news/pdf/share-vde-alaal2018.pdf
Wikidata Utilization at Library of Congress
https://id.loc.gov/search/?q=wikidata
Vanderbot
https://www.wikidata.org/wiki/User:VanderBot
Vanderbilt Faculty additions from Vanderbot
Penn case study: faculty visibility
Problem Statement
Penn Libraries would like to foster the discovery of Penn scholarship on the web.
Most search engines will crawl Wikidata for incorporation of structured data onto
their search results.
Most search engines will now create Knowledge Panels for authors, agents, and
works. Current inventory keeping tools are not crawled by search engines and this
is a problem area for supporting visibility of Penn faculty.
Knowledge Panel Example
Wikidata Processing at Penn
Begin by integrating school level structured data into Wikidata:
https://www.wikidata.org/wiki/Q7896091
Wikidata Processing at Penn
Add Department Level Structured Data using the “part of” property for associating
with school...
https://www.wikidata.org/wiki/Q89100047
Wikidata Processing at Penn
Add Wikidata for Faculty...
https://www.wikidata.org/wiki/Q6127558
Faculty IDs
https://www.wikidata.org/wiki/Q6127558
Faculty Works
https://www.wikidata.org/wiki/Q6127558
Faculty Data Re-use in Wikidata
For faculty pages we ….
● Add Faculty IDs if available: VIAF ID, ISNI, Library of Congress authority ID,
Share-VDE author ID, WorldCat Identities ID
● Associate Faculty with Department using "member of (P463)" property
● Associate Publications to faculty
For non-existing works, we created work pages and add the "author" property
linked with Q number for author.
Scholia page re-using structured data in Wikidata
https://scholia.toolforge.org/organization/Q89100047
Penn researcher profile re-using structured Wikidata
https://scholia.toolforge.org/author/Q6127558
Reasonator Panel
https://reasonator.toolforge.org/?q=Q6127558
Next Steps
Program for Cooperative Cataloging (PCC) Pilot
Charge: The Wikidata Working group will lead Penn participation in the PCC Pilot
Project for Identity Management in Wikidata. Penn's initial focus will be to leverage
Online Books/Back Files serials to the PCC Wikidata objectives.
Activities: For PCC Pilot - We are making sure that serial issues in Penn Libraries
Deep Backfiles have Wikidata entries that clearly identify them and distinguish
them from other serials.
Resources
Campbell, D. G., & Cowan, S. R. (2016). The Paradox of Privacy: Revisiting a Core Library Value in an Age of Big Data and
Linked Data. Library Trends, 64(3), 492–511. https://doi.org/10.1353/lib.2016.0006
Carlson, S. & Anderson, B. (2007). What are data? The many kinds of data and their implications for data re-use. Journal of
Computer-Mediated Communication, 12, 635-651.
DOI: 10.1111/j.1083-6101.2007.00342.x
Possemato, T. (2018). From MARC to BIBFRAME in the SHARE-VDE project. ALA Annual Meeting.
https://www.loc.gov/bibframe/news/pdf/share-vde-alaal2018.pdf
Tsuji, K. (2019). Book Recommender System for Wikipedia Article Readers in a University Library.
8th International Congress on Advanced Applied Informatics (IIAI-AAI), 121–126.
https://doi.org/10.1109/IIAI-AAI.2019.00034
Vincent, N., & Hecht, B. (2020). A Deeper Investigation of the Importance of Wikipedia Links to the Success of Search
Engines. https://arxiv.org/abs/2004.10265
Suggested Reading
Experimentations with Wikidata/Wikibase. Hanging Together: The OCLC Research Blog.
https://hangingtogether.org/?p=8002
Vanderbot: a python script for writing to Wikidata: https://baskauf.blogspot.com/2020/02/vanderbot-python-script-for-writing-to.html

Mais conteúdo relacionado

Mais procurados

Analysis of open health data quality using data object-driven approach to dat...
Analysis of open health data quality using data object-driven approach to dat...Analysis of open health data quality using data object-driven approach to dat...
Analysis of open health data quality using data object-driven approach to dat...
Anastasija Nikiforova
 

Mais procurados (20)

Open Data and Library Services
Open Data and Library Services  Open Data and Library Services
Open Data and Library Services
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
Analysis of open health data quality using data object-driven approach to dat...
Analysis of open health data quality using data object-driven approach to dat...Analysis of open health data quality using data object-driven approach to dat...
Analysis of open health data quality using data object-driven approach to dat...
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
 
IoTSE-based Open Database Vulnerability inspection in three Baltic Countries:...
IoTSE-based Open Database Vulnerability inspection in three Baltic Countries:...IoTSE-based Open Database Vulnerability inspection in three Baltic Countries:...
IoTSE-based Open Database Vulnerability inspection in three Baltic Countries:...
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH Perspective
 
A SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIHA SWOT Analysis of Data Science @ NIH
A SWOT Analysis of Data Science @ NIH
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
dkNET ESP Meeting - February 2016
dkNET ESP Meeting - February 2016dkNET ESP Meeting - February 2016
dkNET ESP Meeting - February 2016
 
Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global Ecosystem
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 
Ziegler Open Data in Special Collections Libraries
Ziegler Open Data in Special Collections LibrariesZiegler Open Data in Special Collections Libraries
Ziegler Open Data in Special Collections Libraries
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
 
Brief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBrief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data Scientist
 
US EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open DataUS EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open Data
 

Semelhante a Hahn "Wikidata as a hub to library linked data re-use"

Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
University of California Curation Center
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
Lucy McKenna
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
OCLC
 

Semelhante a Hahn "Wikidata as a hub to library linked data re-use" (20)

Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
 
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
Researcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisResearcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive Analysis
 
Linked Data: Why Bother?
Linked Data:  Why Bother?Linked Data:  Why Bother?
Linked Data: Why Bother?
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
 
Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
An introduction to the Wikidata Thesis Toolkit / Helen Williams (London Schoo...
An introduction to the Wikidata Thesis Toolkit / Helen Williams (London Schoo...An introduction to the Wikidata Thesis Toolkit / Helen Williams (London Schoo...
An introduction to the Wikidata Thesis Toolkit / Helen Williams (London Schoo...
 
Wusteman Ticer09
Wusteman Ticer09Wusteman Ticer09
Wusteman Ticer09
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
 
Data Library Services In The Data Stewardship Lifecycle
Data Library Services In The Data Stewardship LifecycleData Library Services In The Data Stewardship Lifecycle
Data Library Services In The Data Stewardship Lifecycle
 
Collaborative Data Management at the University of California
Collaborative Data Management at the University of CaliforniaCollaborative Data Management at the University of California
Collaborative Data Management at the University of California
 
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
 
dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
dkNET Webinar: dkNET Hypothesis Center Live Demo 09/24/2021
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
Metadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data RepositoriesMetadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data Repositories
 

Mais de National Information Standards Organization (NISO)

Mais de National Information Standards Organization (NISO) (20)

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 

Último

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Último (20)

Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 

Hahn "Wikidata as a hub to library linked data re-use"

  • 1. Wikidata as a hub to library linked data re-use Jim Hahn Head of Metadata Research, University of Pennsylvania
  • 2. Objectives Attendees will …. ● gain an appreciation of data re-use as applied to linked data in order to advance discovery enhancement projects at their institution; ● understand how Wikidata is utilized for enriching and distributing linked data on the web in order to make use of Wikidata as a persistent structured data source; ● become aware of a Penn pilot project for Penn Faculty visibility on Wikidata and associated graph networks.
  • 3. Defining Data Re-use From Carlson & Anderson (2007, pp. 635) in the Journal of Computer Mediated Communication: Investments in e-science technologies are motivated by a multiplicity of factors. First is the urgency to manage the increasingly large quantities of complex data produced by digital technologies and digitally enabled science. ‘‘Deluge,’’ ‘‘waves,’’ and ‘‘knowledge overload’’ are some of the terms used to describe the situation (Hey & Trefethen,2003). Another related factor is the concern of funding bodies to ‘‘repurpose’’ their investments in data to avoid what is, in turn, termed ‘‘data tombs in mono-disciplinary silos’’ and to see a maximum return on their investments.
  • 4. Data Re-use In What are data? The many kinds of data and their implications for data re-use. Samuelle Carlson and Ben Anderson (2007), compile several case studies to illustrate the “life-stages” of data, including: ● Data Collection; ● Data Formatting; ● Data Release; ● Data Re-Use. Significant findings for data re-use suggest challenges ahead -- namely that there are a variety of data practices and assumptions across the disciplines studied..
  • 5. Importance of Data Re-use Wikidata practices and Library data practices are not necessarily incompatible, but must be negotiated for understanding and importantly, context. Wikidata itself is a more contemporary project with less legacy data problems than libraries. Library data are wonderfully expressive, unique, and complex. Library data were created as strings, and only very recently has entity based classification and cataloging processes started.
  • 7. Why Wikidata OCLC made use of the infrastructure (Wikibase) for Project Passage is continuing to expand upon use Wikibase for the Mellon Funded Entity Management Project. There is an opportunity to re-use the Wikidata properties for ongoing Linked Data Research at the Libraries Wibase in particular offers several advantages, including the following: ● local control over linked data from several disparate projects in linked data ● Creating links among UPenn Scholarship and the broader web of linked data. Recent studies have pointed to the importance of Wikimedia content to search engines/discovery on the web writ large (Vincent & Hecht, 2020).
  • 8. Wikimedia content in recommenders... ● Recommender systems development to promote library content (Tsuji, 2019). ○ Linked data offers some advantages over big data for providing personalization/recommender services (Campbell & Cowan, 2016); e.g. recommendations are based on structured knowledge not based on personal data mining.
  • 9. How Libraries make use of Wikidata Libraries have made a series of contributions to Wikidata… ● LD4 (Linked Data 4 Production) in Particular has engaged a Wikidata Affinity Group ● OCLC Entity Management Project ● Share-VDE ● Library of Congress NACO (Name Authority Cooperative Program)
  • 10. Wikidata Examples in LD4 https://www.wikidata.org/wiki/Wikidata:WikiProject_Linked_Data_for_Production/Practical_Wikidata_for_Librarians
  • 11. Wikidata Example in OCLC https://www.oclc.org/en/worldcat/oclc-and-linked-data.html
  • 12. Wikidata Example in Share-VDE https://www.wikidata.org/wiki/Property:P6329
  • 13. Share-VDE Wikidata Utilization (Possemato, 2018) https://www.loc.gov/bibframe/news/pdf/share-vde-alaal2018.pdf
  • 14. Wikidata Utilization at Library of Congress https://id.loc.gov/search/?q=wikidata
  • 17. Penn case study: faculty visibility
  • 18. Problem Statement Penn Libraries would like to foster the discovery of Penn scholarship on the web. Most search engines will crawl Wikidata for incorporation of structured data onto their search results. Most search engines will now create Knowledge Panels for authors, agents, and works. Current inventory keeping tools are not crawled by search engines and this is a problem area for supporting visibility of Penn faculty.
  • 20. Wikidata Processing at Penn Begin by integrating school level structured data into Wikidata: https://www.wikidata.org/wiki/Q7896091
  • 21. Wikidata Processing at Penn Add Department Level Structured Data using the “part of” property for associating with school... https://www.wikidata.org/wiki/Q89100047
  • 22. Wikidata Processing at Penn Add Wikidata for Faculty... https://www.wikidata.org/wiki/Q6127558
  • 25. Faculty Data Re-use in Wikidata For faculty pages we …. ● Add Faculty IDs if available: VIAF ID, ISNI, Library of Congress authority ID, Share-VDE author ID, WorldCat Identities ID ● Associate Faculty with Department using "member of (P463)" property ● Associate Publications to faculty For non-existing works, we created work pages and add the "author" property linked with Q number for author.
  • 26. Scholia page re-using structured data in Wikidata https://scholia.toolforge.org/organization/Q89100047
  • 27. Penn researcher profile re-using structured Wikidata https://scholia.toolforge.org/author/Q6127558
  • 30. Program for Cooperative Cataloging (PCC) Pilot Charge: The Wikidata Working group will lead Penn participation in the PCC Pilot Project for Identity Management in Wikidata. Penn's initial focus will be to leverage Online Books/Back Files serials to the PCC Wikidata objectives. Activities: For PCC Pilot - We are making sure that serial issues in Penn Libraries Deep Backfiles have Wikidata entries that clearly identify them and distinguish them from other serials.
  • 31. Resources Campbell, D. G., & Cowan, S. R. (2016). The Paradox of Privacy: Revisiting a Core Library Value in an Age of Big Data and Linked Data. Library Trends, 64(3), 492–511. https://doi.org/10.1353/lib.2016.0006 Carlson, S. & Anderson, B. (2007). What are data? The many kinds of data and their implications for data re-use. Journal of Computer-Mediated Communication, 12, 635-651. DOI: 10.1111/j.1083-6101.2007.00342.x Possemato, T. (2018). From MARC to BIBFRAME in the SHARE-VDE project. ALA Annual Meeting. https://www.loc.gov/bibframe/news/pdf/share-vde-alaal2018.pdf Tsuji, K. (2019). Book Recommender System for Wikipedia Article Readers in a University Library. 8th International Congress on Advanced Applied Informatics (IIAI-AAI), 121–126. https://doi.org/10.1109/IIAI-AAI.2019.00034 Vincent, N., & Hecht, B. (2020). A Deeper Investigation of the Importance of Wikipedia Links to the Success of Search Engines. https://arxiv.org/abs/2004.10265
  • 32. Suggested Reading Experimentations with Wikidata/Wikibase. Hanging Together: The OCLC Research Blog. https://hangingtogether.org/?p=8002 Vanderbot: a python script for writing to Wikidata: https://baskauf.blogspot.com/2020/02/vanderbot-python-script-for-writing-to.html