SlideShare a Scribd company logo
1 of 45
The Future of Metadata
Management &
Making Library Collections
Discoverable on the Web
Ted Fons, OCLC
The National Library Descriptors Conference
Proposed Changes in the Development of Library Collections in the Era of the Semantic Web
Warsaw - 21-22 April, 2015
Cataloging
Harvesting
Datamining
The Future
Our Goal
The Goal
S.R. Ranganathan
1. Books are for use.
2. Every reader his [or her] book.
3. Every book its reader.
4. Save the time of the reader.
5. The library is a growing
organism.
Image credit: http://static.guim.co.uk/sys-
images/Guardian/Pix/pictures/2009/3/23/1237806064989/Young-man-
Connect the reader to content.
Cataloging
How We Work Today
Local
Group
Global
We catalog:
• Books
• Music
• Journal titles
• Authorities
What is in a Global Discovery System
Readers want:
• eBooks
• Articles
• Unique content
We catalog:
• Books
• Music
• Journal titles
• Authorities
• Calhoun: “Metadata has changed as collections have
changed. It remains important, but it comes in many
forms and from many sources. The centrality of
bibliographic control has been disrupted.” P. 15.
• And: “There is less need and place for traditional
bibliographic control as a set of methods for providing
[metadata] for discovery, access and management of
the content of mainstream books and serials. “p. 24.
Catalogue 2.0 by Karen Calhoun
“Ken Chad examines the distinction betweeen
redundant cataloging (re-editing records to suit local
practices) and redundant catalogs [in the UK]. He
enumerates the benefits of moving from … 160
standalone catalogues to a single shared catalogue at
the network level for all of these libraries”
Karen Calhoun in Catalogue 2.0
Duplicating records for local purposes
The world’s libraries. Connected.
What is in a Global Discovery System
Readers want:
• eBooks
• Articles
• Unique content
We catalog:
• Books
• Music
• Journal titles
• Authorities
The world’s libraries. Connected.
The value of authorities FRAD Tasks
 Find
 Identify
 Clarify
 Contextualize
http://www.ifla.org/publications/functional-requirements-for-authority-data
What is in a Global Discovery System
So, what should we do?
1. Catalog unique materials
2. Create authorities
3. Use harvesting and data mining for everything else
Where?
Local
Group
Global
Catalog
• Unique Materials
• Create Authorities
Data Harvesting
What is in a Global Discovery System
Readers want:
• eBooks
• Articles
• Unique content
We catalog:
• Books
• Music
• Journal titles
• Authorities
Local
Group
Global
Data Mining & The Web
Local
Group
Global
The Web of …
Documents
Active Documents
Discovery
Data
Knowledge
☌☌☌
Libraries can
connect to the web
of knowledge
The Knowledge Graph
☌
Libraries can
connect to the web
of knowledge
Libraries can create
a knowledge graph
Documents
Entities
Establishing Semantic Identity
For Accurate Representation
on the Web
12/09/2014
Kenning Arlitsch
Dean of the Library
Kenning Arlitsch, Dean of the Library
Patrick OBrien, Semantic Web Research Director
The Point
Libraries are poorly defined and represented on
the Semantic Web…
…but we know how to fix that problem…
…mostly
Google’s Perception of MSU Lib - 2012
MSU Library - 2014
DBPedia entry - 2012
2014 DBpedia entry
2014 Dbpedia
entry
DBPedia entry - 2012
Summary
• Define library organization in Wikipedia
– Beware of *pedia culture and process
• Engage with other trusted data sources
– FreeBase
– Google Places/Google My Business
– Google+
• Mark-up metadata with Schema.org
The Knowledge Graph
☌
Libraries can
connect to the web
of knowledge
Libraries can create
a knowledge graph
Documents
Entities
person place
object concept
organization work
author
subjectitem
availability
The solution
starts here.
Thelibraryknowledgegraph
person place
object concept
organization work
Thelibraryknowledgegraph
http://www.ifla.org/publications/functional-requirements-for-bibliographic-records
FRBR Entities
 Work
 Expression
 Manifestation
 Item
Exampleofbenefits…
Discovery
The Name of the Rose
Summary: The year is 1327. Franciscans in a wealthy
Italian abbey are suspected of heresy, and Brother
William of Baskerville arrives to investigate. His
delicate mission is suddenly overshadowed by seven
bizarre deaths that take place in seven days and
nights of apocalyptic terror.
Subjects
Borrowing Options
eBooks | Printed Books | Audio Books
Other Languages
Monastic libraries -- Italy – Fiction | Semiotics -- Fiction
Example of Benefits: Web Exposure
data.BnF.fr
Number of Visits
-
1,000,000
2,000,000
3,000,000
4,000,000
5,000,000
6,000,000
7,000,000
8,000,000
January February March April May June July August September October
Visits to WorldCat
2012 2013 2014
Photo credit: http://media02.hongkiat.com/freebies-for-web-designers-2011/progress-bar.jpg
What has OCLC done?
How does data mining work?
The Data Strategy: WorldCat Entities
Work and Person Creation Process Flow
Extractors
Enhanced
WC
Records
Harvested
Triples
Refined
Triples
CreateWorkReducer
1. Harvest
3. Reduce
There are three components to the pipeline for creating
Work and Person entities. The harvest component
extracts the data from the different sources. The map
component identifies the objects and combines the triples
through name recognition and authority linkages. The
reduce component pulls together the entity descriptions
and writes them out to HBase.
VIAF
LCNAF
DBPedi
a
CreatePersonReduc
2. Map
ObjectMappe
r
PersonCombi
ne
WorkCombin
e
Datamining
• 197+ million Work descriptions and URIs
• Schema.org + BiblioGraph.net
• RDF Data formats
• RDF/XML, Turtle, Triples, JSON-LD
• Links to WorldCat manifestations
• Links to Dewey, LCSH, LCNAF, VIAF, FAST
• Open Data license via Linked Data Explorer
• 2015: Discovery API, Metadata API
• Released April 2014
http://www.oclc.org/dataThe Work Entity
• 98+ million Person descriptions and URIs
• Person entities with authority: 20.2 million
• Person entities without authority: 78.3 million
• Schema.org + BiblioGraph.net
• Harvested from WorldCat data and enriched from other hubs
RDF Data formats
• RDF/XML, Turtle, Triples, JSON-LD
• Links to WorldCat Works. Added links from WC Works.
• Open Data license via Linked Data Explorer
• 2015: Linked Data Explorer, Discovery API
http://www.oclc.org/dataThe Person Entity
person place
object concept
organization work
Thelibraryknowledgegraph
Local
Group
Global
Datamining
Harvesting
Cataloging
The Future
So, what should we do?
1.Catalog unique materials
2.Create authorities
3.Use harvesting and data mining for
everything else
Cataloging
Harvesting
Datamining
The Future
Discussion
Ted Fons
Executive Director, Data Services &
WorldCat Quality
fonst@oclc.org

More Related Content

What's hot

Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...lisld
 
Rightscaling, engagement, learning: reconfiguring the library for a network e...
Rightscaling, engagement, learning: reconfiguring the library for a network e...Rightscaling, engagement, learning: reconfiguring the library for a network e...
Rightscaling, engagement, learning: reconfiguring the library for a network e...lisld
 
New Directions in Information Organization: A Linked Data Model with BIBFRAME
New Directions in Information Organization: A Linked Data Model with BIBFRAMENew Directions in Information Organization: A Linked Data Model with BIBFRAME
New Directions in Information Organization: A Linked Data Model with BIBFRAMESharonYang
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for DiscoveryOCLC
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerWiLS
 
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...Allison Jai O'Dell
 
Life after MARC: Cataloging Tools of the Future
Life after MARC: Cataloging Tools of the FutureLife after MARC: Cataloging Tools of the Future
Life after MARC: Cataloging Tools of the FutureEmily Nimsakont
 
Virtual World, Virtual Reference
Virtual World, Virtual ReferenceVirtual World, Virtual Reference
Virtual World, Virtual ReferenceLorin Flores
 
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Charleston Conference
 
Linked Data and Libraries: What? Why? How?
Linked Data and Libraries: What? Why? How?Linked Data and Libraries: What? Why? How?
Linked Data and Libraries: What? Why? How?Emily Nimsakont
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? Debra Shapiro
 
Trends in Cataloging & Metadata
Trends in Cataloging & MetadataTrends in Cataloging & Metadata
Trends in Cataloging & MetadataDebra Shapiro
 
Rdf and open linked data a first approach
Rdf and open linked data a first approach Rdf and open linked data a first approach
Rdf and open linked data a first approach @CULT Srl
 
Forging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebForging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebGillian Byrne
 
Li 804 alphild dick
Li 804 alphild dickLi 804 alphild dick
Li 804 alphild dickAlphild Dick
 
INFO 653 Posters, Fall 2019
INFO 653 Posters, Fall 2019INFO 653 Posters, Fall 2019
INFO 653 Posters, Fall 2019PrattSILS
 
The facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentThe facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentlisld
 

What's hot (20)

Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...
 
Rightscaling, engagement, learning: reconfiguring the library for a network e...
Rightscaling, engagement, learning: reconfiguring the library for a network e...Rightscaling, engagement, learning: reconfiguring the library for a network e...
Rightscaling, engagement, learning: reconfiguring the library for a network e...
 
New Directions in Information Organization: A Linked Data Model with BIBFRAME
New Directions in Information Organization: A Linked Data Model with BIBFRAMENew Directions in Information Organization: A Linked Data Model with BIBFRAME
New Directions in Information Organization: A Linked Data Model with BIBFRAME
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for Discovery
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve Meyer
 
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
 
Life after MARC: Cataloging Tools of the Future
Life after MARC: Cataloging Tools of the FutureLife after MARC: Cataloging Tools of the Future
Life after MARC: Cataloging Tools of the Future
 
Virtual World, Virtual Reference
Virtual World, Virtual ReferenceVirtual World, Virtual Reference
Virtual World, Virtual Reference
 
Stahmer-9-Jun15-final
Stahmer-9-Jun15-finalStahmer-9-Jun15-final
Stahmer-9-Jun15-final
 
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
 
Assessing the performance of RDF Engines: Discussing RDF Benchmarks
Assessing the performance of RDF Engines: Discussing RDF Benchmarks Assessing the performance of RDF Engines: Discussing RDF Benchmarks
Assessing the performance of RDF Engines: Discussing RDF Benchmarks
 
Linked Data and Libraries: What? Why? How?
Linked Data and Libraries: What? Why? How?Linked Data and Libraries: What? Why? How?
Linked Data and Libraries: What? Why? How?
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection?
 
Trends in Cataloging & Metadata
Trends in Cataloging & MetadataTrends in Cataloging & Metadata
Trends in Cataloging & Metadata
 
Gonzalez-8-jun15
Gonzalez-8-jun15Gonzalez-8-jun15
Gonzalez-8-jun15
 
Rdf and open linked data a first approach
Rdf and open linked data a first approach Rdf and open linked data a first approach
Rdf and open linked data a first approach
 
Forging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebForging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic Web
 
Li 804 alphild dick
Li 804 alphild dickLi 804 alphild dick
Li 804 alphild dick
 
INFO 653 Posters, Fall 2019
INFO 653 Posters, Fall 2019INFO 653 Posters, Fall 2019
INFO 653 Posters, Fall 2019
 
The facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentThe facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environment
 

Similar to The Future of Metadata Management & Making Library Collections Discoverable on the Web

The Power of Sharing Linked Data - ELAG 2014 Workshop
The Power of Sharing Linked Data - ELAG 2014 WorkshopThe Power of Sharing Linked Data - ELAG 2014 Workshop
The Power of Sharing Linked Data - ELAG 2014 WorkshopRichard Wallis
 
Fuller Disclosure: Getting More Collections into the Network Flow
Fuller Disclosure: Getting More Collections into the Network FlowFuller Disclosure: Getting More Collections into the Network Flow
Fuller Disclosure: Getting More Collections into the Network Flowkramsey
 
Calhoun Rbms Rev June 2008
Calhoun Rbms Rev June 2008Calhoun Rbms Rev June 2008
Calhoun Rbms Rev June 2008Karen S Calhoun
 
Building the Bridge. Librarian as an Information Architect
Building the Bridge. Librarian as an Information ArchitectBuilding the Bridge. Librarian as an Information Architect
Building the Bridge. Librarian as an Information ArchitectStanislaw Skorka
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008PrattSILS
 
Next Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 CalhounNext Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 CalhounKaren S Calhoun
 
Library as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of CooperationLibrary as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of CooperationKaren S Calhoun
 
Traveling Through Transitions Slovenia
Traveling Through Transitions SloveniaTraveling Through Transitions Slovenia
Traveling Through Transitions SloveniaKaren S Calhoun
 
Integrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery NetworkIntegrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery NetworkOCLC Research
 
OUR space: the new world of metadata
OUR space: the new world of metadataOUR space: the new world of metadata
OUR space: the new world of metadataKaren S Calhoun
 
Open Source ILS Add-Ons
Open Source ILS Add-OnsOpen Source ILS Add-Ons
Open Source ILS Add-Onsloriayre
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collectionslljohnston
 
WorldCat Local Illinois
WorldCat Local IllinoisWorldCat Local Illinois
WorldCat Local Illinoisltls
 
The Power of Sharing Linked Data: Giving the Web What It Wants
The Power of Sharing Linked Data: Giving the Web What It WantsThe Power of Sharing Linked Data: Giving the Web What It Wants
The Power of Sharing Linked Data: Giving the Web What It WantsNASIG
 
The Power of Sharing Linked Data (NASIG)
The Power of Sharing Linked Data (NASIG)The Power of Sharing Linked Data (NASIG)
The Power of Sharing Linked Data (NASIG)Richard Wallis
 
The Role of Discovery and its Relationship with the ILS
The Role of Discovery and its Relationship with the ILSThe Role of Discovery and its Relationship with the ILS
The Role of Discovery and its Relationship with the ILSCharleston Conference
 

Similar to The Future of Metadata Management & Making Library Collections Discoverable on the Web (20)

The Power of Sharing Linked Data - ELAG 2014 Workshop
The Power of Sharing Linked Data - ELAG 2014 WorkshopThe Power of Sharing Linked Data - ELAG 2014 Workshop
The Power of Sharing Linked Data - ELAG 2014 Workshop
 
Cataloging Presentation
Cataloging PresentationCataloging Presentation
Cataloging Presentation
 
Fuller Disclosure: Getting More Collections into the Network Flow
Fuller Disclosure: Getting More Collections into the Network FlowFuller Disclosure: Getting More Collections into the Network Flow
Fuller Disclosure: Getting More Collections into the Network Flow
 
Calhoun Rbms Rev June 2008
Calhoun Rbms Rev June 2008Calhoun Rbms Rev June 2008
Calhoun Rbms Rev June 2008
 
Building the Bridge. Librarian as an Information Architect
Building the Bridge. Librarian as an Information ArchitectBuilding the Bridge. Librarian as an Information Architect
Building the Bridge. Librarian as an Information Architect
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008
 
Next Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 CalhounNext Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 Calhoun
 
Library as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of CooperationLibrary as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of Cooperation
 
Traveling Through Transitions Slovenia
Traveling Through Transitions SloveniaTraveling Through Transitions Slovenia
Traveling Through Transitions Slovenia
 
Integrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery NetworkIntegrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery Network
 
OUR space: the new world of metadata
OUR space: the new world of metadataOUR space: the new world of metadata
OUR space: the new world of metadata
 
Sistema Compartit a l'ICOLC
Sistema Compartit a l'ICOLCSistema Compartit a l'ICOLC
Sistema Compartit a l'ICOLC
 
Open Source ILS Add-Ons
Open Source ILS Add-OnsOpen Source ILS Add-Ons
Open Source ILS Add-Ons
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collections
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
WorldCat Local Illinois
WorldCat Local IllinoisWorldCat Local Illinois
WorldCat Local Illinois
 
The Power of Sharing Linked Data: Giving the Web What It Wants
The Power of Sharing Linked Data: Giving the Web What It WantsThe Power of Sharing Linked Data: Giving the Web What It Wants
The Power of Sharing Linked Data: Giving the Web What It Wants
 
The Power of Sharing Linked Data (NASIG)
The Power of Sharing Linked Data (NASIG)The Power of Sharing Linked Data (NASIG)
The Power of Sharing Linked Data (NASIG)
 
The Role of Discovery and its Relationship with the ILS
The Role of Discovery and its Relationship with the ILSThe Role of Discovery and its Relationship with the ILS
The Role of Discovery and its Relationship with the ILS
 

Recently uploaded

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 

Recently uploaded (20)

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 

The Future of Metadata Management & Making Library Collections Discoverable on the Web

  • 1. The Future of Metadata Management & Making Library Collections Discoverable on the Web Ted Fons, OCLC The National Library Descriptors Conference Proposed Changes in the Development of Library Collections in the Era of the Semantic Web Warsaw - 21-22 April, 2015
  • 4. The Goal S.R. Ranganathan 1. Books are for use. 2. Every reader his [or her] book. 3. Every book its reader. 4. Save the time of the reader. 5. The library is a growing organism. Image credit: http://static.guim.co.uk/sys- images/Guardian/Pix/pictures/2009/3/23/1237806064989/Young-man- Connect the reader to content.
  • 6. How We Work Today Local Group Global We catalog: • Books • Music • Journal titles • Authorities
  • 7. What is in a Global Discovery System Readers want: • eBooks • Articles • Unique content We catalog: • Books • Music • Journal titles • Authorities
  • 8. • Calhoun: “Metadata has changed as collections have changed. It remains important, but it comes in many forms and from many sources. The centrality of bibliographic control has been disrupted.” P. 15. • And: “There is less need and place for traditional bibliographic control as a set of methods for providing [metadata] for discovery, access and management of the content of mainstream books and serials. “p. 24. Catalogue 2.0 by Karen Calhoun
  • 9. “Ken Chad examines the distinction betweeen redundant cataloging (re-editing records to suit local practices) and redundant catalogs [in the UK]. He enumerates the benefits of moving from … 160 standalone catalogues to a single shared catalogue at the network level for all of these libraries” Karen Calhoun in Catalogue 2.0 Duplicating records for local purposes
  • 10. The world’s libraries. Connected. What is in a Global Discovery System Readers want: • eBooks • Articles • Unique content We catalog: • Books • Music • Journal titles • Authorities
  • 11. The world’s libraries. Connected. The value of authorities FRAD Tasks  Find  Identify  Clarify  Contextualize http://www.ifla.org/publications/functional-requirements-for-authority-data
  • 12. What is in a Global Discovery System So, what should we do? 1. Catalog unique materials 2. Create authorities 3. Use harvesting and data mining for everything else
  • 15. What is in a Global Discovery System Readers want: • eBooks • Articles • Unique content We catalog: • Books • Music • Journal titles • Authorities
  • 17. Data Mining & The Web
  • 19. The Web of … Documents Active Documents Discovery Data Knowledge ☌☌☌ Libraries can connect to the web of knowledge
  • 20. The Knowledge Graph ☌ Libraries can connect to the web of knowledge Libraries can create a knowledge graph Documents Entities
  • 21.
  • 22.
  • 23. Establishing Semantic Identity For Accurate Representation on the Web 12/09/2014 Kenning Arlitsch Dean of the Library Kenning Arlitsch, Dean of the Library Patrick OBrien, Semantic Web Research Director
  • 24. The Point Libraries are poorly defined and represented on the Semantic Web… …but we know how to fix that problem… …mostly
  • 25. Google’s Perception of MSU Lib - 2012
  • 30. Summary • Define library organization in Wikipedia – Beware of *pedia culture and process • Engage with other trusted data sources – FreeBase – Google Places/Google My Business – Google+ • Mark-up metadata with Schema.org
  • 31. The Knowledge Graph ☌ Libraries can connect to the web of knowledge Libraries can create a knowledge graph Documents Entities
  • 32. person place object concept organization work author subjectitem availability The solution starts here. Thelibraryknowledgegraph
  • 33. person place object concept organization work Thelibraryknowledgegraph http://www.ifla.org/publications/functional-requirements-for-bibliographic-records FRBR Entities  Work  Expression  Manifestation  Item
  • 34. Exampleofbenefits… Discovery The Name of the Rose Summary: The year is 1327. Franciscans in a wealthy Italian abbey are suspected of heresy, and Brother William of Baskerville arrives to investigate. His delicate mission is suddenly overshadowed by seven bizarre deaths that take place in seven days and nights of apocalyptic terror. Subjects Borrowing Options eBooks | Printed Books | Audio Books Other Languages Monastic libraries -- Italy – Fiction | Semiotics -- Fiction
  • 35. Example of Benefits: Web Exposure data.BnF.fr Number of Visits - 1,000,000 2,000,000 3,000,000 4,000,000 5,000,000 6,000,000 7,000,000 8,000,000 January February March April May June July August September October Visits to WorldCat 2012 2013 2014
  • 37. The Data Strategy: WorldCat Entities Work and Person Creation Process Flow Extractors Enhanced WC Records Harvested Triples Refined Triples CreateWorkReducer 1. Harvest 3. Reduce There are three components to the pipeline for creating Work and Person entities. The harvest component extracts the data from the different sources. The map component identifies the objects and combines the triples through name recognition and authority linkages. The reduce component pulls together the entity descriptions and writes them out to HBase. VIAF LCNAF DBPedi a CreatePersonReduc 2. Map ObjectMappe r PersonCombi ne WorkCombin e Datamining
  • 38. • 197+ million Work descriptions and URIs • Schema.org + BiblioGraph.net • RDF Data formats • RDF/XML, Turtle, Triples, JSON-LD • Links to WorldCat manifestations • Links to Dewey, LCSH, LCNAF, VIAF, FAST • Open Data license via Linked Data Explorer • 2015: Discovery API, Metadata API • Released April 2014 http://www.oclc.org/dataThe Work Entity
  • 39. • 98+ million Person descriptions and URIs • Person entities with authority: 20.2 million • Person entities without authority: 78.3 million • Schema.org + BiblioGraph.net • Harvested from WorldCat data and enriched from other hubs RDF Data formats • RDF/XML, Turtle, Triples, JSON-LD • Links to WorldCat Works. Added links from WC Works. • Open Data license via Linked Data Explorer • 2015: Linked Data Explorer, Discovery API http://www.oclc.org/dataThe Person Entity
  • 40. person place object concept organization work Thelibraryknowledgegraph
  • 43. So, what should we do? 1.Catalog unique materials 2.Create authorities 3.Use harvesting and data mining for everything else
  • 45. Discussion Ted Fons Executive Director, Data Services & WorldCat Quality fonst@oclc.org

Editor's Notes

  1. The Web has and continues to evolve: Linked Documents – documents built on the fly from databases – search engines analyzing the links to create discovery – sites starting to publish the [linked] data behind the documents. How have libraries engaged with the web: Enthusiastic & leading for documents – actively disengaged with the search engines (technology issues and commercial concerns) – partial engagement with the web of data. A Web of knowledge is forming as the search engines analyze the relationships in the data – how will libraries participate?
  2. The Web has and continues to evolve: Linked Documents – documents built on the fly from databases – search engines analyzing the links to create discovery – sites starting to publish the [linked] data behind the documents. How have libraries engaged with the web: Enthusiastic & leading for documents – actively disengaged with the search engines (technology issues and commercial concerns) – partial engagement with the web of data. A Web of knowledge is forming as the search engines analyze the relationships in the data – how will libraries participate?
  3. Google's knowledge Graph navigating between entity descriptions….
  4. 0 - Search for MSU in 2012 1 - On the left traditional Organic search results 2 – notice poor description of the MSU Library 3 – Google’s Knowledge Card of the “Thing” they believe to be Montana State University Library 4 – However, it’s the wrong phone number, wrong city, wrong address, wrong map
  5. 0 – After doing research in this area we have concluded a Library must establish and maintain its semantic identity on the Web. This is the same search for Montana state university library in 2014. Lets walk though the changes on this slide and then talk about how they came about in the rest of the presentation 1 – Improved description of the library 2 - Correct address, phone number and a Google map link 3 - more links to key areas of our web site as determined by Google’s algorithms 4 - link to more results from Montana.edu 5 - link to our G+ page w/ a picture of our building and the number of followers 6 – the correct MSU Library Logo 7 – link to a robust Wikipedia description of the MSU library
  6. The Web has and continues to evolve: Linked Documents – documents built on the fly from databases – search engines analyzing the links to create discovery – sites starting to publish the [linked] data behind the documents. How have libraries engaged with the web: Enthusiastic & leading for documents – actively disengaged with the search engines (technology issues and commercial concerns) – partial engagement with the web of data. A Web of knowledge is forming as the search engines analyze the relationships in the data – how will libraries participate?
  7. We, at OCLC, with our major data ingest and processing techniques – Big Data tech Matching incoming data with what we have Identifying the entities and associating their role attributes Works – not so far very visible in libraries – important on the web Building a graph of relationships
  8. Data to underpin innovation! - A person knowledge card in a prototype WorldCat Discovery interface
  9. Refined from 320M harvested entities. f there is a 100 or 700 field for a Person entity, then there will be a BY relationship (creator, contributor, author, illustrator, etc) in the WC Work description that includes a WC Person URI. If there is a 600 field for a Person entity, then there will be an ABOUT relationship (subject, etc) in the WC Work description that includes a WC Person URI. Other sources: After creating the set of Person entities, we started the process of enriching the entities with data harvested from other sources - images and other information from DBPedia, preferred names from LC, see also links from VIAF, and profile information (subjects, genres, and roles most known for) from WC Identities.