SlideShare a Scribd company logo
1 of 25
International Conference on Theory and Practice of Digital Libraries
September 2017
Title here
CC BY-SA
Outline
● Motivation for rethinking metadata aggregation approaches
• Focus: technology adoption in Cultural Heritage/Europeana
● Investigated technologies: IIIF and Sitemaps
● Case studies
● Application of the results in aggregation at Europeana
● Ongoing and future work
CC BY-SA
Metadata aggregation of IIIF Resources at Europeana
Czech Republic, PD
1887, Uměleckoprůmyslové museum v Praze
Preissig, Vojtech
Coloured etchings
Motivation in the context
of Cultural Heritage
Title here
CC BY-SA
Europeana
The Platform for Europe’s Digital Cultural Heritage
● Europeana aggregates (and makes available) metadata:
• From all EU countries
• From ~3,500 galleries,
libraries, archives and museums
• Under a CC0 licence
• More than 54M objects
• In about 50 languages
“We transform the world with culture! We
want to build on Europe’s rich heritage and
make it easier for people to use, whether
for work, for learning or just for fun.”
CC BY-SA
Title here
CC BY-SA
What kinds of technologies are we
considering?
● Focus on technology adoption:
• Technologies that present low barriers for adoption by data providers
● Technologies used by Cultural Heritage institutions for other purposes
• Search engine optimization
• Linked data
• Social web technologies
• IIIF
● What are the successors of OAI-PMH?
CC BY-SA
Cristallisation ou Mouvement du
temps, René Bord
1987, Bibliothèque Municipale De Lyon,
public domain
Investigated technologies:
IIIF
Brief introduction to the IIIF APIs
Europeana & IIIF
CC BY-SA
How can IIIF be used for metadata aggregation?
Ben Albritton Mike Appleby Tom Cramer Jon Stroop Rob Sanderson Stu Snydman Simeon Warner IIIF.io
@bla222 @mikeapps @tcramer @jpstroop @azaroth42 @stusnydman @zimeon @iiif_io
Object = Image + Presentation
Ben Albritton Mike Appleby Tom Cramer Jon Stroop Rob Sanderson Stu Snydman Simeon Warner IIIF.io
@bla222 @mikeapps @tcramer @jpstroop @azaroth42 @stusnydman @zimeon @iiif_io
Presentation API
•Descriptive:
label, description
•Rights: license,
attribution
(to be c’ed)
Image API
● Image Data
Object = Image + Presentation
Ben Albritton Mike Appleby Tom Cramer Jon Stroop Rob Sanderson Stu Snydman Simeon Warner IIIF.io
@bla222 @mikeapps @tcramer @jpstroop @azaroth42 @stusnydman @zimeon @iiif_io
Presentation API (c’ed)
• Structure
• Collections of objects
• Manifests organizing Items, Sequences, Parts together with their
metadata
• Linking
• service: additional service endpoint
• related: resource to display to the user
• seeAlso: semantic metadata resource
Cristallisation ou Mouvement du
temps, René Bord
1987, Bibliothèque Municipale De Lyon,
public domain
Investigated technologies:
Sitemaps
Sitemaps
CC BY-SA
● Sitemaps allow webmasters to inform search engines about pages on their
sites that are available for crawling
● Sitemaps are supported/used by:
• all major search engines
• many content management systems
• many Europeana data providers
● Sitemaps provide a simple technological solution with a very low
implementation barrier
● Sitemaps can support a large range of resource types
• Sitemaps has extensions for images and videos (defined by Google)
Case studies
Netherlands, Public Domain
1910-1925, Rijksmuseum
Anonymous
Tak met vier mangolia’s
First case study:
Crawling services across the IIIF universe
Questions addressed:
• Can Europeana find the available IIIF services through IIIF Service
Registries?
• Is the output of IIIF crawlable? Can robots follow links in IIIF output and
reach all resources?
• How mature and uniform are existing IIIF implementations ?
• Is metadata available?
• Are machine readable licenses available?
CC BY-SA
First case study:
Crawling services across the IIIF universe
Main conclusions:
• Registries are available and are machine readable, but coverage was only
partial
• IIIF provides all that is necessary, but some features are optional (e.g. IIIF
Collections)
• Minor compliance problems only due to immaturity of the
implementations
• IIIF provides a way to link to metadata, but it is optional (and often not
used, misused, or not fully informative)
• IIIF provides licensing information, but it is optional (and often not used)
CC BY-SA
Case studies with
Europeana Partners
Netherlands, Public Domain
1910-1925, Rijksmuseum
Anonymous
Tak met vier mangolia’s
Case studies with partners
Europeana & IIIF
CC BY-SA
To study the feasibility of performing metadata aggregation via IIIF/Sitemaps
we have undertaken case studies with providers of the Europeana Network
• National Library of Wales
• Very active in the IIIF community
• Very advanced in IIIF implementation
• Expertise in full-text content (over IIIF)
• University College Dublin
• Very advanced in IIIF implementation
• Expertise in internet search engine optimization (Sitemaps and its media specific
extensions)
Case studies with National Library of Wales
and University College Dublin
• Crawling IIIF services via IIIF Collections
• Crawling IIIF services via Sitemaps
• Standard Sitemaps
• Sitemaps extended with elements used in IIIF specifications
• Sitemaps extended with elements from the ResourceSync namespace
• Crawling IIIF services via IIIF Collections and HTTP cache headers
• HTTP cache headers allows crawlers to use resource modification
timestamps
• Timestamps are essential for aggregating large collections
CC BY-SA
CC BY-SA
Main conclusions from the case studies
• Applying these technologies was straightforward for providers
• When providers have in-house knowledge on a technology, its adoption/adaptation is
simplified
• None of the case studies presented serious technological obstacles
• Very simple technological solutions are available
• Only very large collections may require additional complexity
• ...the main challenge is to choose among the several possibilities and
establishing a standard (or best practice) within the community(ies):
• Europeana is working with the IIIF community in the context of the IIIF Discovery Technical
Specification group
• Europeana will prepare recommendations targeted at its own partner network.
Cristallisation ou Mouvement du
temps, René Bord
1987, Bibliothèque Municipale De Lyon,
public domain
Application of the results
CC BY-SA
Operational IIIF/Sitemaps harvests so far
@Europeana
The outcomes of the case studies have resulted in real
cases of IIIF/Sitemaps based aggregation into Europeana:
• National Library of Wales
• Sitemaps + IIIF
• University College Dublin
• Sitemaps + IIIF +Sitemaps Video Extension
• Wellcome library
• IIIF Collection + IIIF
Future work
France, Public Domain
Agence Rol. Agence photographique,
Bibliothèque national de France
Chat "regardant" à travers une longue-vue et
autre chat perché dessus
CC BY-SA
R&D ongoing work
Crawling websites/LOD/IIIF in search for
resources represented with Schema.org
• Research Question:
• Can metadata still comply with the requirements of Europeana/EDM, by being
represented with Schema.Org? If so, with what level of quality?
• One IIIF case study is in progress at this time
• IIIF provider: North Carolina State University Libraries
CC BY-SA
Future work
• Research the implications of IIIF and Sitemaps harvesting for the internal
workflows of aggregators
• ResourceSync: one case study in preparation with a collection of more than
600.000 resources
• Continue monitoring and investigating technology trends in our domain:
• Follow the outcomes from the IIIF Discovery Technical Specification Group[1]
• The Linked Data Platform [2]
• Notification Frameworks usage for metadata aggregation
WebSub[3], Linked Data Notifications [4]
Thank you for your attention
nuno.freire@tecnico.ulisboa.pt
Netherlands, Public Domain
1660 - 1625, Rijksmuseum
Anonymous
Arrival of a Portuguese ship
Acknowledgments
Valentine Charles, Europeana Foundation
Fundação para a Ciência e a Tecnologia (FCT): UID/CEC/50021/2013
European Commission: grant agreement number CEF-TC-2015-1-01.

More Related Content

What's hot

The IIIF Image API
The IIIF Image APIThe IIIF Image API
The IIIF Image APIIIIF_io
 
Sharing 3D Cultural Heritage: Standards and metadata
Sharing 3D Cultural Heritage: Standards and metadataSharing 3D Cultural Heritage: Standards and metadata
Sharing 3D Cultural Heritage: Standards and metadataCARARE
 
Fcv acad ind_szeliski
Fcv acad ind_szeliskiFcv acad ind_szeliski
Fcv acad ind_szeliskizukun
 
Europeana Archaeology
Europeana ArchaeologyEuropeana Archaeology
Europeana ArchaeologyCARARE
 
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony CornsImproving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony CornsFARO
 
Connecting European Archaeology datasets: prospects and challenges
Connecting European Archaeology datasets: prospects and challengesConnecting European Archaeology datasets: prospects and challenges
Connecting European Archaeology datasets: prospects and challengesCARARE
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian ArtJon Stroop
 
CARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in EuropeanaCARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in EuropeanaCARARE
 
Digital Tools for Manuscript Study IIIF
Digital Tools for Manuscript Study IIIFDigital Tools for Manuscript Study IIIF
Digital Tools for Manuscript Study IIIFRachel Di Cresce
 
Achieving interoperability between the CARARE schema for monuments and sites ...
Achieving interoperability between the CARARE schema for monuments and sites ...Achieving interoperability between the CARARE schema for monuments and sites ...
Achieving interoperability between the CARARE schema for monuments and sites ...CARARE
 
3D reconstructions for story telling and understanding
3D reconstructions for story telling and understanding3D reconstructions for story telling and understanding
3D reconstructions for story telling and understandingCARARE
 
Beyond Built Heritage Documentation: digital applications needs for research ...
Beyond Built Heritage Documentation: digital applications needs for research ...Beyond Built Heritage Documentation: digital applications needs for research ...
Beyond Built Heritage Documentation: digital applications needs for research ...Ruggero Lancia
 
Early Chinese Periodicals Online (ECPO): From Digitization Towards Open Data....
Early Chinese Periodicals Online (ECPO): From Digitization Towards Open Data....Early Chinese Periodicals Online (ECPO): From Digitization Towards Open Data....
Early Chinese Periodicals Online (ECPO): From Digitization Towards Open Data....Matthias Arnold
 
Carare 2.0: Developing a metadata schema
Carare 2.0: Developing a metadata schema Carare 2.0: Developing a metadata schema
Carare 2.0: Developing a metadata schema CARARE
 
A Cultural Heritage Repository as Source for Learning Materials
A Cultural Heritage Repository as Source for Learning MaterialsA Cultural Heritage Repository as Source for Learning Materials
A Cultural Heritage Repository as Source for Learning MaterialsManjulaPatel
 
Europeana Music Channel, wireframes
Europeana Music Channel, wireframesEuropeana Music Channel, wireframes
Europeana Music Channel, wireframesDavid Haskiya
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsCARARE
 
Geographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena ProjectsGeographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena ProjectsCARARE
 

What's hot (20)

The IIIF Image API
The IIIF Image APIThe IIIF Image API
The IIIF Image API
 
Sharing 3D Cultural Heritage: Standards and metadata
Sharing 3D Cultural Heritage: Standards and metadataSharing 3D Cultural Heritage: Standards and metadata
Sharing 3D Cultural Heritage: Standards and metadata
 
Fcv acad ind_szeliski
Fcv acad ind_szeliskiFcv acad ind_szeliski
Fcv acad ind_szeliski
 
Europeana Archaeology
Europeana ArchaeologyEuropeana Archaeology
Europeana Archaeology
 
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony CornsImproving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
Improving Access and Exploitation of 3D Cultural Heritage Data | Anthony Corns
 
Connecting European Archaeology datasets: prospects and challenges
Connecting European Archaeology datasets: prospects and challengesConnecting European Archaeology datasets: prospects and challenges
Connecting European Archaeology datasets: prospects and challenges
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
 
CARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in EuropeanaCARARE: Connecting Archaeology and Architecture in Europeana
CARARE: Connecting Archaeology and Architecture in Europeana
 
Digital Tools for Manuscript Study IIIF
Digital Tools for Manuscript Study IIIFDigital Tools for Manuscript Study IIIF
Digital Tools for Manuscript Study IIIF
 
Achieving interoperability between the CARARE schema for monuments and sites ...
Achieving interoperability between the CARARE schema for monuments and sites ...Achieving interoperability between the CARARE schema for monuments and sites ...
Achieving interoperability between the CARARE schema for monuments and sites ...
 
3D reconstructions for story telling and understanding
3D reconstructions for story telling and understanding3D reconstructions for story telling and understanding
3D reconstructions for story telling and understanding
 
Beyond Built Heritage Documentation: digital applications needs for research ...
Beyond Built Heritage Documentation: digital applications needs for research ...Beyond Built Heritage Documentation: digital applications needs for research ...
Beyond Built Heritage Documentation: digital applications needs for research ...
 
Early Chinese Periodicals Online (ECPO): From Digitization Towards Open Data....
Early Chinese Periodicals Online (ECPO): From Digitization Towards Open Data....Early Chinese Periodicals Online (ECPO): From Digitization Towards Open Data....
Early Chinese Periodicals Online (ECPO): From Digitization Towards Open Data....
 
Carare 2.0: Developing a metadata schema
Carare 2.0: Developing a metadata schema Carare 2.0: Developing a metadata schema
Carare 2.0: Developing a metadata schema
 
A Cultural Heritage Repository as Source for Learning Materials
A Cultural Heritage Repository as Source for Learning MaterialsA Cultural Heritage Repository as Source for Learning Materials
A Cultural Heritage Repository as Source for Learning Materials
 
Ariadne Services
Ariadne ServicesAriadne Services
Ariadne Services
 
Europeana Music Channel, wireframes
Europeana Music Channel, wireframesEuropeana Music Channel, wireframes
Europeana Music Channel, wireframes
 
Brazil eu collaboration
Brazil eu collaborationBrazil eu collaboration
Brazil eu collaboration
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connections
 
Geographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena ProjectsGeographic Information in the Carare and Athena Projects
Geographic Information in the Carare and Athena Projects
 

Similar to Digital Libraries Conference Focuses on Metadata Aggregation

IIIF at europeana, IIIF conference, Vatican, 2017
IIIF at europeana, IIIF conference, Vatican, 2017IIIF at europeana, IIIF conference, Vatican, 2017
IIIF at europeana, IIIF conference, Vatican, 2017Nuno Freire
 
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...
New approaches for data acquisition at europeana  iiif, sitemaps and schema.o...New approaches for data acquisition at europeana  iiif, sitemaps and schema.o...
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...Nuno Freire
 
Europeana and IIIF
Europeana and IIIFEuropeana and IIIF
Europeana and IIIFIIIF_io
 
NISO REST Training IIIF
NISO REST Training IIIF NISO REST Training IIIF
NISO REST Training IIIF Glen Robson
 
IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019Glen Robson
 
IIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership MeetingIIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership MeetingTom-Cramer
 
The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataNuno Freire
 
Europeana & IIIF - what we have been doing with IIIF and why
Europeana & IIIF - what we have been doing with IIIF and whyEuropeana & IIIF - what we have been doing with IIIF and why
Europeana & IIIF - what we have been doing with IIIF and whyDavid Haskiya
 
International Image Interoperability Framework (IIIF). Sharing high resolutio...
International Image Interoperability Framework (IIIF). Sharing high resolutio...International Image Interoperability Framework (IIIF). Sharing high resolutio...
International Image Interoperability Framework (IIIF). Sharing high resolutio...LIBIS
 
IIIF Introduction and Opportunities at Cornell
IIIF Introduction and Opportunities at CornellIIIF Introduction and Opportunities at Cornell
IIIF Introduction and Opportunities at CornellSimeon Warner
 
3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing accessCARARE
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaAntoine Isaac
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseAntoine Isaac
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with EuropeanaThe Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with EuropeanaVangelis Banos
 
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataEvaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataNuno Freire
 

Similar to Digital Libraries Conference Focuses on Metadata Aggregation (20)

IIIF at europeana, IIIF conference, Vatican, 2017
IIIF at europeana, IIIF conference, Vatican, 2017IIIF at europeana, IIIF conference, Vatican, 2017
IIIF at europeana, IIIF conference, Vatican, 2017
 
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...
New approaches for data acquisition at europeana  iiif, sitemaps and schema.o...New approaches for data acquisition at europeana  iiif, sitemaps and schema.o...
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...
 
Europeana and IIIF
Europeana and IIIFEuropeana and IIIF
Europeana and IIIF
 
NISO REST Training IIIF
NISO REST Training IIIF NISO REST Training IIIF
NISO REST Training IIIF
 
IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019IIIF Introduction given in South Africa - 2019
IIIF Introduction given in South Africa - 2019
 
DLCS
DLCSDLCS
DLCS
 
IIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership MeetingIIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership Meeting
 
The Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage DataThe Europeana Community: Semantics and Cultural Heritage Data
The Europeana Community: Semantics and Cultural Heritage Data
 
Europeana & IIIF - what we have been doing with IIIF and why
Europeana & IIIF - what we have been doing with IIIF and whyEuropeana & IIIF - what we have been doing with IIIF and why
Europeana & IIIF - what we have been doing with IIIF and why
 
International Image Interoperability Framework (IIIF). Sharing high resolutio...
International Image Interoperability Framework (IIIF). Sharing high resolutio...International Image Interoperability Framework (IIIF). Sharing high resolutio...
International Image Interoperability Framework (IIIF). Sharing high resolutio...
 
IIIF Introduction and Opportunities at Cornell
IIIF Introduction and Opportunities at CornellIIIF Introduction and Opportunities at Cornell
IIIF Introduction and Opportunities at Cornell
 
3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access3D content in Europeana: the challenges of providing access
3D content in Europeana: the challenges of providing access
 
Europeana datainaction nov2012
Europeana datainaction nov2012Europeana datainaction nov2012
Europeana datainaction nov2012
 
International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)
 
Multilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at EuropeanaMultilingual challenges and ongoing work to tackle them at Europeana
Multilingual challenges and ongoing work to tackle them at Europeana
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with EuropeanaThe Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
 
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage MetadataEvaluation of Schema.org for Aggregation of Cultural Heritage Metadata
Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata
 

More from Nuno Freire

Aggregation of Schema.org Linked Data for the Europeana Common Culture project
Aggregation of Schema.org Linked Data for the Europeana Common Culture projectAggregation of Schema.org Linked Data for the Europeana Common Culture project
Aggregation of Schema.org Linked Data for the Europeana Common Culture projectNuno Freire
 
Connecting Europe Facility - The eArchiving Building Block
Connecting Europe Facility - The eArchiving Building BlockConnecting Europe Facility - The eArchiving Building Block
Connecting Europe Facility - The eArchiving Building BlockNuno Freire
 
Automated interpretability of linked data ontologies: an evaluation within th...
Automated interpretability of linked data ontologies: an evaluation within th...Automated interpretability of linked data ontologies: an evaluation within th...
Automated interpretability of linked data ontologies: an evaluation within th...Nuno Freire
 
Next Generation Research with Europeana: the Humanities and Cultural Heritage...
Next Generation Research with Europeana: the Humanities and Cultural Heritage...Next Generation Research with Europeana: the Humanities and Cultural Heritage...
Next Generation Research with Europeana: the Humanities and Cultural Heritage...Nuno Freire
 
Demo of the Data Aggregation Lab - June 2018
Demo of the Data Aggregation Lab - June 2018Demo of the Data Aggregation Lab - June 2018
Demo of the Data Aggregation Lab - June 2018Nuno Freire
 
Demo of the Data Aggregation Lab - October 2018
Demo of the Data Aggregation Lab - October 2018Demo of the Data Aggregation Lab - October 2018
Demo of the Data Aggregation Lab - October 2018Nuno Freire
 
Opening Digitized Newspapers Corpora: Europeana’s Full-text Data Interoperabi...
Opening Digitized Newspapers Corpora: Europeana’s Full-text Data Interoperabi...Opening Digitized Newspapers Corpora: Europeana’s Full-text Data Interoperabi...
Opening Digitized Newspapers Corpora: Europeana’s Full-text Data Interoperabi...Nuno Freire
 
Aggregation of Linked Data A case study in the cultural heritage domain
Aggregation of Linked Data A case study in the cultural heritage domainAggregation of Linked Data A case study in the cultural heritage domain
Aggregation of Linked Data A case study in the cultural heritage domainNuno Freire
 
Aggregation of cultural heritage datasets through the Web of Data
Aggregation of cultural heritage datasets through the Web of DataAggregation of cultural heritage datasets through the Web of Data
Aggregation of cultural heritage datasets through the Web of DataNuno Freire
 
Building new knowledge from distributed scientific corpus: HERBADROP & EUROPE...
Building new knowledge from distributed scientific corpus: HERBADROP & EUROPE...Building new knowledge from distributed scientific corpus: HERBADROP & EUROPE...
Building new knowledge from distributed scientific corpus: HERBADROP & EUROPE...Nuno Freire
 
Use Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked DataUse Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked DataNuno Freire
 

More from Nuno Freire (11)

Aggregation of Schema.org Linked Data for the Europeana Common Culture project
Aggregation of Schema.org Linked Data for the Europeana Common Culture projectAggregation of Schema.org Linked Data for the Europeana Common Culture project
Aggregation of Schema.org Linked Data for the Europeana Common Culture project
 
Connecting Europe Facility - The eArchiving Building Block
Connecting Europe Facility - The eArchiving Building BlockConnecting Europe Facility - The eArchiving Building Block
Connecting Europe Facility - The eArchiving Building Block
 
Automated interpretability of linked data ontologies: an evaluation within th...
Automated interpretability of linked data ontologies: an evaluation within th...Automated interpretability of linked data ontologies: an evaluation within th...
Automated interpretability of linked data ontologies: an evaluation within th...
 
Next Generation Research with Europeana: the Humanities and Cultural Heritage...
Next Generation Research with Europeana: the Humanities and Cultural Heritage...Next Generation Research with Europeana: the Humanities and Cultural Heritage...
Next Generation Research with Europeana: the Humanities and Cultural Heritage...
 
Demo of the Data Aggregation Lab - June 2018
Demo of the Data Aggregation Lab - June 2018Demo of the Data Aggregation Lab - June 2018
Demo of the Data Aggregation Lab - June 2018
 
Demo of the Data Aggregation Lab - October 2018
Demo of the Data Aggregation Lab - October 2018Demo of the Data Aggregation Lab - October 2018
Demo of the Data Aggregation Lab - October 2018
 
Opening Digitized Newspapers Corpora: Europeana’s Full-text Data Interoperabi...
Opening Digitized Newspapers Corpora: Europeana’s Full-text Data Interoperabi...Opening Digitized Newspapers Corpora: Europeana’s Full-text Data Interoperabi...
Opening Digitized Newspapers Corpora: Europeana’s Full-text Data Interoperabi...
 
Aggregation of Linked Data A case study in the cultural heritage domain
Aggregation of Linked Data A case study in the cultural heritage domainAggregation of Linked Data A case study in the cultural heritage domain
Aggregation of Linked Data A case study in the cultural heritage domain
 
Aggregation of cultural heritage datasets through the Web of Data
Aggregation of cultural heritage datasets through the Web of DataAggregation of cultural heritage datasets through the Web of Data
Aggregation of cultural heritage datasets through the Web of Data
 
Building new knowledge from distributed scientific corpus: HERBADROP & EUROPE...
Building new knowledge from distributed scientific corpus: HERBADROP & EUROPE...Building new knowledge from distributed scientific corpus: HERBADROP & EUROPE...
Building new knowledge from distributed scientific corpus: HERBADROP & EUROPE...
 
Use Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked DataUse Cases From Digital Humanities for Library Linked Data
Use Cases From Digital Humanities for Library Linked Data
 

Recently uploaded

User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxnoordubaliya2003
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Tamer Koksalan, PhD
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPirithiRaju
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...navyadasi1992
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 

Recently uploaded (20)

User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptx
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdf
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 

Digital Libraries Conference Focuses on Metadata Aggregation

  • 1. International Conference on Theory and Practice of Digital Libraries September 2017
  • 2. Title here CC BY-SA Outline ● Motivation for rethinking metadata aggregation approaches • Focus: technology adoption in Cultural Heritage/Europeana ● Investigated technologies: IIIF and Sitemaps ● Case studies ● Application of the results in aggregation at Europeana ● Ongoing and future work CC BY-SA Metadata aggregation of IIIF Resources at Europeana
  • 3. Czech Republic, PD 1887, Uměleckoprůmyslové museum v Praze Preissig, Vojtech Coloured etchings Motivation in the context of Cultural Heritage
  • 4. Title here CC BY-SA Europeana The Platform for Europe’s Digital Cultural Heritage ● Europeana aggregates (and makes available) metadata: • From all EU countries • From ~3,500 galleries, libraries, archives and museums • Under a CC0 licence • More than 54M objects • In about 50 languages “We transform the world with culture! We want to build on Europe’s rich heritage and make it easier for people to use, whether for work, for learning or just for fun.” CC BY-SA
  • 5. Title here CC BY-SA What kinds of technologies are we considering? ● Focus on technology adoption: • Technologies that present low barriers for adoption by data providers ● Technologies used by Cultural Heritage institutions for other purposes • Search engine optimization • Linked data • Social web technologies • IIIF ● What are the successors of OAI-PMH? CC BY-SA
  • 6. Cristallisation ou Mouvement du temps, René Bord 1987, Bibliothèque Municipale De Lyon, public domain Investigated technologies: IIIF
  • 7. Brief introduction to the IIIF APIs Europeana & IIIF CC BY-SA How can IIIF be used for metadata aggregation?
  • 8. Ben Albritton Mike Appleby Tom Cramer Jon Stroop Rob Sanderson Stu Snydman Simeon Warner IIIF.io @bla222 @mikeapps @tcramer @jpstroop @azaroth42 @stusnydman @zimeon @iiif_io Object = Image + Presentation
  • 9. Ben Albritton Mike Appleby Tom Cramer Jon Stroop Rob Sanderson Stu Snydman Simeon Warner IIIF.io @bla222 @mikeapps @tcramer @jpstroop @azaroth42 @stusnydman @zimeon @iiif_io Presentation API •Descriptive: label, description •Rights: license, attribution (to be c’ed) Image API ● Image Data Object = Image + Presentation
  • 10. Ben Albritton Mike Appleby Tom Cramer Jon Stroop Rob Sanderson Stu Snydman Simeon Warner IIIF.io @bla222 @mikeapps @tcramer @jpstroop @azaroth42 @stusnydman @zimeon @iiif_io Presentation API (c’ed) • Structure • Collections of objects • Manifests organizing Items, Sequences, Parts together with their metadata • Linking • service: additional service endpoint • related: resource to display to the user • seeAlso: semantic metadata resource
  • 11. Cristallisation ou Mouvement du temps, René Bord 1987, Bibliothèque Municipale De Lyon, public domain Investigated technologies: Sitemaps
  • 12. Sitemaps CC BY-SA ● Sitemaps allow webmasters to inform search engines about pages on their sites that are available for crawling ● Sitemaps are supported/used by: • all major search engines • many content management systems • many Europeana data providers ● Sitemaps provide a simple technological solution with a very low implementation barrier ● Sitemaps can support a large range of resource types • Sitemaps has extensions for images and videos (defined by Google)
  • 13. Case studies Netherlands, Public Domain 1910-1925, Rijksmuseum Anonymous Tak met vier mangolia’s
  • 14. First case study: Crawling services across the IIIF universe Questions addressed: • Can Europeana find the available IIIF services through IIIF Service Registries? • Is the output of IIIF crawlable? Can robots follow links in IIIF output and reach all resources? • How mature and uniform are existing IIIF implementations ? • Is metadata available? • Are machine readable licenses available? CC BY-SA
  • 15. First case study: Crawling services across the IIIF universe Main conclusions: • Registries are available and are machine readable, but coverage was only partial • IIIF provides all that is necessary, but some features are optional (e.g. IIIF Collections) • Minor compliance problems only due to immaturity of the implementations • IIIF provides a way to link to metadata, but it is optional (and often not used, misused, or not fully informative) • IIIF provides licensing information, but it is optional (and often not used) CC BY-SA
  • 16. Case studies with Europeana Partners Netherlands, Public Domain 1910-1925, Rijksmuseum Anonymous Tak met vier mangolia’s
  • 17. Case studies with partners Europeana & IIIF CC BY-SA To study the feasibility of performing metadata aggregation via IIIF/Sitemaps we have undertaken case studies with providers of the Europeana Network • National Library of Wales • Very active in the IIIF community • Very advanced in IIIF implementation • Expertise in full-text content (over IIIF) • University College Dublin • Very advanced in IIIF implementation • Expertise in internet search engine optimization (Sitemaps and its media specific extensions)
  • 18. Case studies with National Library of Wales and University College Dublin • Crawling IIIF services via IIIF Collections • Crawling IIIF services via Sitemaps • Standard Sitemaps • Sitemaps extended with elements used in IIIF specifications • Sitemaps extended with elements from the ResourceSync namespace • Crawling IIIF services via IIIF Collections and HTTP cache headers • HTTP cache headers allows crawlers to use resource modification timestamps • Timestamps are essential for aggregating large collections CC BY-SA
  • 19. CC BY-SA Main conclusions from the case studies • Applying these technologies was straightforward for providers • When providers have in-house knowledge on a technology, its adoption/adaptation is simplified • None of the case studies presented serious technological obstacles • Very simple technological solutions are available • Only very large collections may require additional complexity • ...the main challenge is to choose among the several possibilities and establishing a standard (or best practice) within the community(ies): • Europeana is working with the IIIF community in the context of the IIIF Discovery Technical Specification group • Europeana will prepare recommendations targeted at its own partner network.
  • 20. Cristallisation ou Mouvement du temps, René Bord 1987, Bibliothèque Municipale De Lyon, public domain Application of the results
  • 21. CC BY-SA Operational IIIF/Sitemaps harvests so far @Europeana The outcomes of the case studies have resulted in real cases of IIIF/Sitemaps based aggregation into Europeana: • National Library of Wales • Sitemaps + IIIF • University College Dublin • Sitemaps + IIIF +Sitemaps Video Extension • Wellcome library • IIIF Collection + IIIF
  • 22. Future work France, Public Domain Agence Rol. Agence photographique, Bibliothèque national de France Chat "regardant" à travers une longue-vue et autre chat perché dessus
  • 23. CC BY-SA R&D ongoing work Crawling websites/LOD/IIIF in search for resources represented with Schema.org • Research Question: • Can metadata still comply with the requirements of Europeana/EDM, by being represented with Schema.Org? If so, with what level of quality? • One IIIF case study is in progress at this time • IIIF provider: North Carolina State University Libraries
  • 24. CC BY-SA Future work • Research the implications of IIIF and Sitemaps harvesting for the internal workflows of aggregators • ResourceSync: one case study in preparation with a collection of more than 600.000 resources • Continue monitoring and investigating technology trends in our domain: • Follow the outcomes from the IIIF Discovery Technical Specification Group[1] • The Linked Data Platform [2] • Notification Frameworks usage for metadata aggregation WebSub[3], Linked Data Notifications [4]
  • 25. Thank you for your attention nuno.freire@tecnico.ulisboa.pt Netherlands, Public Domain 1660 - 1625, Rijksmuseum Anonymous Arrival of a Portuguese ship Acknowledgments Valentine Charles, Europeana Foundation Fundação para a Ciência e a Tecnologia (FCT): UID/CEC/50021/2013 European Commission: grant agreement number CEF-TC-2015-1-01.