SlideShare uma empresa Scribd logo
1 de 43
Baixar para ler offline
graphthinking
a Repository for Electronic
Documents and Bibliographic Data
Felix Ostrowski (graphthinking, @literarymachine)
Jan Schnasse (hbz, @InspektorHicks)
ELAG, June 11th 2014, University of Bath
graphthinking
Rationale: A new foundation for
Edoweb
● A system to gather, describe and archive
deposit copies of electronic publications
and websites on behalf of the State Library
Center of Rhineland-Palatinate (LBZ)
● Operated by the North Rhine-Westphalian
Library Service Center (hbz) since 2002
● Technical evolution: OPUS – Digitool – regal
graphthinking
The current system and its
shortcomings: Digitool
● Digitool end-of-life is coming
● Unwanted/unexpected dependencies to other projects
hosted on the same Digitool instance
● Performance issues (we have millions of objects in
Digitool)
● No easily configurable search indexes or OAI-PMH
interfaces for single collections
● No out-of-the-box support of regional requirements (e.g.
metadata delivery to German National Library), extra
money/developer hours needed
graphthinking
The current system and its
shortcomings: Homemade
● Mix of self developed and Ex Libris components
● Vicious circle
– introduction of workarounds
– unpredictable migration costs
– decision to stay on obsolete version
– running out of support
– introduction of workarounds
● Administrative responsibilities in different hbz
working groups
graphthinking
Altogether, this leads to a expensive, hard to
maintain and outdated system that doesn't
satisfy our and ours clients needs.
graphthinking
The following aspects are
mandatory to achieve our goals
● Increase the overall performance
● Provide an up-to-date, modern user interface
● Use open source software (Fedora, Elasticsearch, Drupal)
●
Seamlessly import (meta-)data from Digitool and potentially other
(repository) systems
● Integrate the system with the emerging Linked-Open-Data
ecosystem, especially authority data
● Loosen the tight integration with Ex Libris Aleph
● Expose (meta-)data for easy discovery & re-use by others.
graphthinking
Overview of the new architecture
regal (backend)
Fedora Elasticsearch
regal-drupal (frontend)
Ex Libris
Aleph
lobid API
graphthinking
Data model
● Simple hierarchical data model consists of nodes
associated via hasPart and partOf relations
● Each node is identified by a namespace
combined with a Universally Unique Identifier
(UUID)
● Each node can have a bit and a metadata
stream
● Metadata canonically stored as RDF N-triples
● Bitstream can contain arbitrary data
graphthinking
graphthinking
Fedora (3.7.1)
● mainly used to organize and associate
multiple datastreams and their versions
● provides a long term accessible data storage
● usage of Proai as OAI-PMH solution
graphthinking
Elasticsearch (1.1.0)
● Used to provide performant lookup (for
metadata and full-text)
● Stores compacted JSON-LD
● Faceting can be used to browse the collection
graphthinking
Backend / API
● Java Web API (RESTful) implemented with
Jersey
● Abstracts access to storage & indexing,
transparently updates Fedora and different
Elasticsearch indexes
● Provides resources as OAI-ORE aggregations
graphthinking
Drupal Frontend
● Re-use of common features
– User management
– Template-system
– Field API
– RDF Mappings
– HTML-Form API
● Extended with custom modules for
– Storage Backend
– Linked Data Fields
– JavaScript UI enhancements
graphthinking
No big surprises for plain
text input...
graphthinking
Catalinking
graphthinking
Simple lookup widget with
configurable data sources
(currently only lobid-API
is implemented)
graphthinking
graphthinking
Additional linked data is
integrated on-the-fly
graphthinking
graphthinking
Client-side sorting (and
soon also searching) of
linked data
graphthinking
Exposing data
graphthinking
graphthinking
graphthinking
Importing data
graphthinking
This is simply a shortcut,
any linked data URI can
be used.
graphthinking
Tada!
graphthinking
graphthinking
Managing structure
graphthinking
Possible child nodes, in case
of a monograph these are
only files. Journals provide more
complex structures (volumes,
issues, articles).
graphthinking
graphthinking
Basic technical metadata
added by the backend.
graphthinking
Move object by settings
its new parent.
graphthinking
Faceted search, brought to us by Elasticsearch
graphthinking
Facets can be added and removed individually.
graphthinking
graphthinking
Anybody can say anything about anything...
graphthinking
Local views on remote resources,
e.g. authors and classifications.
graphthinking
Obstacles encountered / lessons
learned: Drupal
● is designed to be standalone, so we basically
have two backends
● its HTML Form API can be awkward to work
with if you don't want to do things the
"Drupal-way"
● a pure JavaScript / HTML5 frontend might
replace Drupal in upcoming versions
graphthinking
Obstacles encountered / lessons
learned: Fedora
● is more of an infrastructure than a storage
system
● because of its complexity, we consider
authorization via XACML a big disadvantage
● OAI-PMH is also not supported very well
● we are still looking for a more lightweight
solution
● perhaps as lightweight as simply using the file
system for both bitstreams and metadata
graphthinking
Obstacles encountered / lessons
learned: Elasticsearch
● Works very well with JSON-LD in general
● but needs some care to create proper
mappings
● and could use a more generic notion of
relations than only parent/child.
graphthinking
Further regal applications
● Migrate further
Digitool and
non-Digitool
repositories
● Frontend: Prototype
of an OER World Map
graphthinking
Good news: Linked Data Works!
● regal / Edoweb is not a research project,
● it is integrated into the hbz IT landscape,
● it is on the web,
● it does not require expertise in Linked Data,
● and real librarians will use it to create real
catalog entries.
graphthinking
Thank you!
Questions? Now or later to
felix.ostrowski@gmail.com
schnasse@hbz-nrw.de

Mais conteúdo relacionado

Mais procurados

Why is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncWhy is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz Inc
Franz Inc. - AllegroGraph
 

Mais procurados (20)

HypergraphDB
HypergraphDBHypergraphDB
HypergraphDB
 
GraphQL & DGraph with Go
GraphQL & DGraph with GoGraphQL & DGraph with Go
GraphQL & DGraph with Go
 
DGraph: Introduction To Basics & Quick Start W/Ratel
DGraph: Introduction To Basics & Quick Start W/RatelDGraph: Introduction To Basics & Quick Start W/Ratel
DGraph: Introduction To Basics & Quick Start W/Ratel
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xml
 
Lightweight Collection and Storage of Software Repository Data with DataRover
Lightweight Collection and Storage of  Software Repository Data with DataRoverLightweight Collection and Storage of  Software Repository Data with DataRover
Lightweight Collection and Storage of Software Repository Data with DataRover
 
Publishing RDF SKOS with microservices
Publishing RDF SKOS with microservicesPublishing RDF SKOS with microservices
Publishing RDF SKOS with microservices
 
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
 
Legislation.gov.uk
Legislation.gov.ukLegislation.gov.uk
Legislation.gov.uk
 
DBpedia Japanese
DBpedia JapaneseDBpedia Japanese
DBpedia Japanese
 
Querying GrAF data in linguistic analysis
Querying GrAF data in linguistic analysisQuerying GrAF data in linguistic analysis
Querying GrAF data in linguistic analysis
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
HyperGraphQL
HyperGraphQLHyperGraphQL
HyperGraphQL
 
Sasaki practical-linked-data
Sasaki practical-linked-dataSasaki practical-linked-data
Sasaki practical-linked-data
 
DBpedia Viewer - LDOW 2014
DBpedia Viewer - LDOW 2014DBpedia Viewer - LDOW 2014
DBpedia Viewer - LDOW 2014
 
20181215 introduction to graph databases
20181215   introduction to graph databases20181215   introduction to graph databases
20181215 introduction to graph databases
 
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
 
Tutorial4
Tutorial4Tutorial4
Tutorial4
 
Why is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncWhy is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz Inc
 
Semantika Introduction
Semantika IntroductionSemantika Introduction
Semantika Introduction
 
A primer on building real time data-driven products
A primer on building real time data-driven productsA primer on building real time data-driven products
A primer on building real time data-driven products
 

Destaque (8)

Current LOD Trends and Developments in the German Library Ecosystem
Current LOD Trends and Developments in the German Library EcosystemCurrent LOD Trends and Developments in the German Library Ecosystem
Current LOD Trends and Developments in the German Library Ecosystem
 
A Linked Data based index of library institutions
A Linked Data based index of library institutionsA Linked Data based index of library institutions
A Linked Data based index of library institutions
 
Report on m library activity August 2012
Report on m library activity August 2012Report on m library activity August 2012
Report on m library activity August 2012
 
RDF Transformations
RDF TransformationsRDF Transformations
RDF Transformations
 
Building an editable, versionized LOD service for library data
Building an editable, versionized LOD service for library dataBuilding an editable, versionized LOD service for library data
Building an editable, versionized LOD service for library data
 
Indexing
IndexingIndexing
Indexing
 
Indexing
IndexingIndexing
Indexing
 
5013 Indexing Presentation
5013 Indexing Presentation5013 Indexing Presentation
5013 Indexing Presentation
 

Semelhante a Regal - a Repository for Electronic Documents and Bibliographic Data

Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
scorlosquet
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinths
Daniel Camarda
 

Semelhante a Regal - a Repository for Electronic Documents and Bibliographic Data (20)

Understanding Hadoop
Understanding HadoopUnderstanding Hadoop
Understanding Hadoop
 
Apache Spark 101 - Demi Ben-Ari
Apache Spark 101 - Demi Ben-AriApache Spark 101 - Demi Ben-Ari
Apache Spark 101 - Demi Ben-Ari
 
Apache Hive for modern DBAs
Apache Hive for modern DBAsApache Hive for modern DBAs
Apache Hive for modern DBAs
 
Prashant_Agrawal_CV
Prashant_Agrawal_CVPrashant_Agrawal_CV
Prashant_Agrawal_CV
 
Oslo bekk2014
Oslo bekk2014Oslo bekk2014
Oslo bekk2014
 
Ontology Access Kit_ Workshop Intro Slides.pptx
Ontology Access Kit_ Workshop Intro Slides.pptxOntology Access Kit_ Workshop Intro Slides.pptx
Ontology Access Kit_ Workshop Intro Slides.pptx
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013
 
Apache Flink: Past, Present and Future
Apache Flink: Past, Present and FutureApache Flink: Past, Present and Future
Apache Flink: Past, Present and Future
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
 
Intro to Apache Hadoop
Intro to Apache HadoopIntro to Apache Hadoop
Intro to Apache Hadoop
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
 
Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021
 
Presto
PrestoPresto
Presto
 
AirBNB's ML platform - BigHead
AirBNB's ML platform - BigHeadAirBNB's ML platform - BigHead
AirBNB's ML platform - BigHead
 
Bighead: Airbnb’s End-to-End Machine Learning Platform with Krishna Puttaswa...
 Bighead: Airbnb’s End-to-End Machine Learning Platform with Krishna Puttaswa... Bighead: Airbnb’s End-to-End Machine Learning Platform with Krishna Puttaswa...
Bighead: Airbnb’s End-to-End Machine Learning Platform with Krishna Puttaswa...
 
Apache Marmotta - Introduction
Apache Marmotta - IntroductionApache Marmotta - Introduction
Apache Marmotta - Introduction
 
Integrating Flink with Hive, Seattle Flink Meetup, Feb 2019
Integrating Flink with Hive, Seattle Flink Meetup, Feb 2019Integrating Flink with Hive, Seattle Flink Meetup, Feb 2019
Integrating Flink with Hive, Seattle Flink Meetup, Feb 2019
 
An Introduction to Pentaho Kettle
An Introduction to Pentaho KettleAn Introduction to Pentaho Kettle
An Introduction to Pentaho Kettle
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinths
 
Analyzing Data at Scale with Apache Spark
Analyzing Data at Scale with Apache SparkAnalyzing Data at Scale with Apache Spark
Analyzing Data at Scale with Apache Spark
 

Último

Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
ydyuyu
 
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiAbu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Monica Sydney
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
JOHNBEBONYAP1
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
ayvbos
 
一比一原版田纳西大学毕业证如何办理
一比一原版田纳西大学毕业证如何办理一比一原版田纳西大学毕业证如何办理
一比一原版田纳西大学毕业证如何办理
F
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Monica Sydney
 

Último (20)

Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiAbu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
 
Call girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsCall girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girls
 
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrStory Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 
一比一原版田纳西大学毕业证如何办理
一比一原版田纳西大学毕业证如何办理一比一原版田纳西大学毕业证如何办理
一比一原版田纳西大学毕业证如何办理
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
 
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
 

Regal - a Repository for Electronic Documents and Bibliographic Data