5. Metadata, Vocabularies, Ontologies
Data about data.
Metadata is structured information that
makes it easier to retrieve, use, or manage
heritage information resource.
heritage information resource
Metadata
Metadata Mapping
Meta‐vocabularies
SKOS
Dublin Core
FOAF
Domain vocabularies
www.vestforsk.no
6. Why opening data?
Data has more value than applications
Data ages like Software applications
g
age like
Data is more used if it is easier to use it
www.vestforsk.no
7. Open Data
p
“A piece of content or data is open if
A
anyone is free to use, reuse, and
redistribute it — subject only at most to
only, most,
the requirement to attribute and share‐
alike.”
http://opendefinition.org
www.vestforsk.no
15. Current Web = internet + links + documents
The current Web represents information
using:
Natural language (e.g., English, Norwegian, etc.)
Graphics, multimedia
Page layout
P l t
Okay for humans
Oka for h mans
Difficult for machine processing
Diffic lt for machine processing
www.vestforsk.no
16. What is the problem?
The Web has problems
People aren’t interested in documents
• They are interested in things
People can parse documents and
extract meaning
• Web pages are written in HTML
• HTML describes visualization of information
• Computers can’t!
www.vestforsk.no
17. What do we need to do?
We need to help machines to understand
the Web so machines can help us
understand things
They can learn what we are interested in
They can help us better find what we want
www.vestforsk.no
18. How can we do that?
Besides publishing documents on the
Web
which computers can’t understand easily
Let’s publish something that computers
can understand
www.vestforsk.no
19. Current Data on the Web
Relational Databases
APIs
XML
XLS
CSV
…
Can’t machines and applications already consume that
data on the Web?
www.vestforsk.no
20. Sure!
However, it is available in distinct formats
and data models
www.vestforsk.no
24. Yes,
Yes
We have a standardized way of
y
publishing documents on the
Web
HTML
www.vestforsk.no
25. Then why can’t we have a
Th h ’ h
standard procedure of
p
publishing data on the Web?
g
www.vestforsk.no
26. Resource Description Framework (RDF)
A data model
A way to model data
i e Relational databases use relational data model
i.e. Relational databases use relational data model
RDF is a triple data model
Labeled Graph
Subject, Predicate, Object
<Hans > <lives in> <Sogndal>
<Hans > <lives in> <Sogndal>
<Oslo> <is capital of> <Norway>
www.vestforsk.no
27. So does that mean that
S d th t th t
everyone must publish their data
e e yo e p b e d
in RDF?
www.vestforsk.no
28. It is not mandatory…
It is not mandatory
…however we would like
everyone to publish data in
RDF …
www.vestforsk.no
31. Databases store documents
THINGS have PROPERTIES:
A Photo has a Title, a photographer…, …
ID Name
N Photographer
Ph t h PublisherID ReleasedData
P bli h ID R l dD t
80685-1-nor- Anna L. Szacinski 1 1914
NO Rogstad
… … … … …
This is a THING:
A photograph of Anna Rogstad PublisherID PublisherName
by L. Szacinski, …
1 Riksarkivet
… …
www.vestforsk.no
32. Representing the data in RDF
name Anna Rogstad
photographer
photo L. Szacinski
ID
80685-1-nor-NO
publisher
name
Publisher Riksarkivet
www.vestforsk.no
33. We are on the Web
Everything on the Web is identified by a
URI
www.vestforsk.no
34. Uniform Resource Identifier URI
URIs are the base for providing useful Linked Open Data,
h b f d f l k d
so carefully think the URI scheme you will follow for your entities.
It is usually a good idea to separate the ontology from the actual data
instances,
for example the geo.linkeddata.no follows this scheme:
• http://geo.linkeddata.no/ontology/ClassName (for Concepts)
• http://geo.linkeddata.no/ontology/property (for Properties)
• http://geo.linkeddata.no/resource/InstanceName (for data instances)
Also, the dereferencing method should be decided
How to serve resources after the consumer of the information has
requested them via HTTP
303 redirection or hash URIs
www.vestforsk.no
35. So, link the data to other data
name Anna Rogstad
http://www.ar photographer
kivverket.no/v L. Szacinski
ar/...
ID
80685-1-nor-NO
publisher
name
http://www.ark
http://www ark Riksarkivet
Rik ki t
ivverket.no
www.vestforsk.no
36. Now consider the data from http://kvinnesak.no/
http://www.arki Author http://www.ark
vverket.no/../T
akkebrevet
ivverket.no/va
r/...
description
archiver
Thanking
letter
http://…/arc name
hiver
Randi
Blehr
The letter by Anna Rogstad i hi h she
Th l tt b A R t d in which h
appreciates the appointment
www.vestforsk.no
37. Link data further
http://www.a
rkivverket.no Author http://www.ar
/../Takkebrev kivverket.no/v
et ar/...
description name Anna Rogstad
archiver sameAs
Thanking http://www photographer
letter .arkivverke L. Szacinski
t.no/var/...
http://…/a
name
rchiver ID
80685-1-nor-NO
Randi publisher
Blehr
http://www.ar
p name
Riksarkivet
Rik ki t
kivverket.no
www.vestforsk.no
39. Link more data
http://www.
arkivverket. hasAuthor
no/../Takke http://…/anna-
brevet rogstad
description
archiver
Thanking
letter
http://…/a
name
rchiver
sameAs Randi
Blehr
http://www.nrk livedIn http://dbpedia.org/Bergen
.no/sf/...Blehr
name Randi Blehr
www.vestforsk.no
40. Now link some more data
http://www.
arkivverket. hasAuthor
no/../Takke http://…/anna-
brevet rogstad
description name Anna Rogstad
archiver
Thanking http://www. photographer
letter arkivverket. L. Szacinski
no/var/...
no/var/
http://…/a
name
rchiver ID
80685-1-nor-NO
Randi publisher
sameAs http://www.
Blehr
arkivverket. name
Riksarkivet
Rik ki t
http://www.nrk LivedIn http://dbpedia.org/Bergen
http://www.stortinget.no
.no/sf/...Blehr
name Randi Blehr
www.vestforsk.no
41. Data on the Web that
is in RDF and is linked
to other RDF data is
LINKED DATA
www.vestforsk.no
45. http://www.
arkivverket. hasAuthor
no/../Takke http://…/anna-
brevet rogstad
description title Anna Rogstad
archiver
Thanking http://www. photographer
letter arkivverket. L. Szacinski
no/var/...
no/var/
http://…/a
name
rchiver ID
80685-1-nor-NO
Randi publisher
sameAs http://www.
Blehr
arkivverket. name
Riksarkivet
Rik ki t
http://www.nrk livedIn http://dbpedia.org/Bergen
http://www.stortinget.no
.no/sf/...Blehr
name Randi Blehr
www.vestforsk.no
47. Link data from more than 40 datasets
Make use of more
http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
p
than 2 Billion triples!
www.vestforsk.no
49. The Linking Open Data cloud diagram
Link data from more
than 295 datasets
Last updated: 2011‐09‐19
http://richard.cyganiak.de/2007/10/lod/
www.vestforsk.no
50. Linked data ...
publishing data on the Web ...
... to enable integration, linking and reuse
to enable integration linking
across silos
www.vestforsk.no
51. Six Steps to Publishing Linked Data
1. Understand the Principles
2. Model Your Data
2 Model Your Data
3. Choose URIs for Things in your Data
4. Setup Your Infrastructure
4 Setup Your Infrastructure
5. Link to other Data Sets
6. Describe and Publicise your Data
6 D ib d P bli i D t
www.vestforsk.no
52. Linked data
Apply the principles of the Web to publication of data
The Web:
is a global network of pages
each identified by a URL
fetching a URL gives a document
pages connected by links
open, anyone can say anything about anything else
www.vestforsk.no
53. Linked data
Apply the principles to the Web to publication of data
The linked data web:
is a global network of things
each identified by a URI
fetching a URI gives a set of statements
things connected by typed links
g
open, anyone can say anything about anything else
Linked data is “data you can click on”
www.vestforsk.no
54. LOD Benefits
other humans and applications can
easily access your data using Web technologies
ffollow the links in order to obtain further
f
contextual information
links to your data and search engine indices
can increase the visibility of your data
y y
www.vestforsk.no
55. The road to open knowledge
p g
begins here!
Thank you !
www.vestforsk.no