SlideShare uma empresa Scribd logo
1 de 68
Baixar para ler offline
Providing 
Linked 
Data 
Presented 
by: 
Maribel 
Acosta
LINKED 
DATA 
LIFECYCLE 
EUCLID 
-­‐ 
Querying 
Linked 
Data 
2
Linked 
Data 
Principles 
1. Use 
URIs 
as 
names 
for 
things. 
2. Use 
HTTP 
URIs 
so 
that 
users 
can 
look 
up 
those 
names. 
3. When 
someone 
looks 
up 
a 
URI, 
provide 
useful 
informa6on, 
using 
the 
standards 
(RDF*, 
SPARQL). 
4. Include 
links 
to 
other 
URIs, 
so 
that 
users 
can 
discover 
more 
things. 
CH 
1 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
3
Linked 
Data 
Lifecycle 
Source: 
Sören 
Auer. 
“The 
SemanSc 
Data 
Web” 
(slides) 
Linked 
Data 
Lifecycle 
Source: 
José 
M. 
Alvarez. 
“My 
Linked 
Data 
Lifecycle” 
Source: 
Michael 
Hausenblas. 
“Linked 
Data 
lifeyclcle” 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
4
Core 
Tasks 
for 
Providing 
Linked 
Data 
Based 
on 
the 
proposed 
LD 
lifecycles 
and 
the 
LD 
principles, 
we 
can 
idenSfy 
3 
main 
tasks 
for 
providing 
LD: 
① Crea6ng: 
includes 
data 
extracSon, 
creaSon 
of 
HTTP 
URIs, 
and 
vocabulary 
selecSon. 
(LD 
principles 
1 
& 
2) 
② Interlinking: 
involves 
the 
creaSon 
of 
(RDF) 
links 
to 
external 
data 
sets. 
(LD 
principle 
4) 
③ Publishing: 
consists 
of 
creaSng 
the 
metadata 
and 
making 
the 
data 
set 
accessible. 
(LD 
principle 
3) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
5
Agenda 
1. Crea6ng 
Linked 
Data 
2. Interlinking 
Linked 
Data 
3. Publishing 
Linked 
Data 
4. Linked 
Data 
publishing 
checklist 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
6
CREATING 
LINKED 
DATA 
Linked 
Data 
principles 
1 
& 
2 
EUCLID 
-­‐ 
Querying 
Linked 
Data 
7
Extracting 
the 
Data 
• The 
data 
of 
interest 
may 
be 
stored 
in 
a 
wide 
range 
or 
formats: 
• Several 
tools 
support 
the 
process 
of 
mining 
data 
from 
different 
repositories, 
for 
example: 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
8 
Spreadsheets 
or 
tabular 
data 
Databases 
Text 
R2RML
Naming 
Things: 
URIs 
• All 
the 
things 
or 
disSnct 
enSSes 
within 
the 
data 
must 
be 
named 
• According 
to 
the 
Linked 
Data 
principles, 
the 
standard 
mechanism 
to 
name 
enSSes 
is 
the 
URI 
• Designing 
Cool 
URIs: 
– Leave 
out 
informaSon 
about 
the 
data 
regarding 
to: 
author, 
technologies, 
status, 
access 
mechanisms, 
… 
– Simplicity: 
short, 
mnemonic 
URIs 
– Stability: 
maintain 
the 
URIs 
as 
long 
as 
possible 
– Manageability: 
issue 
the 
URIs 
in 
a 
way 
that 
you 
can 
manage 
Source: 
hdp://www.w3.org/TR/cooluris/ 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
9
Selecting 
Vocabularies 
• Vocabularies 
model 
the 
concepts 
and 
the 
rela6onship 
between 
them 
in 
a 
knowledge 
domain 
• Terms 
from 
well-­‐known 
vocabularies 
should 
be 
reused 
wherever 
possible 
• New 
terms 
should 
be 
define 
only 
if 
you 
can 
not 
find 
required 
terms 
in 
exisSng 
vocabularies 
• A 
large 
number 
of 
vocabularies 
in 
RDF 
are 
openly 
available, 
e.g., 
Linked 
Open 
Vocabularies 
(LOV) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
10
Selecting 
Vocabularies 
(2) 
Linked 
Open 
Vocabularies 
322 
vocabularies 
(2013) 
Source: 
hdp://lov.okfn.org/dataset/lov/ 
458 
vocabularies 
(2014) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
11
Selecting 
Vocabularies 
(3) 
Linked 
Open 
Vocabularies: 
Analyzing 
MusicOntology 
Source:hdp://lov.okfn.org/dataset/lov/details/vocabulary_mo.html 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
12
Selecting 
Vocabularies 
(4) 
Other 
lists 
of 
well-­‐known 
vocabularies 
are 
maintained 
by: 
• W3C 
SWEO 
Linking 
Open 
Data 
community 
project 
hdp://www.w3.org/wiki/TaskForces/CommunityProjects/LinkingOpenData/ 
CommonVocabularies 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
13 
• Library 
Linked 
Data 
Incubator 
Group: 
Vocabularies 
in 
the 
library 
domain 
hdp://www.w3.org/2005/Incubator/lld/XGR-­‐lld-­‐vocabdataset-­‐20111025
Creating 
Linked 
Data 
(Quick 
Recipe) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
14 
• Source 
data 
(database, 
text, 
…) 
• Names 
for 
things 
(instances): 
URIs 
• Vocabularies 
– A 
new 
concept 
should 
be 
created 
only 
when 
it 
can 
not 
be 
found 
in 
exisSng 
vocabularies 
RDF 
dataset 
Steps:
INTERLINKING 
LINKED 
DATA 
Linked 
Data 
principle 
4 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
15
Interlinking 
Data 
Sets 
• It’s 
one 
of 
the 
Linked 
Data 
principles! 
4. 
Include 
links 
to 
other 
URIs, 
so 
that 
users 
can 
• Involves 
discover 
the 
creaSon 
of 
RDF 
links 
between 
two 
different 
RDF 
data 
sets: 
– Links 
at 
instance 
level 
(rdfs:seeAlso, 
owl:sameAs) 
– Links 
at 
schema 
level 
(RDFS 
subclass/subproperty, 
OWL 
equivalent 
class/property, 
SKOS 
mapping 
proper6es) 
• Appropriate 
links 
are 
detected 
via 
link 
discovery 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
16 
more 
things.
Interlinking 
Data 
Sets 
(2) 
Challenges 
for 
link 
discovery 
• Linked 
Data 
sets 
are 
heterogeneous 
in 
terms 
of 
vocabularies, 
formats 
and 
data 
representaSon 
• Large 
range 
of 
knowledge 
domains 
• Scalability: 
LD 
is 
composed 
of 
a 
large 
number 
of 
data 
sets 
and 
RDF 
triples, 
hence 
it 
is 
not 
possible 
to 
compare 
every 
possible 
enSty 
pair 
Source: 
Robert 
Isele. 
“LOD2 
Webinar 
Series: 
Silk” 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
17
Interlinking 
Data 
Sets 
(3) 
Challenges 
for 
link 
discovery 
• An 
instance 
of 
link 
discovery 
is 
the 
en6ty 
resolu6on 
problem: 
deciding 
whether 
two 
en..es 
correspond 
to 
same 
object 
in 
the 
real 
world 
• Name 
ambigui6es: 
typos, 
misspellings, 
different 
languages, 
homonyms 
• Structural 
ambigui6es: 
same 
concepts/enSSes 
with 
different 
structures. 
Requires 
the 
applicaSon 
of 
ontology 
and 
schema 
matching 
techniques 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
18
Interlinking 
Data 
Sets 
(4) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
19 
RDF 
data 
sets 
can 
be 
interlinked: 
Manually 
• Involves 
the 
manual 
exploraSon 
of 
LD 
data 
sets 
and 
their 
RDF 
resources 
to 
idenSfy 
linking 
targets 
• May 
not 
be 
feasible 
when 
the 
number 
of 
enSSes 
within 
the 
data 
set 
is 
very 
large 
(Semi-­‐)Automatically 
• Using 
tools 
that 
perform 
link 
discovery 
based 
on 
linkage 
rules, 
for 
example: 
Silk, 
Limes 
and 
xCurator
owl:sameAs 
& 
rdfs:seeAlso 
(1) 
• owl:sameAs 
• Creates 
links 
between 
individuals 
• States 
that 
two 
URIs 
refer 
to 
the 
same 
individuals 
• rdfs:seeAlso 
• States 
that 
a 
resource 
may 
provide 
addiSonal 
informaSon 
about 
the 
subject 
resource 
• Links 
in 
MusicBrainz: 
– owl:sameAs 
is 
used 
for 
music 
arSsts 
– rdfs:seeAlso 
is 
used 
for 
albums 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
20
owl:sameAs 
& 
rdfs:seeAlso 
(2) 
Examples 
of 
owl:sameAs 
and 
rdfs:seeAlso 
@prefix 
owl: 
<http://www.w3.org/2002/07/owl#> 
@prefix 
rdfs: 
<http://www.w3.org/2000/01/rdf-­‐schema#> 
@prefix 
dbpedia: 
<http://dbpedia.org/resource/> 
<http://musicbrainz.org/artist/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d> 
owl:sameAs 
dbpedia:The_Beatles. 
<http://musicbrainz.org/release/f8bf9e45-­‐a1d1-­‐42ab-­‐8f3c-­‐637b1762e5f7> 
rdfs:seeAlso 
dbpedia:All_you_need_is_love_(album). 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
21
SKOS 
• Simple 
Knowledge 
OrganizaSon 
System 
– hdp://www.w3.org/TR/skos-­‐reference/ 
• Data 
model 
for 
knowledge 
organizaSon 
systems 
(thesauri, 
classificaSon 
scheme, 
taxonomies) 
• SKOS 
data 
is 
expressed 
as 
RDF 
triples 
• Allows 
the 
creaSon 
of 
RDF 
links 
between 
different 
data 
sets 
with 
the 
usage 
of 
mapping 
proper6es 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
22
SKOS: 
Mapping 
Properties 
These 
properSes 
are 
used 
to 
link 
SKOS 
concepts 
(parScularly 
instances) 
in 
different 
schemas: 
• skos:closeMatch: 
links 
two 
concepts 
that 
are 
sufficiently 
similar 
(someSmes 
can 
be 
used 
interchangeably) 
• skos:exactMatch: 
indicates 
that 
the 
two 
concepts 
can 
be 
used 
interchangeably. 
• Axiom: 
It 
is 
a 
transi6ve 
property 
• skos:relatedMatch: 
states 
an 
associaSve 
mapping 
link 
between 
two 
concepts 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
23
SKOS: 
Mapping 
Properties 
Example 
of 
SKOS 
exact 
match 
(2) 
@prefix 
skos: 
<http://www.w3.org/2004/02/skos/core#> 
@prefix 
mo: 
<http://purl.org/ontology/mo/> 
@prefix 
dbpedia-­‐ont: 
<http://dbpedia.org/ontology/> 
@prefix 
schema: 
<http://schema.org/> 
mo:MusicArtist 
skos:exactMatch 
dbpedia-­‐ont:MusicalArtist. 
mo:MusicGroup 
skos:exactMatch 
dbpedia-­‐ont:Band. 
mo:MusicGroup 
skos:exactMatch 
schema:MusicGroup. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
24
SKOS: 
Mapping 
Properties 
Example 
of 
SKOS 
close 
match 
(3) 
@prefix 
skos: 
<http://www.w3.org/2004/02/skos/core#> 
@prefix 
mo: 
<http://purl.org/ontology/mo/> 
@prefix 
dbpedia-­‐ont: 
<http://dbpedia.org/ontology/> 
@prefix 
schema: 
<http://schema.org/> 
mo:SignalGroup 
skos:closeMatch 
schema:MusicAlbum. 
mo:SignalGroup 
skos:closeMatch 
dbpedia-­‐ont:Album. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
25
SKOS: 
Mapping 
Properties 
Integrity 
conditions 
• Guarantee 
(4) 
consistency 
and 
avoid 
contradicSons 
in 
the 
relaSonships 
between 
SKOS 
concepts 
ParSal 
Mapping 
RelaSon 
diagram 
with 
integrity 
condiSons 
skos:Mapping 
Relation 
skos:close 
Match 
skos:related 
skos:exact 
Match 
Match 
Symmetric 
& 
Transi6ve 
Disjoint 
with 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
26 
Symmetric
Interlinking 
Linked 
Data 
(Quick 
Recipe) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
27 
• Create 
links 
at 
instance 
level 
• (Create 
links 
at 
schema 
level) 
Interlinked 
RDF 
dataset 
Steps:
PUBLISHING 
LINKED 
DATA 
Linked 
Data 
principle 
3 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
28
Publishing 
Linked 
Data 
Once 
the 
RDF 
data 
set 
has 
been 
created 
and 
interlinked, 
the 
publishing 
process 
involves 
the 
following 
tasks: 
1. Metadata 
creaSon 
for 
describing 
the 
data 
set 
2. Making 
the 
data 
set 
accessible 
3. Exposing 
the 
data 
set 
in 
Linked 
Data 
repositories 
4. Valida6ng 
the 
data 
set 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
29
Describing 
RDF 
Data 
Sets 
• Consists 
of 
providing 
(machine-­‐readable) 
metadata 
of 
RDF 
data 
sets 
which 
can 
be 
processed 
by 
engines 
• This 
informaSon 
allows 
for: 
– Efficient 
and 
effecSve 
search 
of 
data 
sets 
– SelecSon 
of 
appropriate 
data 
sets 
(for 
consumpSon 
or 
interlinking) 
– Get 
general 
staSsScs 
of 
the 
data 
sets 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
30
Describing 
RDF 
Data 
Sets 
(2) 
• The 
common 
language 
for 
describing 
RDF 
data 
sets 
is 
VoID 
(Vocabulary 
of 
Interlinked 
Data 
sets) 
• Defines 
an 
RDF 
data 
set 
with 
the 
predicate 
void:Dataset 
• Covers 
4 
types 
of 
metadata: 
• General 
metadata 
• Structural 
metadata 
• DescripSons 
of 
linksets 
• Access 
metadata 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
31
VoID: 
General 
Metadata 
• General 
metadata 
is 
used 
by 
users 
to 
idenSfy 
appropriate 
data 
sets. 
• Specifies 
informaSon 
about 
descripSon 
of 
the 
data 
set, 
contact 
person/organizaSon, 
the 
license 
of 
the 
data 
set, 
data 
subject 
and 
some 
technical 
features. 
• VoID 
(re)uses 
predicates 
from 
the 
Dublin 
Core 
Metadata1 
and 
FOAF2 
vocabularies. 
1 
hdp://dublincore.org/documents/2010/10/11/dcmi-­‐terms/ 
2 
hdp://xmlns.com/foaf/spec/ 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
32
VoID: 
General 
Metadata 
(2) 
Source: 
hdp://www.General 
Information 
Contains 
informaSon 
about 
the 
creaSon 
of 
the 
data 
set 
Predicate 
Range 
Descrip6on 
dcterms:title 
Literal 
Name 
of 
the 
data 
set. 
dcterms:description 
Literal 
DescripSon 
of 
the 
data 
set. 
dcterms:source 
RDF 
resource 
w3.org/TR/void/#metadata 
Source 
from 
which 
the 
data 
set 
was 
derived. 
dcterms:creator 
RDF 
resource 
Primarily 
responsible 
of 
creaSng 
the 
data 
set. 
dcterms:date 
xsd:date 
Time 
associated 
with 
an 
event 
in 
the 
life-­‐cycle 
of 
the 
resource. 
dcterms:created 
xsd:date 
Date 
of 
creaSon 
of 
the 
data 
set. 
dcterms:issued 
xsd:date 
Date 
of 
publicaSon 
of 
the 
data 
set. 
dcterms:modified 
xsd:date 
Date 
on 
which 
the 
data 
set 
was 
changed. 
foaf:homepage 
Literal 
Name 
of 
the 
data 
set. 
dcterms:publisher 
RDF 
resource 
EnSty 
responsible 
for 
making 
the 
data 
set 
available. 
dcterms:contributor 
RDF 
resource 
EnSty 
responsible 
for 
making 
contribuSons 
to 
the 
data 
set. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
33
VoID: 
General 
Metadata 
(3) 
Other 
Information 
• License 
of 
the 
data 
set: 
specifies 
the 
usage 
condiSons 
of 
the 
data. 
The 
license 
can 
be 
pointed 
with 
the 
property 
dcterms:license 
• Category 
of 
the 
data 
set: 
to 
specify 
the 
topics 
or 
domains 
covered 
by 
the 
data 
set, 
the 
property 
dcterms:subject 
can 
be 
used 
• Technical 
features: 
the 
property 
void:feature 
can 
be 
used 
to 
express 
technical 
properSes 
of 
the 
data 
(e.g. 
RDF 
serializaSon 
formats) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
34
VoID: 
Structural 
Metadata 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
35 
• Provides 
high-­‐level 
informaSon 
about 
the 
internal 
structure 
of 
the 
data 
set 
• This 
metadata 
is 
useful 
when 
exploring 
or 
querying 
the 
data 
set 
• Includes 
informaSon 
about 
resources, 
vocabularies 
used 
in 
the 
data 
set, 
staSsScs 
and 
examples 
of 
resources 
in 
the 
data 
set
VoID: 
Structural 
Metadata 
(2) 
Information 
about 
resources 
• Example 
resources: 
allow 
users 
to 
get 
an 
impression 
of 
the 
kind 
of 
resources 
included 
in 
the 
data 
set. 
Examples 
can 
be 
shown 
with 
the 
property 
void:exampleResource 
:MusicBrainz 
a 
void:Dataset; 
void:exampleResource 
<http://musicbrainz.org/artist/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d> 
. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
36 
• Padern 
for 
resource 
URIs: 
the 
void:uriSpace 
property 
can 
be 
used 
to 
state 
that 
all 
the 
enSty 
URIs 
in 
a 
data 
set 
start 
with 
a 
given 
string 
:MusicBrainz 
a 
void:Dataset; 
void:uriSpace 
"http://musicbrainz.org/" 
.
VoID: 
Structural 
Metadata 
(3) 
Vocabularies 
used 
in 
the 
data 
set 
• The 
void:vocabulary 
property 
idenSfies 
the 
vocabulary 
or 
ontology 
that 
is 
used 
in 
a 
data 
set 
:MusicBrainz 
a 
void:Dataset; 
void:vocabulary 
<http://purl.org/ontology/mo/> 
. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
37 
• Typically, 
only 
the 
most 
relevant 
vocabularies 
are 
listed 
• This 
property 
can 
only 
be 
used 
for 
enSre 
vocabularies. 
It 
cannot 
be 
used 
to 
express 
that 
a 
subset 
of 
the 
vocabulary 
occurs 
in 
the 
data 
set.
VoID: 
Structural 
Metadata 
(4) 
Statistics 
about 
a 
data 
set 
Express 
numeric 
staSsScs 
about 
a 
data 
set: 
Predicate 
Range 
Descrip6on 
void:triples 
Number 
Source: 
hdp://www.w3.org/TR/void/#metadata 
Total 
number 
of 
triples 
contained 
in 
the 
data 
set. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
38 
void:entities 
Number 
Total 
number 
of 
enSSes 
that 
are 
described 
in 
the 
data 
set. 
An 
enSty 
must 
have 
a 
URI, 
and 
match 
the 
void:uriRegexPadern 
void:classes 
Number 
Total 
number 
of 
disSnct 
classes 
in 
the 
data 
set. 
void:properties 
Number 
Total 
number 
of 
disSnct 
properSes 
in 
the 
data 
set. 
void:distinctSubjects 
Number 
Total 
number 
of 
disSnct 
subjects 
in 
the 
data 
set. 
void:distinctObjects 
Number 
Total 
number 
of 
disSnct 
objects 
in 
the 
data 
set. 
void:documents 
Number 
Total 
number 
of 
documents, 
in 
case 
that 
the 
data 
set 
is 
published 
as 
a 
set 
of 
individual 
documents.
VoID: 
Structural 
Metadata 
(5) 
Partitioned 
data 
sets 
• The 
:MusicBrainz 
a 
void:Dataset; 
void:subset 
:MusicBrainzGroupMembers 
. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
39 
void:subset 
property 
provides 
descripSon 
of 
parts 
of 
a 
data 
set 
• Data 
sets 
can 
be 
parSSoned 
based 
on 
classes 
or 
proper6es: 
• void:classPartition 
contains 
only 
instances 
of 
a 
parScular 
class 
• void:propertyPartition 
contains 
only 
triples 
with 
a 
parScular 
predicate 
:MusicBrainzGroupMembers 
a 
void:Dataset; 
void:classPartition 
[ 
void:class 
mo:MusicArtist 
.] 
; 
void:propertyPartition 
[ 
void:property 
mo:member 
.] 
.
VoID: 
Describing 
Linksets 
@PREFIX 
void:<http://rdfs.org/ns/void#> 
@PREFIX 
owl:<http://www.w3.org/2002/07/owl#> 
:DS1 
a 
void:Dataset 
. 
:DS2 
a 
void:Dataset 
. 
:DS1 
void:subset 
:LS1 
. 
:LS1 
a 
void:Linkset; 
owl:sameAs 
void:linkPredicate 
owl:sameAs; 
void:target 
:DS1, 
:DS2 
. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
40 
• Linkset: 
collecSon 
of 
RDF 
links 
between 
two 
RDF 
data 
sets 
:DS1 
:DS2 
:LS1 
:LS2 
Image 
based 
on 
hdp://semanScweb.org/wiki/File:Void-­‐linkset-­‐conceptual.png
VoID: 
Describing 
Linksets 
(2) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
41 
Example 
@PREFIX 
void:<http://rdfs.org/ns/void#> 
@PREFIX 
owl:<http://www.w3.org/2002/07/owl#> 
:MusicBrainz 
a 
void:Dataset 
. 
:DBpedia 
a 
void:Dataset 
. 
:MusicBrainz 
void:classPartition 
:MBArtists 
. 
:MBArtists 
void:class 
mo:MusicArtist 
. 
:MBArtists 
a 
void:Linkset; 
void:linkPredicate 
owl:sameAs; 
void:target 
:MusicBrainz, 
:DBpedia 
.
VoID: 
Access 
Metadata 
The 
access 
metadata 
describes 
the 
methods 
of 
accessing 
the 
actual 
RDF 
data 
set 
Method 
Predicate 
Descrip6on 
* 
This 
assumes 
that 
the 
default 
graph 
of 
the 
SPARQL 
endpoint 
contains 
the 
data 
set. 
VoID 
cannot 
express 
that 
a 
data 
set 
is 
contained 
in 
a 
specific 
named 
graph. 
This 
can 
be 
specified 
with 
SPARQL 
1.1. 
Service 
DescripSon 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
42 
URI 
look 
up 
endpoint 
void:uriLookupEndpoint 
Specifies 
the 
URI 
of 
a 
service 
for 
accessing 
the 
data 
set 
(different 
from 
the 
SPARQL 
protocol) 
Root 
resource 
void:rootResource 
URI 
of 
the 
top 
concepts 
(only 
for 
data 
sets 
structured 
as 
trees) 
SPARQL 
endpoint 
void:sparqlEndpoint 
Provides 
access 
to 
the 
data 
set 
via 
the 
SPARQL 
protocol.* 
RDF 
data 
dumps 
void:dataDump 
Specifies 
the 
locaSon 
of 
the 
dump 
file. 
If 
the 
data 
set 
is 
split 
into 
mulSple 
files, 
then 
several 
values 
of 
this 
property 
are 
provided. 
CH 
5
Providing 
Access 
to 
the 
Data 
Set 
The 
data 
set 
can 
be 
accessed 
via 
different 
mechanisms: 
RDFa 
RDF 
dump 
SPARQL 
endpoint 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
43 
Dereferencing 
HTTP 
URIs
Dereferencing 
HTTP 
URIs 
• Allows 
for 
easily 
exploring 
certain 
resources 
contained 
in 
the 
data 
set 
• 
What 
to 
return 
for 
a 
URI? 
• Immediate 
descrip6on: 
CH 
1 
triples 
where 
the 
URI 
is 
the 
subject. 
• Backlinks: 
triples 
where 
the 
URI 
is 
the 
object. 
• Related 
descrip6ons: 
informaSon 
of 
interest 
in 
typical 
usage 
scenarios. 
• Metadata: 
informaSon 
as 
author 
and 
licensing 
informaSon. 
• Syntax: 
RDF 
descripSons 
as 
RDF/XML 
and 
human-­‐readable 
formats. 
• ApplicaSons 
(e.g. 
LD 
browsers) 
render 
the 
retrieved 
informaSon 
so 
it 
can 
be 
perceived 
by 
a 
user. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
44 
Source: 
How 
to 
Publish 
Linked 
Data 
on 
The 
Web 
-­‐ 
Chris 
Bizer, 
Richard 
Cyganiak, 
Tom 
Heath.
Dereferencing 
HTTP 
URIs 
(2) 
Example: 
Dereferencing 
http://dbpedia.org/resource/The_Beatles 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
45
RDFa 
• RDFa 
= 
“RDF 
in 
adributes” 
• Extension 
to 
HTML5 
for 
embedding 
RDF 
within 
HTML 
pages: 
– The 
HTML 
is 
processed 
by 
the 
browser, 
the 
(human) 
consumer 
doesn’t 
see 
the 
RDF 
data 
– The 
RDF 
triples 
within 
the 
page 
are 
consumed 
by 
APIs 
to 
extract 
the 
(semi-­‐)structured 
data 
• It 
is 
considered 
as 
the 
bridge 
between 
the 
Web 
of 
Data 
and 
the 
Web 
of 
Documents 
• It 
is 
a 
complete 
serializa6on 
of 
RDF 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
46
RDFa: 
Attributes 
A_ribute 
role 
A_ribute 
Descrip6on 
prefix 
Syntax 
List 
of 
prefix-­‐name 
IRIs 
pairs 
vocab 
IRI 
that 
specifies 
the 
vocabulary 
where 
the 
concept 
is 
defined 
Subject 
about 
Specifies 
the 
subject 
of 
the 
relaSonship 
Predicate 
property 
Express 
the 
relaSonship 
between 
the 
subject 
and 
the 
value 
rel 
Defines 
a 
relaSon 
between 
the 
subject 
and 
a 
URL 
rev 
Express 
reverse 
relaSonships 
between 
two 
resources 
Resource 
href 
Specifies 
an 
object 
URI 
for 
the 
rel 
and 
rev 
adributes 
resource 
Same 
as 
href 
(used 
when 
href 
is 
not 
present) 
src 
Specifies 
the 
subject 
of 
a 
relaSonship 
Literal 
datatype 
Express 
the 
datatype 
of 
the 
object 
of 
the 
property 
adribute 
content 
Supply 
machine-­‐readable 
content 
for 
a 
literal 
xml:lang, 
lang 
Specifies 
the 
language 
of 
the 
literal 
Macro 
typeof 
Indicate 
the 
RDF 
type(s) 
to 
associate 
with 
a 
subject 
inlist 
An 
object 
is 
added 
to 
the 
list 
of 
a 
predicate. 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
47
RDFa: 
Example 
Extracting 
RDF 
from 
HTML 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
48 
HTML 
(+RDFa): 
<div 
class="arSstheader" 
about="hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_" 
typeof="hdp://purl.org./ontology/mo/MusicGroup"> 
… 
</div> 
RDF: 
<hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_>
RDFa: 
Example 
Extracting 
RDF 
from 
HTML 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
49 
HTML 
(+RDFa): 
<div 
class="arSstheader" 
about="hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_" 
typeof="hdp://purl.org./ontology/mo/MusicGroup"> 
… 
</div> 
RDF: 
<hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_> 
<hdp://www.w3.org/1999/02/22-­‐rdf-­‐syntax-­‐ns#type>
RDFa: 
Example 
Extracting 
RDF 
from 
HTML 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
50 
HTML 
(+RDFa): 
<div 
class="arSstheader" 
about="hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_" 
typeof="hdp://purl.org./ontology/mo/MusicGroup"> 
… 
</div> 
RDF: 
<hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_> 
<hdp://www.w3.org/1999/02/22-­‐rdf-­‐syntax-­‐ns#type> 
<hdp://purl.org./ontology/mo/MusicGroup>.
RDFa: 
Example 
(2) 
Extracting 
RDF 
from 
MusicBrainz.org 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
51 
hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d
RDFa: 
Example 
(2) 
Extracting 
RDF 
from 
MusicBrainz.org 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
52 
Source: 
hdp://www.w3.org/2007/08/pyRdfa/
RDFa: 
Example 
(2) 
Extracting 
RDF 
from 
MusicBrainz.org 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
53 
hdp://www.w3.org/2007/08/pyRdfa/extract?uri=hdp%3A%2F%2Fmusicbrainz.org 
%2FarSst%2Fb10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d&format=nt 
Watch 
the 
EUCLID 
screencast: 
http://vimeo.com/euclidproject
RDF 
Dump 
• An 
RDF 
dump 
refers 
to 
a 
file 
which 
contains 
(part 
of) 
a 
data 
set 
specified 
in 
an 
RDF 
serializaSon 
• The 
data 
set 
can 
be 
split 
into 
several 
RDF 
dumps 
• A 
list 
of 
available 
data 
sets 
available 
as 
RDF 
dumps 
can 
be 
found 
at: 
– hdp://www.w3.org/wiki/DataSetRDFDumps 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
54
SPARQL 
Endpoint 
• The 
SPARQL 
endpoint 
refers 
to 
the 
URI 
of 
the 
listener 
of 
the 
SPARQL 
protocol 
service, 
which 
handles 
requests 
for 
SPARQL 
protocol 
operaSons 
• The 
user 
submits 
SPARQL 
queries 
to 
the 
SPARQL 
endpoint 
in 
order 
to 
retrieve 
only 
a 
desired 
subset 
of 
the 
RDF 
data 
set 
• List 
CH 
2 
of 
available 
SPARQL 
endpoints: 
• hdp://www.w3.org/wiki/SparqlEndpoints 
• hdp://sparqles.okfn.org/ 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
55
SPARQLES 
hdp://sparqles.okfn.org/ 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
56
Using 
Linked 
Data 
Catalogs 
• Data 
catalogs, 
markets 
or 
repositories 
are 
pla{orms 
dedicated 
to 
provide 
visibility 
to 
a 
wide 
range 
of 
data 
sets 
from 
different 
domains 
• Allow 
data 
consumers 
to 
easily 
find 
and 
use 
the 
data 
• Usually 
the 
catalogs 
offer 
relevant 
metadata 
about 
the 
creaSon 
of 
the 
data 
set 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
57
Using 
Linked 
Data 
Catalogs 
(2) 
How 
to 
publish 
an 
RDF 
data 
set 
into 
a 
catalog? 
Create 
your 
own 
data 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
58 
catalog 
Recommended 
for 
big 
organizaSons/insStuSons 
aiming 
at 
providing 
a 
large 
number 
of 
data 
sets 
Use 
a 
data 
management 
system, 
for 
example: 
Upload 
your 
data 
set 
into 
an 
exisSng 
catalog 
Allows 
data 
consumers 
to 
easily 
find 
new 
data 
sets 
Common 
LD 
catalogs 
are: 
-­‐ 
-­‐ 
The 
Linking 
Open 
Data 
Cloud
Validating 
Data 
Sets 
There 
are 
different 
ways 
to 
validate 
the 
published 
RDF 
data 
set: 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
59 
Parsing 
& 
Syntax 
General 
validators 
• Vapour 
-­‐ 
Performs 
two 
types 
of 
tests: 
without 
content 
negoSaSon 
and 
requesSng 
RDF/XML 
content 
hdp://validator.linkeddata.org/vapour 
• URI 
Debugger 
-­‐ 
Retreieves 
the 
HTTP 
responses 
of 
accessing 
a 
URI 
hdp://linkeddata.informaSk.hu-­‐berlin.de/uridbg/ 
• RDF 
Triple-­‐Checker 
– 
Dereferences 
namespaces 
associated 
with 
the 
resources 
used 
in 
the 
document 
hdp://graphite.ecs.soton.ac.uk/checker/ 
• W3C 
RDF/XML 
Valida6on 
Service 
– 
Evaluates 
the 
syntax 
of 
RDF/ 
XML 
documents 
and 
displays 
the 
RDF 
triples 
in 
it 
hdp://validator.linkeddata.org/vapour 
• W3C 
Markup 
Valida6on 
Service 
– 
Checks 
syntacSc 
correctness 
for 
web 
documents 
with 
RDFa 
markup 
hdp://validator.w3.org/ 
• RDF:ALERTS 
– 
Validates 
syntax, 
undefined 
resources, 
datatype 
and 
other 
types 
of 
errors 
hdp://swse.deri.org/RDFAlerts/ 
Accessibility
Validating 
Data 
Sets 
(2) 
Example: 
Validating 
URIs 
with 
Vapour 
Source: 
hdp://idi.fundacioncSc.org/vapour 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
60
Validating 
Data 
Sets 
(3) 
Example: 
Validating 
URIs 
with 
Vapour 
Source: 
hdp://idi.fundacioncSc.org/vapour 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
61
Publishing 
Linked 
Data 
(Quick 
Recipe) 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
62 
• Create 
dataset 
metadata 
• Provide 
access 
mechanisms: 
– RDF 
dump 
– SPARQL 
endpoint 
• (Expose 
the 
dataset 
in 
a 
catalog) 
• Validate 
everything! 
Published 
RDF 
dataset 
Steps:
PROVIDING 
LINKED 
DATA: 
CHECKLIST 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
63
Providing 
Linked 
Data: 
Checklist 
(1) 
Creating 
Linked 
Data 
o All 
the 
relevant 
enSSes/concepts 
were 
effecSvely 
extracted 
from 
the 
raw 
data 
? 
o Are 
all 
the 
created 
URIs 
dereferenceable? 
o Are 
you 
reusing 
terms 
from 
widely 
accepted 
vocabularies? 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
64
Providing 
Linked 
Data: 
Checklist 
(2) 
Interlinking 
Linked 
Data 
o Is 
the 
data 
set 
linked 
to 
other 
RDF 
data 
sets? 
o Are 
the 
created 
vocabulary 
terms 
linked 
to 
other 
vocabularies? 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
65
Providing 
Linked 
Data: 
Checklist 
(3) 
Publishing 
Linked 
Data 
o Do 
you 
provide 
data 
set 
metadata? 
o Do 
you 
provide 
informaSon 
about 
licensing? 
o Do 
you 
provide 
addiSonal 
access 
methods? 
o Is 
the 
data 
set 
available 
in 
LD 
catalogs? 
o Did 
the 
data 
set 
pass 
the 
validaSon 
tests? 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
66
Summary 
In 
this 
chapter 
we 
studied: 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
67 
• The 
Linked 
Data 
lifecycle: 
• 3 
core 
tasks: 
creaSng, 
interlinking 
and 
publishing 
• CreaSon 
of 
Linked 
Data: 
• ExtracSng 
relevant 
data, 
using 
URIs 
to 
name 
enSSes 
and 
selecSng 
vocabularies 
and 
expressing 
the 
data 
using 
the 
RDF 
data 
model 
• Interlinking 
Linked 
Data: 
• Challenges 
of 
link 
discovery, 
using 
Silk 
to 
create 
links 
between 
two 
data 
sets 
and 
using 
SKOS 
links 
• Publishing 
Linked 
Data: 
• CreaSon 
of 
data 
set 
metadata; 
publishing 
the 
data 
set 
via 
RDF 
dumps, 
SPARQL 
endpoints 
or 
RDFa; 
uploading 
the 
data 
set 
to 
a 
LD 
catalog
For 
exercises, 
quiz 
and 
further 
material 
visit 
our 
website: 
http://www.euclid-­‐project.eu 
EUCLID 
-­‐ 
Providing 
Linked 
Data 
68 
eBook 
@euclid_project 
euclidproject 
euclidproject 
Other 
channels: 
Course

Mais conteúdo relacionado

Mais procurados

Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataEUCLID project
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic WebJan Beeck
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...Armin Haller
 
euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)Besnik Fetahu
 
Microtask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataMicrotask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataEUCLID project
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on AndroidEUCLID project
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011Peter Mika
 
Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparqlDhavalkumar Thakker
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
RDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsRDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsGraph-TA
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the webChiara Del Vescovo
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelabCAMELIA BOBAN
 
Deriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF DataDeriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF DataGraph-TA
 
Re-using Media on the Web: Media fragment re-mixing and playout
Re-using Media on the Web: Media fragment re-mixing and playoutRe-using Media on the Web: Media fragment re-mixing and playout
Re-using Media on the Web: Media fragment re-mixing and playoutMediaMixerCommunity
 
Development of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemDevelopment of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemNIT Durgapur
 

Mais procurados (20)

Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic Web
 
Semantic Web and Linked Open Data
Semantic Web and Linked Open DataSemantic Web and Linked Open Data
Semantic Web and Linked Open Data
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
 
euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)
 
Microtask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataMicrotask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked Data
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on Android
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
 
Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparql
 
What is Linked Data?
What is Linked Data?What is Linked Data?
What is Linked Data?
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
RDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsRDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL Platforms
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelab
 
Deriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF DataDeriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF Data
 
Web of Data Usage Mining
Web of Data Usage MiningWeb of Data Usage Mining
Web of Data Usage Mining
 
Re-using Media on the Web: Media fragment re-mixing and playout
Re-using Media on the Web: Media fragment re-mixing and playoutRe-using Media on the Web: Media fragment re-mixing and playout
Re-using Media on the Web: Media fragment re-mixing and playout
 
Development of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemDevelopment of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management System
 

Destaque

Mon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked dataMon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked dataeswcsummerschool
 
Presentation media 1 1
Presentation media 1 1Presentation media 1 1
Presentation media 1 1es-mef
 
RESEARCH: Video Codes & Conventions
RESEARCH: Video Codes & Conventions RESEARCH: Video Codes & Conventions
RESEARCH: Video Codes & Conventions es-mef
 
Attracting and adressing audiences 5.5
Attracting and adressing audiences 5.5Attracting and adressing audiences 5.5
Attracting and adressing audiences 5.5es-mef
 
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...eswcsummerschool
 
01 dasar teori manajemen
01 dasar teori manajemen01 dasar teori manajemen
01 dasar teori manajemenMahrus Ali
 
Saddu al dzari'ah
Saddu al dzari'ahSaddu al dzari'ah
Saddu al dzari'ahMahrus Ali
 
Filsafat kontemporer
Filsafat kontemporerFilsafat kontemporer
Filsafat kontemporerMahrus Ali
 
2. pengertian dan ruang lingkup ipi
2. pengertian dan ruang lingkup ipi2. pengertian dan ruang lingkup ipi
2. pengertian dan ruang lingkup ipiMahrus Ali
 
Engro Corporation LCM Leadership change management
Engro Corporation LCM  Leadership change managementEngro Corporation LCM  Leadership change management
Engro Corporation LCM Leadership change managementAzaan Karim
 
Asad umer presentation
Asad umer presentationAsad umer presentation
Asad umer presentationAzaan Karim
 
Nadra Pakistan doc
Nadra Pakistan docNadra Pakistan doc
Nadra Pakistan docAzaan Karim
 
De arkade II Tussentijdse evaluatie
De arkade II Tussentijdse evaluatieDe arkade II Tussentijdse evaluatie
De arkade II Tussentijdse evaluatieBertel Van Gansbeke
 
Air blue presentation
Air blue presentationAir blue presentation
Air blue presentationAzaan Karim
 

Destaque (20)

Mon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked dataMon fundulaki tut_querying linked data
Mon fundulaki tut_querying linked data
 
Presentation media 1 1
Presentation media 1 1Presentation media 1 1
Presentation media 1 1
 
RESEARCH: Video Codes & Conventions
RESEARCH: Video Codes & Conventions RESEARCH: Video Codes & Conventions
RESEARCH: Video Codes & Conventions
 
Attracting and adressing audiences 5.5
Attracting and adressing audiences 5.5Attracting and adressing audiences 5.5
Attracting and adressing audiences 5.5
 
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
Tutorial: Querying a Marine Data Warehouse Using SPARQL - I. Fundulaki - ESWC...
 
G e n d e r
G e n d e rG e n d e r
G e n d e r
 
01 dasar teori manajemen
01 dasar teori manajemen01 dasar teori manajemen
01 dasar teori manajemen
 
Pengantar tqm
Pengantar tqmPengantar tqm
Pengantar tqm
 
Bulbul Jan
Bulbul Jan Bulbul Jan
Bulbul Jan
 
Saddu al dzari'ah
Saddu al dzari'ahSaddu al dzari'ah
Saddu al dzari'ah
 
Filsafat kontemporer
Filsafat kontemporerFilsafat kontemporer
Filsafat kontemporer
 
2. pengertian dan ruang lingkup ipi
2. pengertian dan ruang lingkup ipi2. pengertian dan ruang lingkup ipi
2. pengertian dan ruang lingkup ipi
 
Engro Corporation LCM Leadership change management
Engro Corporation LCM  Leadership change managementEngro Corporation LCM  Leadership change management
Engro Corporation LCM Leadership change management
 
Asad umer presentation
Asad umer presentationAsad umer presentation
Asad umer presentation
 
Godaan setan
Godaan  setanGodaan  setan
Godaan setan
 
Nadra Pakistan doc
Nadra Pakistan docNadra Pakistan doc
Nadra Pakistan doc
 
De arkade II Tussentijdse evaluatie
De arkade II Tussentijdse evaluatieDe arkade II Tussentijdse evaluatie
De arkade II Tussentijdse evaluatie
 
Air blue presentation
Air blue presentationAir blue presentation
Air blue presentation
 
NADRA pakistan
NADRA pakistanNADRA pakistan
NADRA pakistan
 
Testchinois
TestchinoisTestchinois
Testchinois
 

Semelhante a Tue acosta tut_providing_linkeddata

Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked dataLaura Po
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataMarin Dimitrov
 
One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebVictor de Boer
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeDan Brickley
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museumstrevorthornton
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Cory Lampert
 
Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Dr. Haxel Consult
 
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_dataA candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_dataSTIinnsbruck
 
Linked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceBarry Norton
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Richard Urban
 
Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Morgan Briles
 

Semelhante a Tue acosta tut_providing_linkeddata (20)

Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Open data and linked data
Open data and linked dataOpen data and linked data
Open data and linked data
 
Introduction of Linked Data for Science
Introduction of Linked Data for ScienceIntroduction of Linked Data for Science
Introduction of Linked Data for Science
 
Linking up your data
Linking up your dataLinking up your data
Linking up your data
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic Web
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in Practice
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
 
Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Linked Open Data in the World of Patents
Linked Open Data in the World of Patents
 
Linked Data
Linked DataLinked Data
Linked Data
 
Danbri Drupalcon Export
Danbri Drupalcon ExportDanbri Drupalcon Export
Danbri Drupalcon Export
 
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_dataA candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
 
Linked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and Inference
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1
 
Linked Data
Linked DataLinked Data
Linked Data
 
Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web
 
Finding Data Sets
Finding Data SetsFinding Data Sets
Finding Data Sets
 

Mais de eswcsummerschool

Semantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student projectSemantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student projecteswcsummerschool
 
Syrtaki - ESWC SSchool 14 - Student project
Syrtaki  - ESWC SSchool 14 - Student projectSyrtaki  - ESWC SSchool 14 - Student project
Syrtaki - ESWC SSchool 14 - Student projecteswcsummerschool
 
Keep fit (a bit) - ESWC SSchool 14 - Student project
Keep fit (a bit)  - ESWC SSchool 14 - Student projectKeep fit (a bit)  - ESWC SSchool 14 - Student project
Keep fit (a bit) - ESWC SSchool 14 - Student projecteswcsummerschool
 
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student projectArabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student projecteswcsummerschool
 
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student projectFIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student projecteswcsummerschool
 
Personal Tours at the British Museum - ESWC SSchool 14 - Student project
Personal Tours at the British Museum  - ESWC SSchool 14 - Student projectPersonal Tours at the British Museum  - ESWC SSchool 14 - Student project
Personal Tours at the British Museum - ESWC SSchool 14 - Student projecteswcsummerschool
 
Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...eswcsummerschool
 
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...eswcsummerschool
 
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 eswcsummerschool
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014eswcsummerschool
 
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014 Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014 eswcsummerschool
 
Mon norton tut_publishing01
Mon norton tut_publishing01Mon norton tut_publishing01
Mon norton tut_publishing01eswcsummerschool
 
Mon domingue introduction to the school
Mon domingue introduction to the schoolMon domingue introduction to the school
Mon domingue introduction to the schooleswcsummerschool
 
Mon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage dataMon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage dataeswcsummerschool
 
Tue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddataTue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddataeswcsummerschool
 
Thu bernstein key_warp_speed
Thu bernstein key_warp_speedThu bernstein key_warp_speed
Thu bernstein key_warp_speedeswcsummerschool
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringeswcsummerschool
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02eswcsummerschool
 
Mon domingue key_introduction to semantic
Mon domingue key_introduction to semanticMon domingue key_introduction to semantic
Mon domingue key_introduction to semanticeswcsummerschool
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationseswcsummerschool
 

Mais de eswcsummerschool (20)

Semantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student projectSemantic Aquarium - ESWC SSchool 14 - Student project
Semantic Aquarium - ESWC SSchool 14 - Student project
 
Syrtaki - ESWC SSchool 14 - Student project
Syrtaki  - ESWC SSchool 14 - Student projectSyrtaki  - ESWC SSchool 14 - Student project
Syrtaki - ESWC SSchool 14 - Student project
 
Keep fit (a bit) - ESWC SSchool 14 - Student project
Keep fit (a bit)  - ESWC SSchool 14 - Student projectKeep fit (a bit)  - ESWC SSchool 14 - Student project
Keep fit (a bit) - ESWC SSchool 14 - Student project
 
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student projectArabic Sentiment Lexicon - ESWC SSchool 14 - Student project
Arabic Sentiment Lexicon - ESWC SSchool 14 - Student project
 
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student projectFIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
FIT-8BIT An activity music assistant - ESWC SSchool 14 - Student project
 
Personal Tours at the British Museum - ESWC SSchool 14 - Student project
Personal Tours at the British Museum  - ESWC SSchool 14 - Student projectPersonal Tours at the British Museum  - ESWC SSchool 14 - Student project
Personal Tours at the British Museum - ESWC SSchool 14 - Student project
 
Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...Exhibition recommendation using British Museum data and Event Registry - ESWC...
Exhibition recommendation using British Museum data and Event Registry - ESWC...
 
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
Empowering fishing business using Linked Data - ESWC SSchool 14 - Student pro...
 
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
 
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014 Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
Hands On: Amazon Mechanical Turk - M. Acosta - ESWC SS 2014
 
Mon norton tut_publishing01
Mon norton tut_publishing01Mon norton tut_publishing01
Mon norton tut_publishing01
 
Mon domingue introduction to the school
Mon domingue introduction to the schoolMon domingue introduction to the school
Mon domingue introduction to the school
 
Mon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage dataMon norton tut_querying cultural heritage data
Mon norton tut_querying cultural heritage data
 
Tue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddataTue acosta hands_on_providinglinkeddata
Tue acosta hands_on_providinglinkeddata
 
Thu bernstein key_warp_speed
Thu bernstein key_warp_speedThu bernstein key_warp_speed
Thu bernstein key_warp_speed
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
 
Mon domingue key_introduction to semantic
Mon domingue key_introduction to semanticMon domingue key_introduction to semantic
Mon domingue key_introduction to semantic
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservations
 

Tue acosta tut_providing_linkeddata

  • 1. Providing Linked Data Presented by: Maribel Acosta
  • 2. LINKED DATA LIFECYCLE EUCLID -­‐ Querying Linked Data 2
  • 3. Linked Data Principles 1. Use URIs as names for things. 2. Use HTTP URIs so that users can look up those names. 3. When someone looks up a URI, provide useful informa6on, using the standards (RDF*, SPARQL). 4. Include links to other URIs, so that users can discover more things. CH 1 EUCLID -­‐ Providing Linked Data 3
  • 4. Linked Data Lifecycle Source: Sören Auer. “The SemanSc Data Web” (slides) Linked Data Lifecycle Source: José M. Alvarez. “My Linked Data Lifecycle” Source: Michael Hausenblas. “Linked Data lifeyclcle” EUCLID -­‐ Providing Linked Data 4
  • 5. Core Tasks for Providing Linked Data Based on the proposed LD lifecycles and the LD principles, we can idenSfy 3 main tasks for providing LD: ① Crea6ng: includes data extracSon, creaSon of HTTP URIs, and vocabulary selecSon. (LD principles 1 & 2) ② Interlinking: involves the creaSon of (RDF) links to external data sets. (LD principle 4) ③ Publishing: consists of creaSng the metadata and making the data set accessible. (LD principle 3) EUCLID -­‐ Providing Linked Data 5
  • 6. Agenda 1. Crea6ng Linked Data 2. Interlinking Linked Data 3. Publishing Linked Data 4. Linked Data publishing checklist EUCLID -­‐ Providing Linked Data 6
  • 7. CREATING LINKED DATA Linked Data principles 1 & 2 EUCLID -­‐ Querying Linked Data 7
  • 8. Extracting the Data • The data of interest may be stored in a wide range or formats: • Several tools support the process of mining data from different repositories, for example: EUCLID -­‐ Providing Linked Data 8 Spreadsheets or tabular data Databases Text R2RML
  • 9. Naming Things: URIs • All the things or disSnct enSSes within the data must be named • According to the Linked Data principles, the standard mechanism to name enSSes is the URI • Designing Cool URIs: – Leave out informaSon about the data regarding to: author, technologies, status, access mechanisms, … – Simplicity: short, mnemonic URIs – Stability: maintain the URIs as long as possible – Manageability: issue the URIs in a way that you can manage Source: hdp://www.w3.org/TR/cooluris/ EUCLID -­‐ Providing Linked Data 9
  • 10. Selecting Vocabularies • Vocabularies model the concepts and the rela6onship between them in a knowledge domain • Terms from well-­‐known vocabularies should be reused wherever possible • New terms should be define only if you can not find required terms in exisSng vocabularies • A large number of vocabularies in RDF are openly available, e.g., Linked Open Vocabularies (LOV) EUCLID -­‐ Providing Linked Data 10
  • 11. Selecting Vocabularies (2) Linked Open Vocabularies 322 vocabularies (2013) Source: hdp://lov.okfn.org/dataset/lov/ 458 vocabularies (2014) EUCLID -­‐ Providing Linked Data 11
  • 12. Selecting Vocabularies (3) Linked Open Vocabularies: Analyzing MusicOntology Source:hdp://lov.okfn.org/dataset/lov/details/vocabulary_mo.html EUCLID -­‐ Providing Linked Data 12
  • 13. Selecting Vocabularies (4) Other lists of well-­‐known vocabularies are maintained by: • W3C SWEO Linking Open Data community project hdp://www.w3.org/wiki/TaskForces/CommunityProjects/LinkingOpenData/ CommonVocabularies EUCLID -­‐ Providing Linked Data 13 • Library Linked Data Incubator Group: Vocabularies in the library domain hdp://www.w3.org/2005/Incubator/lld/XGR-­‐lld-­‐vocabdataset-­‐20111025
  • 14. Creating Linked Data (Quick Recipe) EUCLID -­‐ Providing Linked Data 14 • Source data (database, text, …) • Names for things (instances): URIs • Vocabularies – A new concept should be created only when it can not be found in exisSng vocabularies RDF dataset Steps:
  • 15. INTERLINKING LINKED DATA Linked Data principle 4 EUCLID -­‐ Providing Linked Data 15
  • 16. Interlinking Data Sets • It’s one of the Linked Data principles! 4. Include links to other URIs, so that users can • Involves discover the creaSon of RDF links between two different RDF data sets: – Links at instance level (rdfs:seeAlso, owl:sameAs) – Links at schema level (RDFS subclass/subproperty, OWL equivalent class/property, SKOS mapping proper6es) • Appropriate links are detected via link discovery EUCLID -­‐ Providing Linked Data 16 more things.
  • 17. Interlinking Data Sets (2) Challenges for link discovery • Linked Data sets are heterogeneous in terms of vocabularies, formats and data representaSon • Large range of knowledge domains • Scalability: LD is composed of a large number of data sets and RDF triples, hence it is not possible to compare every possible enSty pair Source: Robert Isele. “LOD2 Webinar Series: Silk” EUCLID -­‐ Providing Linked Data 17
  • 18. Interlinking Data Sets (3) Challenges for link discovery • An instance of link discovery is the en6ty resolu6on problem: deciding whether two en..es correspond to same object in the real world • Name ambigui6es: typos, misspellings, different languages, homonyms • Structural ambigui6es: same concepts/enSSes with different structures. Requires the applicaSon of ontology and schema matching techniques EUCLID -­‐ Providing Linked Data 18
  • 19. Interlinking Data Sets (4) EUCLID -­‐ Providing Linked Data 19 RDF data sets can be interlinked: Manually • Involves the manual exploraSon of LD data sets and their RDF resources to idenSfy linking targets • May not be feasible when the number of enSSes within the data set is very large (Semi-­‐)Automatically • Using tools that perform link discovery based on linkage rules, for example: Silk, Limes and xCurator
  • 20. owl:sameAs & rdfs:seeAlso (1) • owl:sameAs • Creates links between individuals • States that two URIs refer to the same individuals • rdfs:seeAlso • States that a resource may provide addiSonal informaSon about the subject resource • Links in MusicBrainz: – owl:sameAs is used for music arSsts – rdfs:seeAlso is used for albums EUCLID -­‐ Providing Linked Data 20
  • 21. owl:sameAs & rdfs:seeAlso (2) Examples of owl:sameAs and rdfs:seeAlso @prefix owl: <http://www.w3.org/2002/07/owl#> @prefix rdfs: <http://www.w3.org/2000/01/rdf-­‐schema#> @prefix dbpedia: <http://dbpedia.org/resource/> <http://musicbrainz.org/artist/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d> owl:sameAs dbpedia:The_Beatles. <http://musicbrainz.org/release/f8bf9e45-­‐a1d1-­‐42ab-­‐8f3c-­‐637b1762e5f7> rdfs:seeAlso dbpedia:All_you_need_is_love_(album). EUCLID -­‐ Providing Linked Data 21
  • 22. SKOS • Simple Knowledge OrganizaSon System – hdp://www.w3.org/TR/skos-­‐reference/ • Data model for knowledge organizaSon systems (thesauri, classificaSon scheme, taxonomies) • SKOS data is expressed as RDF triples • Allows the creaSon of RDF links between different data sets with the usage of mapping proper6es EUCLID -­‐ Providing Linked Data 22
  • 23. SKOS: Mapping Properties These properSes are used to link SKOS concepts (parScularly instances) in different schemas: • skos:closeMatch: links two concepts that are sufficiently similar (someSmes can be used interchangeably) • skos:exactMatch: indicates that the two concepts can be used interchangeably. • Axiom: It is a transi6ve property • skos:relatedMatch: states an associaSve mapping link between two concepts EUCLID -­‐ Providing Linked Data 23
  • 24. SKOS: Mapping Properties Example of SKOS exact match (2) @prefix skos: <http://www.w3.org/2004/02/skos/core#> @prefix mo: <http://purl.org/ontology/mo/> @prefix dbpedia-­‐ont: <http://dbpedia.org/ontology/> @prefix schema: <http://schema.org/> mo:MusicArtist skos:exactMatch dbpedia-­‐ont:MusicalArtist. mo:MusicGroup skos:exactMatch dbpedia-­‐ont:Band. mo:MusicGroup skos:exactMatch schema:MusicGroup. EUCLID -­‐ Providing Linked Data 24
  • 25. SKOS: Mapping Properties Example of SKOS close match (3) @prefix skos: <http://www.w3.org/2004/02/skos/core#> @prefix mo: <http://purl.org/ontology/mo/> @prefix dbpedia-­‐ont: <http://dbpedia.org/ontology/> @prefix schema: <http://schema.org/> mo:SignalGroup skos:closeMatch schema:MusicAlbum. mo:SignalGroup skos:closeMatch dbpedia-­‐ont:Album. EUCLID -­‐ Providing Linked Data 25
  • 26. SKOS: Mapping Properties Integrity conditions • Guarantee (4) consistency and avoid contradicSons in the relaSonships between SKOS concepts ParSal Mapping RelaSon diagram with integrity condiSons skos:Mapping Relation skos:close Match skos:related skos:exact Match Match Symmetric & Transi6ve Disjoint with EUCLID -­‐ Providing Linked Data 26 Symmetric
  • 27. Interlinking Linked Data (Quick Recipe) EUCLID -­‐ Providing Linked Data 27 • Create links at instance level • (Create links at schema level) Interlinked RDF dataset Steps:
  • 28. PUBLISHING LINKED DATA Linked Data principle 3 EUCLID -­‐ Providing Linked Data 28
  • 29. Publishing Linked Data Once the RDF data set has been created and interlinked, the publishing process involves the following tasks: 1. Metadata creaSon for describing the data set 2. Making the data set accessible 3. Exposing the data set in Linked Data repositories 4. Valida6ng the data set EUCLID -­‐ Providing Linked Data 29
  • 30. Describing RDF Data Sets • Consists of providing (machine-­‐readable) metadata of RDF data sets which can be processed by engines • This informaSon allows for: – Efficient and effecSve search of data sets – SelecSon of appropriate data sets (for consumpSon or interlinking) – Get general staSsScs of the data sets EUCLID -­‐ Providing Linked Data 30
  • 31. Describing RDF Data Sets (2) • The common language for describing RDF data sets is VoID (Vocabulary of Interlinked Data sets) • Defines an RDF data set with the predicate void:Dataset • Covers 4 types of metadata: • General metadata • Structural metadata • DescripSons of linksets • Access metadata EUCLID -­‐ Providing Linked Data 31
  • 32. VoID: General Metadata • General metadata is used by users to idenSfy appropriate data sets. • Specifies informaSon about descripSon of the data set, contact person/organizaSon, the license of the data set, data subject and some technical features. • VoID (re)uses predicates from the Dublin Core Metadata1 and FOAF2 vocabularies. 1 hdp://dublincore.org/documents/2010/10/11/dcmi-­‐terms/ 2 hdp://xmlns.com/foaf/spec/ EUCLID -­‐ Providing Linked Data 32
  • 33. VoID: General Metadata (2) Source: hdp://www.General Information Contains informaSon about the creaSon of the data set Predicate Range Descrip6on dcterms:title Literal Name of the data set. dcterms:description Literal DescripSon of the data set. dcterms:source RDF resource w3.org/TR/void/#metadata Source from which the data set was derived. dcterms:creator RDF resource Primarily responsible of creaSng the data set. dcterms:date xsd:date Time associated with an event in the life-­‐cycle of the resource. dcterms:created xsd:date Date of creaSon of the data set. dcterms:issued xsd:date Date of publicaSon of the data set. dcterms:modified xsd:date Date on which the data set was changed. foaf:homepage Literal Name of the data set. dcterms:publisher RDF resource EnSty responsible for making the data set available. dcterms:contributor RDF resource EnSty responsible for making contribuSons to the data set. EUCLID -­‐ Providing Linked Data 33
  • 34. VoID: General Metadata (3) Other Information • License of the data set: specifies the usage condiSons of the data. The license can be pointed with the property dcterms:license • Category of the data set: to specify the topics or domains covered by the data set, the property dcterms:subject can be used • Technical features: the property void:feature can be used to express technical properSes of the data (e.g. RDF serializaSon formats) EUCLID -­‐ Providing Linked Data 34
  • 35. VoID: Structural Metadata EUCLID -­‐ Providing Linked Data 35 • Provides high-­‐level informaSon about the internal structure of the data set • This metadata is useful when exploring or querying the data set • Includes informaSon about resources, vocabularies used in the data set, staSsScs and examples of resources in the data set
  • 36. VoID: Structural Metadata (2) Information about resources • Example resources: allow users to get an impression of the kind of resources included in the data set. Examples can be shown with the property void:exampleResource :MusicBrainz a void:Dataset; void:exampleResource <http://musicbrainz.org/artist/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d> . EUCLID -­‐ Providing Linked Data 36 • Padern for resource URIs: the void:uriSpace property can be used to state that all the enSty URIs in a data set start with a given string :MusicBrainz a void:Dataset; void:uriSpace "http://musicbrainz.org/" .
  • 37. VoID: Structural Metadata (3) Vocabularies used in the data set • The void:vocabulary property idenSfies the vocabulary or ontology that is used in a data set :MusicBrainz a void:Dataset; void:vocabulary <http://purl.org/ontology/mo/> . EUCLID -­‐ Providing Linked Data 37 • Typically, only the most relevant vocabularies are listed • This property can only be used for enSre vocabularies. It cannot be used to express that a subset of the vocabulary occurs in the data set.
  • 38. VoID: Structural Metadata (4) Statistics about a data set Express numeric staSsScs about a data set: Predicate Range Descrip6on void:triples Number Source: hdp://www.w3.org/TR/void/#metadata Total number of triples contained in the data set. EUCLID -­‐ Providing Linked Data 38 void:entities Number Total number of enSSes that are described in the data set. An enSty must have a URI, and match the void:uriRegexPadern void:classes Number Total number of disSnct classes in the data set. void:properties Number Total number of disSnct properSes in the data set. void:distinctSubjects Number Total number of disSnct subjects in the data set. void:distinctObjects Number Total number of disSnct objects in the data set. void:documents Number Total number of documents, in case that the data set is published as a set of individual documents.
  • 39. VoID: Structural Metadata (5) Partitioned data sets • The :MusicBrainz a void:Dataset; void:subset :MusicBrainzGroupMembers . EUCLID -­‐ Providing Linked Data 39 void:subset property provides descripSon of parts of a data set • Data sets can be parSSoned based on classes or proper6es: • void:classPartition contains only instances of a parScular class • void:propertyPartition contains only triples with a parScular predicate :MusicBrainzGroupMembers a void:Dataset; void:classPartition [ void:class mo:MusicArtist .] ; void:propertyPartition [ void:property mo:member .] .
  • 40. VoID: Describing Linksets @PREFIX void:<http://rdfs.org/ns/void#> @PREFIX owl:<http://www.w3.org/2002/07/owl#> :DS1 a void:Dataset . :DS2 a void:Dataset . :DS1 void:subset :LS1 . :LS1 a void:Linkset; owl:sameAs void:linkPredicate owl:sameAs; void:target :DS1, :DS2 . EUCLID -­‐ Providing Linked Data 40 • Linkset: collecSon of RDF links between two RDF data sets :DS1 :DS2 :LS1 :LS2 Image based on hdp://semanScweb.org/wiki/File:Void-­‐linkset-­‐conceptual.png
  • 41. VoID: Describing Linksets (2) EUCLID -­‐ Providing Linked Data 41 Example @PREFIX void:<http://rdfs.org/ns/void#> @PREFIX owl:<http://www.w3.org/2002/07/owl#> :MusicBrainz a void:Dataset . :DBpedia a void:Dataset . :MusicBrainz void:classPartition :MBArtists . :MBArtists void:class mo:MusicArtist . :MBArtists a void:Linkset; void:linkPredicate owl:sameAs; void:target :MusicBrainz, :DBpedia .
  • 42. VoID: Access Metadata The access metadata describes the methods of accessing the actual RDF data set Method Predicate Descrip6on * This assumes that the default graph of the SPARQL endpoint contains the data set. VoID cannot express that a data set is contained in a specific named graph. This can be specified with SPARQL 1.1. Service DescripSon EUCLID -­‐ Providing Linked Data 42 URI look up endpoint void:uriLookupEndpoint Specifies the URI of a service for accessing the data set (different from the SPARQL protocol) Root resource void:rootResource URI of the top concepts (only for data sets structured as trees) SPARQL endpoint void:sparqlEndpoint Provides access to the data set via the SPARQL protocol.* RDF data dumps void:dataDump Specifies the locaSon of the dump file. If the data set is split into mulSple files, then several values of this property are provided. CH 5
  • 43. Providing Access to the Data Set The data set can be accessed via different mechanisms: RDFa RDF dump SPARQL endpoint EUCLID -­‐ Providing Linked Data 43 Dereferencing HTTP URIs
  • 44. Dereferencing HTTP URIs • Allows for easily exploring certain resources contained in the data set • What to return for a URI? • Immediate descrip6on: CH 1 triples where the URI is the subject. • Backlinks: triples where the URI is the object. • Related descrip6ons: informaSon of interest in typical usage scenarios. • Metadata: informaSon as author and licensing informaSon. • Syntax: RDF descripSons as RDF/XML and human-­‐readable formats. • ApplicaSons (e.g. LD browsers) render the retrieved informaSon so it can be perceived by a user. EUCLID -­‐ Providing Linked Data 44 Source: How to Publish Linked Data on The Web -­‐ Chris Bizer, Richard Cyganiak, Tom Heath.
  • 45. Dereferencing HTTP URIs (2) Example: Dereferencing http://dbpedia.org/resource/The_Beatles EUCLID -­‐ Providing Linked Data 45
  • 46. RDFa • RDFa = “RDF in adributes” • Extension to HTML5 for embedding RDF within HTML pages: – The HTML is processed by the browser, the (human) consumer doesn’t see the RDF data – The RDF triples within the page are consumed by APIs to extract the (semi-­‐)structured data • It is considered as the bridge between the Web of Data and the Web of Documents • It is a complete serializa6on of RDF EUCLID -­‐ Providing Linked Data 46
  • 47. RDFa: Attributes A_ribute role A_ribute Descrip6on prefix Syntax List of prefix-­‐name IRIs pairs vocab IRI that specifies the vocabulary where the concept is defined Subject about Specifies the subject of the relaSonship Predicate property Express the relaSonship between the subject and the value rel Defines a relaSon between the subject and a URL rev Express reverse relaSonships between two resources Resource href Specifies an object URI for the rel and rev adributes resource Same as href (used when href is not present) src Specifies the subject of a relaSonship Literal datatype Express the datatype of the object of the property adribute content Supply machine-­‐readable content for a literal xml:lang, lang Specifies the language of the literal Macro typeof Indicate the RDF type(s) to associate with a subject inlist An object is added to the list of a predicate. EUCLID -­‐ Providing Linked Data 47
  • 48. RDFa: Example Extracting RDF from HTML EUCLID -­‐ Providing Linked Data 48 HTML (+RDFa): <div class="arSstheader" about="hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_" typeof="hdp://purl.org./ontology/mo/MusicGroup"> … </div> RDF: <hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_>
  • 49. RDFa: Example Extracting RDF from HTML EUCLID -­‐ Providing Linked Data 49 HTML (+RDFa): <div class="arSstheader" about="hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_" typeof="hdp://purl.org./ontology/mo/MusicGroup"> … </div> RDF: <hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_> <hdp://www.w3.org/1999/02/22-­‐rdf-­‐syntax-­‐ns#type>
  • 50. RDFa: Example Extracting RDF from HTML EUCLID -­‐ Providing Linked Data 50 HTML (+RDFa): <div class="arSstheader" about="hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_" typeof="hdp://purl.org./ontology/mo/MusicGroup"> … </div> RDF: <hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d#_> <hdp://www.w3.org/1999/02/22-­‐rdf-­‐syntax-­‐ns#type> <hdp://purl.org./ontology/mo/MusicGroup>.
  • 51. RDFa: Example (2) Extracting RDF from MusicBrainz.org EUCLID -­‐ Providing Linked Data 51 hdp://musicbrainz.org/arSst/b10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d
  • 52. RDFa: Example (2) Extracting RDF from MusicBrainz.org EUCLID -­‐ Providing Linked Data 52 Source: hdp://www.w3.org/2007/08/pyRdfa/
  • 53. RDFa: Example (2) Extracting RDF from MusicBrainz.org EUCLID -­‐ Providing Linked Data 53 hdp://www.w3.org/2007/08/pyRdfa/extract?uri=hdp%3A%2F%2Fmusicbrainz.org %2FarSst%2Fb10bbbfc-­‐cf9e-­‐42e0-­‐be17-­‐e2c3e1d2600d&format=nt Watch the EUCLID screencast: http://vimeo.com/euclidproject
  • 54. RDF Dump • An RDF dump refers to a file which contains (part of) a data set specified in an RDF serializaSon • The data set can be split into several RDF dumps • A list of available data sets available as RDF dumps can be found at: – hdp://www.w3.org/wiki/DataSetRDFDumps EUCLID -­‐ Providing Linked Data 54
  • 55. SPARQL Endpoint • The SPARQL endpoint refers to the URI of the listener of the SPARQL protocol service, which handles requests for SPARQL protocol operaSons • The user submits SPARQL queries to the SPARQL endpoint in order to retrieve only a desired subset of the RDF data set • List CH 2 of available SPARQL endpoints: • hdp://www.w3.org/wiki/SparqlEndpoints • hdp://sparqles.okfn.org/ EUCLID -­‐ Providing Linked Data 55
  • 56. SPARQLES hdp://sparqles.okfn.org/ EUCLID -­‐ Providing Linked Data 56
  • 57. Using Linked Data Catalogs • Data catalogs, markets or repositories are pla{orms dedicated to provide visibility to a wide range of data sets from different domains • Allow data consumers to easily find and use the data • Usually the catalogs offer relevant metadata about the creaSon of the data set EUCLID -­‐ Providing Linked Data 57
  • 58. Using Linked Data Catalogs (2) How to publish an RDF data set into a catalog? Create your own data EUCLID -­‐ Providing Linked Data 58 catalog Recommended for big organizaSons/insStuSons aiming at providing a large number of data sets Use a data management system, for example: Upload your data set into an exisSng catalog Allows data consumers to easily find new data sets Common LD catalogs are: -­‐ -­‐ The Linking Open Data Cloud
  • 59. Validating Data Sets There are different ways to validate the published RDF data set: EUCLID -­‐ Providing Linked Data 59 Parsing & Syntax General validators • Vapour -­‐ Performs two types of tests: without content negoSaSon and requesSng RDF/XML content hdp://validator.linkeddata.org/vapour • URI Debugger -­‐ Retreieves the HTTP responses of accessing a URI hdp://linkeddata.informaSk.hu-­‐berlin.de/uridbg/ • RDF Triple-­‐Checker – Dereferences namespaces associated with the resources used in the document hdp://graphite.ecs.soton.ac.uk/checker/ • W3C RDF/XML Valida6on Service – Evaluates the syntax of RDF/ XML documents and displays the RDF triples in it hdp://validator.linkeddata.org/vapour • W3C Markup Valida6on Service – Checks syntacSc correctness for web documents with RDFa markup hdp://validator.w3.org/ • RDF:ALERTS – Validates syntax, undefined resources, datatype and other types of errors hdp://swse.deri.org/RDFAlerts/ Accessibility
  • 60. Validating Data Sets (2) Example: Validating URIs with Vapour Source: hdp://idi.fundacioncSc.org/vapour EUCLID -­‐ Providing Linked Data 60
  • 61. Validating Data Sets (3) Example: Validating URIs with Vapour Source: hdp://idi.fundacioncSc.org/vapour EUCLID -­‐ Providing Linked Data 61
  • 62. Publishing Linked Data (Quick Recipe) EUCLID -­‐ Providing Linked Data 62 • Create dataset metadata • Provide access mechanisms: – RDF dump – SPARQL endpoint • (Expose the dataset in a catalog) • Validate everything! Published RDF dataset Steps:
  • 63. PROVIDING LINKED DATA: CHECKLIST EUCLID -­‐ Providing Linked Data 63
  • 64. Providing Linked Data: Checklist (1) Creating Linked Data o All the relevant enSSes/concepts were effecSvely extracted from the raw data ? o Are all the created URIs dereferenceable? o Are you reusing terms from widely accepted vocabularies? EUCLID -­‐ Providing Linked Data 64
  • 65. Providing Linked Data: Checklist (2) Interlinking Linked Data o Is the data set linked to other RDF data sets? o Are the created vocabulary terms linked to other vocabularies? EUCLID -­‐ Providing Linked Data 65
  • 66. Providing Linked Data: Checklist (3) Publishing Linked Data o Do you provide data set metadata? o Do you provide informaSon about licensing? o Do you provide addiSonal access methods? o Is the data set available in LD catalogs? o Did the data set pass the validaSon tests? EUCLID -­‐ Providing Linked Data 66
  • 67. Summary In this chapter we studied: EUCLID -­‐ Providing Linked Data 67 • The Linked Data lifecycle: • 3 core tasks: creaSng, interlinking and publishing • CreaSon of Linked Data: • ExtracSng relevant data, using URIs to name enSSes and selecSng vocabularies and expressing the data using the RDF data model • Interlinking Linked Data: • Challenges of link discovery, using Silk to create links between two data sets and using SKOS links • Publishing Linked Data: • CreaSon of data set metadata; publishing the data set via RDF dumps, SPARQL endpoints or RDFa; uploading the data set to a LD catalog
  • 68. For exercises, quiz and further material visit our website: http://www.euclid-­‐project.eu EUCLID -­‐ Providing Linked Data 68 eBook @euclid_project euclidproject euclidproject Other channels: Course