9. How to connect data?
Google Refne
Energy
consumption
Crime
data Unifying Linking to Publishing
Converting
and other data on the
cleaning
to RDF
sources Web WWW
Census
data
Storing
10. What for?
Clean up raw data Link data to registry
• Identify duplicates • e.g. Freebase
• Discover patterns
Make data consistent
• Spot inconsistencies
11.
12. Formats supported
TSV, CSV or values separated by a custom separator you specify
Excel
Google spreadsheets
XML, RDF as XML
RDF N3 files
JSON
13. Google Refine
Energy
consumption
Crime
data Unifying Linking to Publishing
Converting
and other data on the
cleaning
to RDF
sources Web WWW
Borough
profiles
Storing
Reconciliation
Building a
skeleton
14.
15. Google Refine
Energy
consumption
Crime
data Unifying Linking to Publishing
Converting
and other data on the
cleaning
to RDF
sources Web WWW
Census
data
Storing
Reconciliation
Building a
skeleton
16. London boroughs
has label
Dataset
has structure
components
Specified by
Defined by
properties
Defined by columns
in the table
have label
represent type
Header of
the column
e.g. measure
17. London boroughs
has label
Dataset
has structure
components
Specified by
Defined by
properties
Defined by columns
in the table
have label
represent type
Header of
the column
e.g. measure
19. London boroughs
has label
Dataset
has structure
components
Specified by
Defined by
properties
Defined by columns
in the table
Cube vocabulary represent type
have label
Header of
the column
e.g. measure
20. London boroughs
rdfs:label
qb:dataSet
qb:structure
qb:component
qb:ComponentSp
qb:DataStructu
ecification
reDefinition
e.g. qb:measure
qb:ComponentProperty
rdfs:label
e.g. qb: MeasureProperty
“Area name”
e.g. measure
21. Google Refine
Energy
consumption
Crime
data Unifying Linking to Publishing
Converting
and other data on the
cleaning
to RDF
sources Web WWW
Census
data
Storing
Reconciliation
Defining structure
Building a
skeleton
Filling with values
22. Pubby
Silk
Google Refine
Energy
consumption
Crime
data Unifying Linking to Publishing
Converting
and other data on the
cleaning
to RDF
sources Web WWW
Census
data
Storing
Fuseki
23. London boroughs
rdfs:label
qb:structure qb:DataStruc qb:component qb:Compon
qb:dataSe
tureDefinitio entSpecific
t
n ation
qb:Compo … e.g. qb:measure
nentSpecif
ication
qb:Compo … qb:ComponentPro
nentSpecifi
cation
qb:Compo … perty
nentSpecifi
cation
qb:Compo … rdfs:label
nentSpecifi
Energy consumption cation
e.g. qb: MeasureProperty
rdfs:label
qb:DataStruc qb:component qb:Compon “Area name”
qb:dataSe qb:structure
tureDefinitio entSpecific
t
n ation
qb:Compo … e.g. qb:measure e.g. measure
nentSpecif
ication
qb:Compo … qb:ComponentPro
nentSpecifi
cation
qb:Compo … perty
nentSpecifi
cation Area name URI
qb:Compo … rdfs:label
nentSpecifi
cation
Crime rates in boroughs e.g. qb: MeasureProperty
rdfs:label
“LAU1 Area”
qb:structure qb:DataStruc qb:component qb:Compon
qb:dataSe
tureDefinitio entSpecific
t
n ation
e.g. measure
qb:Compo … e.g. qb:measure
nentSpecif
ication
qb:Compo … qb:ComponentPro
nentSpecifi
cation
qb:Compo … perty
nentSpecifi Area name URI
cation
qb:Compo … rdfs:label
nentSpecifi
cation
e.g. qb: MeasureProperty
“Borough”
e.g. measure
Area name URI
24. London boroughs
rdfs:label
qb:structure qb:DataStruc qb:component qb:Compon
qb:dataSe
tureDefinitio entSpecific
t
n ation
qb:Compo … e.g. qb:measure
nentSpecif
ication
qb:Compo … qb:ComponentPro
nentSpecifi
cation
qb:Compo … perty
nentSpecifi
cation
qb:Compo … rdfs:label
nentSpecifi
cation
e.g. qb: MeasureProperty
qb:Observation
qb:dataSet
Crime rates in boroughs “Area name”
rdfs:label
qb:structure qb:DataStruc qb:component qb:Compon
qb:dataSe
tureDefinitio entSpecific e.g. measure
t
n ation
qb:Compo … e.g. qb:measure
nentSpecif
ication
qb:Compo … qb:ComponentPro Area name URI
nentSpecifi
cation
qb:Compo … perty http://example.org/def/statistical-dimension/area-name
nentSpecifi
cation
qb:Compo … rdfs:label
nentSpecifi
cation
e.g. qb: MeasureProperty
qb:Observation
qb:dataSet
“Borough”
e.g. measure
Area name URI
http://example.org/def/statistical-dimension/area-name