Mais conteúdo relacionado Semelhante a Linked Data: some social challenges (20) Linked Data: some social challenges1. L ink ed Da ta
social challenges
som e tech &
michele barbera
<barbera@netseven.it> @barbz79it
92. t e ch
s o m e es
i s s u
107. relationships
Class Level
http://test01.sindice.net/szydan/dataset-view/dataset/default/www.bbc.co.uk
by Giovanni Tummarello
113. relationships
http://test01.sindice.net/szydan/dataset-view/dataset/default/www.bbc.co.uk
by Giovanni Tummarello
127. SIREn
Data Collection Settings
500M web data documents Cluster of 4 nodes
(RDF, RDFa, Microformat, etc.) 2 nodes for indexing
200K datasets 2 nodes for querying
50B triples Replication
Indexing Performance Services
Full index construction takes Keyword and structured queries
approx 24 hours Dataset search
436K triples / second 99% uptime
128. SIREn
Data Collection Settings
500M web data documents Cluster of 4 nodes
(RDF, RDFa, Microformat, etc.) 2 nodes for indexing
200K datasets 2 nodes for querying
50B triples Replication
Indexing Performance Services
Full index construction takes Keyword and structured queries
approx 24 hours Dataset search
436K triples / second 99% uptime
129. SIREn
Data Collection Settings
500M web data documents Cluster of 4 nodes
(RDF, RDFa, Microformat, etc.) 2 nodes for indexing
200K datasets 2 nodes for querying
50B triples Replication
Indexing Performance Services
Full index construction takes Keyword and structured queries
approx 24 hours Dataset search
436K triples / second 99% uptime
130. SIREn
Data Collection Settings
500M web data documents Cluster of 4 nodes
(RDF, RDFa, Microformat, etc.) 2 nodes for indexing
200K datasets 2 nodes for querying
50B triples Replication
Indexing Performance Services
Full index construction takes Keyword and structured queries
approx 24 hours Dataset search
436K triples / second 99% uptime
131. SIREn
Data Collection Settings
500M web data documents Cluster of 4 nodes
(RDF, RDFa, Microformat, etc.) 2 nodes for indexing
200K datasets 2 nodes for querying
50B triples Replication
spaziodati.3scale.net
Indexing Performance Services
Full index construction takes Keyword and structured queries
approx 24 hours Dataset search
436K triples / second 99% uptime
132. SIREn
Data Collection Settings
500M web data documents Cluster of 4 nodes
(RDF, RDFa, Microformat, etc.) 2 nodes for indexing
200K datasets 2 nodes for querying
50B triples Replication
spaziodati.3scale.net
Indexing Performance Services
Full index construction takes Keyword and structured queries
approx 24 hours Dataset search
436K triples / second 99% uptime
133. SIREn
Data Collection Settings
500M web data documents Cluster of 4 nodes
(RDF, RDFa, Microformat, etc.) 2 nodes for indexing
200K datasets 2 nodes for querying
50B triples Replication
spaziodati.3scale.net
Indexing Performance Services
Full index construction takes Keyword and structured queries
approx 24 hours Dataset search
436K triples / second 99% uptime
189. tables
u_id f_id
1 2
1 3
3 4
4 3
id name age affiliation
1 Michele 33 net7
2 Mario 32 unipi
3 Silvia 28 unifi
4 Irene 27 unitn Institution City
net7 pisa
unipi pisa
unifi firenze
unitn trento
192. graphs?
pisa Firenze
place
Trento
e
e
plac
plac
unipi
net7 unifi
e
plac
ks
ks
unitn
wor
wor
friend
frien
ks
d
michele
231. AAA
library wikidb
scholarly
276. caution!
http://universities.org/italy#cnr
ns:president a_person
ns:department some_department
ns:department some_department
owl:sameAs
http://www.example.com/cnr
ns:creator jonnhy
322. D a ta
d i e my?
k o
n n
a
368. 5 billion 40%global data
mobile phones 30 billion
pieces of content shared
growth in
projected
generated per year vs 5%
on facebook every month
235
terabytes
15 out of 17
data collected
by US library
of Congress
60%
potential increas in retailers’
sectors in US have more data stored
per company than the US Library
in april 2011 operating margins possible
with big data
of Congress
BIG DATA AND INFO OVERLOAD IN USE IN 2010:
250$ billion
potential annual value
600$ billion
300$ to Europe’s public sector
potential annual consumer
surplus from using
billion administration - more
than GDP of Greece
personal location data globally
potential annual value
to US health care
(more than double
the total annual 60% 140.000-190.000
more deep analytical talent positions
health care potential increase and 1,5 million more data-savvy managers
spending in Spain) in retailers’ operating margins need to take full advantage of big data
possible wiith big data
with big dat only in United States
396. u n p r e c e d e n t e d