SlideShare uma empresa Scribd logo
1 de 14
Baixar para ler offline
1-5 stars: Metadata on the Openness
Level of Open Data Sets in Europe
Sébastien Martin, Muriel Foulonneau, Slim Turki
Context & Objectives
•
•
•
•

Level of reuse of open data is still disappointing.
Development of open data requires a better reusability of data.
Degree of openness is a key success factor.
Catalogs listing data have a crucial role.

Analyse PublicData.eu catalogue
(i) identify the quality of a sample of metadata properties, which
are critical to enable data reuse
(ii) study the stated level of data openness.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

2
PublicData.eu
•

•

Many local and national portals to provide access to public sector open
datasets - 114 EU catalogues on datacatalogs.org
Gather datasets across geographic and institutional boundaries

PublicData.eu
•
•
•
•
•
•

pan-European catalogue launched under the FP7 LOD2 project.
aggregates data from CKAN open data catalogues all over Europe.
collects data from 26 sources
1st to be published in Europe in 2011
data beyond the European Union, e.g., Serbian datasets.
not exhaustive, it represents a unique aggregation of European datasets.

•
•

17.027 datasets
UK: largest provider

21/11/2013

3
Methodology
Descriptions of datasets collected in May 2013
236 distinct dataset properties identified, partially due to
•
•

linguistic diversity; some providers adapt property names in their language
problems of consistency in naming (upper / lower case, spaces /
underscore for a single field).

Major challenge to understand the content of the PublicData.eu
Data collected and analysed to identify information made available
on data openness and reusability in particular the licensing
conditions and the data formats.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

4
Tim Berners-Lee’s evaluation scale

★

Available on the web (whatever format) but with an
open license, to be Open Data

★★ Available as machine-readable structured data
★★★ 2 + non-proprietary format

★★★★
★★★★★

21/11/2013

3 + Use open standards from W3C (RDF and SPARQL)
to identify things
4 + Link your data to other people’s data to provide
context

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

5
★ Data Licences
13.535 / 17.027 datasets have at least 1 license indication
12.470 datasets can be considered having some form of open
license  73,24%
769 datasets have a Creative Commons license
Significant number of datasets have a national license:
•
•
•

apie v2 to publish information created by French public authorities
UK-crown which “covers material created by civil servants, ministers and
government departments and agencies” in the UK,
UK Open Government License

128 datasets with an explicitly closed license

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

6
★★ Machine readable format
• Facilitates data reusability
• 4.051 / 17.027 with
content_TYPE
• 11.285 with at least one
indication about format
• 56 datasets in RDF
• Dominant proportion of
spreadsheets type’s formats
Distribution of formats

40% not a machine readable format
34% of datasets available in a machine readable format
 machine readability cond. for openness levels of 2★ and >
21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

7
★★★ Use of non-proprietary formats
Creates ambiguities as the openness nature of formats can be
debated in some cases:
•
•

Certain formats are proprietary but their specifications are open.
Some formats have been open at a certain point of time but additions and
further evolutions remain proprietary

In many cases, value of property was too vague to determine
whether the format was or not proprietary.
It was possible to identify:
•
•

For 49% of the datasets, a non-proprietary format
For 21% a proprietary format.

Use of proprietary formats is a critical issue for improving the
level of openness of datasets.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

8
★★★★ Use of open standards from
W3C
Including HTML, XML, and RDF in particular.
•

XML-based formats may be entirely independent from W3C (e.g. KML)

Availability in W3C standards: 9,5% of datasets
Availability in XML based formats: 10%

Information remains unknown in most cases

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

9
★★★★★ Linked data
Linked data are only mentioned in the description of a single
dataset (Brandweer Amsterdam-Amstelland Uitrukberichten)
for which the format is described as “linked data api, rdf json”.
58 datasets mention RDF (or RDFa) as a format or content type,
i.e., 0,34%.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

10
Level of openness (1/2)
6.891 / 17.027 datasets show at least one information about their
degree of openness.
All come from Data.gov.uk (8 689 datasets)
For a majority of datasets, the level of openness is unknown.
•

21/11/2013

Coherent with lack of licensing information without which it is impossible
to conclude on even ★ openness level.

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

Distribution of openness levels in UK datasets

11
Level of openness (2/2)
Approximate level of openness derived from licensing and format
properties
•
•

73,24% of the datasets should have ★ or above.
Reference to 5★ should take into consideration linkages, cannot be
inferred from dataset metadata.

Level of openness according
to Format and License
related properties

Data openness mainly related to 1st level of compliance: licensing
issue.
•
21/11/2013

Data providers have clearly not focused on publication of data in reusable
formats.
1-5 stars: Metadata on the Openness Level of
12
Open Data Sets in Europe
Conclusion
• Limited openness of datasets advertised as open data
• Heterogeneity of associated metadata
 Difficulty for reusers to (i) discover datasets, despite the
creation of large catalogues of datasets, and to (ii) effectively
reuse machine readable and contextualized data.
★ may be sufficient to ensure transparency of gov. action,
facilitating reuse of data through services is not served below 2★
Confirmed risks regarding major challenges that data providers
have to face: (i) language barrier and (ii) lack of consistency of
metadata.
Harmonization of practices, training and tools necessary to
ensure that datasets are available in relevant formats.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

13
1-5 stars: Metadata on the Openness
Level of Open Data Sets in Europe
Sébastien Martin, Muriel Foulonneau, Slim Turki

Contact:

muriel.foulonneau@tudor.lu

Mais conteúdo relacionado

Mais procurados

On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsNina Jeliazkova
 
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...Pedro Príncipe
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the webChiara Del Vescovo
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampWim Peters
 
Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...Aksw Group
 
Tonex's Link 16 Operational Overview Training
Tonex's Link 16 Operational Overview TrainingTonex's Link 16 Operational Overview Training
Tonex's Link 16 Operational Overview TrainingTonex
 
Automatics and Remote Control
Automatics and Remote ControlAutomatics and Remote Control
Automatics and Remote ControlVisionary_
 
Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
 Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata ItemsLviv Data Science Summer School
 

Mais procurados (10)

On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurements
 
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
 
Euro lipids 2014_graz
Euro lipids 2014_grazEuro lipids 2014_graz
Euro lipids 2014_graz
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative Camp
 
Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...
 
Krakow2010
Krakow2010Krakow2010
Krakow2010
 
Tonex's Link 16 Operational Overview Training
Tonex's Link 16 Operational Overview TrainingTonex's Link 16 Operational Overview Training
Tonex's Link 16 Operational Overview Training
 
Automatics and Remote Control
Automatics and Remote ControlAutomatics and Remote Control
Automatics and Remote Control
 
Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
 Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
 

Destaque

How to resize facebook photos using pic monkey
How to resize facebook photos using pic monkeyHow to resize facebook photos using pic monkey
How to resize facebook photos using pic monkeysweetaunzo
 
COURRIER CAB 31 MD
COURRIER CAB 31 MDCOURRIER CAB 31 MD
COURRIER CAB 31 MDComPol
 
Google glass
Google glassGoogle glass
Google glasscolegioyo
 
Геомаркетинг Геомаркетинговые исследования
Геомаркетинг Геомаркетинговые исследованияГеомаркетинг Геомаркетинговые исследования
Геомаркетинг Геомаркетинговые исследованияgeo-marketing
 
Asat book0-fresh blood
Asat book0-fresh bloodAsat book0-fresh blood
Asat book0-fresh bloodAshraf Ali
 
Carta de Oneida Pinto A El Espectador
Carta de Oneida Pinto A El EspectadorCarta de Oneida Pinto A El Espectador
Carta de Oneida Pinto A El EspectadorPrensaOneidaPinto
 
Aulbrey Meade - Surgical Tech RESUME
Aulbrey Meade - Surgical Tech RESUMEAulbrey Meade - Surgical Tech RESUME
Aulbrey Meade - Surgical Tech RESUMEAulbrey Meade
 
Buruketak 3.1.
Buruketak 3.1.Buruketak 3.1.
Buruketak 3.1.auldreikie
 
New Barco ClickShare CMS-1
New Barco ClickShare CMS-1New Barco ClickShare CMS-1
New Barco ClickShare CMS-1Paul Richards
 
Making the cut - Roberta Lucca, Bossa
Making the cut - Roberta Lucca, BossaMaking the cut - Roberta Lucca, Bossa
Making the cut - Roberta Lucca, BossaLondonGamesConference
 
Sistemas de equações de 1º grau - Como fazer + exercicios
Sistemas de equações de 1º grau - Como fazer + exerciciosSistemas de equações de 1º grau - Como fazer + exercicios
Sistemas de equações de 1º grau - Como fazer + exerciciosAna Tapadinhas
 
Гаражи, Чернигов , ул. Пушкина
Гаражи, Чернигов , ул. ПушкинаГаражи, Чернигов , ул. Пушкина
Гаражи, Чернигов , ул. ПушкинаAlexander Gashpar
 

Destaque (20)

The star system
The star systemThe star system
The star system
 
How to resize facebook photos using pic monkey
How to resize facebook photos using pic monkeyHow to resize facebook photos using pic monkey
How to resize facebook photos using pic monkey
 
COURRIER CAB 31 MD
COURRIER CAB 31 MDCOURRIER CAB 31 MD
COURRIER CAB 31 MD
 
Google glass
Google glassGoogle glass
Google glass
 
VTSP 5.5
VTSP 5.5VTSP 5.5
VTSP 5.5
 
Геомаркетинг Геомаркетинговые исследования
Геомаркетинг Геомаркетинговые исследованияГеомаркетинг Геомаркетинговые исследования
Геомаркетинг Геомаркетинговые исследования
 
Asat book0-fresh blood
Asat book0-fresh bloodAsat book0-fresh blood
Asat book0-fresh blood
 
Less is More
Less is MoreLess is More
Less is More
 
Carta de Oneida Pinto A El Espectador
Carta de Oneida Pinto A El EspectadorCarta de Oneida Pinto A El Espectador
Carta de Oneida Pinto A El Espectador
 
Presentación1
Presentación1Presentación1
Presentación1
 
Calendario escolar
Calendario escolarCalendario escolar
Calendario escolar
 
Aulbrey Meade - Surgical Tech RESUME
Aulbrey Meade - Surgical Tech RESUMEAulbrey Meade - Surgical Tech RESUME
Aulbrey Meade - Surgical Tech RESUME
 
Buruketak 3.1.
Buruketak 3.1.Buruketak 3.1.
Buruketak 3.1.
 
New Barco ClickShare CMS-1
New Barco ClickShare CMS-1New Barco ClickShare CMS-1
New Barco ClickShare CMS-1
 
Making the cut - Roberta Lucca, Bossa
Making the cut - Roberta Lucca, BossaMaking the cut - Roberta Lucca, Bossa
Making the cut - Roberta Lucca, Bossa
 
Sistemas de equações de 1º grau - Como fazer + exercicios
Sistemas de equações de 1º grau - Como fazer + exerciciosSistemas de equações de 1º grau - Como fazer + exercicios
Sistemas de equações de 1º grau - Como fazer + exercicios
 
Гаражи, Чернигов , ул. Пушкина
Гаражи, Чернигов , ул. ПушкинаГаражи, Чернигов , ул. Пушкина
Гаражи, Чернигов , ул. Пушкина
 
Gravitation
GravitationGravitation
Gravitation
 
Mechanics 2
Mechanics 2Mechanics 2
Mechanics 2
 
Megacoderit
MegacoderitMegacoderit
Megacoderit
 

Semelhante a 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe

OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...Open Science Fair
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutionsOpen Data Support
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE
 
Industry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraftIndustry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraftRuleML
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open dataMarie Gustafsson Friberger
 
CARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE
 
OSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data setsOSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data setsOpen Science Fair
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the NetherlandsJisc RDM
 
Can new technologies and digitalization improve infrastructure governance? - ...
Can new technologies and digitalization improve infrastructure governance? - ...Can new technologies and digitalization improve infrastructure governance? - ...
Can new technologies and digitalization improve infrastructure governance? - ...OECD Governance
 
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch CatalogueExposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch CatalogueRaul Palma
 
Exploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sourcesExploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sourcesLaura Po
 
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...OpenAIRE
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 
Data-as-a-Service: DataGraft
Data-as-a-Service: DataGraftData-as-a-Service: DataGraft
Data-as-a-Service: DataGraftdapaasproject
 

Semelhante a 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe (20)

OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
Webinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management PlanningWebinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management Planning
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
Industry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraftIndustry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraft
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open data
 
DatalEt-Ecosystem Provider - The DEEP project
DatalEt-Ecosystem Provider - The DEEP projectDatalEt-Ecosystem Provider - The DEEP project
DatalEt-Ecosystem Provider - The DEEP project
 
Fair data vs 5 star open data final
Fair data vs 5 star open data finalFair data vs 5 star open data final
Fair data vs 5 star open data final
 
CARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practice
 
OSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data setsOSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data sets
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the Netherlands
 
Can new technologies and digitalization improve infrastructure governance? - ...
Can new technologies and digitalization improve infrastructure governance? - ...Can new technologies and digitalization improve infrastructure governance? - ...
Can new technologies and digitalization improve infrastructure governance? - ...
 
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch CatalogueExposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
 
Exploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sourcesExploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sources
 
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
Data-as-a-Service: DataGraft
Data-as-a-Service: DataGraftData-as-a-Service: DataGraft
Data-as-a-Service: DataGraft
 

Mais de Slim Turki, Dr.

Local Digital Twins Conversations: Framing the Green + Digital Transition
Local Digital Twins Conversations:  Framing the Green + Digital TransitionLocal Digital Twins Conversations:  Framing the Green + Digital Transition
Local Digital Twins Conversations: Framing the Green + Digital TransitionSlim Turki, Dr.
 
Data ecosystems: turning data into public value
Data ecosystems:  turning data into public valueData ecosystems:  turning data into public value
Data ecosystems: turning data into public valueSlim Turki, Dr.
 
#opendata Back to the future
#opendata Back to the future#opendata Back to the future
#opendata Back to the futureSlim Turki, Dr.
 
Data Ecosystems for Geospatial Data
Data Ecosystems for Geospatial DataData Ecosystems for Geospatial Data
Data Ecosystems for Geospatial DataSlim Turki, Dr.
 
Open Data in Disaster Management
Open Data in Disaster ManagementOpen Data in Disaster Management
Open Data in Disaster ManagementSlim Turki, Dr.
 
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open DataBE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open DataSlim Turki, Dr.
 
How open data ecosystems are stimulated?
How open data ecosystems are stimulated?How open data ecosystems are stimulated?
How open data ecosystems are stimulated?Slim Turki, Dr.
 
BE-GOOD Challenges - factsheet 2017-06
BE-GOOD Challenges - factsheet 2017-06BE-GOOD Challenges - factsheet 2017-06
BE-GOOD Challenges - factsheet 2017-06Slim Turki, Dr.
 
Service innovation: the hidden value of open data
Service innovation: the hidden value of open dataService innovation: the hidden value of open data
Service innovation: the hidden value of open dataSlim Turki, Dr.
 
From open data to data-driven services
From open data to data-driven servicesFrom open data to data-driven services
From open data to data-driven servicesSlim Turki, Dr.
 
How open data are turned into services?
How open data are turned into services?How open data are turned into services?
How open data are turned into services?Slim Turki, Dr.
 
SPOCS: A semantic interoperability layer to support the implementation of the...
SPOCS: A semantic interoperability layer to support the implementation of the...SPOCS: A semantic interoperability layer to support the implementation of the...
SPOCS: A semantic interoperability layer to support the implementation of the...Slim Turki, Dr.
 
Open Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and OpportunitiesOpen Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and OpportunitiesSlim Turki, Dr.
 
Luxembourg Service Jam 2013 - Guide book
Luxembourg Service Jam 2013 - Guide bookLuxembourg Service Jam 2013 - Guide book
Luxembourg Service Jam 2013 - Guide bookSlim Turki, Dr.
 
Luxembourg Service Jam 2012 - Guide book
Luxembourg Service Jam 2012 - Guide bookLuxembourg Service Jam 2012 - Guide book
Luxembourg Service Jam 2012 - Guide bookSlim Turki, Dr.
 
Global Service Jam - Luxembourg spot
Global Service Jam - Luxembourg spotGlobal Service Jam - Luxembourg spot
Global Service Jam - Luxembourg spotSlim Turki, Dr.
 
Compliance In e-government Service Engineering
Compliance In e-government Service EngineeringCompliance In e-government Service Engineering
Compliance In e-government Service EngineeringSlim Turki, Dr.
 

Mais de Slim Turki, Dr. (18)

Local Digital Twins Conversations: Framing the Green + Digital Transition
Local Digital Twins Conversations:  Framing the Green + Digital TransitionLocal Digital Twins Conversations:  Framing the Green + Digital Transition
Local Digital Twins Conversations: Framing the Green + Digital Transition
 
Data ecosystems: turning data into public value
Data ecosystems:  turning data into public valueData ecosystems:  turning data into public value
Data ecosystems: turning data into public value
 
#opendata Back to the future
#opendata Back to the future#opendata Back to the future
#opendata Back to the future
 
Data Ecosystems for Geospatial Data
Data Ecosystems for Geospatial DataData Ecosystems for Geospatial Data
Data Ecosystems for Geospatial Data
 
Open Data in Disaster Management
Open Data in Disaster ManagementOpen Data in Disaster Management
Open Data in Disaster Management
 
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open DataBE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
 
How open data ecosystems are stimulated?
How open data ecosystems are stimulated?How open data ecosystems are stimulated?
How open data ecosystems are stimulated?
 
BE-GOOD Challenges - factsheet 2017-06
BE-GOOD Challenges - factsheet 2017-06BE-GOOD Challenges - factsheet 2017-06
BE-GOOD Challenges - factsheet 2017-06
 
Service innovation: the hidden value of open data
Service innovation: the hidden value of open dataService innovation: the hidden value of open data
Service innovation: the hidden value of open data
 
From open data to data-driven services
From open data to data-driven servicesFrom open data to data-driven services
From open data to data-driven services
 
How open data are turned into services?
How open data are turned into services?How open data are turned into services?
How open data are turned into services?
 
SPOCS: A semantic interoperability layer to support the implementation of the...
SPOCS: A semantic interoperability layer to support the implementation of the...SPOCS: A semantic interoperability layer to support the implementation of the...
SPOCS: A semantic interoperability layer to support the implementation of the...
 
Open Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and OpportunitiesOpen Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and Opportunities
 
Luxembourg Service Jam 2013 - Guide book
Luxembourg Service Jam 2013 - Guide bookLuxembourg Service Jam 2013 - Guide book
Luxembourg Service Jam 2013 - Guide book
 
Luxembourg Service Jam 2012 - Guide book
Luxembourg Service Jam 2012 - Guide bookLuxembourg Service Jam 2012 - Guide book
Luxembourg Service Jam 2012 - Guide book
 
Global Service Jam - Luxembourg spot
Global Service Jam - Luxembourg spotGlobal Service Jam - Luxembourg spot
Global Service Jam - Luxembourg spot
 
Legora@IESS1.0
Legora@IESS1.0Legora@IESS1.0
Legora@IESS1.0
 
Compliance In e-government Service Engineering
Compliance In e-government Service EngineeringCompliance In e-government Service Engineering
Compliance In e-government Service Engineering
 

Último

Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioChristian Posta
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding TeamAdam Moalla
 
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPathCommunity
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureEric D. Schabell
 
Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesDavid Newbury
 
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfIaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfDaniel Santiago Silva Capera
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1DianaGray10
 
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IES VE
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostMatt Ray
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaborationbruanjhuli
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopBachir Benyammi
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024SkyPlanner
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Will Schroeder
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8DianaGray10
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarPrecisely
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXTarek Kalaji
 
20230202 - Introduction to tis-py
20230202 - Introduction to tis-py20230202 - Introduction to tis-py
20230202 - Introduction to tis-pyJamie (Taka) Wang
 

Último (20)

Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and Istio
 
20150722 - AGV
20150722 - AGV20150722 - AGV
20150722 - AGV
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team
 
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation Developers
 
OpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability AdventureOpenShift Commons Paris - Choose Your Own Observability Adventure
OpenShift Commons Paris - Choose Your Own Observability Adventure
 
Linked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond OntologiesLinked Data in Production: Moving Beyond Ontologies
Linked Data in Production: Moving Beyond Ontologies
 
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdfIaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
IaC & GitOps in a Nutshell - a FridayInANuthshell Episode.pdf
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
 
201610817 - edge part1
201610817 - edge part1201610817 - edge part1
201610817 - edge part1
 
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 Workshop
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024
 
20230104 - machine vision
20230104 - machine vision20230104 - machine vision
20230104 - machine vision
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity Webinar
 
VoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBXVoIP Service and Marketing using Odoo and Asterisk PBX
VoIP Service and Marketing using Odoo and Asterisk PBX
 
20230202 - Introduction to tis-py
20230202 - Introduction to tis-py20230202 - Introduction to tis-py
20230202 - Introduction to tis-py
 

1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe

  • 1. 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe Sébastien Martin, Muriel Foulonneau, Slim Turki
  • 2. Context & Objectives • • • • Level of reuse of open data is still disappointing. Development of open data requires a better reusability of data. Degree of openness is a key success factor. Catalogs listing data have a crucial role. Analyse PublicData.eu catalogue (i) identify the quality of a sample of metadata properties, which are critical to enable data reuse (ii) study the stated level of data openness. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 2
  • 3. PublicData.eu • • Many local and national portals to provide access to public sector open datasets - 114 EU catalogues on datacatalogs.org Gather datasets across geographic and institutional boundaries PublicData.eu • • • • • • pan-European catalogue launched under the FP7 LOD2 project. aggregates data from CKAN open data catalogues all over Europe. collects data from 26 sources 1st to be published in Europe in 2011 data beyond the European Union, e.g., Serbian datasets. not exhaustive, it represents a unique aggregation of European datasets. • • 17.027 datasets UK: largest provider 21/11/2013 3
  • 4. Methodology Descriptions of datasets collected in May 2013 236 distinct dataset properties identified, partially due to • • linguistic diversity; some providers adapt property names in their language problems of consistency in naming (upper / lower case, spaces / underscore for a single field). Major challenge to understand the content of the PublicData.eu Data collected and analysed to identify information made available on data openness and reusability in particular the licensing conditions and the data formats. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 4
  • 5. Tim Berners-Lee’s evaluation scale ★ Available on the web (whatever format) but with an open license, to be Open Data ★★ Available as machine-readable structured data ★★★ 2 + non-proprietary format ★★★★ ★★★★★ 21/11/2013 3 + Use open standards from W3C (RDF and SPARQL) to identify things 4 + Link your data to other people’s data to provide context 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 5
  • 6. ★ Data Licences 13.535 / 17.027 datasets have at least 1 license indication 12.470 datasets can be considered having some form of open license  73,24% 769 datasets have a Creative Commons license Significant number of datasets have a national license: • • • apie v2 to publish information created by French public authorities UK-crown which “covers material created by civil servants, ministers and government departments and agencies” in the UK, UK Open Government License 128 datasets with an explicitly closed license 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 6
  • 7. ★★ Machine readable format • Facilitates data reusability • 4.051 / 17.027 with content_TYPE • 11.285 with at least one indication about format • 56 datasets in RDF • Dominant proportion of spreadsheets type’s formats Distribution of formats 40% not a machine readable format 34% of datasets available in a machine readable format  machine readability cond. for openness levels of 2★ and > 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 7
  • 8. ★★★ Use of non-proprietary formats Creates ambiguities as the openness nature of formats can be debated in some cases: • • Certain formats are proprietary but their specifications are open. Some formats have been open at a certain point of time but additions and further evolutions remain proprietary In many cases, value of property was too vague to determine whether the format was or not proprietary. It was possible to identify: • • For 49% of the datasets, a non-proprietary format For 21% a proprietary format. Use of proprietary formats is a critical issue for improving the level of openness of datasets. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 8
  • 9. ★★★★ Use of open standards from W3C Including HTML, XML, and RDF in particular. • XML-based formats may be entirely independent from W3C (e.g. KML) Availability in W3C standards: 9,5% of datasets Availability in XML based formats: 10% Information remains unknown in most cases 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 9
  • 10. ★★★★★ Linked data Linked data are only mentioned in the description of a single dataset (Brandweer Amsterdam-Amstelland Uitrukberichten) for which the format is described as “linked data api, rdf json”. 58 datasets mention RDF (or RDFa) as a format or content type, i.e., 0,34%. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 10
  • 11. Level of openness (1/2) 6.891 / 17.027 datasets show at least one information about their degree of openness. All come from Data.gov.uk (8 689 datasets) For a majority of datasets, the level of openness is unknown. • 21/11/2013 Coherent with lack of licensing information without which it is impossible to conclude on even ★ openness level. 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe Distribution of openness levels in UK datasets 11
  • 12. Level of openness (2/2) Approximate level of openness derived from licensing and format properties • • 73,24% of the datasets should have ★ or above. Reference to 5★ should take into consideration linkages, cannot be inferred from dataset metadata. Level of openness according to Format and License related properties Data openness mainly related to 1st level of compliance: licensing issue. • 21/11/2013 Data providers have clearly not focused on publication of data in reusable formats. 1-5 stars: Metadata on the Openness Level of 12 Open Data Sets in Europe
  • 13. Conclusion • Limited openness of datasets advertised as open data • Heterogeneity of associated metadata  Difficulty for reusers to (i) discover datasets, despite the creation of large catalogues of datasets, and to (ii) effectively reuse machine readable and contextualized data. ★ may be sufficient to ensure transparency of gov. action, facilitating reuse of data through services is not served below 2★ Confirmed risks regarding major challenges that data providers have to face: (i) language barrier and (ii) lack of consistency of metadata. Harmonization of practices, training and tools necessary to ensure that datasets are available in relevant formats. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 13
  • 14. 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe Sébastien Martin, Muriel Foulonneau, Slim Turki Contact: muriel.foulonneau@tudor.lu

Notas do Editor

  1. The study uses the Tim Berners-Lee’s five star evaluation scale.
  2. The one star openness level depends upon data licenses. Licensing information can be found in 10 distinct metadata properties, i.e., licence, License, licence_url, License_details, License_ID, License_summary, License_title, License_uri, License_url, and mandate.
  3. The two star level depends upon the format in which the data is made available.