Soil Research Data Policies, Data availability and Access, and the Interoperability challenge for CREA Soil Open Data, Italy
1. RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Giovanni L’Abate
Soil Research Data Policies,
Data availability and Access,
and the Interoperability challenge for
CREA
Soil Open Data
2. RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
In this talk
1. CREA Soil Research Data
1. Data Type & Formats
2. Standards
3. Capture Methods
4. Privacy Statement
5. Ethics and Intellectual Property
6. Access
7. Data Sharing and Reuse
8. The SISI webGIS application
9. Short-Term Storage and Data Management
10. Deposit and Long-Term Preservation
11. Resourcing
2. The Interoperability challenge
1. OGC - Soil Interoperability Experiment
2. Catalogue services to search for data
3. Vocabularies used in those catalogue services
3. RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Soil Data Type & Formats
Soil Observations (Observations & Measurements)
Spatial Feature (GIS point data). E.g. Soil Profiles, Soil Samples,
Soil Samples derivatives
Soil Maps
Spatial Feature (GIS polygonal data/Gridded). E.g. Soil Body, Soil
Parameter
Printed Maps (PDF,JPG,GIF,TIF, PNG);
WebGIS tools (WMS, WFS, WCS)
Formats of Spatial Feature (Download at http://soilmaps.entecra.it)
SHP, KML, MDB
Web Feature Service (WFS)
CSV, SHP, GML, GJSON
Web Map Service (WMS)
AtomPub, GIF, GeoRSS, JPEG, KML, OpenLayers, PNG, SVG, TIFF
4. RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Standards
ISO SoilML, WFS, WMS, and WCS.
Observations and Measurements - XML
Implementation
Implementation Standard for Geographic
information - Simple feature access
World Reference Base, 2th edition (2006).
ftp://ftp.fao.org/agl/agll/docs/wsrr103e.pdf
USDA soil Taxonomy, 10th edition (2006).
www.nrcs.usda.gov/Internet/FSE_DOCUMENTS/nrcs142p2_052172.pdf
The European directive on “Infrastructure for
Spatial Information in the European
Community (INSPIRE)” has brought together
most of the existing national european
standards and ISO into a well defined
model.
5. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Capture Methods
6. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Privacy Statement
The Data Protection Directive (Directive 95/46/EC on the
protection of individuals with regard to the processing of
personal data and on the free movement of such data) is a
European Union directive adopted in 1995 which regulates the
processing of personal data within the European Union. It is an
important component of EU privacy and human rights law.
Privacy is defined as “any personal data information concerning a
natural person, legal person, entity or association, identified or
identifiable even indirectly, including a personal identification
number”.
EU Directive on the freedom of access to information on the
environment (90/313/EEC), because of the public interest for
the entire community, environmental data are subjected to a
special legal regime (guarantees of access and dissemination)
distinct from that of privacy.
7. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Ethics and Intellectual Property
An European specific database right law, the Database Directive
(96/9/EC), protects the producer of a database, who has invested the
necessary effort to constitute the database. That means that
Database rights under the EU are created automatically, vested in
the employers of creators and do not have to be registered to have
effect.
How to cite data sources:
Creator (PublicationYear): Title. Version. Publisher. ResourceType.
Approver (Year). Identifier1
:Edoardo A.C. Costantini, Giovanni
L'Abate, Giovanni Allegri (2012): Soil Information System of Italy
(SISI). Version 1.0. Consiglio per la ricerca in agricoltura e
l'analisi dell'economia agraria (CREA). WebGIS. 2015.
http://aginfra-sg.ct.infn.it/sisi
1
https://www.popdata.bc.ca/publishing_research_materials/citingsources
8. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Data Access
SoilSamples SoilProfiles SoilMaps
http://www.soilmaps.it/ita/cartadeisuoli3.htmlhttp://soilmaps.entecra.it/webgis/sisi/map.htmlhttp://soilmaps.entecra.it/webgis/geosample/map.html
9. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Data Sharing and Reuse
According to CREA "Strategia per la valorizzazione del
patrimonio informativo del CRA” (2013).
Italian Open Data License (IODL v2.0) and the
principle of Open Data: freely available to all, free of
copyright, except for the obligation to cite the source.
Compliancy
Open Data Commons - Open Database License (ODbL)
Creative Commons License version 3.0 (CC-BY)
How to cite data license:
This {DATABASE-NAME} is made available under the
[LicenseName] [LicenseLogo] [LicenseURL]2: The
SISI WebGIS (2015) is made available under the
Italian Open Data License (IODL v2.0)
[LicenseLogo] http://www.dati.gov.it/iodl/2.0/
2 http://www.dati.gov.it/content/italian-open-data-license-domande-e-risposte
10. RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
The SISI webGIS application
Soil Bodies
1) Soil region (1:5,000,000): 34 regions
described by 20 attributes
2) Soil systems (1:500,000): 3,358 polygons
described by 40 attributes and related to
Derived soil profile by 9,727 relations
Soil Profiles
3) Derived soil profile (1,412) composed by
4,263 Derived Profile Element described
by 16 Soil parameters statistic (mean
value, standard deviation, frequency) for
82,791 measured or modeled data;
4) Observed Soil profiles (1,412) composed
by 4,284 analyzed Soil Profile Elements
described by 46 soil parameters for
81,366 measured or modeled data; 1747
photos about 834 Observed soil profile
https://aginfra-sg.ct.infn.it/sisi
11. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Short-Term Storage and Data Management
On local server:
ISIS1.4.mdb (MS
Access) database
Online database:
PostgreSQL 8.2
Web server:
Geoserver 2.1-beta3
12. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Deposit and Long-Term Preservation
Select data of long-term value: Data sharing and
preservation may not be applicable in every case. The
SISI webgis contains selected maps and observations out
of the ISIS 1.4 national database (55,000 observations).
The selection is based on representativenes at national
scale.
Safeguard the data behind the graph: the data which
underpin publications should be extracted, captured in
machine-readable form and deposited somewhere so
they remain accessible.
Assure that your data will remain accessible: focus on
making a convincing case that your data will remain
accessible.
13. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Resourcing
Outline and justify costs: If you need to purchase storage,
outsource services such as back-up and preservation, or plan
to pay for data management support, these costs should be
outlined and justified in your proposal.
Don’t underestimate the human effort required: Creating
documentation and making your data understandable to
others is very time consuming, so be realistic about how much
effort is needed to prepare your data for sharing and
preservation.
Show efficient use of public funds: it is appropriate to use
public funds to support the management and sharing of
publicly-funded research data, but this is expected to be
efficient and cost-effective.
14. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
The Interoperability challenge
Semantic interoperability
facilitates building of
data services that reuse
and combine data from
different sources.
GIS Servers commonly use
the same protocols
(WMS,WFS,WCS) but
Databases are still highly
customized.
15. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
OGC - Soil Interoperability Experiment
Big or small, data need to be
explicitly defined for others to
reuse it, and linked to other
pieces of data to be placed in a
data ecosystem
Standard vocabularies boost soil data interoperability &
sharing
Local data managers would get references to describe,
publish and share their data
16. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Catalogue services to search for data
INSPIRE geoportal
http://inspire-geoportal.ec.europa.eu/
CIARD RING directory
http://ring.ciard.info/
CSIRO Data Access Portal
https://data.csiro.au/dap
FAO AGRIS
agris.fao.org
17. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Vocabularies used in those catalogue services
INSPIRE geoportal
http://inspire-geoportal.ec.europa.eu/
CIARD RING directory
http://ring.ciard.info/
CSIRO Data Access Portal
https://data.csiro.au/dap
FAO AGRIS
agris.fao.org
INSPIRE registry
http://inspire.ec.europa.eu/registry
agINFRA vocabularies
http://vocabularies.aginfra.eu
SISS Voc
http://www.sissvoc.info/
Agrovoc
http://aims.fao.org/vest-registry/vocabularies/agrovoc-
multilingual-agricultural-thesaurus
18. 25/06/13
RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
The agINFRA soil vocabulary
http://vocabularies.aginfra.eu/soil.html
19. RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Our Vision
Applications using CREA Open Data? Giving on-time responses to
users using data produced in CREA is actually just a dream…
On the other end, some works has been done toward Soil Data
Interoperability. Standards are there, web services too, many
worldwide initiatives: Global Soil Partnership, Global soil map of
the world, Soil Interoperability experiment (Soil IE)…
Let’s work together finding out how to share our data!
20. RDA Firenze Italy 14-15 November 2016
Research Data Alliance. FAIR data management: best practices and open issuesG. L’Abate
Thanks for your attention!
http://abp.entecra.it/
http://soilmaps.entecra.it/
Acknowledgments to EU 7FP agINFRA project