SlideShare uma empresa Scribd logo
1 de 27
Case study 2:
(Plazi) Treatment Repository
Donat Agosti & Willi Egloff (Plazi, Bern)
March 27, 2014
Dublin, RDA Third Plenary Meeting,
RDA/CODATA Legal Interoperability IG
Overview
Who are we?
The issue
The Plazi workflow
The legal aspects
Synopsis
Extensive decentralized biodiversity infrastructure
Plants
3,400 Herbaria worldwide
10,000 Associate curators and specialists
350,000,000 specimens in collections
180,000,000 specimens digitized
2,000,000,000 specimens including animals
Source: gbif.org; http://sciweb.nybg.org/science2/IndexHerbariorum.asp
200,000,000+ printed pages
1,900,000 species described
20,000,000+ species treatments
17,000 new species per year
Biodiversity libraries
BUT: The data are hidden
Incomplete digitization
Publications are
unstructured
Collections are incomplete
Data are not linked
Most data are not open
Names as information tags in life sciences
Names
Characteristics
Publications
GenesCollections
Specimens
Distribution
A global reference system for spatial data
60 48'9.75"N
50 50'1.23"E
A global reference system for species related data
(http://www.yourwildlife.org/wp-content/uploads/2013/02/Common-ant-collage.jpg)
2D78C98D-
0B15-4362-
8DD8-
185983C468FE
A global reference system for species related data
Spatial data Taxonomic data
Entity Location Species
Entity name Location name Scientific Name
Reference Geo-Coordinate UUID
Reference System Coordinate System Hierarchical System
Reference Data Global Map / Global
Satellite coverage
Global Names
Archictecture
Needed:
Global Names Architecture
http://globalnames.org
(Reference system for all names)
SEE also: RDA Biodiversity Data Integration IG;
RDA Data publishing IG
A global reference system for species related data
Formica obsoleta Linnaeus 1758, 580
zoobank.org:act:2D78C98D-0B15-4362-8DD8-185983C468FE
Taxonomic name usage defined by a treatment
A global reference system for species related data
A global reference system for species related data
Treatment: sections of publications
documenting the features or distribution
of a related group of organisms (called a
“taxon”, plural “taxa”) in ways adhering
to highly formalized conventions.
(Catapano, 2010)
Formica obsoleta, Linnaeus 1758: 580
Formica obsoleta Linnaeus 1758, 580
zoobank.org:act:2D78C98D-0B15-4362-8DD8-185983C468FE
Taxonomic name usage defined by a treatment
treatment.plazi.org/id/2D78C98D-0B15-4362-8DD8-185983
C468FE
A global reference system for species related data
Text
<tax:treatment>
<tax:nomenclature>
<tax:name>
<tax:xid source="HNS" identifier="193329"/>
<tax:xmldata>
<dc:Genus>Mystrium</dc:Genus>
<dc:Species>leonie</dc:Species>
</tax:xmldata>
Mystrium leonie
</tax:name> Bohn & Verhaagh
<tax:status>n. sp.</tax:status>
Fig 1 D - F
</tax:nomenclature>
<tax:div type="description">
<tax:p>HOLOTYPE WORKER: TL 3.95, HL 1.02, HW 0.
1.30, SI 137, PW 0.73, ML 0.38. Mandible oute
to a sharp apical tooth, the apex parallel to
(Holotype with material in mandibles, so mand
$ described below from paratypes.) Median cly
....
</treatment>
Enhanced and linked text
Formalization of taxonomic publications
Links
Conversionn
The way forward or prospective publishing
Fresh of press: fully automated
distribution of data from publications
From discovery to publcation in three weeks …
What does this mean?
Linked Open Data Cloud
http://www.w3.org/wiki/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
Plazi workflow: overview
1 Million Treatment Goal
to complement Global Names Architecture
name usages with the respective treatments
Semantic enhanced linked publishing
Funding
The real issue
Access to ant taxonomic publications through antbase.org /Smithsonian Institution, including
currently the entire body of non-copyrighted publications since 1758 (>4,000 publications or
The real issue: copyright
Restrictions to information exchange:
- National security / data protection (n/a)
- Copyright (only "works")
- Database protection (only private commercial databases)
- Data use agreements
Copyright issues
Obstacles to Plazi workflow:
- Scanning / reproduction of works
- Scanning / reproduction of databases
- Making available of works
Copyright issues
Legal base for actual workflow
- Legal license for internal use in organizations /
institutions (Art. 19 CH-Copyright Act)
- No database protection in CH
- Legal license overrules data use agreements
Copyright issues
Making available:
- Only non copyrighted data (names, treatments,
references, ... See http://plazi.org/?q=blue_list)
- Works (original publications) restricted to internal use
Copyright issues
Removing further hurdles to information exchange:
- Suggest mandatory legal licenses for research purposes at
EU-level
- Explore application of extended collective licenses
(Scandinavian countries)
- Introduce extended collective licenses into CH-copyright
law
Copyright issues
For further reading:
http://plazi.org/?q=plazi_publications
http://plazi.org
Thank you very much!
Donat Agosti & Willi Egloff
agosti@plazi.org

Mais conteúdo relacionado

Mais procurados

From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...
Catherine Canevet
 
Data editors meeting at SEFS
Data editors meeting at SEFSData editors meeting at SEFS
Data editors meeting at SEFS
Aaike De Wever
 

Mais procurados (20)

Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 
Open scholarship [a FOSTER open science talk]
Open scholarship [a FOSTER open science talk]Open scholarship [a FOSTER open science talk]
Open scholarship [a FOSTER open science talk]
 
Science Seminar Series 4 Norman Johnson
Science Seminar Series 4 Norman JohnsonScience Seminar Series 4 Norman Johnson
Science Seminar Series 4 Norman Johnson
 
VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...
VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...
VIVO Mini-Grant: Integrating the UMLS Ontology into VIVO for Linking Biomedic...
 
Workshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningWorkshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data mining
 
Met soc15 roccaserra-biocrates-datasharing
Met soc15 roccaserra-biocrates-datasharingMet soc15 roccaserra-biocrates-datasharing
Met soc15 roccaserra-biocrates-datasharing
 
Berlin 6 Open Access Conference: Theodore Papazoglou
Berlin 6 Open Access Conference: Theodore PapazoglouBerlin 6 Open Access Conference: Theodore Papazoglou
Berlin 6 Open Access Conference: Theodore Papazoglou
 
OpenAIRE at Workshop on CRIS and OAR, May 2010
OpenAIRE at Workshop on CRIS and OAR, May 2010OpenAIRE at Workshop on CRIS and OAR, May 2010
OpenAIRE at Workshop on CRIS and OAR, May 2010
 
Use and integration of controlled vocabularies (AGROVOC) in DSpace Repositories
Use and integration of controlled vocabularies (AGROVOC) in DSpace RepositoriesUse and integration of controlled vocabularies (AGROVOC) in DSpace Repositories
Use and integration of controlled vocabularies (AGROVOC) in DSpace Repositories
 
From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...
 
PhyloTastic: names-based phyloinformatic data integration
PhyloTastic: names-based phyloinformatic data integrationPhyloTastic: names-based phyloinformatic data integration
PhyloTastic: names-based phyloinformatic data integration
 
Ondex: Data integration and visualisation
Ondex: Data integration and visualisationOndex: Data integration and visualisation
Ondex: Data integration and visualisation
 
Data editors meeting at SEFS
Data editors meeting at SEFSData editors meeting at SEFS
Data editors meeting at SEFS
 
Text Mining from Three Perspectives - Publisher
Text Mining from Three Perspectives - PublisherText Mining from Three Perspectives - Publisher
Text Mining from Three Perspectives - Publisher
 
Building a Model Organism Metabolome Database
Building a  Model Organism Metabolome DatabaseBuilding a  Model Organism Metabolome Database
Building a Model Organism Metabolome Database
 
Bioschemas at bio hackathon 2017
Bioschemas at bio hackathon 2017Bioschemas at bio hackathon 2017
Bioschemas at bio hackathon 2017
 
High throughput mining of the scholarly literature
High throughput mining of the scholarly literatureHigh throughput mining of the scholarly literature
High throughput mining of the scholarly literature
 
Developing an Efficient Infrastruture, Standards and Data-Flow for Metabolomics
Developing an Efficient Infrastruture, Standards and Data-Flow for MetabolomicsDeveloping an Efficient Infrastruture, Standards and Data-Flow for Metabolomics
Developing an Efficient Infrastruture, Standards and Data-Flow for Metabolomics
 
Amanuens.is HUmans and machines annotating scholarly literature
Amanuens.is HUmans and machines annotating scholarly literatureAmanuens.is HUmans and machines annotating scholarly literature
Amanuens.is HUmans and machines annotating scholarly literature
 
Literature-data integration in the life sciences – Jo McEntyre, EMBL-EBI
Literature-data integration in the life sciences – Jo McEntyre, EMBL-EBILiterature-data integration in the life sciences – Jo McEntyre, EMBL-EBI
Literature-data integration in the life sciences – Jo McEntyre, EMBL-EBI
 

Semelhante a 20140327 rda plazi_final

Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and CommunicationSetting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
vbrant
 
Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment
Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment
Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment
ICZN
 
Special Libraries Associatin
Special Libraries AssociatinSpecial Libraries Associatin
Special Libraries Associatin
drielinger
 

Semelhante a 20140327 rda plazi_final (20)

20140317 pi b_nmbe_journal_club
20140317 pi b_nmbe_journal_club20140317 pi b_nmbe_journal_club
20140317 pi b_nmbe_journal_club
 
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and CommunicationSetting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
 
20110122 vibrant final
20110122 vibrant final20110122 vibrant final
20110122 vibrant final
 
Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment
Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment
Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment
 
20140623 swets agosti_final
20140623 swets agosti_final20140623 swets agosti_final
20140623 swets agosti_final
 
Biodiversity Heritage Library : Development and Partnerhips
Biodiversity Heritage Library : Development and PartnerhipsBiodiversity Heritage Library : Development and Partnerhips
Biodiversity Heritage Library : Development and Partnerhips
 
Nothing in taxonomy makes sense except in the light of Open Access
Nothing in taxonomy makes sense except in the light of Open Access Nothing in taxonomy makes sense except in the light of Open Access
Nothing in taxonomy makes sense except in the light of Open Access
 
Mla May 7
Mla May 7Mla May 7
Mla May 7
 
An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...
 
Special Libraries Associatin
Special Libraries AssociatinSpecial Libraries Associatin
Special Libraries Associatin
 
Agosti 20140813 icd8_agosti_global_dipterology-2
Agosti 20140813 icd8_agosti_global_dipterology-2Agosti 20140813 icd8_agosti_global_dipterology-2
Agosti 20140813 icd8_agosti_global_dipterology-2
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UK
 
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
 
ContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific LiteratureContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific Literature
 
A Step Towards (From) Read to Write Access to Taxonomic Publications
A Step Towards  (From) Read to Write Access to Taxonomic PublicationsA Step Towards  (From) Read to Write Access to Taxonomic Publications
A Step Towards (From) Read to Write Access to Taxonomic Publications
 
Ifla Bhl080208cr
Ifla Bhl080208crIfla Bhl080208cr
Ifla Bhl080208cr
 
Eol fellow-march2010
Eol fellow-march2010Eol fellow-march2010
Eol fellow-march2010
 
Can machines understand the scientific literature
Can machines understand the scientific literatureCan machines understand the scientific literature
Can machines understand the scientific literature
 
Biodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBiodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary Challenge
 
The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...
The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...
The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...
 

Mais de agosti

Mais de agosti (17)

DOI and the Mitteilungen: communicating scientific results in the future
DOI and the Mitteilungen: communicating scientific results in the futureDOI and the Mitteilungen: communicating scientific results in the future
DOI and the Mitteilungen: communicating scientific results in the future
 
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
 
BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...
BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...
BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...
 
Revolutionizing the Research on Ants through new Methods and Technologies: th...
Revolutionizing the Research on Ants through new Methods and Technologies: th...Revolutionizing the Research on Ants through new Methods and Technologies: th...
Revolutionizing the Research on Ants through new Methods and Technologies: th...
 
Open Research Data: Taxonomy
Open Research Data: TaxonomyOpen Research Data: Taxonomy
Open Research Data: Taxonomy
 
20150701 opendata bern_agosti_2
20150701 opendata bern_agosti_220150701 opendata bern_agosti_2
20150701 opendata bern_agosti_2
 
Plazi or the challenge to free biodiversity data caught in hundreds of millio...
Plazi or the challenge to free biodiversity data caught in hundreds of millio...Plazi or the challenge to free biodiversity data caught in hundreds of millio...
Plazi or the challenge to free biodiversity data caught in hundreds of millio...
 
20141027 bouchout declaration
20141027 bouchout declaration20141027 bouchout declaration
20141027 bouchout declaration
 
20140924 rda _bouchout
20140924 rda _bouchout20140924 rda _bouchout
20140924 rda _bouchout
 
20140922 rda codata_legal_ig_plazi_final
20140922 rda codata_legal_ig_plazi_final20140922 rda codata_legal_ig_plazi_final
20140922 rda codata_legal_ig_plazi_final
 
2 donat agosti-1
2 donat agosti-12 donat agosti-1
2 donat agosti-1
 
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
 
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
 
20140523 swiss curators_bouchout_2
20140523 swiss curators_bouchout_220140523 swiss curators_bouchout_2
20140523 swiss curators_bouchout_2
 
20110725 ibc xml
20110725 ibc xml20110725 ibc xml
20110725 ibc xml
 
20110222 behesty monitoring and measuring biodiversity
20110222 behesty monitoring and measuring biodiversity20110222 behesty monitoring and measuring biodiversity
20110222 behesty monitoring and measuring biodiversity
 
20090921 Art Databanken Agosti Final
20090921 Art Databanken Agosti Final20090921 Art Databanken Agosti Final
20090921 Art Databanken Agosti Final
 

20140327 rda plazi_final

  • 1. Case study 2: (Plazi) Treatment Repository Donat Agosti & Willi Egloff (Plazi, Bern) March 27, 2014 Dublin, RDA Third Plenary Meeting, RDA/CODATA Legal Interoperability IG
  • 2. Overview Who are we? The issue The Plazi workflow The legal aspects Synopsis
  • 3. Extensive decentralized biodiversity infrastructure Plants 3,400 Herbaria worldwide 10,000 Associate curators and specialists 350,000,000 specimens in collections 180,000,000 specimens digitized 2,000,000,000 specimens including animals Source: gbif.org; http://sciweb.nybg.org/science2/IndexHerbariorum.asp
  • 4. 200,000,000+ printed pages 1,900,000 species described 20,000,000+ species treatments 17,000 new species per year Biodiversity libraries BUT: The data are hidden Incomplete digitization Publications are unstructured Collections are incomplete Data are not linked Most data are not open
  • 5. Names as information tags in life sciences Names Characteristics Publications GenesCollections Specimens Distribution
  • 6. A global reference system for spatial data 60 48'9.75"N 50 50'1.23"E
  • 7. A global reference system for species related data (http://www.yourwildlife.org/wp-content/uploads/2013/02/Common-ant-collage.jpg) 2D78C98D- 0B15-4362- 8DD8- 185983C468FE
  • 8. A global reference system for species related data Spatial data Taxonomic data Entity Location Species Entity name Location name Scientific Name Reference Geo-Coordinate UUID Reference System Coordinate System Hierarchical System Reference Data Global Map / Global Satellite coverage Global Names Archictecture
  • 9. Needed: Global Names Architecture http://globalnames.org (Reference system for all names) SEE also: RDA Biodiversity Data Integration IG; RDA Data publishing IG A global reference system for species related data
  • 10. Formica obsoleta Linnaeus 1758, 580 zoobank.org:act:2D78C98D-0B15-4362-8DD8-185983C468FE Taxonomic name usage defined by a treatment A global reference system for species related data
  • 11. A global reference system for species related data Treatment: sections of publications documenting the features or distribution of a related group of organisms (called a “taxon”, plural “taxa”) in ways adhering to highly formalized conventions. (Catapano, 2010) Formica obsoleta, Linnaeus 1758: 580
  • 12. Formica obsoleta Linnaeus 1758, 580 zoobank.org:act:2D78C98D-0B15-4362-8DD8-185983C468FE Taxonomic name usage defined by a treatment treatment.plazi.org/id/2D78C98D-0B15-4362-8DD8-185983 C468FE A global reference system for species related data
  • 13. Text <tax:treatment> <tax:nomenclature> <tax:name> <tax:xid source="HNS" identifier="193329"/> <tax:xmldata> <dc:Genus>Mystrium</dc:Genus> <dc:Species>leonie</dc:Species> </tax:xmldata> Mystrium leonie </tax:name> Bohn & Verhaagh <tax:status>n. sp.</tax:status> Fig 1 D - F </tax:nomenclature> <tax:div type="description"> <tax:p>HOLOTYPE WORKER: TL 3.95, HL 1.02, HW 0. 1.30, SI 137, PW 0.73, ML 0.38. Mandible oute to a sharp apical tooth, the apex parallel to (Holotype with material in mandibles, so mand $ described below from paratypes.) Median cly .... </treatment> Enhanced and linked text Formalization of taxonomic publications
  • 15. The way forward or prospective publishing Fresh of press: fully automated distribution of data from publications From discovery to publcation in three weeks …
  • 16. What does this mean? Linked Open Data Cloud http://www.w3.org/wiki/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
  • 18. 1 Million Treatment Goal to complement Global Names Architecture name usages with the respective treatments Semantic enhanced linked publishing
  • 21. Access to ant taxonomic publications through antbase.org /Smithsonian Institution, including currently the entire body of non-copyrighted publications since 1758 (>4,000 publications or The real issue: copyright
  • 22. Restrictions to information exchange: - National security / data protection (n/a) - Copyright (only "works") - Database protection (only private commercial databases) - Data use agreements Copyright issues
  • 23. Obstacles to Plazi workflow: - Scanning / reproduction of works - Scanning / reproduction of databases - Making available of works Copyright issues
  • 24. Legal base for actual workflow - Legal license for internal use in organizations / institutions (Art. 19 CH-Copyright Act) - No database protection in CH - Legal license overrules data use agreements Copyright issues
  • 25. Making available: - Only non copyrighted data (names, treatments, references, ... See http://plazi.org/?q=blue_list) - Works (original publications) restricted to internal use Copyright issues
  • 26. Removing further hurdles to information exchange: - Suggest mandatory legal licenses for research purposes at EU-level - Explore application of extended collective licenses (Scandinavian countries) - Introduce extended collective licenses into CH-copyright law Copyright issues
  • 27. For further reading: http://plazi.org/?q=plazi_publications http://plazi.org Thank you very much! Donat Agosti & Willi Egloff agosti@plazi.org