SlideShare uma empresa Scribd logo
1 de 22
Preparing a data paper for
Scott Edmunds
scott@gigasciencejournal.com
25th January 2021
0000-0001-6444-1436
https://www.nytimes.com/2019/02/28/learning/teach-about-climate-change-with-these-24-new-york-times-graphs.html
https://www.nytimes.com/interactive/2020/03/21/upshot/coronavirus-deaths-by-country.html
Why do we need data papers?
Why we need to fill biodiversity data gaps
Expert predictions
of species richness
https://www.nature.com/articles/ncomms9221
Completeness of
biodiversity records
Past and current
malaria prevalence
https://en.wikipedia.org/wiki/Malaria#/media/File:World-map-of-past-and-current-malaria-
prevalence-world-development-report-2009.png
Completeness of
biodiversity records
Why we need to fill biodiversity data gaps
Why we need to go beyond open: FAIR
https://doi.org/10.1038/sdata.2016.18
https://www.ands.org.au/working-with-data/fairdata/training
Research data (+ software and underlying
methods) need to be shared for scrutiny and
re-use
Buckheit & Donoho: Scholarly articles
are merely advertisement of scholarship.
The actual scholarly artifacts, i.e. the
data and computational methods, which
support the scholarship, remain largely
inaccessible.
Researcher incentive systems have
not been aligned to this
Also need to be credited/tracked and treated as
first class research objects
…to 2009
From 1665…
How ‘data papers’ enhance FAIRness, visibility, accessibility
and provide credit.
Source: Dimitrova et al. https://doi.org/10.1093/gigascience/giab034
Journal policy & practice slowly catching up
https://f1000research.com/data-policies
In 2013
Rewarding open data: GigaScience
http://gigasciencejournal.com/
Launched July 2012, now partnering with OUP. Publishes “Data Notes” for CC0 data.
Published by:
2011-2016
2016-date
APC covers curation and 1TB of storage in our GigaDB repository
http://gigadb.org/
Rewarding open data: GigaScience
Since 2011,
and working
with
Rewarding open data, pt2:
Launched September 2020 to break barriers of speed, interactivity and cost
Published by:
https://gigabytejournal.com/
Data Publishing: nothing new…
Data & Metadata Collection/Experiments
Analysis/Hypothesis/Analysis
Conclusions
+ Area of Interest/Question
1839
1859
20 Yrs.
Technical features of
Main advantage of workflow is XML from start to end
https://gigabytejournal.com/
Several modules acting as one platform: no
import/export of files, so fast and accurate
Cutting out production allows huge time & cost
saving (currently 4-8 hours per paper)
Any number of versions can be published instantly,
including typographic quality PDF
Allows instantaneous switch of views
Leverage embeddable dynamic content/widgets
Initial focus on forkable products: data + software +
updates
Advantages of GigaByte: interactive features
What does focusing on Data + software + XML allow us to do?
https://youtu.be/TVdKLtRGSYs
Thinking about users: authors, reviewers, readers
https://gigabytejournal.com/
Streamlined questionnaire-based review
Reconfigured for short, easy to write & review data & software papers
Export as PDF, XML, HTML… “on the fly”
Links between
preprints and papers
(inc open review)
Thinking about users: authors, reviewers, readers
https://gigabytejournal.com/data-release-description
Data Release: a short, updatable, description of a research dataset
What does a GigaByte data paper look like?
Discoverability & credit: Highlights and help to
contextualize openly available datasets to
encourage reuse.
Sharing: All data can be linked to the Data Release
via GBIF, GigaDB or other data DOIs or accessions.
Data, not analysis: Incentivizes and allows more
rapid releases of data before subsequent detailed
analysis has been carried out. Or in coordination
with publication of an analysis paper.
Simple: Structure = Context, Methods, Data
Validation and QC, Reuse Potential, Data
Availability
Submit via:
Key to a data paper: Data Availability
Summary of where to find/access all the supporting data
Follows the Data Citation Principles (#CiteTheDOI)
Also collects together other accession numbers and
reporting checklists
https://www.force11.org/datacitationprinciples
The data papers submitted should describe
datasets with the following criteria:
• Data has clear relevance for research on vectors of
human vector-borne diseases
• Dataset contains more than 5,000 records that are
new to GBIF.org in 2021/22 with high-quality data
and metadata
• Data is dedicated to the public domain under an
open CC0 designation
Notes on the series
Data deposition is key, and supported by
GBIF helpdesk and GigaDB curators
• Authors should start by preparing the dataset and
publishing it through GBIF.org before writing
• Support from health@gbif.org for questions on
publishing data through GBIF, data standards, etc.
• GigaDB team (database@gigasciencejournal.com) on
hand to help with additional supporting data
• GigaDB curators will also help review process by
providing a data audit for each submission
Notes on the series
Thanks to TDR/WHO for support of this
datasets on vectors of human diseases series
Due to this very generous sponsorship the
article processing fee (normally $350 USD)
will be waived for the first 15 papers that are
accepted and meet the series criteria.
All authors will be part of a collaborative
follow-up commentary in GigaScience
https://doi.org/10.1186/s13742-016-0121-x
Many thanks to our partners
For further questions contact: editorial@gigabytejournal.com
https://gigabytejournal.com/
Submit now:
Questions?

Mais conteúdo relacionado

Mais procurados

Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015
Kerstin Forsberg
 
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
Dr. Haxel Consult
 
Best Practice on Clean Energy: Claudio Baldassarre
Best Practice on Clean Energy: Claudio BaldassarreBest Practice on Clean Energy: Claudio Baldassarre
Best Practice on Clean Energy: Claudio Baldassarre
Semantic Web Company
 

Mais procurados (20)

Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015
 
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
 
Data management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK Story
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community Call
 
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...
2019-10-11 The value of FAIR data in health data networks - The Hyve - ELIXIR...
 
Complying with the EC Open Data Directive
Complying with the EC Open Data DirectiveComplying with the EC Open Data Directive
Complying with the EC Open Data Directive
 
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019
 
ICIC 2014 New Product Introduction InfoChem
ICIC 2014 New Product Introduction InfoChemICIC 2014 New Product Introduction InfoChem
ICIC 2014 New Product Introduction InfoChem
 
How 2019 became the year FAIR landed in biopharmaceutical R&D
How 2019 became the year FAIR landed in biopharmaceutical R&DHow 2019 became the year FAIR landed in biopharmaceutical R&D
How 2019 became the year FAIR landed in biopharmaceutical R&D
 
Best Practice on Clean Energy: Claudio Baldassarre
Best Practice on Clean Energy: Claudio BaldassarreBest Practice on Clean Energy: Claudio Baldassarre
Best Practice on Clean Energy: Claudio Baldassarre
 
THOR Ambassador Webinar
THOR Ambassador WebinarTHOR Ambassador Webinar
THOR Ambassador Webinar
 
Supporting Open Data Publishers
Supporting Open Data PublishersSupporting Open Data Publishers
Supporting Open Data Publishers
 
Linked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcareLinked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcare
 
ODIN: Connecting research and researchers
ODIN: Connecting research and researchersODIN: Connecting research and researchers
ODIN: Connecting research and researchers
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
New ways to communicate in science: perspectives from biodiversity research
New ways to communicate in science: perspectives from biodiversity researchNew ways to communicate in science: perspectives from biodiversity research
New ways to communicate in science: perspectives from biodiversity research
 
Practical Guide to Publishing Open Data
Practical Guide to Publishing Open DataPractical Guide to Publishing Open Data
Practical Guide to Publishing Open Data
 
Building A Community Resource For The Life Sciences
Building A Community Resource For The Life SciencesBuilding A Community Resource For The Life Sciences
Building A Community Resource For The Life Sciences
 
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair"
 

Semelhante a Scott Edmunds: Preparing a data paper for GigaByte

CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECAProject
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
Carole Goble
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
 
Sundmaeker-FGS-Wien-V04.pptx
Sundmaeker-FGS-Wien-V04.pptxSundmaeker-FGS-Wien-V04.pptx
Sundmaeker-FGS-Wien-V04.pptx
FIWARE
 

Semelhante a Scott Edmunds: Preparing a data paper for GigaByte (20)

CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
FAIR Cookbook
FAIR Cookbook FAIR Cookbook
FAIR Cookbook
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
Data Strategies: Metadata, Open Data, Linked Data
Data Strategies: Metadata, Open Data, Linked DataData Strategies: Metadata, Open Data, Linked Data
Data Strategies: Metadata, Open Data, Linked Data
 
CGSpace and PRMS Information Session
CGSpace and PRMS Information SessionCGSpace and PRMS Information Session
CGSpace and PRMS Information Session
 
HealthData.gov Challenge Webinar
HealthData.gov Challenge WebinarHealthData.gov Challenge Webinar
HealthData.gov Challenge Webinar
 
Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?
 
Don't think DevOps think Compliant Database DevOps
Don't think DevOps think Compliant Database DevOpsDon't think DevOps think Compliant Database DevOps
Don't think DevOps think Compliant Database DevOps
 
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
 
IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
 
HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Sundmaeker-FGS-Wien-V04.pptx
Sundmaeker-FGS-Wien-V04.pptxSundmaeker-FGS-Wien-V04.pptx
Sundmaeker-FGS-Wien-V04.pptx
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big Data
 
dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017
 
Rising tide of data update 20171024
Rising tide of data update 20171024Rising tide of data update 20171024
Rising tide of data update 20171024
 

Mais de GigaScience, BGI Hong Kong

Mais de GigaScience, BGI Hong Kong (20)

Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
 
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
 
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
 
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
 
Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10
 
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixRicardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
 
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserAnil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
 
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
 
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceVenice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
 
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...
 
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
 
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
 
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"eventSusanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
 
Jie Zheng at #ICG12: PhenoSpD: an atlas of phenotypic correlations and a mult...
Jie Zheng at #ICG12: PhenoSpD: an atlas of phenotypic correlations and a mult...Jie Zheng at #ICG12: PhenoSpD: an atlas of phenotypic correlations and a mult...
Jie Zheng at #ICG12: PhenoSpD: an atlas of phenotypic correlations and a mult...
 
Valerie de Anda at #ICG12: A new multi-genomic approach for the study of biog...
Valerie de Anda at #ICG12: A new multi-genomic approach for the study of biog...Valerie de Anda at #ICG12: A new multi-genomic approach for the study of biog...
Valerie de Anda at #ICG12: A new multi-genomic approach for the study of biog...
 
Zhipeng Li at #ICG12: Draft Genome of the Reindeer (Rangifer tarandus)
Zhipeng Li at #ICG12: Draft Genome of the Reindeer (Rangifer tarandus)Zhipeng Li at #ICG12: Draft Genome of the Reindeer (Rangifer tarandus)
Zhipeng Li at #ICG12: Draft Genome of the Reindeer (Rangifer tarandus)
 

Último

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 

Último (20)

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 

Scott Edmunds: Preparing a data paper for GigaByte

  • 1. Preparing a data paper for Scott Edmunds scott@gigasciencejournal.com 25th January 2021 0000-0001-6444-1436
  • 3. Why we need to fill biodiversity data gaps Expert predictions of species richness https://www.nature.com/articles/ncomms9221 Completeness of biodiversity records
  • 4. Past and current malaria prevalence https://en.wikipedia.org/wiki/Malaria#/media/File:World-map-of-past-and-current-malaria- prevalence-world-development-report-2009.png Completeness of biodiversity records Why we need to fill biodiversity data gaps
  • 5. Why we need to go beyond open: FAIR https://doi.org/10.1038/sdata.2016.18 https://www.ands.org.au/working-with-data/fairdata/training
  • 6. Research data (+ software and underlying methods) need to be shared for scrutiny and re-use Buckheit & Donoho: Scholarly articles are merely advertisement of scholarship. The actual scholarly artifacts, i.e. the data and computational methods, which support the scholarship, remain largely inaccessible. Researcher incentive systems have not been aligned to this Also need to be credited/tracked and treated as first class research objects …to 2009 From 1665…
  • 7. How ‘data papers’ enhance FAIRness, visibility, accessibility and provide credit. Source: Dimitrova et al. https://doi.org/10.1093/gigascience/giab034
  • 8. Journal policy & practice slowly catching up https://f1000research.com/data-policies In 2013
  • 9. Rewarding open data: GigaScience http://gigasciencejournal.com/ Launched July 2012, now partnering with OUP. Publishes “Data Notes” for CC0 data. Published by: 2011-2016 2016-date
  • 10. APC covers curation and 1TB of storage in our GigaDB repository http://gigadb.org/ Rewarding open data: GigaScience Since 2011, and working with
  • 11. Rewarding open data, pt2: Launched September 2020 to break barriers of speed, interactivity and cost Published by: https://gigabytejournal.com/
  • 12. Data Publishing: nothing new… Data & Metadata Collection/Experiments Analysis/Hypothesis/Analysis Conclusions + Area of Interest/Question 1839 1859 20 Yrs.
  • 13. Technical features of Main advantage of workflow is XML from start to end https://gigabytejournal.com/ Several modules acting as one platform: no import/export of files, so fast and accurate Cutting out production allows huge time & cost saving (currently 4-8 hours per paper) Any number of versions can be published instantly, including typographic quality PDF Allows instantaneous switch of views Leverage embeddable dynamic content/widgets Initial focus on forkable products: data + software + updates
  • 14. Advantages of GigaByte: interactive features What does focusing on Data + software + XML allow us to do? https://youtu.be/TVdKLtRGSYs
  • 15. Thinking about users: authors, reviewers, readers https://gigabytejournal.com/ Streamlined questionnaire-based review Reconfigured for short, easy to write & review data & software papers Export as PDF, XML, HTML… “on the fly” Links between preprints and papers (inc open review)
  • 16. Thinking about users: authors, reviewers, readers https://gigabytejournal.com/data-release-description Data Release: a short, updatable, description of a research dataset What does a GigaByte data paper look like? Discoverability & credit: Highlights and help to contextualize openly available datasets to encourage reuse. Sharing: All data can be linked to the Data Release via GBIF, GigaDB or other data DOIs or accessions. Data, not analysis: Incentivizes and allows more rapid releases of data before subsequent detailed analysis has been carried out. Or in coordination with publication of an analysis paper. Simple: Structure = Context, Methods, Data Validation and QC, Reuse Potential, Data Availability Submit via:
  • 17. Key to a data paper: Data Availability Summary of where to find/access all the supporting data Follows the Data Citation Principles (#CiteTheDOI) Also collects together other accession numbers and reporting checklists https://www.force11.org/datacitationprinciples
  • 18. The data papers submitted should describe datasets with the following criteria: • Data has clear relevance for research on vectors of human vector-borne diseases • Dataset contains more than 5,000 records that are new to GBIF.org in 2021/22 with high-quality data and metadata • Data is dedicated to the public domain under an open CC0 designation Notes on the series
  • 19. Data deposition is key, and supported by GBIF helpdesk and GigaDB curators • Authors should start by preparing the dataset and publishing it through GBIF.org before writing • Support from health@gbif.org for questions on publishing data through GBIF, data standards, etc. • GigaDB team (database@gigasciencejournal.com) on hand to help with additional supporting data • GigaDB curators will also help review process by providing a data audit for each submission Notes on the series
  • 20. Thanks to TDR/WHO for support of this datasets on vectors of human diseases series Due to this very generous sponsorship the article processing fee (normally $350 USD) will be waived for the first 15 papers that are accepted and meet the series criteria.
  • 21. All authors will be part of a collaborative follow-up commentary in GigaScience https://doi.org/10.1186/s13742-016-0121-x
  • 22. Many thanks to our partners For further questions contact: editorial@gigabytejournal.com https://gigabytejournal.com/ Submit now: Questions?