This document discusses accessible occurrence data and checklists in Canada and what they can tell us about species diversity. It notes that over 11 million occurrence records are available on GBIF from Canada. While these data are accessible, many records have outdated taxonomy or lack location coordinates. The document encourages making checklist data openly available but notes it requires curation. Opportunities for innovative research using these biodiversity data are mentioned.
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
What do accessible occurrence data and checklists tell us about species diversity in Canada?
1. What do accessible occurrence
data and checklists tell us about
species diversity in Canada?
David P. Shorthouse
Université de Montréal / Canadensys
@dpsSpiders
9. Open
• Legally unrestrictive
• Openly licensed
• Can be reused, reconstituted, recombined
republished
FINE PRINT
The Canadian Museum of Nature owns the copyright
of almost all information (data and images) accessed
through its databases.
Data may not be transferred to another database for
distribution to others without prior written permission
from the Canadian Museum of Nature.
10. The Goal
• Establish provenance
• Track citations & metrics of use
• Build new networks of collaboration
• Play a significant role in « Big Data »
17. Species at Risk on Canadensys
24 specimens
Cypripedium candidum Muhl. ex Willd.
Small White Lady’s-slipper
ENDANGERED
5 specimens
Antennaria flagellaris (A. Gray) A. Gray
Stoloniferous Pussytoes
ENDANGERED
23. Utility of « Accessible » Occurrence Data
Hjarding, A., Tolley, K. A., & Burgess, N. D. 2014. Red List assessments of East
African chameleons: a case study of why we need experts. Oryx
DOI 10.1017/s0030605313001427
Photographer: H. Vannoy Davis
« 99.9%of GBIF records
used outdated taxonomy
and 20% had no locality
coordinates »
26. What Can We Mine Now?
http://canent.shorthouse.net
https://github.com/dshorthouse/article_semanticizer
27. Upcoming Events
Canadensys new research initiatives workshop
Anne Bruneau, anne.bruneau@umontreal.ca
http://www.canadensys.net/2014/workshop-proposal
Data Publication Workshop K.W. Neatby Building:
January 13, 14, 2015; co-located in Tallahassee, FL with
iDigBio
James Macklin, james.macklin@agr.gc.ca
David Shorthouse, david.shorthouse@umontreal.ca
28. Take Home
• « Accessible » means computable & open
• Occurrence and checklist data can be
published, but they require curation
– Seek assistance from Canadensys & elsewhere
• There are significant challenges, but we can be
creative
• What new, innovative research can we do with
these data?
29. What do accessible
occurrence data and checklists
say about us?
@dpsSpiders
Editor's Notes
Loaded term
What does this mean?
« Able to be reached or entered »
Homonyms
same name for many taxa
Synonyms
different names for same taxa
Variant representations
orthography, spelling,
differences in authority
Requires infrastructure or the capacity to reconcile scientific names and their surrogates (eg taxon concepts)
TODO:
- Data for red-listed species in Canada (SARA)
The project is an international collaboration between the National Centre for Text Mining (UK), Missouri Botanical Garden (US), Dalhousie University’s Big Data Analytics Institute (Canada) and Ryerson University’s Social Media Lab (Canada). NaCTeM was also a recipient of the 2011 Digging into Data call with the Integrated Social History Environment for Research (ISHER) project.
Started March 2014