SlideShare uma empresa Scribd logo
1 de 23
Taxonomic Databases Working Group Annual Meeting 2011 GBIF:  Issues in providing federated access to digital information related to biological specimens. David Remsen Senior Programme Officer Global Biodiversity Information Facility (GBIF) TDWG 2011
Issue #1:  The consequences of scale ,[object Object]
About GBIF ,[object Object],[object Object],[object Object],The mission of the Global Biodiversity Information Facility (GBIF) is to facilitate free and open access to biodiversity data worldwide via the Internet to underpin sustainable development. ,[object Object],[object Object],Primary biodiversity data
“ Wrapper ”  Software PyWrapper  (Python) TAPIR Link (PHP) DiGIR  (PHP) Your database Insect Collection Install one of these  ‘ wrappers ’ ABCD Bird Observations Herbarium Data DarwinCore DarwinCore
The promise of federation Insect Collection Herbarium Bird Observations Herbarium Any specimens from Thailand? GBIF Data Portal I will ask! I do! I do! I do! Nope! GBIF Data Portal as a  Gateway
The challenge of federation Insect Collection Herbarium Bird Observations Herbarium Hello? Server Not Available GBIF Data Portal Hi!
The rise of Indexing Insect Collection Herbarium Bird Observations Herbarium Any data records from Thailand? Send me an index of all of your data GBIF Data Portal  (now with Data!) GBIF Data Portal as a  Data Index
The wrong tools for the job Insect Collection Herbarium Bird Observations Herbarium Any data records from Thailand? Send me an index of your data once per month Here is page one. If I go offline, s tart again Not too fast! You ask the same questions every time GBIF Data Portal  (now with Data!)
Darwin Core Archives A text-based solution to publishing biodiversity data
A Refined Approach Insect Collection Herbarium Bird Observations Herbarium Any data records from Thailand? This is fast! GBIF Data Portal  (now with Data!) URL URL URL URL This is easy
2007 Today 70 million 2010 2008 2009 147 million 180 million 201 million 302 million Growth Need  for a new standard identified
Issue #2:  Geospatial Integration ,[object Object],[object Object]
Geo-referenced USA data Verbatim data as shared on the network
Issue #2:  Geospatial Integration ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Geo-referenced USA data Data following interpretation ,[object Object],[object Object]
Issue #3:  Taxonomic Integration ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Enabled taxonomic data to be published through GBIF
Trochilidae  (Hummingbirds)   (today) Misinterpretations (Hummingbirds are only found in western hemisphere)
Trochilidae  (Hummingbirds)   (next month) Improved interpretation
Search for  Oenanthe ( water dropwort plant   or   wheatear bird ) Difficult for user to interpret Accurate search results Today Next month
Improved the means to match names
In summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Thank you

Mais conteúdo relacionado

Mais procurados

Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...Kyle Copas
 
GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016Dag Endresen
 
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016Dag Endresen
 
GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)Dag Endresen
 
Exploring the future of scholarly publishing of biodiversity data
Exploring the future of scholarly publishing of biodiversity dataExploring the future of scholarly publishing of biodiversity data
Exploring the future of scholarly publishing of biodiversity dataVishwas Chavan
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)Dag Endresen
 
GBIF and Open Science
GBIF and Open ScienceGBIF and Open Science
GBIF and Open ScienceDag Endresen
 
GBIF towards 2030 (November 2018)
GBIF towards 2030 (November 2018)GBIF towards 2030 (November 2018)
GBIF towards 2030 (November 2018)Dag Endresen
 
Intro to GBIF: NBN Crowdsourcing Data Capture Summit
Intro to GBIF: NBN Crowdsourcing Data Capture SummitIntro to GBIF: NBN Crowdsourcing Data Capture Summit
Intro to GBIF: NBN Crowdsourcing Data Capture SummitKyle Copas
 
The Global Biodiversity Information Facility and Africa Rising
The Global Biodiversity Information Facility and Africa RisingThe Global Biodiversity Information Facility and Africa Rising
The Global Biodiversity Information Facility and Africa RisingFatima Parker-Allie
 
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...Dag Endresen
 
Nigel Robinson - ZooBank and Zoological Record: a partnership for success
Nigel Robinson - ZooBank and Zoological Record: a partnership for successNigel Robinson - ZooBank and Zoological Record: a partnership for success
Nigel Robinson - ZooBank and Zoological Record: a partnership for successICZN
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18Dag Endresen
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementDag Endresen
 
Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Dag Endresen
 
Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013Dag Endresen
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021Dag Endresen
 
Museum collections as research data - October 2019
Museum collections as research data - October 2019Museum collections as research data - October 2019
Museum collections as research data - October 2019Dag Endresen
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeVince Smith
 
COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges
COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges
COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges COBWEB Project
 

Mais procurados (20)

Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
 
GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016
 
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
 
GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)
 
Exploring the future of scholarly publishing of biodiversity data
Exploring the future of scholarly publishing of biodiversity dataExploring the future of scholarly publishing of biodiversity data
Exploring the future of scholarly publishing of biodiversity data
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
GBIF and Open Science
GBIF and Open ScienceGBIF and Open Science
GBIF and Open Science
 
GBIF towards 2030 (November 2018)
GBIF towards 2030 (November 2018)GBIF towards 2030 (November 2018)
GBIF towards 2030 (November 2018)
 
Intro to GBIF: NBN Crowdsourcing Data Capture Summit
Intro to GBIF: NBN Crowdsourcing Data Capture SummitIntro to GBIF: NBN Crowdsourcing Data Capture Summit
Intro to GBIF: NBN Crowdsourcing Data Capture Summit
 
The Global Biodiversity Information Facility and Africa Rising
The Global Biodiversity Information Facility and Africa RisingThe Global Biodiversity Information Facility and Africa Rising
The Global Biodiversity Information Facility and Africa Rising
 
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
 
Nigel Robinson - ZooBank and Zoological Record: a partnership for success
Nigel Robinson - ZooBank and Zoological Record: a partnership for successNigel Robinson - ZooBank and Zoological Record: a partnership for success
Nigel Robinson - ZooBank and Zoological Record: a partnership for success
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data management
 
Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)Germplasm data exchange, CGIAR SINGER (2009)
Germplasm data exchange, CGIAR SINGER (2009)
 
Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021
 
Museum collections as research data - October 2019
Museum collections as research data - October 2019Museum collections as research data - October 2019
Museum collections as research data - October 2019
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 
COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges
COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges
COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges
 

Destaque

Biodiversity capecod short
Biodiversity capecod shortBiodiversity capecod short
Biodiversity capecod shortDavid Remsen
 
Remsen celebration of discovery
Remsen celebration of discoveryRemsen celebration of discovery
Remsen celebration of discoveryDavid Remsen
 
Collaboration Forum Keynote
Collaboration Forum KeynoteCollaboration Forum Keynote
Collaboration Forum KeynoteDavid Remsen
 
Nodes Portal Toolkit Primer
Nodes Portal Toolkit PrimerNodes Portal Toolkit Primer
Nodes Portal Toolkit PrimerDavid Remsen
 
Emergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBLEmergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBLDavid Remsen
 
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)İbrahim ATAY
 
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)İbrahim ATAY
 

Destaque (9)

Biodiversity capecod short
Biodiversity capecod shortBiodiversity capecod short
Biodiversity capecod short
 
Remsen sherborne
Remsen sherborneRemsen sherborne
Remsen sherborne
 
Remsen celebration of discovery
Remsen celebration of discoveryRemsen celebration of discovery
Remsen celebration of discovery
 
Tdwg 1-remsen
Tdwg 1-remsenTdwg 1-remsen
Tdwg 1-remsen
 
Collaboration Forum Keynote
Collaboration Forum KeynoteCollaboration Forum Keynote
Collaboration Forum Keynote
 
Nodes Portal Toolkit Primer
Nodes Portal Toolkit PrimerNodes Portal Toolkit Primer
Nodes Portal Toolkit Primer
 
Emergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBLEmergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBL
 
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
 
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
 

Semelhante a GBIF Annual Meeting 2011: Issues in Providing Federated Access to Digital Biodiversity Data

Data editors meeting at SEFS
Data editors meeting at SEFSData editors meeting at SEFS
Data editors meeting at SEFSAaike De Wever
 
GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015Dag Endresen
 
Chavan Finland 13082009
Chavan Finland 13082009Chavan Finland 13082009
Chavan Finland 13082009Vishwas Chavan
 
TDWG at the University of Tasmania
TDWG at the University of TasmaniaTDWG at the University of Tasmania
TDWG at the University of Tasmanialeebel
 
Remsen EOL Content Summit
Remsen EOL Content SummitRemsen EOL Content Summit
Remsen EOL Content SummitDavid Remsen
 
Cross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage LibraryCross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage LibraryChris Freeland
 
Ecological Society of America
Ecological Society of America Ecological Society of America
Ecological Society of America Vishwas Chavan
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...Dag Endresen
 
GBIF (Global Biodiversity Information Facility) Position Paper: Data Hosting ...
GBIF (Global Biodiversity Information Facility) Position Paper: Data Hosting ...GBIF (Global Biodiversity Information Facility) Position Paper: Data Hosting ...
GBIF (Global Biodiversity Information Facility) Position Paper: Data Hosting ...Phil Cryer
 
2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC RomeDag Endresen
 
PhD defense Julien Troudet (29/11/2017)
PhD defense Julien Troudet (29/11/2017)PhD defense Julien Troudet (29/11/2017)
PhD defense Julien Troudet (29/11/2017)Julien Troudet
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...GigaScience, BGI Hong Kong
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3Gianpaolo Coro
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeEdward Baker
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 
National Biodiversity Informatics Goals
National Biodiversity Informatics GoalsNational Biodiversity Informatics Goals
National Biodiversity Informatics GoalsDavid Remsen
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013ECNOfficer
 

Semelhante a GBIF Annual Meeting 2011: Issues in Providing Federated Access to Digital Biodiversity Data (20)

Data editors meeting at SEFS
Data editors meeting at SEFSData editors meeting at SEFS
Data editors meeting at SEFS
 
GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015
 
Chavan Finland 13082009
Chavan Finland 13082009Chavan Finland 13082009
Chavan Finland 13082009
 
TDWG at the University of Tasmania
TDWG at the University of TasmaniaTDWG at the University of Tasmania
TDWG at the University of Tasmania
 
Remsen EOL Content Summit
Remsen EOL Content SummitRemsen EOL Content Summit
Remsen EOL Content Summit
 
Implementation of Semantic Network Dictionary System for Global Observation ...
Implementation of Semantic Network Dictionary System for Global Observation ...Implementation of Semantic Network Dictionary System for Global Observation ...
Implementation of Semantic Network Dictionary System for Global Observation ...
 
Implementation of semantic network dictionary system
Implementation of semantic network dictionary system Implementation of semantic network dictionary system
Implementation of semantic network dictionary system
 
Cross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage LibraryCross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage Library
 
Ecological Society of America
Ecological Society of America Ecological Society of America
Ecological Society of America
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
 
GBIF (Global Biodiversity Information Facility) Position Paper: Data Hosting ...
GBIF (Global Biodiversity Information Facility) Position Paper: Data Hosting ...GBIF (Global Biodiversity Information Facility) Position Paper: Data Hosting ...
GBIF (Global Biodiversity Information Facility) Position Paper: Data Hosting ...
 
Gbrd Sworkshop Sept09
Gbrd Sworkshop Sept09Gbrd Sworkshop Sept09
Gbrd Sworkshop Sept09
 
2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome
 
PhD defense Julien Troudet (29/11/2017)
PhD defense Julien Troudet (29/11/2017)PhD defense Julien Troudet (29/11/2017)
PhD defense Julien Troudet (29/11/2017)
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
National Biodiversity Informatics Goals
National Biodiversity Informatics GoalsNational Biodiversity Informatics Goals
National Biodiversity Informatics Goals
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013
 

Mais de David Remsen

Use and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological InformaticsUse and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological InformaticsDavid Remsen
 
uBio presentation to UMLS group of NLM / NIH
uBio presentation to UMLS group of NLM / NIHuBio presentation to UMLS group of NLM / NIH
uBio presentation to UMLS group of NLM / NIHDavid Remsen
 
uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006David Remsen
 
uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004David Remsen
 
Nodes Portal Toolkit primer
Nodes Portal Toolkit primerNodes Portal Toolkit primer
Nodes Portal Toolkit primerDavid Remsen
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - RemsenDavid Remsen
 
D3 02 Vernacular Names
D3 02 Vernacular NamesD3 02 Vernacular Names
D3 02 Vernacular NamesDavid Remsen
 
D3 02 National Checklists
D3 02 National ChecklistsD3 02 National Checklists
D3 02 National ChecklistsDavid Remsen
 
Cataloging Taxonomic Data
Cataloging Taxonomic DataCataloging Taxonomic Data
Cataloging Taxonomic DataDavid Remsen
 
Digitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current ApproachesDigitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current ApproachesDavid Remsen
 

Mais de David Remsen (12)

Use and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological InformaticsUse and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological Informatics
 
uBio presentation to UMLS group of NLM / NIH
uBio presentation to UMLS group of NLM / NIHuBio presentation to UMLS group of NLM / NIH
uBio presentation to UMLS group of NLM / NIH
 
uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006
 
Thomson Reuters
Thomson ReutersThomson Reuters
Thomson Reuters
 
uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004
 
Nodes Portal Toolkit primer
Nodes Portal Toolkit primerNodes Portal Toolkit primer
Nodes Portal Toolkit primer
 
Remsen sherborne
Remsen sherborneRemsen sherborne
Remsen sherborne
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - Remsen
 
D3 02 Vernacular Names
D3 02 Vernacular NamesD3 02 Vernacular Names
D3 02 Vernacular Names
 
D3 02 National Checklists
D3 02 National ChecklistsD3 02 National Checklists
D3 02 National Checklists
 
Cataloging Taxonomic Data
Cataloging Taxonomic DataCataloging Taxonomic Data
Cataloging Taxonomic Data
 
Digitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current ApproachesDigitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current Approaches
 

Último

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Último (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

GBIF Annual Meeting 2011: Issues in Providing Federated Access to Digital Biodiversity Data

  • 1. Taxonomic Databases Working Group Annual Meeting 2011 GBIF: Issues in providing federated access to digital information related to biological specimens. David Remsen Senior Programme Officer Global Biodiversity Information Facility (GBIF) TDWG 2011
  • 2.
  • 3.
  • 4. “ Wrapper ” Software PyWrapper (Python) TAPIR Link (PHP) DiGIR (PHP) Your database Insect Collection Install one of these ‘ wrappers ’ ABCD Bird Observations Herbarium Data DarwinCore DarwinCore
  • 5. The promise of federation Insect Collection Herbarium Bird Observations Herbarium Any specimens from Thailand? GBIF Data Portal I will ask! I do! I do! I do! Nope! GBIF Data Portal as a Gateway
  • 6. The challenge of federation Insect Collection Herbarium Bird Observations Herbarium Hello? Server Not Available GBIF Data Portal Hi!
  • 7. The rise of Indexing Insect Collection Herbarium Bird Observations Herbarium Any data records from Thailand? Send me an index of all of your data GBIF Data Portal (now with Data!) GBIF Data Portal as a Data Index
  • 8. The wrong tools for the job Insect Collection Herbarium Bird Observations Herbarium Any data records from Thailand? Send me an index of your data once per month Here is page one. If I go offline, s tart again Not too fast! You ask the same questions every time GBIF Data Portal (now with Data!)
  • 9. Darwin Core Archives A text-based solution to publishing biodiversity data
  • 10. A Refined Approach Insect Collection Herbarium Bird Observations Herbarium Any data records from Thailand? This is fast! GBIF Data Portal (now with Data!) URL URL URL URL This is easy
  • 11. 2007 Today 70 million 2010 2008 2009 147 million 180 million 201 million 302 million Growth Need for a new standard identified
  • 12.
  • 13. Geo-referenced USA data Verbatim data as shared on the network
  • 14.
  • 15.
  • 16.
  • 17. Enabled taxonomic data to be published through GBIF
  • 18. Trochilidae (Hummingbirds) (today) Misinterpretations (Hummingbirds are only found in western hemisphere)
  • 19. Trochilidae (Hummingbirds) (next month) Improved interpretation
  • 20. Search for Oenanthe ( water dropwort plant or wheatear bird ) Difficult for user to interpret Accurate search results Today Next month
  • 21. Improved the means to match names
  • 22.

Notas do Editor

  1. To start with, GBIF strives to create a global biodiversity data network that facilitates free and open access to primary biodiversity data worldwide. Currently, the network includes over 9200 datasets from over 340 data publishers representing over 100 countries and international organisations. Collectively the network provides access to over 300 million data records.
  2. The foundation of the GBIF data network has historically been based on access to biodiversity databases mediated through one of the TDWG protocols listed above. These different protocols support the means to query databases in a standard manner and receive data results formatted according to Darwin Core or ABCD XML specifications.
  3. These protocols were designed to support a fully federated network where a user could query the network through a gateway, which would propagate the query to all the members of the network and assemble the resultant responses to the user.
  4. The GBIF network, however, was never able to function in this federated role. Real-time querying of databases was hampered by many factors not the least of which was that at any given time up to ¼ of the data servers were offline.
  5. As a result the GBIF data portals provide discovery of data through a central index. This index consists of a subset of all the data served through the network that can be used to answer the key questions related to the data store – what species are included, where were they found and when were they collected.
  6. DIGIR, TAPIR and BIOCASE are not well suited for building indexes of databases. They require long iterations of queries to harvest an entire dataset. A dataset of 260,000 specimens, served via TAPIR allows 200 records to be retrieved per request. This requires 1300 request/response pairs and takes over 9 hours to compete. During this time 500 MB of XML data is transferred. This is transformed into a 32MB text file once the data are processed in the GBIF server which could have been further compressed to a 3MB zip file. Producing such a data export and zipping it would take under a minute if produced by the database itself. Thus in 2009, GBIF began to promote the use of a new indexing data format.
  7. Darwin Core Archives provide Darwin Core-based occurrence and taxonomic data in a simple, text-based format. It simplifies the exchange of indexes by eliminating the use of federated transfer protocols. Data is accessed via a simple URL using HTTP.
  8. Darwin Core Archives provide GBIF with the means to 1) reduce what is currently more than a months (or more) time between when a data publisher registers data and its subsequent appearance in the data portal. We anticipate that with increased uptake of Darwin Core Archive and improvements in our data integration processes, we can reduce the latency from approx. a month down to a week or less. In addition, Darwin Core Archive has enabled us to index very large datasets that simply could not be harvested using the federated protocols.
  9. Thus, since the Darwin Core Archive standard has been adopted, GBIF has seen a significant increase in the numbers of data records published through the network with a 50% increase in 2011 alone.
  10. A second significant issue that challenges effective delivery of biodiversity data in a federated network is due to issues of quality relating to geospatial properties of records.
  11. This map shows raw data as harvested from data providers that is asserted to originate in the United States. Note the mirror image of the United States over India and China. This is due to a missing negative symbol in the longitude data value.
  12. This is how the data looks like after improved interpretation methods have been applied. We can now recognise international waters and offshore islands.
  13. Providing taxonomic access to biodiversity data is a key requirement for many users. Both DarwinCore and ABCD provide the means for data publishers to include the Linnean classification of the referenced species within the data record. In a federated network, the result is that the same taxon may be classified in different ways. Not only does this complicate assembling a common taxonomic backbone for organising indexed data, it also complicates distinguishing actual homonyms – cases where the same name has been applied to two different taxa. In addition scientific names are often misspelled and even a correctly spelled name may exist as many different orthographies.
  14. GBIF assembles a taxonomic backbone from taxonomic sources that are more authoritative than the classifications included with collections data. These sources are derived from new capacities within the GBIF network that enable species information to be published through the GBIF network in the same manner as collections (species occurrence) data. The GBIF taxonomic backbone, once assembled from a mix of both authoritative and collections-based classifications, is now composed entirely from published taxonomic catalogue data.
  15. An example of how this impacts data organisation and delivery is illustrated in the map above. A european bird species with a name not occurring in the Catalogue of Life was mistakenly placed within the hummingbirds (a new world group) based on classification information tied to some of the specimens. This resulted in the map above where one erroneous species grouping impacts the map for the entire family.
  16. With access to a wider array of authoritative taxonomic sources, we are able to match more taxa using more reliable sources and improve the taxonomic backbone used to organise all species data records.
  17. This improved taxonomic reconciliation extends to the resolution of homonyms – names for different taxa that are spelled alike. Relying solely on taxonomic information within occurrence data sources provides a confusing array of possible homonyms. Relying on taxonomic authority files reveals there are exactly two genera with this name and includes a common name to help distinguish them.
  18. Lastly, informatics improvements complement the addition of authoritative taxonomic sources in providing better methods for matching names to authority files. GBIFs name parsing service parses names into recognised component parts and builds canonical representations of names that allow different forms of the same name to be matched to authority file information.