SlideShare uma empresa Scribd logo
1 de 34
Advancing the International Plant Names Index (IPNI) Nicky Nicolson, Alan Paton, Jim Croft, James Macklin, Paul Morris, Greg Whitbread, Kanchi Gandhi
Advancing IPNI ,[object Object],[object Object],[object Object]
What data? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
 
 
 
 
 
 
 
 
 
How is data entered? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
How is data managed? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Standardisation – author and title
Standardisation – epithet updates
Standardisation of epithets ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Rhus kea mc yi  was an OCR error for  Rhus kea rne yi  but the incorrect value persists in datasets derived from IPNI
Statistics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
http://www.ipni.org/stats.html
As well as the data… ,[object Object],[object Object],[object Object]
Why should anyone care? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Future ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data in - contributor services ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Pre-publication data entry ,[object Object],[object Object],[object Object],[object Object]
Electronic Publication Example - Phytokeys ,[object Object],[object Object],[object Object],[object Object],PhytoKeys 4: 67–94 (2011) doi: 10.3897/phytokeys.4.1581 www.phytokeys.com
Pre-publication issues ,[object Object],[object Object],[object Object],[object Object]
Where IPNI data are placed Any name occurrence: e.g.  specimens, reports, literature citation concepts Standard form of name
Data out - links ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Links to concept layer ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Links to the Concept Layer Example The Plant List
Link to name occurrence layer ,[object Object],[object Object],[object Object],[object Object],[object Object]
Conclusion ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
 

Mais conteúdo relacionado

Mais procurados

Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
abedejesus
 
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsBibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
GESIS
 
From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...
Catherine Canevet
 
FundRef, or Name That Funder!
FundRef, or Name That Funder!FundRef, or Name That Funder!
FundRef, or Name That Funder!
Crossref
 

Mais procurados (20)

Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
 
Bourne RDAP11 Data Publication Repositories
Bourne RDAP11 Data Publication RepositoriesBourne RDAP11 Data Publication Repositories
Bourne RDAP11 Data Publication Repositories
 
Data publication: Discover, Explore, Visualise
Data publication: Discover, Explore, VisualiseData publication: Discover, Explore, Visualise
Data publication: Discover, Explore, Visualise
 
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOMetadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
 
data citation
data citationdata citation
data citation
 
Data Quality and the FAIR principles
Data Quality and the FAIR principlesData Quality and the FAIR principles
Data Quality and the FAIR principles
 
Workshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in WikidataWorkshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in Wikidata
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
Workflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopterWorkflows for Publishing Data; Scientific Data's experience as an early adopter
Workflows for Publishing Data; Scientific Data's experience as an early adopter
 
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsBibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
 
From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
 
Sharing IR metadata with SHARE
Sharing IR metadata with SHARESharing IR metadata with SHARE
Sharing IR metadata with SHARE
 
National Data Archive (NADA) 3.0
National Data Archive (NADA) 3.0National Data Archive (NADA) 3.0
National Data Archive (NADA) 3.0
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 
FundRef, or Name That Funder!
FundRef, or Name That Funder!FundRef, or Name That Funder!
FundRef, or Name That Funder!
 
We've Got the Data - Now What Do We Do About It? Applying Quality Standard to...
We've Got the Data - Now What Do We Do About It? Applying Quality Standard to...We've Got the Data - Now What Do We Do About It? Applying Quality Standard to...
We've Got the Data - Now What Do We Do About It? Applying Quality Standard to...
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.
 

Destaque

ASSESSMENTS-Taxonomic-Assessments-Javier
ASSESSMENTS-Taxonomic-Assessments-JavierASSESSMENTS-Taxonomic-Assessments-Javier
ASSESSMENTS-Taxonomic-Assessments-Javier
Javier Otegui
 
History of taxonomyyyyy
History of taxonomyyyyyHistory of taxonomyyyyy
History of taxonomyyyyy
Raccoon30
 
Plant Name Services Using Tropicos
Plant Name Services Using TropicosPlant Name Services Using Tropicos
Plant Name Services Using Tropicos
Chris Freeland
 
IPNI PhytoKeys integration
IPNI PhytoKeys integrationIPNI PhytoKeys integration
IPNI PhytoKeys integration
nickyn
 
Segers Introduction To Scientific Nomenclature
Segers Introduction To Scientific NomenclatureSegers Introduction To Scientific Nomenclature
Segers Introduction To Scientific Nomenclature
ICZN
 
History of taxonomy
History of taxonomyHistory of taxonomy
History of taxonomy
Ospina19
 
Angiosperm phylogeny grouping I (APG I)
Angiosperm phylogeny grouping I (APG I)Angiosperm phylogeny grouping I (APG I)
Angiosperm phylogeny grouping I (APG I)
Pabasara Gunawardane
 

Destaque (18)

ASSESSMENTS-Taxonomic-Assessments-Javier
ASSESSMENTS-Taxonomic-Assessments-JavierASSESSMENTS-Taxonomic-Assessments-Javier
ASSESSMENTS-Taxonomic-Assessments-Javier
 
Building a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic LiteratureBuilding a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic Literature
 
Review of new features at www.tropicos.org
Review of new features at www.tropicos.orgReview of new features at www.tropicos.org
Review of new features at www.tropicos.org
 
History of taxonomyyyyy
History of taxonomyyyyyHistory of taxonomyyyyy
History of taxonomyyyyy
 
Plant Name Services Using Tropicos
Plant Name Services Using TropicosPlant Name Services Using Tropicos
Plant Name Services Using Tropicos
 
IPNI PhytoKeys integration
IPNI PhytoKeys integrationIPNI PhytoKeys integration
IPNI PhytoKeys integration
 
Segers Introduction To Scientific Nomenclature
Segers Introduction To Scientific NomenclatureSegers Introduction To Scientific Nomenclature
Segers Introduction To Scientific Nomenclature
 
Dye and Yielding Plants M.P. Dr. Azra khan PH.D. Research Paper
Dye and Yielding Plants M.P. Dr. Azra khan PH.D. Research  Paper Dye and Yielding Plants M.P. Dr. Azra khan PH.D. Research  Paper
Dye and Yielding Plants M.P. Dr. Azra khan PH.D. Research Paper
 
Removing Barriers For Disabled Students - Session Five
Removing Barriers For Disabled Students - Session FiveRemoving Barriers For Disabled Students - Session Five
Removing Barriers For Disabled Students - Session Five
 
Classification
ClassificationClassification
Classification
 
History of taxonomy
History of taxonomyHistory of taxonomy
History of taxonomy
 
Biogeography
BiogeographyBiogeography
Biogeography
 
Tannin yielding plants
Tannin yielding plantsTannin yielding plants
Tannin yielding plants
 
History of international code of botanical nomenclature 1
History of international  code of botanical nomenclature 1History of international  code of botanical nomenclature 1
History of international code of botanical nomenclature 1
 
evidences of anatomy, cytology and chemistry to plant taxonomy
evidences of anatomy, cytology and chemistry to plant taxonomyevidences of anatomy, cytology and chemistry to plant taxonomy
evidences of anatomy, cytology and chemistry to plant taxonomy
 
Angiosperm phylogeny grouping I (APG I)
Angiosperm phylogeny grouping I (APG I)Angiosperm phylogeny grouping I (APG I)
Angiosperm phylogeny grouping I (APG I)
 
Botanical nomenclature
Botanical nomenclatureBotanical nomenclature
Botanical nomenclature
 
Plant taxonomy
Plant taxonomyPlant taxonomy
Plant taxonomy
 

Semelhante a Advancing the International Plant Names Index (IPNI)

2010 nasig integrating_usage_statistics
2010 nasig integrating_usage_statistics2010 nasig integrating_usage_statistics
2010 nasig integrating_usage_statistics
showslidedump
 
Taxonomies for Publishing: Enhancing the User Experience
Taxonomies for Publishing: Enhancing the User ExperienceTaxonomies for Publishing: Enhancing the User Experience
Taxonomies for Publishing: Enhancing the User Experience
TSoholt
 
Getting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring SuccessGetting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring Success
kramsey
 
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
Dr. Haxel Consult
 

Semelhante a Advancing the International Plant Names Index (IPNI) (20)

Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & Statistics
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Author's workflow and the role of open access
Author's workflow and the role of open accessAuthor's workflow and the role of open access
Author's workflow and the role of open access
 
2010 nasig integrating_usage_statistics
2010 nasig integrating_usage_statistics2010 nasig integrating_usage_statistics
2010 nasig integrating_usage_statistics
 
Jonathan Breeze, Symplectic
Jonathan Breeze, SymplecticJonathan Breeze, Symplectic
Jonathan Breeze, Symplectic
 
BLC & Digital Science: Jonathan Breeze, Symplectic
BLC & Digital Science: Jonathan Breeze, SymplecticBLC & Digital Science: Jonathan Breeze, Symplectic
BLC & Digital Science: Jonathan Breeze, Symplectic
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
Taxonomies for Publishing: Enhancing the User Experience
Taxonomies for Publishing: Enhancing the User ExperienceTaxonomies for Publishing: Enhancing the User Experience
Taxonomies for Publishing: Enhancing the User Experience
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
Getting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring SuccessGetting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring Success
 
Elsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing IndustryElsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing Industry
 
Institutional Identifiers internally and throughout the supply chain
Institutional Identifiers internally and throughout the supply chainInstitutional Identifiers internally and throughout the supply chain
Institutional Identifiers internally and throughout the supply chain
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
 
British Library
British LibraryBritish Library
British Library
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
 
The Pistoia Alliance Biology Domain Strategy April 2011
The Pistoia Alliance Biology Domain Strategy April 2011The Pistoia Alliance Biology Domain Strategy April 2011
The Pistoia Alliance Biology Domain Strategy April 2011
 
Presentation from Code Camp 2017
Presentation from Code Camp 2017Presentation from Code Camp 2017
Presentation from Code Camp 2017
 
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific KnowledgeNZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
 

Mais de nickyn (8)

829 tdwg-2015-nicolson-kew-strings-to-things
829 tdwg-2015-nicolson-kew-strings-to-things829 tdwg-2015-nicolson-kew-strings-to-things
829 tdwg-2015-nicolson-kew-strings-to-things
 
Rda p5-env-plenary-nn
Rda p5-env-plenary-nnRda p5-env-plenary-nn
Rda p5-env-plenary-nn
 
Challenges in developing names services - RDA
Challenges in developing names services - RDAChallenges in developing names services - RDA
Challenges in developing names services - RDA
 
Kew at the pro-iBiosphere data hackathon
Kew at the pro-iBiosphere data hackathonKew at the pro-iBiosphere data hackathon
Kew at the pro-iBiosphere data hackathon
 
names-backbone-graph-TDWG
names-backbone-graph-TDWGnames-backbone-graph-TDWG
names-backbone-graph-TDWG
 
A names backbone - a graph of taxonomy
A names backbone - a graph of taxonomyA names backbone - a graph of taxonomy
A names backbone - a graph of taxonomy
 
Services and Kew's (names) data
Services and Kew's (names) dataServices and Kew's (names) data
Services and Kew's (names) data
 
Building a names backbone
Building a names backboneBuilding a names backbone
Building a names backbone
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 

Advancing the International Plant Names Index (IPNI)

Notas do Editor

  1. Elecronic publication Lisd plant list example
  2. Sample IPNI record
  3. Standardised author
  4. Standardised publication title and collation
  5. Distribution
  6. Type specimen information
  7. Links to other IPNI records
  8. Code annotation (on linked record)
  9. Full record history
  10. Resolvable persistent identifier (LSID), returns structured data in a standard format.
  11. Could mention issues with this here – names aren’t entered until the hard copy arrives at K / HUH library – we estimate 2 year time lag between publication data and entry to IPNI. Stats derived from 2004 onwards. IK editors discussion: Could do some analysis on this with sandwich student
  12. Spelling correction Endings Connecting vowels OCR error fixes
  13. Orange: authors (88.6%) Green: publication titles Author standardisation: 1% rise requires creation of over 25,000 links Checking intensive - often ambiguity in the non-standard, unlinked abbreviations, e.g. un-standardised string ' Henr. ' was found to be: Henrickson  in this string:  ( Henr . ) S.L.Welsh & Crompton Henrard  in this string:  ( Henr . ) Clayton
  14. These shown as number of epithets modified per month July 2010 is when we did a big OCR fix
  15. Screenshot showing propagation of errors on next slide
  16. OCR error translated mc -> rne – dates from IK digitisation T he old version persists in many datasets that have been derived from IPNI. Linking (via persistent identifier – as described in later slide) would ensure that derived datasets benefit from this kind of curation.
  17. Can mention the GTI work here
  18. Stats page on the site at http://www.ipni.org/stats.html contains these tables from 2004 onwards This is data for most recent full year (2010)
  19. BUT the response to user queries has very little visibility - point to point email, only visible to participants, even though the issues discussed may be of wider relevance
  20. NN/AP: Perhaps we should add the average number of searches per day to the stats page.
  21. Division of labour btw nomenclature and taxonomy: IPNI handles citation of name, reference and authorship and objective links such as combination – basionym. Checklists handle taxonomic synonymy and references supporting the assertion of concepts Referencing datasets benefit from ongoing curation of IPNI data.
  22. This string translation higher value than purely lexical approach as an editor has checked it. Edit distance – number of single character transpositions required to modify one string into another NN 2011-07-14: here is another example which might better explain what I am on about: Plectranthus macrophyl i us -> Plectranthus macrophyl l us and Plectranthus m i crophyllus -> Plectranthus m a crophyllus same edit distance (1 character) BUT: former is high value – checked by editor, latter programmatically derived and a much more dangerous assumption to make
  23. Data structure 10 years old – needs re-engineering to deal with requirements. NN: Moved data structure point to the notes as the crux is not just the data structure but the idea that we have a single data structure – technically I’d like us to split between top copy version for editing and multiple (dumber, flatter) slaves to service API calls etc – these can be hit as hard as we like without impacting on the editors workflow. Faceting – different routes to the data.