SlideShare uma empresa Scribd logo
1 de 31
Bibliographic references in BHL
Coordination and routes for
cooperation across organizations,
projects and e-infrastructures
23rd of May 2013
William Ulate R., Missouri Botanical Garden
Questions to Answer
1. Type of content we discuss (e.g., occurrences, genes, behaviour,
morphology, etc.)
2. Sources of content (from where)
3. Formats of content (formats, standards)
4. Methods of gathering information (e.g., harvesting, ftp uploads,
protocols)
5. Methods of delivery of information (e,g., free searches, API, web
services, automated exports, linking mechanisms, etc.; provide links
to API and web services documentation)
6. Identifiers used (type, persistence, dereferencing, resolvability)
7. Present or forthcoming interoperability features with other
platforms
8. Constraints, needs and expectations to:
a) Suppliers of content, and
b) Users of content
9. What is needed for Bibliographic References?
A brief history…
The Biodiversity Heritage Library
www.biodiversitylibrary.org
Book Viewer
Sharing
BHL shares data through:
APIs
Data Export
OpenURL
OAI-PMH
Open Data
• Downloads
– Simple tab-delimited exports of core data
– http://www.biodiversitylibrary.org/data/BHLExportSchema.pdf
• Data model
– DB schema as ERD
– http://bhl-bits.googlecode.com/files/20090930_BHLDataModel.pdf
Services
• Names Service
– Return all occurrences of a name throughout BHL digitized corpus
• Documentation: http://bit.ly/2e6sg9
– Access to 100+ million name strings using TaxonFinder & NetiNeti
• 1.5 million unique names
– Algorithm to detect nomenclatural & taxonomic acts
• OpenURL
– Facilitate links to citations: protologues, articles, references
• Documentation: http://www.biodiversitylibrary.org/openurlhelp.aspx
– Useful to Nomenclators, Reference Systems
• IPNI
• Tropicos
Services: OpenURL
http://www.biodiversitylibrary.org/openurl?
pid=title:3934&volume=14&issue=&spage=301&date=1879
http://www.tropicos.org/Name/1200408
DOIs
DOIs for Legacy Literature
• BHL member of CrossRef through Smithsonian
• Started assigning DOIs to BHL monographs
– Low hanging fruit: Easy, non-controversial
– 54,856 DOIs Approved to date
• Next, other publication types / articles?
– Process of automatically assigning CrossRef DOIs
to articles has a higher potential for collisions.
Article-level metadata
• Disambiguating and locating structural components
in the corpus
• Done by automated and crowdsourced means
– Thanks Rod Page! Welcome others!
• Greatly increases semantic value of the dataset
• Makes data addressable and thus linkable
Chapter-level metadataTreatment-level metadataPart-level metadata
Genesis: “BHL Article Repository”
• Idea first introduced at TDWG 2008, Fremantle
(by BHL, many have discussed for years)
• YouTube for biodiversity articles
• Needed (need) a way to access articles in BHL
– “BHL has no articles.”
– BHL has hundreds of thousands of articles but you
can’t search for them via author, article title search
– Can find via “article coordinates” using BHL’s UI &
OpenURL resolver: Journal / Volume / Start Page / Year
CiteBank
• Objectives
– Create a repository for community-vetted
taxonomic bibliographies.
– Ability to ingest, display, download, and index
articles so that the BHL can operate as an article
repository.
– Provide links to content published online through
other repositories.
• Launched on December 6th 2010
• 185609 bibliographic records to date
Citations today: http://citebank.org
Citations Providers
Specimen
Databases
Commercial
Aggregators
Software Tools
Open Access
Digital Libraries
Indices
Nomenclators
Specimen
Databases
Commercial
Aggregators
Software Tools
Open Access
Digital Libraries
Indices
Nomenclators
Open Access
Publishers
International Collaborative Projects
Lessons Learned
• Biblio/Drupal data model insufficient for mass of data
envisioned for all biodiversity, too flat and difficult to
expand in collaboration with Biblio development
community
• Data providers want their content findable and
managed in the Biodiversity Heritage Library, not a
system alongside BHL
• Maintaining two platforms for biodiversity literature
threatens sustainability of the literature resources over
the longer term
Global Names Architecture
What have we done?
• Articles
– Extended BHL data model to store article metadata
– Built process to harvest data from BioStor
• Created user interfaces for adding article metadata
and associated files
– Defined functional requirements as improvements to
Drupal-based Citebank
– Defined process flow for adding article metadata and
associated files
– Implemented UI changes
• Changed BHL UI to accommodate article search
• Changed BHL UI to accommodate article display (TOC)
Articles in the BHL UI
Articles
Articles
Articles
Requirements for a citation repository?
Admin. Interface
– IMPORT AND MAPPING TOOL
• Preview/Accept/Reject/Undo/Report on Import
• No standard schema, MODS or Bibtex
• Drag & drop GUI or mapped source and target field config.
– USER MANAGEMENT
• Self-Registration
• Admin. Approval & Deletion
• User Roles Assignment
– GLOBAL UPDATES
Requirements for a citation repository?
General User Interface
– IMPORT
• Upload/Preview/Accept/Reject/Undo/Report on Import
– CREATE CITATION
• By filling a Form, via BibTex
– BROWSE
• Faceted: title,author,subject, year, contributor, my citations
Requirements for a citation repository?
• CITATION TYPES
– Journal Article, Book Chapter, Conference Proceedings,
Conference Paper, Thesis, Government Report, Note, etc.
• OAI HARVESTING
– Harvest and serve data through OAI-PMH
• SPECIFICATIONS FOR DATA PROVIDERS PAGE
• CONTRIBUTORS PAGE
– Recognize ALL contributions
• REPORTING
– Statistics Page by Citation and Publication type
– Recent/Latest Uploads
What are we doing?
• Integrate BHL’s Services with ZooBank, IPNI & IF
• Authoritative list of titles in common use for
nomenclatural acts (“TL3”)
• Harvest relevant content from Mendeley
• Integrate services and interfaces with the GNUB
data model
• Interoperate with citation parsing tools & services
Support citation reconciliation
.
.
.
.
.
.
.
L. Sp. Pl. 2: 971. 1753
Linneaus, C. Species Plantarum, vol. 2 p. 971. 1753
Linné, Carl von. Sp. Pl. Vol. 2 Page 971. 1753
Caroli Linnaei, Species Plantarum exhibentes plantas rite cognitas, ad genera
relatas, cum Differentis Specificis, Nominibus Trivialibus, Synonymis Selectis,
Locis Natalibus, secundum SYSTEMA SEXUALE digestas.. 2:971. 1753
Zea mays
Questions to Answer
1. Type of content - Literature, Images, OCR Text
and Bibliographic Citations
2. Sources of content - BHL, CB & other Repositories
3. Formats of content - BibTex, MODS, DC
4. Methods of gathering info - Harvesting, FTP Uploads
5. Methods of delivery of info - Free Searches, API, web
services, exports, linking
mechanisms
6. Identifiers used - CrossRef DOIs for Monographs
7. Interoperability with
other platforms - Zoobank, IPNI, IF
8. Constraints, needs and expectations to suppliers of content
and users of content
Thank you
pro-iBiosphere Meeting 3
Coordination and routes for cooperation across organizations, projects and e-infrastructures
Berlin, Germany
May 23rd, 2013
William.Ulate@mobot.org
Global BHL Project Manager
BHL Technical Director
Senior Project Manager
Missouri Botanical Garden

Mais conteúdo relacionado

Mais procurados

-Open Archives Initiatives(final)
-Open Archives Initiatives(final)-Open Archives Initiatives(final)
-Open Archives Initiatives(final)
floyd taag
 
Next generation online catalogs
Next generation online catalogsNext generation online catalogs
Next generation online catalogs
afraser246
 

Mais procurados (18)

Describing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.orgDescribing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.org
 
WorldCat Local: Global Network, Local Results
WorldCat Local: Global Network, Local ResultsWorldCat Local: Global Network, Local Results
WorldCat Local: Global Network, Local Results
 
Dulin PermaCC Talk for MIT PIS
Dulin PermaCC Talk for MIT PISDulin PermaCC Talk for MIT PIS
Dulin PermaCC Talk for MIT PIS
 
-Open Archives Initiatives(final)
-Open Archives Initiatives(final)-Open Archives Initiatives(final)
-Open Archives Initiatives(final)
 
NISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
NISO Webinar: The Future of Integrated Library Systems PART 2: User InteractionNISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
NISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
 
Open archives initiatives(final)
 Open archives initiatives(final) Open archives initiatives(final)
Open archives initiatives(final)
 
Role of Cataloger in the 21st Century Academic Library
Role of Cataloger in the 21st Century Academic LibraryRole of Cataloger in the 21st Century Academic Library
Role of Cataloger in the 21st Century Academic Library
 
Metadata in the age of data curation and linked data
Metadata in the age of data curation and linked dataMetadata in the age of data curation and linked data
Metadata in the age of data curation and linked data
 
Gary Price, MIT Program on Information Science
Gary Price, MIT Program on Information ScienceGary Price, MIT Program on Information Science
Gary Price, MIT Program on Information Science
 
Best Practices for Descriptive Metadata
Best Practices for Descriptive MetadataBest Practices for Descriptive Metadata
Best Practices for Descriptive Metadata
 
OER for repository managers
OER for repository managersOER for repository managers
OER for repository managers
 
Open Metrics for Open Repositories at OR2012
Open Metrics for Open Repositories at OR2012Open Metrics for Open Repositories at OR2012
Open Metrics for Open Repositories at OR2012
 
Impact of the evergreen library automation system on public library users
Impact of the evergreen library automation system on public library usersImpact of the evergreen library automation system on public library users
Impact of the evergreen library automation system on public library users
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and Practice
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
Role of Libraries in the Google Age
Role of Libraries in the Google AgeRole of Libraries in the Google Age
Role of Libraries in the Google Age
 
Next generation online catalogs
Next generation online catalogsNext generation online catalogs
Next generation online catalogs
 
Open source software for implementation of union catalogue
Open source software for implementation of union catalogueOpen source software for implementation of union catalogue
Open source software for implementation of union catalogue
 

Destaque

/Volumes/rooster/media quest
/Volumes/rooster/media quest/Volumes/rooster/media quest
/Volumes/rooster/media quest
gueste64d9cca
 
Lane score powerpoint
Lane score powerpointLane score powerpoint
Lane score powerpoint
guest8af9bb
 

Destaque (9)

/Volumes/rooster/media quest
/Volumes/rooster/media quest/Volumes/rooster/media quest
/Volumes/rooster/media quest
 
Fourth Global BHL Meeting - Technical Update
Fourth Global BHL Meeting - Technical UpdateFourth Global BHL Meeting - Technical Update
Fourth Global BHL Meeting - Technical Update
 
Lane score powerpoint
Lane score powerpointLane score powerpoint
Lane score powerpoint
 
Online ungdomskultur - nye sociale muligheder for Folkekirken
Online ungdomskultur - nye sociale muligheder for FolkekirkenOnline ungdomskultur - nye sociale muligheder for Folkekirken
Online ungdomskultur - nye sociale muligheder for Folkekirken
 
Global BHL Update May 2013
Global BHL Update May 2013Global BHL Update May 2013
Global BHL Update May 2013
 
Ikke om 10 minutter eller lige om lidt...
Ikke om 10 minutter eller lige om lidt...Ikke om 10 minutter eller lige om lidt...
Ikke om 10 minutter eller lige om lidt...
 
Positions Currently Covered Under Special Employee Referral Scheme
Positions Currently Covered Under Special Employee Referral SchemePositions Currently Covered Under Special Employee Referral Scheme
Positions Currently Covered Under Special Employee Referral Scheme
 
5 grundpræmisser for digital deltagelse
5 grundpræmisser for digital deltagelse5 grundpræmisser for digital deltagelse
5 grundpræmisser for digital deltagelse
 
Our digital lives. Participation. Friends. NOW!
Our digital lives. Participation. Friends. NOW!Our digital lives. Participation. Friends. NOW!
Our digital lives. Participation. Friends. NOW!
 

Semelhante a Bibliographic References in BHL

BHL Developments - Prague
BHL Developments - PragueBHL Developments - Prague
BHL Developments - Prague
Chris Freeland
 
Cross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage LibraryCross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage Library
Chris Freeland
 
Ontology-based Tools to Enhance the Curation Workflow
Ontology-based Tools to Enhance the Curation WorkflowOntology-based Tools to Enhance the Curation Workflow
Ontology-based Tools to Enhance the Curation Workflow
Trish Whetzel
 
Revolutionary and Evolutionary Innovation - Marshall Breeding
Revolutionary and Evolutionary Innovation - Marshall Breeding Revolutionary and Evolutionary Innovation - Marshall Breeding
Revolutionary and Evolutionary Innovation - Marshall Breeding
CONUL Conference
 

Semelhante a Bibliographic References in BHL (20)

BHL Technical Update (May 2013)
BHL Technical Update (May 2013)BHL Technical Update (May 2013)
BHL Technical Update (May 2013)
 
BHL @ #TDWG09 - with discussion
BHL @ #TDWG09 - with discussionBHL @ #TDWG09 - with discussion
BHL @ #TDWG09 - with discussion
 
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
 
Recommendation and the Library
Recommendation and the LibraryRecommendation and the Library
Recommendation and the Library
 
BHL Developments - Prague
BHL Developments - PragueBHL Developments - Prague
BHL Developments - Prague
 
Biodiversity Heritiage Library: progress and process
Biodiversity Heritiage Library: progress and processBiodiversity Heritiage Library: progress and process
Biodiversity Heritiage Library: progress and process
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications
 
Cross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage LibraryCross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage Library
 
Global Library of Life: The Biodiversity Heritage Library
Global Library of Life: The Biodiversity Heritage LibraryGlobal Library of Life: The Biodiversity Heritage Library
Global Library of Life: The Biodiversity Heritage Library
 
Ontology-based Tools to Enhance the Curation Workflow
Ontology-based Tools to Enhance the Curation WorkflowOntology-based Tools to Enhance the Curation Workflow
Ontology-based Tools to Enhance the Curation Workflow
 
Global BHL Activities
Global BHL ActivitiesGlobal BHL Activities
Global BHL Activities
 
Next Generation Repositories
Next Generation RepositoriesNext Generation Repositories
Next Generation Repositories
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)
 
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible LibraryBeyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible Library
 
Implementing web scale discovery services: special reference to Indian Librar...
Implementing web scale discovery services: special reference to Indian Librar...Implementing web scale discovery services: special reference to Indian Librar...
Implementing web scale discovery services: special reference to Indian Librar...
 
Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?
 
Digital Library Infrastructure for a Million Books
Digital Library Infrastructure for a Million BooksDigital Library Infrastructure for a Million Books
Digital Library Infrastructure for a Million Books
 
Revolutionary and Evolutionary Innovation - Marshall Breeding
Revolutionary and Evolutionary Innovation - Marshall Breeding Revolutionary and Evolutionary Innovation - Marshall Breeding
Revolutionary and Evolutionary Innovation - Marshall Breeding
 
Limitreal
LimitrealLimitreal
Limitreal
 

Mais de William Ulate

Finding the annotation needs of the botanical community in a digital library
Finding the annotation needs of the botanical community in a digital libraryFinding the annotation needs of the botanical community in a digital library
Finding the annotation needs of the botanical community in a digital library
William Ulate
 
Unlocking knowledge in biodiversity legacy literature through automatic seman...
Unlocking knowledge in biodiversity legacy literature through automatic seman...Unlocking knowledge in biodiversity legacy literature through automatic seman...
Unlocking knowledge in biodiversity legacy literature through automatic seman...
William Ulate
 
Purposeful Gaming and BHL
Purposeful Gaming and BHLPurposeful Gaming and BHL
Purposeful Gaming and BHL
William Ulate
 
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
William Ulate
 

Mais de William Ulate (18)

Enhancing the WFO in support of GSPC.pptx
Enhancing the WFO in support of GSPC.pptxEnhancing the WFO in support of GSPC.pptx
Enhancing the WFO in support of GSPC.pptx
 
Finding the annotation needs of the botanical community in a digital library
Finding the annotation needs of the botanical community in a digital libraryFinding the annotation needs of the botanical community in a digital library
Finding the annotation needs of the botanical community in a digital library
 
Botanists and annotations printer friendly
Botanists and annotations   printer friendlyBotanists and annotations   printer friendly
Botanists and annotations printer friendly
 
Expanding Access to Biodiversity Literature. Mining Biodiversity.
Expanding Access to Biodiversity Literature. Mining Biodiversity.Expanding Access to Biodiversity Literature. Mining Biodiversity.
Expanding Access to Biodiversity Literature. Mining Biodiversity.
 
Text Mining Biodiversity 20160127
Text Mining Biodiversity 20160127Text Mining Biodiversity 20160127
Text Mining Biodiversity 20160127
 
BHL Tech Status Update Tech Director W.Ulate 2015.12.11
BHL Tech Status Update Tech Director W.Ulate 2015.12.11BHL Tech Status Update Tech Director W.Ulate 2015.12.11
BHL Tech Status Update Tech Director W.Ulate 2015.12.11
 
Unlocking knowledge in biodiversity legacy literature through automatic seman...
Unlocking knowledge in biodiversity legacy literature through automatic seman...Unlocking knowledge in biodiversity legacy literature through automatic seman...
Unlocking knowledge in biodiversity legacy literature through automatic seman...
 
Engaging the Citizen Scientist in Content Enhancement for BHL
Engaging the Citizen Scientist in Content Enhancement for BHLEngaging the Citizen Scientist in Content Enhancement for BHL
Engaging the Citizen Scientist in Content Enhancement for BHL
 
Digitalización de Literatura de Biodiversidad: an overview of the BHL for CON...
Digitalización de Literatura de Biodiversidad: an overview of the BHL for CON...Digitalización de Literatura de Biodiversidad: an overview of the BHL for CON...
Digitalización de Literatura de Biodiversidad: an overview of the BHL for CON...
 
BHL Technical Director's Report, Mar. 2014
BHL Technical Director's Report, Mar. 2014BHL Technical Director's Report, Mar. 2014
BHL Technical Director's Report, Mar. 2014
 
BHL Markup Efforts and Plans
BHL Markup Efforts and PlansBHL Markup Efforts and Plans
BHL Markup Efforts and Plans
 
Purposeful Gaming and BHL
Purposeful Gaming and BHLPurposeful Gaming and BHL
Purposeful Gaming and BHL
 
A new flora fauna mycota should...
A new flora fauna mycota should...A new flora fauna mycota should...
A new flora fauna mycota should...
 
The BHL way to content
The BHL way to contentThe BHL way to content
The BHL way to content
 
TDWG 2012 Poster for Art of Life project
TDWG 2012 Poster for Art of Life projectTDWG 2012 Poster for Art of Life project
TDWG 2012 Poster for Art of Life project
 
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
 
BHL: Toward a Global, Sustainable Resource
BHL: Toward a Global, Sustainable ResourceBHL: Toward a Global, Sustainable Resource
BHL: Toward a Global, Sustainable Resource
 
Global BHL Meeting Action Items
Global BHL Meeting Action ItemsGlobal BHL Meeting Action Items
Global BHL Meeting Action Items
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

Bibliographic References in BHL

  • 1. Bibliographic references in BHL Coordination and routes for cooperation across organizations, projects and e-infrastructures 23rd of May 2013 William Ulate R., Missouri Botanical Garden
  • 2. Questions to Answer 1. Type of content we discuss (e.g., occurrences, genes, behaviour, morphology, etc.) 2. Sources of content (from where) 3. Formats of content (formats, standards) 4. Methods of gathering information (e.g., harvesting, ftp uploads, protocols) 5. Methods of delivery of information (e,g., free searches, API, web services, automated exports, linking mechanisms, etc.; provide links to API and web services documentation) 6. Identifiers used (type, persistence, dereferencing, resolvability) 7. Present or forthcoming interoperability features with other platforms 8. Constraints, needs and expectations to: a) Suppliers of content, and b) Users of content 9. What is needed for Bibliographic References?
  • 4. The Biodiversity Heritage Library www.biodiversitylibrary.org
  • 6. Sharing BHL shares data through: APIs Data Export OpenURL OAI-PMH
  • 7. Open Data • Downloads – Simple tab-delimited exports of core data – http://www.biodiversitylibrary.org/data/BHLExportSchema.pdf • Data model – DB schema as ERD – http://bhl-bits.googlecode.com/files/20090930_BHLDataModel.pdf
  • 8. Services • Names Service – Return all occurrences of a name throughout BHL digitized corpus • Documentation: http://bit.ly/2e6sg9 – Access to 100+ million name strings using TaxonFinder & NetiNeti • 1.5 million unique names – Algorithm to detect nomenclatural & taxonomic acts • OpenURL – Facilitate links to citations: protologues, articles, references • Documentation: http://www.biodiversitylibrary.org/openurlhelp.aspx – Useful to Nomenclators, Reference Systems • IPNI • Tropicos
  • 10. DOIs
  • 11. DOIs for Legacy Literature • BHL member of CrossRef through Smithsonian • Started assigning DOIs to BHL monographs – Low hanging fruit: Easy, non-controversial – 54,856 DOIs Approved to date • Next, other publication types / articles? – Process of automatically assigning CrossRef DOIs to articles has a higher potential for collisions.
  • 12. Article-level metadata • Disambiguating and locating structural components in the corpus • Done by automated and crowdsourced means – Thanks Rod Page! Welcome others! • Greatly increases semantic value of the dataset • Makes data addressable and thus linkable Chapter-level metadataTreatment-level metadataPart-level metadata
  • 13. Genesis: “BHL Article Repository” • Idea first introduced at TDWG 2008, Fremantle (by BHL, many have discussed for years) • YouTube for biodiversity articles • Needed (need) a way to access articles in BHL – “BHL has no articles.” – BHL has hundreds of thousands of articles but you can’t search for them via author, article title search – Can find via “article coordinates” using BHL’s UI & OpenURL resolver: Journal / Volume / Start Page / Year
  • 14. CiteBank • Objectives – Create a repository for community-vetted taxonomic bibliographies. – Ability to ingest, display, download, and index articles so that the BHL can operate as an article repository. – Provide links to content published online through other repositories. • Launched on December 6th 2010 • 185609 bibliographic records to date
  • 17. Specimen Databases Commercial Aggregators Software Tools Open Access Digital Libraries Indices Nomenclators Specimen Databases Commercial Aggregators Software Tools Open Access Digital Libraries Indices Nomenclators Open Access Publishers International Collaborative Projects
  • 18. Lessons Learned • Biblio/Drupal data model insufficient for mass of data envisioned for all biodiversity, too flat and difficult to expand in collaboration with Biblio development community • Data providers want their content findable and managed in the Biodiversity Heritage Library, not a system alongside BHL • Maintaining two platforms for biodiversity literature threatens sustainability of the literature resources over the longer term
  • 20. What have we done? • Articles – Extended BHL data model to store article metadata – Built process to harvest data from BioStor • Created user interfaces for adding article metadata and associated files – Defined functional requirements as improvements to Drupal-based Citebank – Defined process flow for adding article metadata and associated files – Implemented UI changes • Changed BHL UI to accommodate article search • Changed BHL UI to accommodate article display (TOC)
  • 21. Articles in the BHL UI
  • 25. Requirements for a citation repository? Admin. Interface – IMPORT AND MAPPING TOOL • Preview/Accept/Reject/Undo/Report on Import • No standard schema, MODS or Bibtex • Drag & drop GUI or mapped source and target field config. – USER MANAGEMENT • Self-Registration • Admin. Approval & Deletion • User Roles Assignment – GLOBAL UPDATES
  • 26. Requirements for a citation repository? General User Interface – IMPORT • Upload/Preview/Accept/Reject/Undo/Report on Import – CREATE CITATION • By filling a Form, via BibTex – BROWSE • Faceted: title,author,subject, year, contributor, my citations
  • 27. Requirements for a citation repository? • CITATION TYPES – Journal Article, Book Chapter, Conference Proceedings, Conference Paper, Thesis, Government Report, Note, etc. • OAI HARVESTING – Harvest and serve data through OAI-PMH • SPECIFICATIONS FOR DATA PROVIDERS PAGE • CONTRIBUTORS PAGE – Recognize ALL contributions • REPORTING – Statistics Page by Citation and Publication type – Recent/Latest Uploads
  • 28. What are we doing? • Integrate BHL’s Services with ZooBank, IPNI & IF • Authoritative list of titles in common use for nomenclatural acts (“TL3”) • Harvest relevant content from Mendeley • Integrate services and interfaces with the GNUB data model • Interoperate with citation parsing tools & services
  • 29. Support citation reconciliation . . . . . . . L. Sp. Pl. 2: 971. 1753 Linneaus, C. Species Plantarum, vol. 2 p. 971. 1753 Linné, Carl von. Sp. Pl. Vol. 2 Page 971. 1753 Caroli Linnaei, Species Plantarum exhibentes plantas rite cognitas, ad genera relatas, cum Differentis Specificis, Nominibus Trivialibus, Synonymis Selectis, Locis Natalibus, secundum SYSTEMA SEXUALE digestas.. 2:971. 1753 Zea mays
  • 30. Questions to Answer 1. Type of content - Literature, Images, OCR Text and Bibliographic Citations 2. Sources of content - BHL, CB & other Repositories 3. Formats of content - BibTex, MODS, DC 4. Methods of gathering info - Harvesting, FTP Uploads 5. Methods of delivery of info - Free Searches, API, web services, exports, linking mechanisms 6. Identifiers used - CrossRef DOIs for Monographs 7. Interoperability with other platforms - Zoobank, IPNI, IF 8. Constraints, needs and expectations to suppliers of content and users of content
  • 31. Thank you pro-iBiosphere Meeting 3 Coordination and routes for cooperation across organizations, projects and e-infrastructures Berlin, Germany May 23rd, 2013 William.Ulate@mobot.org Global BHL Project Manager BHL Technical Director Senior Project Manager Missouri Botanical Garden

Notas do Editor

  1. Guidelines for speakers giving presentationsPresentation are limited to 15 minutes for each speaker plus 5 minutes for discussion.Presentations should clearly answer the following questions (7-8 slides), definitely focusing on the interoperability problem:Type of content we discuss (e.g., occurrences, genes, behaviour, morphology, etc.)Sources of content (from where)Formats of content (formats, standards)Methods of gathering information (e.g., harvesting, ftp uploads, protocols)Methods of delivery of information (e,g., free searches, API, web services, automated exports, linking mechanisms, etc.; provide links to API and web services documentation)Identifiers used (type, persistence, dereferencing, resolvability)’Present or forthcoming interoperability features with other platformsConstraints, needs and expectations to: a) Suppliers of content, and b) Users of contentOverall picture of what is needed within a certain domain (e.g., for names, references, genes, images, etc.) (2-3-slides)The final outputs of presentations and discussions should be two-fold:Summary table encompassing the answers to the above questions, that will be a basis for the whitepaper and future workMoU draft discussedProposing an Advisory Board of key stakeholders that will form the ground for a consortium to develop and launch the future BKMSTasks involved:Task 2.1. Coordination and routes for cooperation across organizations, projects and e-infrastructures (lead: Plazi). Encompassing the information gathered at Workshop 1 (Leiden, February 2013) and through the online questionnaire.Task 4.1 Improve technical cooperation and interoperability at the e-infrastructure level (lead: FUB-BGBM).Task 4.2 Promote and monitor the development and adoption of common mark-up standards and interoperability between schemas by identifying technical and societal constraints and needs to increase collaboration and interoperability between e-platforms and projects, and by envisioning practical solutions towards the Biodiversity Knowledge Management System (lead: Plazi).=============Concrete examples of ideas for potential points in a draft MoUA primary purpose of the “Routes towards cooperation” meeting is to increase our reciprocal understanding and progress towards a multi-institutional Memorandum of Understanding(MoU). The following points are potential points in a draft MoU. It is welcome to comment them here on the wiki before the meeting takes place, or to add further points. The results would then have to be further discussed by the appropriate levels.Establishment of a multi-institutional focus group to coordinate software development to improve the efficiency of resource use by means of common Open Source based development projects using Open Source methodology.Agreements on specialization, e.g., one institution specializes in geographical analysis and visualization, providing services to other institutions or projectsAgreement on long-term management procedures to provide stable identifiers. This agreement may be technology neutral (except that some way to use the identifiers in the human readable as well as semantic web should be specified). Both stable http-URIs (preferred in semantic web) and DOI technology (publishing industry) are possible implementations.Agreement on following the Linked Open Data example. (Note: Edinburgh may be a best practices example?)Agreement to communicate the data policies according to the Linked Open Data five star scoringPolicy agreements on Open AccessAgreement to register all services that are provided to other Biodiversity institutions in the Biodiversity Catalogue (Univ. Manchester, myExperiment).Agreement to communicate the expected and planned stability of services by means of a standard vocabulary (e.g.: undecided, experimental, long-term service without fixed API, long-term service with stable and versioned API)Agreement to collaborate on the development of shared term definitions (glossary-style) with the understanding that new terms can be freely added, but an effort will be made to re-use or improve existing term definitions.Agreement on crowdsourcing activities to clean up data, e.g. bibliographic references, or markup content in legacy literature, e.g. scientific names, treatments, material citations.Paul Kirk: Centrally 'cached' data should have a clear mechanism for providing usage statistics back to sources.
  2. Type of content we discuss (e.g., occurrences, genes, behaviour, morphology, etc.)Sources of content (from where)Formats of content (formats, standards)Methods of gathering information (e.g., harvesting, ftp uploads, protocols)Methods of delivery of information (e,g., free searches, API, web services, automated exports, linking mechanisms, etc.; provide links to API and web services documentation)Identifiers used (type, persistence, dereferencing, resolvability)Present or forthcoming interoperability features with other platformsConstraints, needs and expectations to: a) Suppliers of content, and b) Users of content
  3. [PortalUser Interface]
  4. [Book Viewer Interface]
  5. We ask the user to provide metadata if they’re generating a chapter or book title
  6. On legacy literature, what your plans are with BHL, and especially your move into content?GrowthMore Global ContentTaxon NamesArticle MetadataMicrocitations and COiNSAPIZoobankOCR improvements through GamingCrowdsource MarkupWFO?
  7. [Citebank homepage]
  8. [Citebank homepage]
  9. [Citebank stats]
  10. [World in which CiteBank lives]
  11. [Citations in BHL and Sustainability Considerations]
  12. [Citebank homepage]
  13. [GNA Diagram]
  14. [Define functional requirements]
  15. We ask the user to provide metadata if they’re generating a chapter or book title
  16. We ask the user to provide metadata if they’re generating a chapter or book title
  17. [Where are we going?]
  18. [Diagram of citations reconciliation]
  19. Type of content we discuss (e.g., occurrences, genes, behaviour, morphology, etc.)Sources of content (from where)Formats of content (formats, standards)Methods of gathering information (e.g., harvesting, ftp uploads, protocols)Methods of delivery of information (e,g., free searches, API, web services, automated exports, linking mechanisms, etc.; provide links to API and web services documentation)Identifiers used (type, persistence, dereferencing, resolvability)Present or forthcoming interoperability features with other platformsConstraints, needs and expectations to: a) Suppliers of content, and b) Users of content