SlideShare uma empresa Scribd logo
1 de 23
Challenges for semantics in EOL Phenotype Ontology RCN NESCent 25 February 2011 Cynthia Parr National Museum of Natural History Smithsonian Institution
http://www.eol.org ,[object Object]
Summary descriptions across biology domains
Freely accessible
Available from a single portal in a common format
Quality
Always growing,[object Object]
Typical species page
http://www.eol.org/content_partner Objects can come from many partners Objects are sorted by topic and by taxon Each partner gets credit
Curation, Comments, Tags
Not
Statistics 2.8 million pages – one (or more) per taxon 2 million data objects 500 thousand pages with objects 100+ partner databases 700 curators/1000s contributors/~46,000 members
http://NodeXL.codeplex.com
Schema Very coarsely structured 33 subjects (TDWG Species Profile Model) No numeric data Minimal controlled vocabularies API
Corvidae
We have an infrastructure . . . Aggregation mechanisms Names resolution Curation mechanisms Public and machine interfaces Version 2 (August) vastly improved support for community interaction Version 3 (???)
Rich page calculations
Possible path to semantics
What could we do?
Organize info on EOL pages Index by taxon Sort into one of the 33 SPM subjects Improve discoverability
Serve data by API or query interface “Give me all the information you have about the elbow joint and life histories in rodents”

Mais conteúdo relacionado

Mais procurados

Pub medpresentation
Pub medpresentationPub medpresentation
Pub medpresentation
purplemooki
 

Mais procurados (7)

Impact of the evergreen library automation system on public library users
Impact of the evergreen library automation system on public library usersImpact of the evergreen library automation system on public library users
Impact of the evergreen library automation system on public library users
 
The Research Data Life Cycle for Biology - A Researcher Perspective
The Research Data Life Cycle for Biology - A Researcher PerspectiveThe Research Data Life Cycle for Biology - A Researcher Perspective
The Research Data Life Cycle for Biology - A Researcher Perspective
 
Pub medpresentation
Pub medpresentationPub medpresentation
Pub medpresentation
 
Accessing The Materials You Need
Accessing The Materials You NeedAccessing The Materials You Need
Accessing The Materials You Need
 
Webs of Life and Data: Impacts of open and networked data on scientific pract...
Webs of Life and Data: Impacts of open and networked data on scientific pract...Webs of Life and Data: Impacts of open and networked data on scientific pract...
Webs of Life and Data: Impacts of open and networked data on scientific pract...
 
Opportunities in chemical structure standardization
Opportunities in chemical structure standardizationOpportunities in chemical structure standardization
Opportunities in chemical structure standardization
 
Biodiversity Heritage Library
Biodiversity Heritage LibraryBiodiversity Heritage Library
Biodiversity Heritage Library
 

Destaque (6)

Word meaning, sentence meaning, and syntactic meaning
Word meaning, sentence meaning, and syntactic  meaningWord meaning, sentence meaning, and syntactic  meaning
Word meaning, sentence meaning, and syntactic meaning
 
Phrase and sentence meaning
Phrase and sentence meaningPhrase and sentence meaning
Phrase and sentence meaning
 
Unit 3 - Reference and Sense
Unit 3 -  Reference and SenseUnit 3 -  Reference and Sense
Unit 3 - Reference and Sense
 
The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...
The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...
The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...
 
SEMANTICS
SEMANTICS SEMANTICS
SEMANTICS
 
Challenges and patterns for semantics at scale
Challenges and patterns for semantics at scaleChallenges and patterns for semantics at scale
Challenges and patterns for semantics at scale
 

Semelhante a Challenge of Semantics for the Encyclopedia of Life

BioOne Keynote
BioOne KeynoteBioOne Keynote
BioOne Keynote
drielinger
 
Special Libraries Associatin
Special Libraries AssociatinSpecial Libraries Associatin
Special Libraries Associatin
drielinger
 
Digital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Digital Libraries for Science: Botanicus and the Biodiversity Heritage LibraryDigital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Digital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Chris Freeland
 

Semelhante a Challenge of Semantics for the Encyclopedia of Life (20)

An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...
 
Mla May 7
Mla May 7Mla May 7
Mla May 7
 
BioOne Keynote
BioOne KeynoteBioOne Keynote
BioOne Keynote
 
An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...An International Cooperative Digital Library for Taxonomic Literature: The Bi...
An International Cooperative Digital Library for Taxonomic Literature: The Bi...
 
Biodiversity Heritage Library : Development and Partnerhips
Biodiversity Heritage Library : Development and PartnerhipsBiodiversity Heritage Library : Development and Partnerhips
Biodiversity Heritage Library : Development and Partnerhips
 
Eol fellow-march2010
Eol fellow-march2010Eol fellow-march2010
Eol fellow-march2010
 
A Global Library of Life: The Biodiversity Heritage Library
A Global Library of Life: The Biodiversity Heritage LibraryA Global Library of Life: The Biodiversity Heritage Library
A Global Library of Life: The Biodiversity Heritage Library
 
Ifla Bhl080208cr
Ifla Bhl080208crIfla Bhl080208cr
Ifla Bhl080208cr
 
The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...
The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...
The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...
 
Global Library of Life: The Biodiversity Heritage Library
Global Library of Life: The Biodiversity Heritage LibraryGlobal Library of Life: The Biodiversity Heritage Library
Global Library of Life: The Biodiversity Heritage Library
 
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
 
EOL biotracker briefing
EOL biotracker briefingEOL biotracker briefing
EOL biotracker briefing
 
Special Libraries Associatin
Special Libraries AssociatinSpecial Libraries Associatin
Special Libraries Associatin
 
The Biodiversity Heritage Library
The Biodiversity Heritage LibraryThe Biodiversity Heritage Library
The Biodiversity Heritage Library
 
Next Generation Catalogs: Extensible Catalog, David Lindahl
Next Generation Catalogs: Extensible Catalog, David LindahlNext Generation Catalogs: Extensible Catalog, David Lindahl
Next Generation Catalogs: Extensible Catalog, David Lindahl
 
Biodiversity Heritiage Library: progress and process
Biodiversity Heritiage Library: progress and processBiodiversity Heritiage Library: progress and process
Biodiversity Heritiage Library: progress and process
 
Digital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Digital Libraries for Science: Botanicus and the Biodiversity Heritage LibraryDigital Libraries for Science: Botanicus and the Biodiversity Heritage Library
Digital Libraries for Science: Botanicus and the Biodiversity Heritage Library
 
BHL Tech Report
BHL Tech ReportBHL Tech Report
BHL Tech Report
 
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library ProjectSmithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
 
BHL Technology Overview
BHL Technology OverviewBHL Technology Overview
BHL Technology Overview
 

Mais de Cyndy Parr

Parr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbag
Cyndy Parr
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
Cyndy Parr
 

Mais de Cyndy Parr (20)

Open data and the ag data commons
Open data and the ag data commonsOpen data and the ag data commons
Open data and the ag data commons
 
Ag Data Commons for AgBioData
Ag Data Commons for AgBioDataAg Data Commons for AgBioData
Ag Data Commons for AgBioData
 
Biodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscape
 
Public access to research results at USDA
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDA
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
 
Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.
 
Parr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbag
 
Ag Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research dataAg Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research data
 
Big Data Initiatives for Agroecosystems
Big Data Initiatives for AgroecosystemsBig Data Initiatives for Agroecosystems
Big Data Initiatives for Agroecosystems
 
TDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's Welcome
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
 
How the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute dataHow the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute data
 
The Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of Life
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
 

Último

Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
FIDO Alliance
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 

Último (20)

Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 

Challenge of Semantics for the Encyclopedia of Life

Notas do Editor

  1. So, the approach of EOL is rather different than many other sites. EOL is a giant mashup that creates pages, that are then available for curators to assess and rate, or for anybody to provide comments or tags.
  2. Objects such as these are essentially chunks of text sorted by topic.Each of these credits the source, and can receive comments or ratings, or can be trusted or untrusted by curators.
  3. From this page from LepTree:EpipyropidaePlanthopperparasites
  4. Given this scale, I think ths was the ONLY way we could start.Imagine how large an ontology we’d have to have to fully describe organisms ranging from this tiny Pelagic diatom, 50 microns longWhales, in this case a humpback, many orders of magnitude larger, also pelagic, but physiologically and morphologically quite differentPixie's Parasol, saprophytic organism with complex life cycles (note the collembola on it)An animal like a humpback is characterized in Animal Diversity Web by an ontology with about 400 concepts, just scratches the surface, similarly this Saturnid moth we characterized in the LepTree project with a few more hundred concepts, some of which overlap with the whale but most don’t. The size of ontologies spoken about here is on the order of 5 to 70K conceptsThink about what kind of characters you’d need to characterize this halobacteria – an archaean!!But a scientist studying food webs might want to know characteristics across a wide swath of life.
  5. Represents about 2200 projects, and 1000 instances of data flow or hyperlinks between them. Hundreds of partners, each with their own ontology (in many cases for good reason!) and you can see that the ontology space itself, much less the way you Most of these are NOT using ontologies
  6. One of the things that may be valuable about EOL is the ability to assess the amount of information available for a group of taxaFamily Corvidae, showing the hooded crow here, is where I curate. It has reasonably rich content with 74% of pages having some text though only 27% have images. There are also a large number of unreviewed images (from Wikipedia and Flickr) and text (mostly from Wikipedia) I am working through.This could be expanded to highlight gaps in what we know about organisms – what areas of biology, for example, lack information. Could be used by funding agencies to prioritize grants, by students deciding what needs to be studied.Might show how to find content summaries on current pages
  7. Not biologically relevant concepts but it is a start
  8. Hand wavy, we aren’t actually doing this just yet but we could….Note that by referring to the URIs for the concepts can take advantage of the relationship assertions among the terms, but we don’t need to manage them ourselves, so this might be pointers to the EQ statements described earlier, with enough information here that we can display to humans, but enough info so scientists and ontologists can have the formalisms needed for reasoning
  9. Let’s say we figure out HOW to do it, should we do it?
  10. Good for general public, to the extent that the concepts have understandable labelsThese are from the Animal Diversity Web, put these in the reproduction part of the pageAlong with any other reproduction data we get from other sourcesSome problems – some of our audiences aren’t interested in the fine detail but you never know…how do you decide what to hide?
  11. For scientists, let them download or access the data, providing not only the source of where the info came from but machine-readable URIs that define the concepts, so that they can integrate and perform analyses on the dataDownload data like this, combine it with a phylogeny of rodents and you might be able to test evolutionary hypothesesmiddleman
  12. If querying interfaces or APIs are not your thing, we could easily make the whole web page browsable by semantic web browsers You could do whatever you want with that….
  13. Most ambitious, pie in the sky
  14. Informtics for evolution, systematics, and biodiversity