Challenge of Semantics for the Encyclopedia of Life

•Transferir como PPTX, PDF•

2 gostaram•775 visualizações

An introduction to EOL (http://www.eol.org) and some of the challenges and possible applications for structured, semantic information about biological organisms. Presented at the kick-off meeting of the NSF-Funded Phenotype Ontology Research Coordination Network.

Tecnologia

Challenges for semantics in EOL Phenotype Ontology RCN NESCent 25 February 2011 Cynthia Parr National Museum of Natural History Smithsonian Institution

Summary descriptions across biology domains

Available from a single portal in a common format

http://www.eol.org/content_partner Objects can come from many partners Objects are sorted by topic and by taxon Each partner gets credit

Statistics 2.8 million pages – one (or more) per taxon 2 million data objects 500 thousand pages with objects 100+ partner databases 700 curators/1000s contributors/~46,000 members

We have an infrastructure . . . Aggregation mechanisms Names resolution Curation mechanisms Public and machine interfaces Version 2 (August) vastly improved support for community interaction Version 3 (???)

Organize info on EOL pages Index by taxon Sort into one of the 33 SPM subjects Improve discoverability

Serve data by API or query interface “Give me all the information you have about the elbow joint and life histories in rodents”

Mais conteúdo relacionado

Mais procurados

Impact of the evergreen library automation system on public library usersIndiana Online Users Group

The Research Data Life Cycle for Biology - A Researcher PerspectivePhilippa Griffin

Pub medpresentationpurplemooki

Accessing The Materials You NeedDawn Lowe-Wincentsen

Webs of Life and Data: Impacts of open and networked data on scientific pract...Sarah Anna Stewart

Opportunities in chemical structure standardizationValery Tkachenko

Biodiversity Heritage LibraryChris Freeland

Mais procurados (7)

Impact of the evergreen library automation system on public library users

The Research Data Life Cycle for Biology - A Researcher Perspective

Pub medpresentation

Accessing The Materials You Need

Webs of Life and Data: Impacts of open and networked data on scientific pract...

Opportunities in chemical structure standardization

Biodiversity Heritage Library

Destaque

Word meaning, sentence meaning, and syntactic meaningNick Izquierdo

Phrase and sentence meaningRatna Nurhidayati

Unit 3 - Reference and SenseAshwag Al Hamid

The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...Paul Shapiro

SEMANTICS Hameel Khan

Challenges and patterns for semantics at scaleRob Vesse

Destaque (6)

Word meaning, sentence meaning, and syntactic meaning

Phrase and sentence meaning

Unit 3 - Reference and Sense

The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...

SEMANTICS

Challenges and patterns for semantics at scale

Semelhante a Challenge of Semantics for the Encyclopedia of Life

An International Cooperative Digital Library for Taxonomic Literature: The Bi...Martin Kalfatovic

Mla May 7drielinger

BioOne Keynotedrielinger

An International Cooperative Digital Library for Taxonomic Literature: The Bi...Martin Kalfatovic

Biodiversity Heritage Library : Development and PartnerhipsNancy Gwinn

Eol fellow-march2010tgarnett

A Global Library of Life: The Biodiversity Heritage LibraryMartin Kalfatovic

Ifla Bhl080208crConnie Rinaldo

The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...drielinger

Global Library of Life: The Biodiversity Heritage LibraryMartin Kalfatovic

Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...Martin Kalfatovic

EOL biotracker briefingCyndy Parr

Special Libraries Associatindrielinger

The Biodiversity Heritage LibraryMartin Kalfatovic

Next Generation Catalogs: Extensible Catalog, David Lindahlyouthelectronix

Biodiversity Heritiage Library: progress and processPhil Cryer

Digital Libraries for Science: Botanicus and the Biodiversity Heritage LibraryChris Freeland

BHL Tech ReportChris Freeland

Smithsonian Libraries 2.0 and the Biodiversity Heritage Library ProjectMartin Kalfatovic

BHL Technology OverviewChris Freeland

Semelhante a Challenge of Semantics for the Encyclopedia of Life (20)

An International Cooperative Digital Library for Taxonomic Literature: The Bi...

Mla May 7

BioOne Keynote

An International Cooperative Digital Library for Taxonomic Literature: The Bi...

Biodiversity Heritage Library : Development and Partnerhips

Eol fellow-march2010

A Global Library of Life: The Biodiversity Heritage Library

Ifla Bhl080208cr

The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...

Global Library of Life: The Biodiversity Heritage Library

Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...

EOL biotracker briefing

Special Libraries Associatin

The Biodiversity Heritage Library

Next Generation Catalogs: Extensible Catalog, David Lindahl

Biodiversity Heritiage Library: progress and process

Digital Libraries for Science: Botanicus and the Biodiversity Heritage Library

BHL Tech Report

Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project

BHL Technology Overview

Mais de Cyndy Parr

Open data and the ag data commonsCyndy Parr

Ag Data Commons for AgBioDataCyndy Parr

Biodiversity informatics and the agricultural data landscapeCyndy Parr

Public access to research results at USDACyndy Parr

Ag Data Commons: Agricultural research metadata and dataCyndy Parr

Ag Data Commons: A new USDA catalog and repository for agricultural research ...Cyndy Parr

Preparing for data-intensive science across domains.Cyndy Parr

Parr ag datacommonsnal_brownbagCyndy Parr

Ag Data Commons: Adding Value to open agricultural research dataCyndy Parr

Big Data Initiatives for AgroecosystemsCyndy Parr

TDWG 2014 opening talk: Chair's WelcomeCyndy Parr

Behavior ontology workshop princetonCyndy Parr

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr

Frontiers of discovery with Encyclopedia of LifeCyndy Parr

Practical interoperability across semantic stores of data for ecological, tax...Cyndy Parr

Using and extending Darwin Core for structured attribute dataCyndy Parr

How the Encyclopedia of Life is wrangling organismal attribute dataCyndy Parr

The Road to TraitBank: What's Next for the Encyclopedia of LifeCyndy Parr

Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Cyndy Parr

Encyclopedia of Life: Use cases for phenotypesCyndy Parr

Mais de Cyndy Parr (20)

Open data and the ag data commons

Ag Data Commons for AgBioData

Biodiversity informatics and the agricultural data landscape

Public access to research results at USDA

Ag Data Commons: Agricultural research metadata and data

Ag Data Commons: A new USDA catalog and repository for agricultural research ...

Preparing for data-intensive science across domains.

Parr ag datacommonsnal_brownbag

Ag Data Commons: Adding Value to open agricultural research data

Big Data Initiatives for Agroecosystems

TDWG 2014 opening talk: Chair's Welcome

Behavior ontology workshop princeton

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK

Frontiers of discovery with Encyclopedia of Life

Practical interoperability across semantic stores of data for ecological, tax...

Using and extending Darwin Core for structured attribute data

How the Encyclopedia of Life is wrangling organismal attribute data

The Road to TraitBank: What's Next for the Encyclopedia of Life

Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...

Encyclopedia of Life: Use cases for phenotypes

Último

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Google AI Hackathon: LLM based Evaluator for RAGSujit Pal

Scaling API-first – The story of a global engineering organizationRadu Cotescu

How to convert PDF to text with Nanonetsnaman860154

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

A Call to Action for Generative AI in 2024Results

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Histor y of HAM Radio presentation slidevu2urc

Challenge of Semantics for the Encyclopedia of Life

1. Challenges for semantics in EOL Phenotype Ontology RCN NESCent 25 February 2011 Cynthia Parr National Museum of Natural History Smithsonian Institution

3. Summary descriptions across biology domains

4. Freely accessible

5. Available from a single portal in a common format

6. Quality

8. Typical species page

9. http://www.eol.org/content_partner Objects can come from many partners Objects are sorted by topic and by taxon Each partner gets credit

10.

11. Curation, Comments, Tags

12. Not

13. Statistics 2.8 million pages – one (or more) per taxon 2 million data objects 500 thousand pages with objects 100+ partner databases 700 curators/1000s contributors/~46,000 members

14.

15. http://NodeXL.codeplex.com

16. Schema Very coarsely structured 33 subjects (TDWG Species Profile Model) No numeric data Minimal controlled vocabularies API

17. Corvidae

18. We have an infrastructure . . . Aggregation mechanisms Names resolution Curation mechanisms Public and machine interfaces Version 2 (August) vastly improved support for community interaction Version 3 (???)

19. Rich page calculations

20. Possible path to semantics

21. What could we do?

22. Organize info on EOL pages Index by taxon Sort into one of the 33 SPM subjects Improve discoverability

23. Serve data by API or query interface “Give me all the information you have about the elbow joint and life histories in rodents”

24. Make the whole page semantically browsable (LOD: linked open data) Taxon Text blobs Character data Metadata

25. Consistency checks Curators Crowd-sourcing Reasoning… … inferring summaries ….mining for patterns? … hypothesis testing?

26. ievobio.org

27. Image credits Michal Koupý Lorraine Phelan David J Patterson Dmitry Mozzherin

Notas do Editor

So, the approach of EOL is rather different than many other sites. EOL is a giant mashup that creates pages, that are then available for curators to assess and rate, or for anybody to provide comments or tags.
Objects such as these are essentially chunks of text sorted by topic.Each of these credits the source, and can receive comments or ratings, or can be trusted or untrusted by curators.
From this page from LepTree:EpipyropidaePlanthopperparasites
Given this scale, I think ths was the ONLY way we could start.Imagine how large an ontology we’d have to have to fully describe organisms ranging from this tiny Pelagic diatom, 50 microns longWhales, in this case a humpback, many orders of magnitude larger, also pelagic, but physiologically and morphologically quite differentPixie's Parasol, saprophytic organism with complex life cycles (note the collembola on it)An animal like a humpback is characterized in Animal Diversity Web by an ontology with about 400 concepts, just scratches the surface, similarly this Saturnid moth we characterized in the LepTree project with a few more hundred concepts, some of which overlap with the whale but most don’t. The size of ontologies spoken about here is on the order of 5 to 70K conceptsThink about what kind of characters you’d need to characterize this halobacteria – an archaean!!But a scientist studying food webs might want to know characteristics across a wide swath of life.
Represents about 2200 projects, and 1000 instances of data flow or hyperlinks between them. Hundreds of partners, each with their own ontology (in many cases for good reason!) and you can see that the ontology space itself, much less the way you Most of these are NOT using ontologies
One of the things that may be valuable about EOL is the ability to assess the amount of information available for a group of taxaFamily Corvidae, showing the hooded crow here, is where I curate. It has reasonably rich content with 74% of pages having some text though only 27% have images. There are also a large number of unreviewed images (from Wikipedia and Flickr) and text (mostly from Wikipedia) I am working through.This could be expanded to highlight gaps in what we know about organisms – what areas of biology, for example, lack information. Could be used by funding agencies to prioritize grants, by students deciding what needs to be studied.Might show how to find content summaries on current pages
Not biologically relevant concepts but it is a start
Hand wavy, we aren’t actually doing this just yet but we could….Note that by referring to the URIs for the concepts can take advantage of the relationship assertions among the terms, but we don’t need to manage them ourselves, so this might be pointers to the EQ statements described earlier, with enough information here that we can display to humans, but enough info so scientists and ontologists can have the formalisms needed for reasoning
Let’s say we figure out HOW to do it, should we do it?
Good for general public, to the extent that the concepts have understandable labelsThese are from the Animal Diversity Web, put these in the reproduction part of the pageAlong with any other reproduction data we get from other sourcesSome problems – some of our audiences aren’t interested in the fine detail but you never know…how do you decide what to hide?
For scientists, let them download or access the data, providing not only the source of where the info came from but machine-readable URIs that define the concepts, so that they can integrate and perform analyses on the dataDownload data like this, combine it with a phylogeny of rodents and you might be able to test evolutionary hypothesesmiddleman
If querying interfaces or APIs are not your thing, we could easily make the whole web page browsable by semantic web browsers You could do whatever you want with that….
Most ambitious, pie in the sky
Informtics for evolution, systematics, and biodiversity

Challenge of Semantics for the Encyclopedia of Life

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (7)

Destaque

Destaque (6)

Semelhante a Challenge of Semantics for the Encyclopedia of Life

Semelhante a Challenge of Semantics for the Encyclopedia of Life (20)

Mais de Cyndy Parr

Mais de Cyndy Parr (20)

Último

Último (20)

Challenge of Semantics for the Encyclopedia of Life

Notas do Editor