SlideShare uma empresa Scribd logo
1 de 15
eol.org
@eol
@cydparr
How the Encyclopedia of Life is
wrangling organismal attribute data
How EOL works
EOL
Crowds
Harvest
Third party applications
EOL Today
Key Milestones in 2013
1.1 million species pages
240+ content providers
3.3 million unique annual
visitors from 235
countries
0 100000 200000 300000 400000 500000 600000 700000 800000
Distribution
MolecularBiology
Multiple topics
TypeInformation
Habitat
ConservationStatus
Threats
Morphology
Conservation
Management
Trends
Size
Associations
Uses
TrophicStrategy
Cyclicity & Life Cycle
PopulationBiology
Reproduction
Migration
Taxonomy
LifeExpectancy
Identification
Behaviour
Ecology
Diseases
Number of text objectsSubjectoftextobject
Text mining, crowdsourcing, standardizing
see http://eol.org/info/fellows
Co-occurrence, term extraction &
linked data
Thessen & Devries
EnvO habitat terms Pafilis et al.
Altitude Specificity of Flower
Coloration
Wright
Morphological impacts of extinction
risk in fish
Chang
Butterfly-hostplant associations Ferrer-Parris et al.
Species Interactions Poelen & Mungall
et al.
14 datasets containing 25k
taxa, 422k
interactions, for 3k
locations
alpha version of
ingestion, normalization,
aggregation
alpha version of web API
alpha version of data
exports
Dr. Katy Börner led
Information Visualization
MOOC
GLoBI http://globalbioticinteractions.wordpress.com/
EOL TraitBank
Funded: Marine focus
Virtuoso triple store, re-using URIs where possible
5 datasets 128,050 data points for 20,896 taxa
Harvest and display on data tab
Downloads, fancy searching
Machine access
Uploads & harvests will be by spreadsheet
and Darwin Core Archive
Support for annotation and curation
Please contact me to be part of the private beta
Easy access to analyzable trait data
“Are blue organisms more common in high altitudes?”
“Does the evolution of mammalian bacula appear to be
related to the pattern of promiscuous mating?”
“What organisms should I collect to fill in gaps in genome
quality tissue collections?”
• Look for trait, download for all taxa
• Create a collection of taxa, download all data
• Use Reol: an R interface to EOL (Banbury, O’Meara)
http://reolblog.wordpress.com/
• Find more specialized data repositories
But also . . .
Thanks
Funding & other contributions
Sloan Foundation
Smithsonian Institution
David Rubenstein
Marine Biological Laboratory
Harvard University
Our content partners
Thousands of individual
contributors, and hundreds of
volunteer curators
Image credits
Jenny from Taipei
Cynthia Parr
Chief Scientist @eol
@cydparr parrc@si.edu
Alexandria Archive: Sarah Kansa, Eric Kansa, 34 othe
zooarchaeologists
GLoBI: Jorrit Poelen (lead/software), Chris Mungall
(ontologies), James Simons (biologist) and Robert
Reiz (software). Datasets shared by: Peter D.
Roopnarine, Rachel Hertog, Carlos García-
Robledo, James Simons, Jenny L. Wrast, C.
Barnes, International Council for the Exploration of
the Sea (ICES), Jose R. Ferrer Paris, Senol
Akin, Malcolm Storey (BioInfo.org.uk), Ivy E.
Baremore, Joel Sachs (SPIRE), Colt W. Cook, David A.
Blewett
Quick math
In Phenoscape
57 publications had 565,158 anatomical trait
descriptions for 2,527 kinds of organisms
= 223 traits/organism
In ZFIN
38,189 trait descriptions for 4,727 genes for Zebra
Fish
1.9 million species on the planet
= LOTS OF TRAITS
Anatolia Zooarchaeology Case Study led by
Alexandria Archive Institute
1. 14 different sites
2. 34+ zooarchaeologists
3. Decoding, cleanup, metadata documentation
4. 220,000+ specimens
5. 450 entities linked to 143 EOL taxon concepts
6. Anatomical entities linked to Uberon.org
7. Biometrics linked to measurement ontology
8. Collaborative analysis
http://opencontext.org/

Mais conteúdo relacionado

Mais procurados

Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesCyndy Parr
 
Quentin D. Wheeler - ZooBank and the Taxonomic Renaissance
Quentin D. Wheeler - ZooBank and the Taxonomic RenaissanceQuentin D. Wheeler - ZooBank and the Taxonomic Renaissance
Quentin D. Wheeler - ZooBank and the Taxonomic RenaissanceICZN
 
Tomlinson et al (2016) - sediment & biota
Tomlinson et al (2016) - sediment & biotaTomlinson et al (2016) - sediment & biota
Tomlinson et al (2016) - sediment & biotaMSTomlinson
 
Linking biodiversity data for ecology
Linking biodiversity data for ecologyLinking biodiversity data for ecology
Linking biodiversity data for ecologyAnne Thessen
 
Austin ecn2013
Austin ecn2013Austin ecn2013
Austin ecn2013ECNOfficer
 
GloBI @ Berkeley Institute for Data Science Feb 5, 2015
GloBI @ Berkeley Institute for Data Science Feb 5, 2015GloBI @ Berkeley Institute for Data Science Feb 5, 2015
GloBI @ Berkeley Institute for Data Science Feb 5, 2015jhpoelen245
 
Tony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentationTony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentationTony Rees
 
EMODnet 2015
EMODnet 2015EMODnet 2015
EMODnet 2015hammockj
 
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...taxonbytes
 
Danita CV 2015 July
Danita CV 2015 JulyDanita CV 2015 July
Danita CV 2015 JulyDanita Mayer
 
FISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD WorkshopFISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD Workshopseanb
 
marine environment system
marine environment systemmarine environment system
marine environment systemSugheidi27
 
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010Dan Rosauer
 
uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006David Remsen
 
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do this
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do thisANL Soil Metagenomics 2014 Soil Reference Database - Let's do this
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do thisAdina Chuang Howe
 
NYC Audubon Conservation
NYC Audubon ConservationNYC Audubon Conservation
NYC Audubon Conservationgphillips
 
Challenge of Semantics for the Encyclopedia of Life
Challenge of Semantics for the Encyclopedia of LifeChallenge of Semantics for the Encyclopedia of Life
Challenge of Semantics for the Encyclopedia of LifeCyndy Parr
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life Cyndy Parr
 

Mais procurados (20)

Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
 
Quentin D. Wheeler - ZooBank and the Taxonomic Renaissance
Quentin D. Wheeler - ZooBank and the Taxonomic RenaissanceQuentin D. Wheeler - ZooBank and the Taxonomic Renaissance
Quentin D. Wheeler - ZooBank and the Taxonomic Renaissance
 
Tomlinson et al (2016) - sediment & biota
Tomlinson et al (2016) - sediment & biotaTomlinson et al (2016) - sediment & biota
Tomlinson et al (2016) - sediment & biota
 
Linking biodiversity data for ecology
Linking biodiversity data for ecologyLinking biodiversity data for ecology
Linking biodiversity data for ecology
 
Austin ecn2013
Austin ecn2013Austin ecn2013
Austin ecn2013
 
GloBI @ Berkeley Institute for Data Science Feb 5, 2015
GloBI @ Berkeley Institute for Data Science Feb 5, 2015GloBI @ Berkeley Institute for Data Science Feb 5, 2015
GloBI @ Berkeley Institute for Data Science Feb 5, 2015
 
Tony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentationTony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentation
 
EMODnet 2015
EMODnet 2015EMODnet 2015
EMODnet 2015
 
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
 
Danita CV 2015 July
Danita CV 2015 JulyDanita CV 2015 July
Danita CV 2015 July
 
FISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD WorkshopFISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD Workshop
 
Plant names: Obstacles and Solutions to access information about plants
Plant names: Obstacles and Solutions to access information about plantsPlant names: Obstacles and Solutions to access information about plants
Plant names: Obstacles and Solutions to access information about plants
 
marine environment system
marine environment systemmarine environment system
marine environment system
 
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
 
uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006
 
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do this
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do thisANL Soil Metagenomics 2014 Soil Reference Database - Let's do this
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do this
 
NYC Audubon Conservation
NYC Audubon ConservationNYC Audubon Conservation
NYC Audubon Conservation
 
Challenge of Semantics for the Encyclopedia of Life
Challenge of Semantics for the Encyclopedia of LifeChallenge of Semantics for the Encyclopedia of Life
Challenge of Semantics for the Encyclopedia of Life
 
phylosmith
phylosmithphylosmith
phylosmith
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 

Destaque

Requerimento de Eduardo da Fonte
Requerimento de Eduardo da FonteRequerimento de Eduardo da Fonte
Requerimento de Eduardo da FontePortal NE10
 
Classifications in EOL
Classifications in EOLClassifications in EOL
Classifications in EOLCyndy Parr
 
Processo judicial eletronico
Processo judicial eletronicoProcesso judicial eletronico
Processo judicial eletronicoPortal NE10
 
Locais provisorios de votação
Locais provisorios de votaçãoLocais provisorios de votação
Locais provisorios de votaçãoPortal NE10
 
Programação Festival de Quadrilhas 2014
Programação Festival de Quadrilhas 2014Programação Festival de Quadrilhas 2014
Programação Festival de Quadrilhas 2014Portal NE10
 
Grade Ciclo Natalino 2013
Grade Ciclo Natalino 2013Grade Ciclo Natalino 2013
Grade Ciclo Natalino 2013Portal NE10
 
Relação de CTS para a Copa do Mundo de 2014
Relação de CTS para a Copa do Mundo de 2014Relação de CTS para a Copa do Mundo de 2014
Relação de CTS para a Copa do Mundo de 2014Portal NE10
 

Destaque (7)

Requerimento de Eduardo da Fonte
Requerimento de Eduardo da FonteRequerimento de Eduardo da Fonte
Requerimento de Eduardo da Fonte
 
Classifications in EOL
Classifications in EOLClassifications in EOL
Classifications in EOL
 
Processo judicial eletronico
Processo judicial eletronicoProcesso judicial eletronico
Processo judicial eletronico
 
Locais provisorios de votação
Locais provisorios de votaçãoLocais provisorios de votação
Locais provisorios de votação
 
Programação Festival de Quadrilhas 2014
Programação Festival de Quadrilhas 2014Programação Festival de Quadrilhas 2014
Programação Festival de Quadrilhas 2014
 
Grade Ciclo Natalino 2013
Grade Ciclo Natalino 2013Grade Ciclo Natalino 2013
Grade Ciclo Natalino 2013
 
Relação de CTS para a Copa do Mundo de 2014
Relação de CTS para a Copa do Mundo de 2014Relação de CTS para a Copa do Mundo de 2014
Relação de CTS para a Copa do Mundo de 2014
 

Semelhante a How the Encyclopedia of Life is wrangling organismal attribute data

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr
 
Global patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctnessGlobal patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctnessAlison Specht
 
Parfrey smbe euk_2013_final
Parfrey smbe euk_2013_finalParfrey smbe euk_2013_final
Parfrey smbe euk_2013_finalLaura_Parfrey
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Cyndy Parr
 
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Hilmar Lapp
 
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...Arvinder Singh
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Larry Smarr
 
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of LifeUsing Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of LifeLarry Smarr
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientistsCyndy Parr
 
Microbial Phylogenomics (EVE161) Class 5
Microbial Phylogenomics (EVE161) Class 5Microbial Phylogenomics (EVE161) Class 5
Microbial Phylogenomics (EVE161) Class 5Jonathan Eisen
 
PENSOFT ARTICLE COLLECTION ABOUT MYANMAR
PENSOFT ARTICLE COLLECTION ABOUT MYANMARPENSOFT ARTICLE COLLECTION ABOUT MYANMAR
PENSOFT ARTICLE COLLECTION ABOUT MYANMARMYO AUNG Myanmar
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureLarry Smarr
 
Neaq june.4.10
Neaq june.4.10Neaq june.4.10
Neaq june.4.10tbarbaro
 
Sophomore Proposal Presentation 03172011
Sophomore Proposal Presentation 03172011Sophomore Proposal Presentation 03172011
Sophomore Proposal Presentation 03172011Amy Chen
 
CESAB-ACTIAS-sfe2018
CESAB-ACTIAS-sfe2018CESAB-ACTIAS-sfe2018
CESAB-ACTIAS-sfe2018CESAB-FRB
 
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013millerjeremya
 

Semelhante a How the Encyclopedia of Life is wrangling organismal attribute data (20)

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Global patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctnessGlobal patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctness
 
Parfrey smbe euk_2013_final
Parfrey smbe euk_2013_finalParfrey smbe euk_2013_final
Parfrey smbe euk_2013_final
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
 
Shorthouse
ShorthouseShorthouse
Shorthouse
 
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
 
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of LifeUsing Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of Life
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
 
Microbial Phylogenomics (EVE161) Class 5
Microbial Phylogenomics (EVE161) Class 5Microbial Phylogenomics (EVE161) Class 5
Microbial Phylogenomics (EVE161) Class 5
 
PENSOFT ARTICLE COLLECTION ABOUT MYANMAR
PENSOFT ARTICLE COLLECTION ABOUT MYANMARPENSOFT ARTICLE COLLECTION ABOUT MYANMAR
PENSOFT ARTICLE COLLECTION ABOUT MYANMAR
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New Cyberinfrastructure
 
Neaq june.4.10
Neaq june.4.10Neaq june.4.10
Neaq june.4.10
 
Sophomore Proposal Presentation 03172011
Sophomore Proposal Presentation 03172011Sophomore Proposal Presentation 03172011
Sophomore Proposal Presentation 03172011
 
CESAB-ACTIAS-sfe2018
CESAB-ACTIAS-sfe2018CESAB-ACTIAS-sfe2018
CESAB-ACTIAS-sfe2018
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
 
Big Data Field Museum
Big Data Field MuseumBig Data Field Museum
Big Data Field Museum
 

Mais de Cyndy Parr

Open data and the ag data commons
Open data and the ag data commonsOpen data and the ag data commons
Open data and the ag data commonsCyndy Parr
 
Ag Data Commons for AgBioData
Ag Data Commons for AgBioDataAg Data Commons for AgBioData
Ag Data Commons for AgBioDataCyndy Parr
 
Biodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeCyndy Parr
 
Public access to research results at USDA
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDACyndy Parr
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataCyndy Parr
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Cyndy Parr
 
Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Cyndy Parr
 
Parr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagCyndy Parr
 
Ag Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research dataAg Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research dataCyndy Parr
 
Big Data Initiatives for Agroecosystems
Big Data Initiatives for AgroecosystemsBig Data Initiatives for Agroecosystems
Big Data Initiatives for AgroecosystemsCyndy Parr
 
TDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeCyndy Parr
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princetonCyndy Parr
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Cyndy Parr
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataCyndy Parr
 
Species pages and portals
Species pages and portals Species pages and portals
Species pages and portals Cyndy Parr
 
Building EOL species pages
Building EOL species pagesBuilding EOL species pages
Building EOL species pagesCyndy Parr
 
Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...Cyndy Parr
 
EOL and Science: Yes we can!
EOL and Science: Yes we can!EOL and Science: Yes we can!
EOL and Science: Yes we can!Cyndy Parr
 
EOL China Center status
EOL China Center statusEOL China Center status
EOL China Center statusCyndy Parr
 
Western Ghats Portal
Western Ghats PortalWestern Ghats Portal
Western Ghats PortalCyndy Parr
 

Mais de Cyndy Parr (20)

Open data and the ag data commons
Open data and the ag data commonsOpen data and the ag data commons
Open data and the ag data commons
 
Ag Data Commons for AgBioData
Ag Data Commons for AgBioDataAg Data Commons for AgBioData
Ag Data Commons for AgBioData
 
Biodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscape
 
Public access to research results at USDA
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDA
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
 
Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.
 
Parr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbag
 
Ag Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research dataAg Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research data
 
Big Data Initiatives for Agroecosystems
Big Data Initiatives for AgroecosystemsBig Data Initiatives for Agroecosystems
Big Data Initiatives for Agroecosystems
 
TDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's Welcome
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
 
Species pages and portals
Species pages and portals Species pages and portals
Species pages and portals
 
Building EOL species pages
Building EOL species pagesBuilding EOL species pages
Building EOL species pages
 
Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...
 
EOL and Science: Yes we can!
EOL and Science: Yes we can!EOL and Science: Yes we can!
EOL and Science: Yes we can!
 
EOL China Center status
EOL China Center statusEOL China Center status
EOL China Center status
 
Western Ghats Portal
Western Ghats PortalWestern Ghats Portal
Western Ghats Portal
 

Último

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Último (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

How the Encyclopedia of Life is wrangling organismal attribute data

  • 1. eol.org @eol @cydparr How the Encyclopedia of Life is wrangling organismal attribute data
  • 3. EOL Today Key Milestones in 2013 1.1 million species pages 240+ content providers 3.3 million unique annual visitors from 235 countries
  • 4. 0 100000 200000 300000 400000 500000 600000 700000 800000 Distribution MolecularBiology Multiple topics TypeInformation Habitat ConservationStatus Threats Morphology Conservation Management Trends Size Associations Uses TrophicStrategy Cyclicity & Life Cycle PopulationBiology Reproduction Migration Taxonomy LifeExpectancy Identification Behaviour Ecology Diseases Number of text objectsSubjectoftextobject
  • 5. Text mining, crowdsourcing, standardizing see http://eol.org/info/fellows Co-occurrence, term extraction & linked data Thessen & Devries EnvO habitat terms Pafilis et al. Altitude Specificity of Flower Coloration Wright Morphological impacts of extinction risk in fish Chang Butterfly-hostplant associations Ferrer-Parris et al. Species Interactions Poelen & Mungall et al.
  • 6. 14 datasets containing 25k taxa, 422k interactions, for 3k locations alpha version of ingestion, normalization, aggregation alpha version of web API alpha version of data exports Dr. Katy Börner led Information Visualization MOOC GLoBI http://globalbioticinteractions.wordpress.com/
  • 7. EOL TraitBank Funded: Marine focus Virtuoso triple store, re-using URIs where possible 5 datasets 128,050 data points for 20,896 taxa Harvest and display on data tab Downloads, fancy searching Machine access
  • 8.
  • 9.
  • 10. Uploads & harvests will be by spreadsheet and Darwin Core Archive Support for annotation and curation Please contact me to be part of the private beta
  • 11. Easy access to analyzable trait data “Are blue organisms more common in high altitudes?” “Does the evolution of mammalian bacula appear to be related to the pattern of promiscuous mating?” “What organisms should I collect to fill in gaps in genome quality tissue collections?” • Look for trait, download for all taxa • Create a collection of taxa, download all data • Use Reol: an R interface to EOL (Banbury, O’Meara) http://reolblog.wordpress.com/ • Find more specialized data repositories
  • 12. But also . . .
  • 13. Thanks Funding & other contributions Sloan Foundation Smithsonian Institution David Rubenstein Marine Biological Laboratory Harvard University Our content partners Thousands of individual contributors, and hundreds of volunteer curators Image credits Jenny from Taipei Cynthia Parr Chief Scientist @eol @cydparr parrc@si.edu Alexandria Archive: Sarah Kansa, Eric Kansa, 34 othe zooarchaeologists GLoBI: Jorrit Poelen (lead/software), Chris Mungall (ontologies), James Simons (biologist) and Robert Reiz (software). Datasets shared by: Peter D. Roopnarine, Rachel Hertog, Carlos García- Robledo, James Simons, Jenny L. Wrast, C. Barnes, International Council for the Exploration of the Sea (ICES), Jose R. Ferrer Paris, Senol Akin, Malcolm Storey (BioInfo.org.uk), Ivy E. Baremore, Joel Sachs (SPIRE), Colt W. Cook, David A. Blewett
  • 14. Quick math In Phenoscape 57 publications had 565,158 anatomical trait descriptions for 2,527 kinds of organisms = 223 traits/organism In ZFIN 38,189 trait descriptions for 4,727 genes for Zebra Fish 1.9 million species on the planet = LOTS OF TRAITS
  • 15. Anatolia Zooarchaeology Case Study led by Alexandria Archive Institute 1. 14 different sites 2. 34+ zooarchaeologists 3. Decoding, cleanup, metadata documentation 4. 220,000+ specimens 5. 450 entities linked to 143 EOL taxon concepts 6. Anatomical entities linked to Uberon.org 7. Biometrics linked to measurement ontology 8. Collaborative analysis http://opencontext.org/

Notas do Editor

  1. We have a working infrastructure as well as more than 200 partners, We harvest and sort text and multimedia by topic and by species and put it on our pages. Curation + user-added content from the crowds is added to the mix.This is fed back to providers, giving them traffic, quality control on their own content, and new content for them to use And, we are already seeing spinoff products. We make it easy for developers, and everything is either public domain or CC-licensed so it can be re-used.
  2. We now have over a million pages with content, some of it is even in other languages like Arabic, Spanish, and Chinese. And we are getting traffic mostly from the general public, from all over the world.
  3. Most of our 5.4 million content objects are text blobs and here are the subjects of that text. Most often, our text objects are about distribution. But there are many other subjects involved including essays that include multiple subjects.
  4. Except for the first, links for that one on request
  5. Information Visualization MOOC (Massive Open Online Course) led by Dr. Katy Börner of Indiana University, students TwyBethard (United States), Andrew Miles (United Kingdom), Edward Kok (Netherlands) and Mattia Della Libera (Italy) used GloBI data to create an insightful visualization of spatial marine food webs in the Gulf of Mexico.
  6. Starting with marine dataIn the most simplistic view, we’ll be storing triplesThis data will be organized on a data tab, sorting out the data into the 35 or so “topics” that we currently have text chapters for, and we will also allow powerful downloading and searching capabilityFinally we’ll be setting up ways for other applications to grab the data and do interesting things with it. We already have a tool for making field guides,The approach here builds on our innovations for EOL and adds some proven technology called the “semantic web” to our domain. The next step takes this chain of innovation even further.
  7. Drawing data from the literature, from online databases, and from published datasets as in Dryad, summarizing collections databases
  8. Everyone wants to know theattributes of organismsPeople exploring the world find something and want to be able to search on characteristics they can seeTeachers want their students to become adept at analyzing data, and how better than to work with real numerical information about the size of organisms or their behavior or what their sensitivity is to temperature and what might happen in the face of climate changeSo while scientists were saying they needed us to provide data they could analyze, we heard the same thing from our educators, too.
  9. Phenoscape is a database that is looking at anatomical traits in fishes. Looking just at 57 publications they have more than 500K descriptions for 2500 kinds of organisms.ZFIN is a model organism database for zebrafish, a common model organism for developmental biologists. In just this one species they have captured nearly 40,000 traits – just for ONE very well-studied SPECIES
  10. .