SlideShare uma empresa Scribd logo
1 de 22
What should a flora/fauna/mycota
of the future be able to do for me?
William Ulate
BHL Technical Director
Global BHL Coordinator
Berlin, Germany
May 21, 2013
Global…
New Partners and Geographies
Dear Sir / Madam Can i just
congratulate you on an
absolutely brilliant online
resource. I am compiling a
report on an invasive
hydromedusae and could not
believe the ease and efficiency
of this web page which
genuinely saved me weeks of
my life
Research that previously
took months now takes
only a few hours
La plus grande
#bibliotheque #botanique &
#zoologique online The
largest online botanical &
zoological #library #BHL
The freeing of knowledge
may lead to new
discoveries and changes
in the way the natural
world is perceived
22.00
40.00
84.86
94.6
105.85
9.2
16.4
31.8
35.4
38.9
-
20
40
60
80
100
120
Oct-08 Oct-09 Oct-10 Oct-11 Oct-12
Pages (Millions) and Volumes (in Thousands)
included in BHL
Volumes (K)
Pages (M)
More Online Content
Global Replication & Serving
Replicated Data Center Portal Application
For me a future Flora/Fauna/Mycota should…
learn from (the errors of) the past...
I may want to find new things later…
Taxon Names
BEFORE
Name Instances 101,591,803 101,288,804
Unique Names 7,498,554 7,464,924
Verified Names 1,905,507 1,902,803
EOL Names 63,130,350 62,963,582
EOL Pages 13,579,868 13,532,684
AFTER
Name Instances 151,222,182 150,066,425
Unique Names 29,246,382 29,091,767
Verified Names 10,153,165 10,109,540
EOL Names 87,791,695 87,135,089
EOL Pages 15,466,713 15,342,867
Scientific Name Extraction
• TaxonFinder algorithm in production since
2008
– More than 100 million candidate name strings
– More than 1.5 million unique, verified names
– Available through UI, APIs, Data Exports & Internet
Archive
• New collaboration with Global Names
– Improved algorithm, better precision & recall
– More data with TaxonFinder and Neti Neti!
For me a future Flora/Fauna/Mycota should…
allow me to provide and harvest
marked up content with names
of people & organizations, places, taxa,
specimens, illustrations, coordinates,
citations, tables of context and indexes.
*E.xvi�c�piteI von c. cXx.WptdvonfnrWmn
bu�fbe;bcn.5 am cix bIa � S &3rn~ 41X
a�m cv(f b1air�'o�et ert oiensr �; �',
:�hlrfc�c wa ff�4am.diug bist a
6aiw~s ff oJrJtwt nof bL4ecImt& blfafra mem
b t wag `wr 4 cn wiu 4 e8t5m.ed bvUratflb ck
wuo, ma144'*4I bttE5rmbebt =rt3'kn am4ra
tif vrmr Waff C * t6rmnli an `tn�ciblatGteaM
w ?ffoaifrn w4wmeu nu weib e , wpiteI
voE5teiri ct c ober gtUcr cit cm` 91 cLi biar J '
>bSciatl�Oiff ;Bruet wacfttc n qmcx b1a bl:
bt5c lttmtt bb9 lkr w.llr#e iti ncn xoa ff cu :r
trtuft *e t � B Rn "� trv W1Rt' ?Cm c blas
waIwutr Ober �ci ti 1V Ces ' wt
gbtiemwwajfu tpctt, afferain 9 c: b�titbfof
�r f eran m rs bra wlg auig4;f aer�m *mc vrt
blatcabtfm wfru an'deg~m rt blas Iaum
bwWt� run f ncmai b14ianf tJobrrfan
ebrut4net vnber Brwt Ober awawi*m.crriii
btafwfm uww c on$ 'it ttu wttkc 5,10 $ m~C
fca trc* cx u W�e�&mcyfbq4 Mabtt mmw
rc a iiu bc Jcn ncI.end.*, blat s. a u:�rprd3
rw4ftf wm c ii,+ ttCC tn wa frr9fr orfab fcfbt
enb c optiti bt -r9 ceDa ttDcn i34M sn Sem i
Images
Crowdsource Markup
Display text Species Profile Model category
General/summary TaxonBiology
Geographic range Distribution
Habitat Habitat
Food sources and feeding behavior TrophicStrategy
Physical description (general) Description
Physical description (detailed morphology) DiagnosticDescription
For me a future Flora/Fauna/Mycota should…
be digital, openly and freely accessible,
marked up and mark-able by users,
linked and registered in
a Common Framework that allows
for gradual crowd-sourced incremental
semantic enrichment with proper attribution
> 390,000 views
in 10 months
> 1200 sets
> 60,000+ images
For me a future Flora/Fauna/Mycota should…
be easy to integrate with other knowledge,
allow versioning and track changes,
hold and show conflicting opinions
Independent of format, mobile enabled &
be continually growing.
Thank you
William Ulate
Global BHL Project Manager / Technical Director
Missouri Botanical Garden
william.ulate@mobot.org
Skype: william_ulate_r

Mais conteúdo relacionado

Mais de William Ulate

Unlocking knowledge in biodiversity legacy literature through automatic seman...
Unlocking knowledge in biodiversity legacy literature through automatic seman...Unlocking knowledge in biodiversity legacy literature through automatic seman...
Unlocking knowledge in biodiversity legacy literature through automatic seman...
William Ulate
 
Purposeful Gaming and BHL
Purposeful Gaming and BHLPurposeful Gaming and BHL
Purposeful Gaming and BHL
William Ulate
 
Bibliographic References in BHL
Bibliographic References in BHLBibliographic References in BHL
Bibliographic References in BHL
William Ulate
 
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
William Ulate
 

Mais de William Ulate (18)

Text Mining Biodiversity 20160127
Text Mining Biodiversity 20160127Text Mining Biodiversity 20160127
Text Mining Biodiversity 20160127
 
BHL Tech Status Update Tech Director W.Ulate 2015.12.11
BHL Tech Status Update Tech Director W.Ulate 2015.12.11BHL Tech Status Update Tech Director W.Ulate 2015.12.11
BHL Tech Status Update Tech Director W.Ulate 2015.12.11
 
Unlocking knowledge in biodiversity legacy literature through automatic seman...
Unlocking knowledge in biodiversity legacy literature through automatic seman...Unlocking knowledge in biodiversity legacy literature through automatic seman...
Unlocking knowledge in biodiversity legacy literature through automatic seman...
 
Engaging the Citizen Scientist in Content Enhancement for BHL
Engaging the Citizen Scientist in Content Enhancement for BHLEngaging the Citizen Scientist in Content Enhancement for BHL
Engaging the Citizen Scientist in Content Enhancement for BHL
 
Digitalización de Literatura de Biodiversidad: an overview of the BHL for CON...
Digitalización de Literatura de Biodiversidad: an overview of the BHL for CON...Digitalización de Literatura de Biodiversidad: an overview of the BHL for CON...
Digitalización de Literatura de Biodiversidad: an overview of the BHL for CON...
 
BHL Technical Director's Report, Mar. 2014
BHL Technical Director's Report, Mar. 2014BHL Technical Director's Report, Mar. 2014
BHL Technical Director's Report, Mar. 2014
 
BHL Markup Efforts and Plans
BHL Markup Efforts and PlansBHL Markup Efforts and Plans
BHL Markup Efforts and Plans
 
Purposeful Gaming and BHL
Purposeful Gaming and BHLPurposeful Gaming and BHL
Purposeful Gaming and BHL
 
Fourth Global BHL Meeting - Technical Update
Fourth Global BHL Meeting - Technical UpdateFourth Global BHL Meeting - Technical Update
Fourth Global BHL Meeting - Technical Update
 
Bibliographic References in BHL
Bibliographic References in BHLBibliographic References in BHL
Bibliographic References in BHL
 
BHL Technical Update (May 2013)
BHL Technical Update (May 2013)BHL Technical Update (May 2013)
BHL Technical Update (May 2013)
 
Global BHL Update May 2013
Global BHL Update May 2013Global BHL Update May 2013
Global BHL Update May 2013
 
The BHL way to content
The BHL way to contentThe BHL way to content
The BHL way to content
 
TDWG 2012 Poster for Art of Life project
TDWG 2012 Poster for Art of Life projectTDWG 2012 Poster for Art of Life project
TDWG 2012 Poster for Art of Life project
 
BHL Technical Projects Updates
BHL Technical Projects UpdatesBHL Technical Projects Updates
BHL Technical Projects Updates
 
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
The Biodiversity Heritage Library: an Open Global Resource of Literature for ...
 
BHL: Toward a Global, Sustainable Resource
BHL: Toward a Global, Sustainable ResourceBHL: Toward a Global, Sustainable Resource
BHL: Toward a Global, Sustainable Resource
 
Global BHL Meeting Action Items
Global BHL Meeting Action ItemsGlobal BHL Meeting Action Items
Global BHL Meeting Action Items
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 

A new flora fauna mycota should...

  • 1. What should a flora/fauna/mycota of the future be able to do for me? William Ulate BHL Technical Director Global BHL Coordinator Berlin, Germany May 21, 2013
  • 3. New Partners and Geographies
  • 4. Dear Sir / Madam Can i just congratulate you on an absolutely brilliant online resource. I am compiling a report on an invasive hydromedusae and could not believe the ease and efficiency of this web page which genuinely saved me weeks of my life Research that previously took months now takes only a few hours La plus grande #bibliotheque #botanique & #zoologique online The largest online botanical & zoological #library #BHL The freeing of knowledge may lead to new discoveries and changes in the way the natural world is perceived
  • 5. 22.00 40.00 84.86 94.6 105.85 9.2 16.4 31.8 35.4 38.9 - 20 40 60 80 100 120 Oct-08 Oct-09 Oct-10 Oct-11 Oct-12 Pages (Millions) and Volumes (in Thousands) included in BHL Volumes (K) Pages (M) More Online Content
  • 6. Global Replication & Serving Replicated Data Center Portal Application
  • 7. For me a future Flora/Fauna/Mycota should… learn from (the errors of) the past...
  • 8. I may want to find new things later…
  • 9. Taxon Names BEFORE Name Instances 101,591,803 101,288,804 Unique Names 7,498,554 7,464,924 Verified Names 1,905,507 1,902,803 EOL Names 63,130,350 62,963,582 EOL Pages 13,579,868 13,532,684 AFTER Name Instances 151,222,182 150,066,425 Unique Names 29,246,382 29,091,767 Verified Names 10,153,165 10,109,540 EOL Names 87,791,695 87,135,089 EOL Pages 15,466,713 15,342,867
  • 10. Scientific Name Extraction • TaxonFinder algorithm in production since 2008 – More than 100 million candidate name strings – More than 1.5 million unique, verified names – Available through UI, APIs, Data Exports & Internet Archive • New collaboration with Global Names – Improved algorithm, better precision & recall – More data with TaxonFinder and Neti Neti!
  • 11. For me a future Flora/Fauna/Mycota should… allow me to provide and harvest marked up content with names of people & organizations, places, taxa, specimens, illustrations, coordinates, citations, tables of context and indexes.
  • 12. *E.xvi�c�piteI von c. cXx.WptdvonfnrWmn bu�fbe;bcn.5 am cix bIa � S &3rn~ 41X a�m cv(f b1air�'o�et ert oiensr �; �', :�hlrfc�c wa ff�4am.diug bist a 6aiw~s ff oJrJtwt nof bL4ecImt& blfafra mem b t wag `wr 4 cn wiu 4 e8t5m.ed bvUratflb ck wuo, ma144'*4I bttE5rmbebt =rt3'kn am4ra tif vrmr Waff C * t6rmnli an `tn�ciblatGteaM w ?ffoaifrn w4wmeu nu weib e , wpiteI voE5teiri ct c ober gtUcr cit cm` 91 cLi biar J ' >bSciatl�Oiff ;Bruet wacfttc n qmcx b1a bl: bt5c lttmtt bb9 lkr w.llr#e iti ncn xoa ff cu :r trtuft *e t � B Rn "� trv W1Rt' ?Cm c blas waIwutr Ober �ci ti 1V Ces ' wt gbtiemwwajfu tpctt, afferain 9 c: b�titbfof �r f eran m rs bra wlg auig4;f aer�m *mc vrt blatcabtfm wfru an'deg~m rt blas Iaum bwWt� run f ncmai b14ianf tJobrrfan ebrut4net vnber Brwt Ober awawi*m.crriii btafwfm uww c on$ 'it ttu wttkc 5,10 $ m~C fca trc* cx u W�e�&mcyfbq4 Mabtt mmw rc a iiu bc Jcn ncI.end.*, blat s. a u:�rprd3 rw4ftf wm c ii,+ ttCC tn wa frr9fr orfab fcfbt enb c optiti bt -r9 ceDa ttDcn i34M sn Sem i
  • 14. Crowdsource Markup Display text Species Profile Model category General/summary TaxonBiology Geographic range Distribution Habitat Habitat Food sources and feeding behavior TrophicStrategy Physical description (general) Description Physical description (detailed morphology) DiagnosticDescription
  • 15. For me a future Flora/Fauna/Mycota should… be digital, openly and freely accessible, marked up and mark-able by users, linked and registered in a Common Framework that allows for gradual crowd-sourced incremental semantic enrichment with proper attribution
  • 16.
  • 17. > 390,000 views in 10 months > 1200 sets > 60,000+ images
  • 18.
  • 19.
  • 20.
  • 21. For me a future Flora/Fauna/Mycota should… be easy to integrate with other knowledge, allow versioning and track changes, hold and show conflicting opinions Independent of format, mobile enabled & be continually growing.
  • 22. Thank you William Ulate Global BHL Project Manager / Technical Director Missouri Botanical Garden william.ulate@mobot.org Skype: william_ulate_r

Notas do Editor

  1. For the meeting on Wednesday on legacy literature, we would like to ask you to give a brief (5-10min) outline of what your plans are with BHL, and especially your move into content. This would be helpful for a more informed following discussion.
  2. ExtensiveAiming for a critical mass of biodiversity literatureGlobalOriginating in the US and UK, BHL now has nodes in Europe, China, Australia, Brazil, Egypt, and AfricaOpen Data is freely available for viewing, downloading, and re-use
  3. On legacy literature, what your plans are with BHL, and especially your move into content?
  4. Mention Neti Neti
  5. You can see from this slide that accuracy goes way down when processing older blackletter-type typefaces.
  6. On legacy literature, what your plans are with BHL, and especially your move into content?GrowthMore Global ContentTaxon NamesArticle MetadataMicrocitations and COiNSAPIZoobankOCR improvements through GamingCrowdsource MarkupWFO?
  7. Natural history illustrations from the Biodiversity Heritage Library seem to leap across boundaries while being catalogued, emerging simultaneously as history, science and art. As historic documents, they paint a vibrant picture of the first time European scientists and explorers encountered exotic plants and animals in the 17th and 18th centuries, drawn by some of the finest illustrators of the world.   Also, as biodiversity records, they provide valuable documentation of when, where, and who first observed a species, and some of them are our only surviving representations of extinct species.  Finally, as aesthetic elements, they communicate human emotions and other values toward nature by exemplifying the mimesis in art and providing a vivid expression of human creativity and imagination.This year, the Missouri Botanical Garden received a grant from the National Endowment for the Humanities (NEH) to support a project called The Art of Life: Data Mining and Crowdsourcing the Identification and Description of Natural History Illustrations from the Biodiversity Heritage Library (BHL).
  8. The authors have worked on the development of an effective metadata schema for such natural history illustrations, but instead of developing yet another schema from scratch, they have identified existing schemas that meet the needs of the project and integrated a solution that combines the best in biodiversity informatics and image curation standards and best practices. This schema needs to support three main objectives:  (1) to enable the discovery, description and use of the identified images by artists, biologists, humanities scholars, and educators;  (2) to make BHL’s metadata and images available to other platforms; and  (3) to import crowdsourced metadata generated in other platforms back into BHL..A preliminary schema version will be presented to the TDWG community, explaining how we addressed metadata challenges specific to biodiversity data, in order to obtain feedback on the final version.