Mobile phone apps monitoring biodiversity/Biodiversity indices
Semantic data mining of literature
1. Semantic data mining of literature David (Dauvit) King The Open University [email_address] Workpackage 7 Biodiversity literature access and data mining ViBRANT Virtual Biodiversity
2.
3.
4.
5.
6.
Editor's Notes
Leading the way 40 years ago -now 200,000+ students many mature, also CPD NLP in our own group, Also experts in semantic web, ie KMi And through the BBC close involvement with popular science on radio and TV, most relevant to this audience is another OU + NHM collaboration: iSpot
We process text Extract key words and concepts Format into XML for export Not scanning service Not an OCR service
Data mining to look for patterns Patterns might be patterns of erros, eg BCA ae ligature Context resolve problems like Homo -> Homa Validate and populate with existing resources, so our approach is sustainable after ViBRANT completes
Scratchpads in the first instance But because we are using a modular approach and delivering the tools as web services they could be accessed from any other biodiversity resource
As you can see our work package is BLAND not ViBRANT So back to David Morse for the discussion and your questions