SlideShare a Scribd company logo
1 of 6
Semantic data mining of literature David (Dauvit) King The Open University [email_address] Workpackage 7 Biodiversity literature access and data mining ViBRANT Virtual Biodiversity
Who we are & what we do ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],5
What we will do in ViBRANT ,[object Object],5
How we are doing it ,[object Object],[object Object],[object Object],5
Who are our users & how will they engage? ,[object Object],[object Object],5
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],6

More Related Content

Viewers also liked

Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery InformaticsPhilip Bourne
 
ViBRANT Project Overview
ViBRANT Project OverviewViBRANT Project Overview
ViBRANT Project Overviewvbrant
 
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...European Data Forum
 
Digital Lab Research areas
Digital Lab Research areasDigital Lab Research areas
Digital Lab Research areasblount_l
 
Simagis for healthcare
Simagis for healthcareSimagis for healthcare
Simagis for healthcarekhvatkov
 
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...butest
 
Literature mining and large-scale data integration
Literature mining and large-scale data integrationLiterature mining and large-scale data integration
Literature mining and large-scale data integrationLars Juhl Jensen
 
Xu Xing: EasyGenomics – Next Generation Bioinformatics on the Cloud
Xu Xing: EasyGenomics – Next Generation Bioinformatics on the CloudXu Xing: EasyGenomics – Next Generation Bioinformatics on the Cloud
Xu Xing: EasyGenomics – Next Generation Bioinformatics on the CloudGigaScience, BGI Hong Kong
 
Data visualization for development
Data visualization for developmentData visualization for development
Data visualization for developmentSara-Jayne Terp
 
Why Human Brain Cannot Score Her2 Cancer Biomarker
Why Human Brain Cannot Score Her2 Cancer BiomarkerWhy Human Brain Cannot Score Her2 Cancer Biomarker
Why Human Brain Cannot Score Her2 Cancer Biomarkerkhvatkov
 
START LAB - Introduction of the MOBILE APP Edition by Olivier Verdin
START LAB - Introduction of the MOBILE APP Edition by Olivier VerdinSTART LAB - Introduction of the MOBILE APP Edition by Olivier Verdin
START LAB - Introduction of the MOBILE APP Edition by Olivier VerdinSolvay Entrepreneurs
 
Exposome & Expotype - Exploring new challenges for Health Informatics Researc...
Exposome & Expotype - Exploring new challenges for Health Informatics Researc...Exposome & Expotype - Exploring new challenges for Health Informatics Researc...
Exposome & Expotype - Exploring new challenges for Health Informatics Researc...Fernando Martin-Sanchez
 
Using Artificial Intelligence For Cytology Screening
Using Artificial Intelligence For Cytology Screening Using Artificial Intelligence For Cytology Screening
Using Artificial Intelligence For Cytology Screening Vitali Khvatkov
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningSalford Systems
 
AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryDavid Leahy
 

Viewers also liked (20)

Foundations for Discovery Informatics
Foundations for Discovery InformaticsFoundations for Discovery Informatics
Foundations for Discovery Informatics
 
Molecular profiling 2012
Molecular profiling 2012Molecular profiling 2012
Molecular profiling 2012
 
Presentacion
PresentacionPresentacion
Presentacion
 
Mobile lab app
Mobile lab appMobile lab app
Mobile lab app
 
ViBRANT Project Overview
ViBRANT Project OverviewViBRANT Project Overview
ViBRANT Project Overview
 
Translational Informatics in the Pre-Competitive Era
Translational Informatics in the Pre-Competitive EraTranslational Informatics in the Pre-Competitive Era
Translational Informatics in the Pre-Competitive Era
 
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
 
Digital Lab Research areas
Digital Lab Research areasDigital Lab Research areas
Digital Lab Research areas
 
Simagis for healthcare
Simagis for healthcareSimagis for healthcare
Simagis for healthcare
 
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
 
Literature mining and large-scale data integration
Literature mining and large-scale data integrationLiterature mining and large-scale data integration
Literature mining and large-scale data integration
 
Xu Xing: EasyGenomics – Next Generation Bioinformatics on the Cloud
Xu Xing: EasyGenomics – Next Generation Bioinformatics on the CloudXu Xing: EasyGenomics – Next Generation Bioinformatics on the Cloud
Xu Xing: EasyGenomics – Next Generation Bioinformatics on the Cloud
 
Data visualization for development
Data visualization for developmentData visualization for development
Data visualization for development
 
Why Human Brain Cannot Score Her2 Cancer Biomarker
Why Human Brain Cannot Score Her2 Cancer BiomarkerWhy Human Brain Cannot Score Her2 Cancer Biomarker
Why Human Brain Cannot Score Her2 Cancer Biomarker
 
START LAB - Introduction of the MOBILE APP Edition by Olivier Verdin
START LAB - Introduction of the MOBILE APP Edition by Olivier VerdinSTART LAB - Introduction of the MOBILE APP Edition by Olivier Verdin
START LAB - Introduction of the MOBILE APP Edition by Olivier Verdin
 
Epic2014 balancing
Epic2014 balancingEpic2014 balancing
Epic2014 balancing
 
Exposome & Expotype - Exploring new challenges for Health Informatics Researc...
Exposome & Expotype - Exploring new challenges for Health Informatics Researc...Exposome & Expotype - Exploring new challenges for Health Informatics Researc...
Exposome & Expotype - Exploring new challenges for Health Informatics Researc...
 
Using Artificial Intelligence For Cytology Screening
Using Artificial Intelligence For Cytology Screening Using Artificial Intelligence For Cytology Screening
Using Artificial Intelligence For Cytology Screening
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data Mining
 
AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug Discovery
 

Similar to Semantic data mining of literature

Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and ChallengesJens Lehmann
 
Nmc 2007 Publish
Nmc 2007 PublishNmc 2007 Publish
Nmc 2007 Publishurauch
 
Scholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to showScholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to showDerek Keats
 
ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"Fabien Gandon
 
Virtual Reality, Augmented Reality, Mixed Reality, Extended Reality – what yo...
Virtual Reality, Augmented Reality, Mixed Reality, Extended Reality – what yo...Virtual Reality, Augmented Reality, Mixed Reality, Extended Reality – what yo...
Virtual Reality, Augmented Reality, Mixed Reality, Extended Reality – what yo...Stephen Rhind-Tutt
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Digital collections and humanities research
Digital collections and humanities researchDigital collections and humanities research
Digital collections and humanities researchHarriett Green
 
SNSInkCloudWiner20150410
SNSInkCloudWiner20150410SNSInkCloudWiner20150410
SNSInkCloudWiner20150410Dov Winer
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital librariesSören Auer
 
Open data-science-presentation
Open data-science-presentationOpen data-science-presentation
Open data-science-presentationopendatascience
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessAbby Clobridge
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries mdabrowski
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...Dag Endresen
 
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...Heidi Nance
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New ScienceAnita de Waard
 
Hello islandora building a digital repository nov 30, 2016 v6
Hello islandora  building a digital repository nov 30, 2016 v6Hello islandora  building a digital repository nov 30, 2016 v6
Hello islandora building a digital repository nov 30, 2016 v6eohallor
 

Similar to Semantic data mining of literature (20)

Our World is Socio-technical
Our World is Socio-technicalOur World is Socio-technical
Our World is Socio-technical
 
DIGITAL LIBRARY
DIGITAL LIBRARYDIGITAL LIBRARY
DIGITAL LIBRARY
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Nmc 2007 Publish
Nmc 2007 PublishNmc 2007 Publish
Nmc 2007 Publish
 
Scholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to showScholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to show
 
Roberts leiden110213
Roberts leiden110213Roberts leiden110213
Roberts leiden110213
 
Irish Digital Libraries Summit
Irish Digital Libraries SummitIrish Digital Libraries Summit
Irish Digital Libraries Summit
 
ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"ESWC 2015 Closing and "General Chair's minute of Madness"
ESWC 2015 Closing and "General Chair's minute of Madness"
 
Virtual Reality, Augmented Reality, Mixed Reality, Extended Reality – what yo...
Virtual Reality, Augmented Reality, Mixed Reality, Extended Reality – what yo...Virtual Reality, Augmented Reality, Mixed Reality, Extended Reality – what yo...
Virtual Reality, Augmented Reality, Mixed Reality, Extended Reality – what yo...
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Digital collections and humanities research
Digital collections and humanities researchDigital collections and humanities research
Digital collections and humanities research
 
SNSInkCloudWiner20150410
SNSInkCloudWiner20150410SNSInkCloudWiner20150410
SNSInkCloudWiner20150410
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
Open data-science-presentation
Open data-science-presentationOpen data-science-presentation
Open data-science-presentation
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
 
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New Science
 
Hello islandora building a digital repository nov 30, 2016 v6
Hello islandora  building a digital repository nov 30, 2016 v6Hello islandora  building a digital repository nov 30, 2016 v6
Hello islandora building a digital repository nov 30, 2016 v6
 

More from vbrant

Tweddle & robinson vibrant jan 13 web
Tweddle & robinson vibrant jan 13 webTweddle & robinson vibrant jan 13 web
Tweddle & robinson vibrant jan 13 webvbrant
 
Citizen Science Workshop: Comber (Sarah Faulwetter)
Citizen Science Workshop: Comber (Sarah Faulwetter)Citizen Science Workshop: Comber (Sarah Faulwetter)
Citizen Science Workshop: Comber (Sarah Faulwetter)vbrant
 
I spot @ vibrant nhm 10.1.13
I spot @ vibrant nhm 10.1.13I spot @ vibrant nhm 10.1.13
I spot @ vibrant nhm 10.1.13vbrant
 
Citizen Science Workshop: Global Canopy Project (Jon Parsons)
Citizen Science Workshop: Global Canopy Project (Jon Parsons)Citizen Science Workshop: Global Canopy Project (Jon Parsons)
Citizen Science Workshop: Global Canopy Project (Jon Parsons)vbrant
 
Participation, Publication, Persistence & Platforms
Participation, Publication, Persistence & PlatformsParticipation, Publication, Persistence & Platforms
Participation, Publication, Persistence & Platformsvbrant
 
ViBRANT management arrangements
ViBRANT management arrangementsViBRANT management arrangements
ViBRANT management arrangementsvbrant
 
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and CommunicationSetting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and Communicationvbrant
 
Welcome and logistics
Welcome and logisticsWelcome and logistics
Welcome and logisticsvbrant
 
Lessons learnt from EDIT - linking taxonomy and conservation
Lessons learnt from EDIT - linking taxonomy and conservationLessons learnt from EDIT - linking taxonomy and conservation
Lessons learnt from EDIT - linking taxonomy and conservationvbrant
 
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and CommunicationSetting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and Communicationvbrant
 
The Path to Enlightened Solutions for Biodiversity's Dark Data
The Path to Enlightened Solutions for Biodiversity's Dark DataThe Path to Enlightened Solutions for Biodiversity's Dark Data
The Path to Enlightened Solutions for Biodiversity's Dark Datavbrant
 
Search portal
Search portalSearch portal
Search portalvbrant
 
WP2 Overview (Technical architecture)
WP2 Overview (Technical architecture)WP2 Overview (Technical architecture)
WP2 Overview (Technical architecture)vbrant
 
Nothing can possbily go wrong – Risk Analysis discussion
Nothing can possbily go wrong – Risk Analysis discussionNothing can possbily go wrong – Risk Analysis discussion
Nothing can possbily go wrong – Risk Analysis discussionvbrant
 
INOTAXA markup and its relations to ViBRANT
INOTAXA markup and its relations to ViBRANTINOTAXA markup and its relations to ViBRANT
INOTAXA markup and its relations to ViBRANTvbrant
 
Content Markup / Plazi
Content Markup / PlaziContent Markup / Plazi
Content Markup / Plazivbrant
 
XML-based editorial workflow, or how to extract more value from the same source?
XML-based editorial workflow, or how to extract more value from the same source?XML-based editorial workflow, or how to extract more value from the same source?
XML-based editorial workflow, or how to extract more value from the same source?vbrant
 
Gathering data for publications
Gathering data for publicationsGathering data for publications
Gathering data for publicationsvbrant
 
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...vbrant
 
Mobile phone apps monitoring biodiversity/Biodiversity indices
Mobile phone apps monitoring biodiversity/Biodiversity indicesMobile phone apps monitoring biodiversity/Biodiversity indices
Mobile phone apps monitoring biodiversity/Biodiversity indicesvbrant
 

More from vbrant (20)

Tweddle & robinson vibrant jan 13 web
Tweddle & robinson vibrant jan 13 webTweddle & robinson vibrant jan 13 web
Tweddle & robinson vibrant jan 13 web
 
Citizen Science Workshop: Comber (Sarah Faulwetter)
Citizen Science Workshop: Comber (Sarah Faulwetter)Citizen Science Workshop: Comber (Sarah Faulwetter)
Citizen Science Workshop: Comber (Sarah Faulwetter)
 
I spot @ vibrant nhm 10.1.13
I spot @ vibrant nhm 10.1.13I spot @ vibrant nhm 10.1.13
I spot @ vibrant nhm 10.1.13
 
Citizen Science Workshop: Global Canopy Project (Jon Parsons)
Citizen Science Workshop: Global Canopy Project (Jon Parsons)Citizen Science Workshop: Global Canopy Project (Jon Parsons)
Citizen Science Workshop: Global Canopy Project (Jon Parsons)
 
Participation, Publication, Persistence & Platforms
Participation, Publication, Persistence & PlatformsParticipation, Publication, Persistence & Platforms
Participation, Publication, Persistence & Platforms
 
ViBRANT management arrangements
ViBRANT management arrangementsViBRANT management arrangements
ViBRANT management arrangements
 
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and CommunicationSetting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
 
Welcome and logistics
Welcome and logisticsWelcome and logistics
Welcome and logistics
 
Lessons learnt from EDIT - linking taxonomy and conservation
Lessons learnt from EDIT - linking taxonomy and conservationLessons learnt from EDIT - linking taxonomy and conservation
Lessons learnt from EDIT - linking taxonomy and conservation
 
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and CommunicationSetting the Scene for ViBRANT – Strategy, Philosophy and Communication
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
 
The Path to Enlightened Solutions for Biodiversity's Dark Data
The Path to Enlightened Solutions for Biodiversity's Dark DataThe Path to Enlightened Solutions for Biodiversity's Dark Data
The Path to Enlightened Solutions for Biodiversity's Dark Data
 
Search portal
Search portalSearch portal
Search portal
 
WP2 Overview (Technical architecture)
WP2 Overview (Technical architecture)WP2 Overview (Technical architecture)
WP2 Overview (Technical architecture)
 
Nothing can possbily go wrong – Risk Analysis discussion
Nothing can possbily go wrong – Risk Analysis discussionNothing can possbily go wrong – Risk Analysis discussion
Nothing can possbily go wrong – Risk Analysis discussion
 
INOTAXA markup and its relations to ViBRANT
INOTAXA markup and its relations to ViBRANTINOTAXA markup and its relations to ViBRANT
INOTAXA markup and its relations to ViBRANT
 
Content Markup / Plazi
Content Markup / PlaziContent Markup / Plazi
Content Markup / Plazi
 
XML-based editorial workflow, or how to extract more value from the same source?
XML-based editorial workflow, or how to extract more value from the same source?XML-based editorial workflow, or how to extract more value from the same source?
XML-based editorial workflow, or how to extract more value from the same source?
 
Gathering data for publications
Gathering data for publicationsGathering data for publications
Gathering data for publications
 
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
WP6 Overview: From prototypes to industry standards: Markup, semantic enhance...
 
Mobile phone apps monitoring biodiversity/Biodiversity indices
Mobile phone apps monitoring biodiversity/Biodiversity indicesMobile phone apps monitoring biodiversity/Biodiversity indices
Mobile phone apps monitoring biodiversity/Biodiversity indices
 

Semantic data mining of literature

  • 1. Semantic data mining of literature David (Dauvit) King The Open University [email_address] Workpackage 7 Biodiversity literature access and data mining ViBRANT Virtual Biodiversity
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.

Editor's Notes

  1. Leading the way 40 years ago -now 200,000+ students many mature, also CPD NLP in our own group, Also experts in semantic web, ie KMi And through the BBC close involvement with popular science on radio and TV, most relevant to this audience is another OU + NHM collaboration: iSpot
  2. We process text Extract key words and concepts Format into XML for export Not scanning service Not an OCR service
  3. Data mining to look for patterns Patterns might be patterns of erros, eg BCA ae ligature Context resolve problems like Homo -> Homa Validate and populate with existing resources, so our approach is sustainable after ViBRANT completes
  4. Scratchpads in the first instance But because we are using a modular approach and delivering the tools as web services they could be accessed from any other biodiversity resource
  5. As you can see our work package is BLAND not ViBRANT So back to David Morse for the discussion and your questions