SlideShare a Scribd company logo
1 of 53
Triples for the People (Scientists):   Liberating biological knowledge with the Semantic Web 1 Ottawa/Chicago Semantic Web Meetup : 23-11-09 Michel Dumontier, Ph.D. Associate Professor of Bioinformatics Carleton University Department of Biology School of Computer Science Institute of Biochemistry Ottawa Institute of Systems Biology Ottawa-Carleton Institute of Biomedical Engineering
Web-based Knowledge Discovery  a very painful process Carole Goble (ISWC 2005) 2 Ottawa/Chicago Semantic Web Meetup : 23-11-09
With current web search engines… It takes a lot of digging to get answers 3 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Portals provide structured information and give better results 4 Ottawa/Chicago Semantic Web Meetup : 23-11-09
We need to expose the deep web  Surface web:167 terabytes Deep web:91,000 terabytes 545-to-one Ottawa/Chicago Semantic Web Meetup : 23-11-09 5
Data silos – not made for sharing 6 Ottawa/Chicago Semantic Web Meetup : 23-11-09
We want to simultaneously query the 1000+ biological databases 7 Ottawa/Chicago Semantic Web Meetup : 23-11-09
How do we integrate these resources? 8 Ottawa/Chicago Semantic Web Meetup : 23-11-09
The Semantic Web is a web of knowledge. 9 Ottawa/Chicago Semantic Web Meetup : 23-11-09 It is about standards for publishing, sharing and querying  knowledge drawn from diverse sources It enables the answering of  sophisticated questions
A growing web of linked data 10 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Bio2RDF provides a framework to glue to link data networks together  11 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Resource Description Framework (RDF) Allows one to talk about anything Uniform Resource Identifier (URI) can be used as entity names http://bio2rdf.org/uniprot:P05067 	is a name for Amyloid precursor protein http://bio2rdf.org/omim:104300 	is a name for Alzheimer disease uniprot:P05067 omim:104300 12 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Resource Description Framework (RDF) Allows one to express statements 	A RDF statement consists of: ,[object Object]
Predicate: resource identified by a URI
Object: resource or literaluniprot:P05067 is a Protein 13 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Multi-Source Data Integration depends on consistent naming uniprot:P05067 Protein Protein is a UniProt has name + uniprot:P05067 Membrane uniprot:P05067 Membrane located in located in Gene Ontology + uniprot:P05067 interacts with uniprot:P05067 uniprot:P05067 interacts with Unified view iRefIndex 14 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Building statements creates knowledge Amyloid precursor protein Alzheimer Disease label label is involved in uniprot:P05067 omim:104300 is a  is a Protein Disease 15 Ottawa/Chicago Semantic Web Meetup : 23-11-09
RDF has multiple representations RDF/XML <?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:u="http://purl.uniprot.org/uniprot/"      <rdf:Descriptionrdf:about=“&u;Q16665">         <rdf:typerdf:resource=“&u;Protein"/>     </rdf:Description> </rdf:RDF> RDF/N3 PREFIX u: <http://purl.uniprot.org/uniprot/> . <u:Q16665> a <u:Protein> . 16 16 Ottawa/Chicago Semantic Web Meetup : 23-11-09
	Bio2RDF’s RDFized data fits together Ottawa/Chicago Semantic Web Meetup : 23-11-09 17
Bio2RDF serves up over 4 billion triples of linked biological data 18 Ottawa/Chicago Semantic Web Meetup : 23-11-09
something you can lookup or search for with rich descriptions 19 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Bio2RDF: Raw Data! Ottawa/Chicago Semantic Web Meetup : 23-11-09 20
21 SPARQL is the new cool kid on the block SQLSPARQL Ottawa/Chicago Semantic Web Meetup : 23-11-09
Bio2RDF’s describe service uses SPARQL CONSTRUCT { 	?s ?p ?o . } WHERE { 	?s ?p ?o . 	FILTER(?s = <http://bio2rdf.org/ns:id>). }  Sent to http://ns.bio2rdf.org/sparql?query=...  22 Ottawa/Chicago Semantic Web Meetup : 23-11-09 http://bio2rdf.org/ns:id
Bio2RDF’s search service uses SPARQLhttp://bio2rdf.org/search/hexokinase 23 Ottawa/Chicago Semantic Web Meetup : 23-11-09 bio2rdf.org kegg gene uniprot
Yai for data! 24 Ottawa/Chicago Semantic Web Meetup : 23-11-09 But how do we discover more than what was in the data?
Ontology as Strategy 25 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Reasoning and Inference through Semantics fact uniprot:P05067 is a is a Protein is a Molecule ontology Knowledge base 26 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Logic Based Ontologies Are Conceptual Lego 27 Ottawa/Chicago Semantic Web Meetup : 23-11-09
A simple ontology: Animals Living Thing Body Part eats has part Plant Arm Animal eats Grass Leg eats Herbivore Tree Person Carnivore Cow 28 Ottawa/Chicago Semantic Web Meetup : 23-11-09
The Web Ontology Language (OWL) Has Explicit Semantics Can therefore be used to capture knowledge in a machine understandable way 29 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Key idea: Subsumption Subsumption is the primary axis (relationship) in OWL Superclass/subclass relationship, “is a” All members of a subclass must be members of its superclasses owl:Thing superclass of all Classes Molecule Protein ,[object Object]
 Protein is a subclass of Molecule
 Molecule is a superclass of Protein
 Molecule subsumes Protein30 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Key Idea: Disjunction DNA Protein Stating that 2 classes are disjoint means = individual Something cannot be both an Protein and DNA This can help us find errors 31 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Key Idea: Class equivalence By stating the necessary and sufficient conditions  we discover new knowledge Transcription Factor “A protein that binds to DNA and regulates gene expression. Ottawa/Chicago Semantic Web Meetup : 23-11-09 32
Many ontologies required Barry smith Ottawa/Chicago Semantic Web Meetup : 23-11-09 33
Over 170 bio-ontologies Ottawa/Chicago Semantic Web Meetup : 23-11-09 34
We’re interested in Personalized Medicine The ability to offer  The Right Drug To The Right Patient For The Right Disease At The Right Time With The Right Dosage 	Genetic and metabolic data will allow drugs to be tailored to patient subgroups 35 Ottawa/Chicago Semantic Web Meetup : 23-11-09
PHARMGKB  is an emerging resource for pharmacogenomics + Role of genes, gene variants , drugs  + pharmacokinetics  + pharmacodynamics + clinical outcomes.  + Links to publications - Natural language descriptions - Variant details in publications 36 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Pharmacogenomics of Depression KNOWLEDGE BASE contains statements from 11/40 relevant publications involving 45 genes / gene variants, 57 drugs annotated with 19 classes of antidepressants, 45 drug treatments, 47 drug-gene interactions, 29 clinical outcomes, 10 drug-induced side-effects, and 8 gene-disease interactions. 37 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Protégé 4, FaCT++, DL Query Tab Querying the PDKB Nortriptyline induced side effects for ABCB1 gene variants   ‘side effect’ that      ‘is realized by’ some          (‘drug treatment’ that       	‘involves’ some ‘nortriptyline’  and  	‘involves’ some  (‘variant of’ some ‘ABCB1’)) 38 Ottawa/Chicago Semantic Web Meetup : 23-11-09 postural hypotension is a side effect of nortriptyline treatment of depression for individuals presenting the 3435C>T genotype
Web-based Knowledge Discovery Some of our queries need services 39 Ottawa/Chicago Semantic Web Meetup : 23-11-09
The Holy Grail: Align the promoters of all serine threoninekinases involved exclusively in the regulation of cell sorting during wound healing in blood vessels. Retrieve and align 2000nt 5' from every serine/threoninekinase in Musmusculus expressed exclusively in the tunica [I | M |A] whose expression increases 5X or more within 5 hours of wounding but is not activated during the normal development of blood vessels, and is <40% homologous in the active site to kinases known to be involved in cell-cycle regulation in any other species. 40 Ottawa/Chicago Semantic Web Meetup : 23-11-09
41 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Semantic Automated Discovery and Integration http://sadiframework.org 42 Ottawa/Chicago Semantic Web Meetup : 23-11-09 Mark Wilkinson, UBC Michel Dumontier, Carleton University Christopher Baker, UNB
As OWL AxiomsHomologousGeneImageis  owl:equivalentTo { Gene Q   hasImage   image P Gene Q   hasSequence   Sequence Q Gene R   hasSequence   Sequence R Sequence Q   similarTo   Sequence R Gene R = “my gene of interest”   } 43 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Build a knowledge base from a series of  questions 44 Ottawa/Chicago Semantic Web Meetup : 23-11-09
You want to join the knowledge web 45 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Share your data 46 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Bridge your data with others in semantic communities 47 Ottawa/Chicago Semantic Web Meetup : 23-11-09
Time-sensitive or frequently updated data is one way to encourage more visits. 48 Ottawa/Chicago Semantic Web Meetup : 23-11-09

More Related Content

What's hot

FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Carole Goble
 
Names project (Amanda Hill)
Names project (Amanda Hill)Names project (Amanda Hill)
Names project (Amanda Hill)
JISC.AM
 

What's hot (20)

New ways to communicate in science: perspectives from biodiversity research
New ways to communicate in science: perspectives from biodiversity researchNew ways to communicate in science: perspectives from biodiversity research
New ways to communicate in science: perspectives from biodiversity research
 
Modern Tools & Rationales for 21st Century Research
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century Research
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
Wikiconference 2016 talk Burgstaller
Wikiconference 2016 talk BurgstallerWikiconference 2016 talk Burgstaller
Wikiconference 2016 talk Burgstaller
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
 
Communicating Use and Reuse in the Digital Collection Interface by L. Kelly F...
Communicating Use and Reuse in the Digital Collection Interface by L. Kelly F...Communicating Use and Reuse in the Digital Collection Interface by L. Kelly F...
Communicating Use and Reuse in the Digital Collection Interface by L. Kelly F...
 
Pride cluster presentation
Pride cluster presentation Pride cluster presentation
Pride cluster presentation
 
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
Reuse of public proteomics data
Reuse of public proteomics dataReuse of public proteomics data
Reuse of public proteomics data
 
Navigating Archaeology’s Big Data Reality
Navigating Archaeology’s Big Data RealityNavigating Archaeology’s Big Data Reality
Navigating Archaeology’s Big Data Reality
 
PRIDE-ProteomeXchange
PRIDE-ProteomeXchangePRIDE-ProteomeXchange
PRIDE-ProteomeXchange
 
OpenMinTeD: Making Sense of Large Volumes of Data
OpenMinTeD: Making Sense of Large Volumes of DataOpenMinTeD: Making Sense of Large Volumes of Data
OpenMinTeD: Making Sense of Large Volumes of Data
 
Sharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yetSharing re-usable phylogenetic data: we're not there yet
Sharing re-usable phylogenetic data: we're not there yet
 
Eva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSCEva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSC
 
Names Amanda Hill
Names Amanda HillNames Amanda Hill
Names Amanda Hill
 
Names project (Amanda Hill)
Names project (Amanda Hill)Names project (Amanda Hill)
Names project (Amanda Hill)
 
Mass spectrometry resources at the EBI
Mass spectrometry resources at the EBIMass spectrometry resources at the EBI
Mass spectrometry resources at the EBI
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDM
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBI
 

Viewers also liked

Hotel webinar final8162010
Hotel webinar final8162010Hotel webinar final8162010
Hotel webinar final8162010
UserZoom
 
Tom Nastas Presentation Adam Smith Private Equity Conference
Tom Nastas Presentation Adam Smith Private Equity ConferenceTom Nastas Presentation Adam Smith Private Equity Conference
Tom Nastas Presentation Adam Smith Private Equity Conference
Thomas Nastas
 
The Hartman Group Overview
The Hartman Group OverviewThe Hartman Group Overview
The Hartman Group Overview
linda.cox
 
Wmp Firefox Plugin License
Wmp Firefox Plugin LicenseWmp Firefox Plugin License
Wmp Firefox Plugin License
jyimbo
 
Ipsos store brands muscle in May 2012
Ipsos store brands muscle in May 2012Ipsos store brands muscle in May 2012
Ipsos store brands muscle in May 2012
Damjana Kocjanc
 
IT for Nursing @ RIC - 5
IT for Nursing @ RIC - 5IT for Nursing @ RIC - 5
IT for Nursing @ RIC - 5
Sascha Funk
 
Anxlisi del decret_de_plurilingxisme
Anxlisi del decret_de_plurilingxismeAnxlisi del decret_de_plurilingxisme
Anxlisi del decret_de_plurilingxisme
Joana Pérez Camarena
 

Viewers also liked (20)

20121119 Csusm Business Br
20121119 Csusm Business Br20121119 Csusm Business Br
20121119 Csusm Business Br
 
Detskaya Rabota2
Detskaya Rabota2Detskaya Rabota2
Detskaya Rabota2
 
Hotel webinar final8162010
Hotel webinar final8162010Hotel webinar final8162010
Hotel webinar final8162010
 
Tom Nastas Presentation Adam Smith Private Equity Conference
Tom Nastas Presentation Adam Smith Private Equity ConferenceTom Nastas Presentation Adam Smith Private Equity Conference
Tom Nastas Presentation Adam Smith Private Equity Conference
 
IVI Presentation At Rusnano Conference
IVI Presentation At Rusnano ConferenceIVI Presentation At Rusnano Conference
IVI Presentation At Rusnano Conference
 
Glucose english
Glucose englishGlucose english
Glucose english
 
IVI Program, 'Scaling Up Entrepreneurship,' progam description
IVI Program, 'Scaling Up Entrepreneurship,' progam descriptionIVI Program, 'Scaling Up Entrepreneurship,' progam description
IVI Program, 'Scaling Up Entrepreneurship,' progam description
 
The Hartman Group Overview
The Hartman Group OverviewThe Hartman Group Overview
The Hartman Group Overview
 
Make Love Not War
Make Love Not WarMake Love Not War
Make Love Not War
 
Youth 3.0
Youth 3.0Youth 3.0
Youth 3.0
 
Vincentvan Gogh
Vincentvan GoghVincentvan Gogh
Vincentvan Gogh
 
Wmp Firefox Plugin License
Wmp Firefox Plugin LicenseWmp Firefox Plugin License
Wmp Firefox Plugin License
 
Ipsos store brands muscle in May 2012
Ipsos store brands muscle in May 2012Ipsos store brands muscle in May 2012
Ipsos store brands muscle in May 2012
 
IPCC2010-1
IPCC2010-1IPCC2010-1
IPCC2010-1
 
IT for Nursing @ RIC - 5
IT for Nursing @ RIC - 5IT for Nursing @ RIC - 5
IT for Nursing @ RIC - 5
 
Anxlisi del decret_de_plurilingxisme
Anxlisi del decret_de_plurilingxismeAnxlisi del decret_de_plurilingxisme
Anxlisi del decret_de_plurilingxisme
 
Northstar So
Northstar SoNorthstar So
Northstar So
 
The Economics of Grid-Connected Hybrid Distributed Generation
The Economics of Grid-Connected Hybrid Distributed GenerationThe Economics of Grid-Connected Hybrid Distributed Generation
The Economics of Grid-Connected Hybrid Distributed Generation
 
HOME / Operation and Maintenance of Septic Systems: Protect Your Investment
HOME / Operation and Maintenance of Septic Systems: Protect Your InvestmentHOME / Operation and Maintenance of Septic Systems: Protect Your Investment
HOME / Operation and Maintenance of Septic Systems: Protect Your Investment
 
Arai presentation
Arai presentationArai presentation
Arai presentation
 

Similar to Triples for the People (Scientists):  Liberating biological knowledge with the Semantic Web

Evaluation of beef production and consumption ontology and presentatio...
Evaluation of beef production and consumption ontology and presentatio...Evaluation of beef production and consumption ontology and presentatio...
Evaluation of beef production and consumption ontology and presentatio...
Robert Trypuz
 
How Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open ScienceHow Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open Science
drnigam
 
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
Michel Dumontier
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
Angelo Salatino
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
Angelo Salatino
 

Similar to Triples for the People (Scientists):  Liberating biological knowledge with the Semantic Web (20)

Evaluation of beef production and consumption ontology and presentatio...
Evaluation of beef production and consumption ontology and presentatio...Evaluation of beef production and consumption ontology and presentatio...
Evaluation of beef production and consumption ontology and presentatio...
 
How Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open ScienceHow Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open Science
 
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
We’re all SMILES! Building Chemical Semantic Web Services with SADI, ChEBI, a...
 
Text Mining: the next data frontier. Beyond Open Access
Text Mining: the next data frontier. Beyond Open AccessText Mining: the next data frontier. Beyond Open Access
Text Mining: the next data frontier. Beyond Open Access
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic Web
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Third-Party PubMed Tools
Third-Party PubMed ToolsThird-Party PubMed Tools
Third-Party PubMed Tools
 
MLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed ToolsMLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed Tools
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
 
Bio2RDF and Beyond!
Bio2RDF and Beyond!Bio2RDF and Beyond!
Bio2RDF and Beyond!
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...2010 CASCON - Towards a integrated network of data and services for the life ...
2010 CASCON - Towards a integrated network of data and services for the life ...
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
 
Web Science, SADI, and the Singularity
Web Science, SADI, and the SingularityWeb Science, SADI, and the Singularity
Web Science, SADI, and the Singularity
 
Next Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 CalhounNext Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 Calhoun
 
Isf vivo2013
Isf vivo2013Isf vivo2013
Isf vivo2013
 
How SADI & SHARE help restore the Scientific Method to in silico science
How SADI & SHARE help restore the Scientific Method to in silico scienceHow SADI & SHARE help restore the Scientific Method to in silico science
How SADI & SHARE help restore the Scientific Method to in silico science
 
Transparency in the Data Supply Chain
Transparency in the Data Supply ChainTransparency in the Data Supply Chain
Transparency in the Data Supply Chain
 
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORELOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
 

More from Michel Dumontier

CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
Michel Dumontier
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
Michel Dumontier
 

More from Michel Dumontier (20)

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health System
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 

Recently uploaded

🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 

Recently uploaded (20)

Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...
Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...
Independent Call Girls Service Mohali Sector 116 | 6367187148 | Call Girl Ser...
 
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service AvailableCall Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
 
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
 
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
 
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
 
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
 
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
 
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
 
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
 
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service AvailableTrichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
 
Low Rate Call Girls Bangalore {7304373326} ❤️VVIP NISHA Call Girls in Bangalo...
Low Rate Call Girls Bangalore {7304373326} ❤️VVIP NISHA Call Girls in Bangalo...Low Rate Call Girls Bangalore {7304373326} ❤️VVIP NISHA Call Girls in Bangalo...
Low Rate Call Girls Bangalore {7304373326} ❤️VVIP NISHA Call Girls in Bangalo...
 
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
 
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
 
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
 

Triples for the People (Scientists):  Liberating biological knowledge with the Semantic Web

  • 1. Triples for the People (Scientists):  Liberating biological knowledge with the Semantic Web 1 Ottawa/Chicago Semantic Web Meetup : 23-11-09 Michel Dumontier, Ph.D. Associate Professor of Bioinformatics Carleton University Department of Biology School of Computer Science Institute of Biochemistry Ottawa Institute of Systems Biology Ottawa-Carleton Institute of Biomedical Engineering
  • 2. Web-based Knowledge Discovery a very painful process Carole Goble (ISWC 2005) 2 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 3. With current web search engines… It takes a lot of digging to get answers 3 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 4. Portals provide structured information and give better results 4 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 5. We need to expose the deep web Surface web:167 terabytes Deep web:91,000 terabytes 545-to-one Ottawa/Chicago Semantic Web Meetup : 23-11-09 5
  • 6. Data silos – not made for sharing 6 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 7. We want to simultaneously query the 1000+ biological databases 7 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 8. How do we integrate these resources? 8 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 9. The Semantic Web is a web of knowledge. 9 Ottawa/Chicago Semantic Web Meetup : 23-11-09 It is about standards for publishing, sharing and querying knowledge drawn from diverse sources It enables the answering of sophisticated questions
  • 10. A growing web of linked data 10 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 11. Bio2RDF provides a framework to glue to link data networks together 11 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 12. Resource Description Framework (RDF) Allows one to talk about anything Uniform Resource Identifier (URI) can be used as entity names http://bio2rdf.org/uniprot:P05067 is a name for Amyloid precursor protein http://bio2rdf.org/omim:104300 is a name for Alzheimer disease uniprot:P05067 omim:104300 12 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 13.
  • 15. Object: resource or literaluniprot:P05067 is a Protein 13 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 16. Multi-Source Data Integration depends on consistent naming uniprot:P05067 Protein Protein is a UniProt has name + uniprot:P05067 Membrane uniprot:P05067 Membrane located in located in Gene Ontology + uniprot:P05067 interacts with uniprot:P05067 uniprot:P05067 interacts with Unified view iRefIndex 14 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 17. Building statements creates knowledge Amyloid precursor protein Alzheimer Disease label label is involved in uniprot:P05067 omim:104300 is a is a Protein Disease 15 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 18. RDF has multiple representations RDF/XML <?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:u="http://purl.uniprot.org/uniprot/" <rdf:Descriptionrdf:about=“&u;Q16665"> <rdf:typerdf:resource=“&u;Protein"/> </rdf:Description> </rdf:RDF> RDF/N3 PREFIX u: <http://purl.uniprot.org/uniprot/> . <u:Q16665> a <u:Protein> . 16 16 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 19. Bio2RDF’s RDFized data fits together Ottawa/Chicago Semantic Web Meetup : 23-11-09 17
  • 20. Bio2RDF serves up over 4 billion triples of linked biological data 18 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 21. something you can lookup or search for with rich descriptions 19 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 22. Bio2RDF: Raw Data! Ottawa/Chicago Semantic Web Meetup : 23-11-09 20
  • 23. 21 SPARQL is the new cool kid on the block SQLSPARQL Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 24. Bio2RDF’s describe service uses SPARQL CONSTRUCT { ?s ?p ?o . } WHERE { ?s ?p ?o . FILTER(?s = <http://bio2rdf.org/ns:id>). } Sent to http://ns.bio2rdf.org/sparql?query=... 22 Ottawa/Chicago Semantic Web Meetup : 23-11-09 http://bio2rdf.org/ns:id
  • 25. Bio2RDF’s search service uses SPARQLhttp://bio2rdf.org/search/hexokinase 23 Ottawa/Chicago Semantic Web Meetup : 23-11-09 bio2rdf.org kegg gene uniprot
  • 26. Yai for data! 24 Ottawa/Chicago Semantic Web Meetup : 23-11-09 But how do we discover more than what was in the data?
  • 27. Ontology as Strategy 25 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 28. Reasoning and Inference through Semantics fact uniprot:P05067 is a is a Protein is a Molecule ontology Knowledge base 26 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 29. Logic Based Ontologies Are Conceptual Lego 27 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 30. A simple ontology: Animals Living Thing Body Part eats has part Plant Arm Animal eats Grass Leg eats Herbivore Tree Person Carnivore Cow 28 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 31. The Web Ontology Language (OWL) Has Explicit Semantics Can therefore be used to capture knowledge in a machine understandable way 29 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 32.
  • 33. Protein is a subclass of Molecule
  • 34. Molecule is a superclass of Protein
  • 35. Molecule subsumes Protein30 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 36. Key Idea: Disjunction DNA Protein Stating that 2 classes are disjoint means = individual Something cannot be both an Protein and DNA This can help us find errors 31 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 37. Key Idea: Class equivalence By stating the necessary and sufficient conditions we discover new knowledge Transcription Factor “A protein that binds to DNA and regulates gene expression. Ottawa/Chicago Semantic Web Meetup : 23-11-09 32
  • 38. Many ontologies required Barry smith Ottawa/Chicago Semantic Web Meetup : 23-11-09 33
  • 39. Over 170 bio-ontologies Ottawa/Chicago Semantic Web Meetup : 23-11-09 34
  • 40. We’re interested in Personalized Medicine The ability to offer The Right Drug To The Right Patient For The Right Disease At The Right Time With The Right Dosage Genetic and metabolic data will allow drugs to be tailored to patient subgroups 35 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 41. PHARMGKB is an emerging resource for pharmacogenomics + Role of genes, gene variants , drugs + pharmacokinetics + pharmacodynamics + clinical outcomes. + Links to publications - Natural language descriptions - Variant details in publications 36 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 42. Pharmacogenomics of Depression KNOWLEDGE BASE contains statements from 11/40 relevant publications involving 45 genes / gene variants, 57 drugs annotated with 19 classes of antidepressants, 45 drug treatments, 47 drug-gene interactions, 29 clinical outcomes, 10 drug-induced side-effects, and 8 gene-disease interactions. 37 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 43. Protégé 4, FaCT++, DL Query Tab Querying the PDKB Nortriptyline induced side effects for ABCB1 gene variants ‘side effect’ that ‘is realized by’ some (‘drug treatment’ that ‘involves’ some ‘nortriptyline’ and ‘involves’ some (‘variant of’ some ‘ABCB1’)) 38 Ottawa/Chicago Semantic Web Meetup : 23-11-09 postural hypotension is a side effect of nortriptyline treatment of depression for individuals presenting the 3435C>T genotype
  • 44. Web-based Knowledge Discovery Some of our queries need services 39 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 45. The Holy Grail: Align the promoters of all serine threoninekinases involved exclusively in the regulation of cell sorting during wound healing in blood vessels. Retrieve and align 2000nt 5' from every serine/threoninekinase in Musmusculus expressed exclusively in the tunica [I | M |A] whose expression increases 5X or more within 5 hours of wounding but is not activated during the normal development of blood vessels, and is <40% homologous in the active site to kinases known to be involved in cell-cycle regulation in any other species. 40 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 46. 41 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 47. Semantic Automated Discovery and Integration http://sadiframework.org 42 Ottawa/Chicago Semantic Web Meetup : 23-11-09 Mark Wilkinson, UBC Michel Dumontier, Carleton University Christopher Baker, UNB
  • 48. As OWL AxiomsHomologousGeneImageis owl:equivalentTo { Gene Q hasImage image P Gene Q hasSequence Sequence Q Gene R hasSequence Sequence R Sequence Q similarTo Sequence R Gene R = “my gene of interest” } 43 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 49. Build a knowledge base from a series of questions 44 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 50. You want to join the knowledge web 45 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 51. Share your data 46 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 52. Bridge your data with others in semantic communities 47 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 53. Time-sensitive or frequently updated data is one way to encourage more visits. 48 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 54. 49 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 55. 50 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 56. The Knowledge Web • Merging data & services • Reasoning & question answering • Persistent (RESTful) • Trust & Security Data consumers must be able to rely upon your data to use it as a foundation for their own applications. 51 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 57. Join the knowledge web. 52 Ottawa/Chicago Semantic Web Meetup : 23-11-09
  • 58. dumontierlab.com michel_dumontier@carleton.ca 53 Ottawa/Chicago Semantic Web Meetup : 23-11-09