SlideShare uma empresa Scribd logo
1 de 36
The Past, Present and Future of
Knowledge in Biology
Robert Stevens
BioHealth Informatics Group
The University of Manchester
Manchester
United Kingdom
Robert.Stevens@manchester.ac.uk
Overview
• A look at the state of play
• For what are we using ontologies?
• What do we count as knowledge?
• Doing so much more with knowledge
• Stopping text being a dead end
Text and Ontologies: The Terrible
Twins of Knowledge in Biology
Robert Stevens
BioHealth Informatics Group
The University of Manchester
Manchester
United Kingdom
Robert.Stevens@manchester.ac.uk
Biology now has lots of facts
Genome
Proteome
Transcriptome
Interactome
Metabolome
PHENOME
Lots of catalogues
Data are only as Good as their
Metadata
• There is a lot of biology out there…
• How these entities are described in our data varies
• We don’t even agree on what entities there are to
describe in our data
• This makes analysing data hard: You have to know
what your data represent
• …, but also how the entities described in your data
relate to each other
• We need to describe our data – their metadata
Creating Woods, not Trees
Genes
Proteins
Pathways
Interactions
Literature
Complex
Machines
Virtual
Organism
…. from biological facts, we make a system that is some model of a real organism
Timeline
There’s a Lot of it About
Searching for “ontology” in five
year chunks on the ACM digital
portal
Searching for “ontology” in five
year chunks on the ACM digital
portal
Searching for “ontology” in five
year chunks on PubMed
Searching for “ontology” in five
year chunks on PubMed
It’s all Gruber’s Fault
• “In the context of knowledge sharing, the term ontology means a
specification of a conceptualisation. That is, an ontology is a description
(like a formal specification of a program) of the concepts and relationships
that can exist for an agent or a community of agents. This definition is
consistent with the usage of ontology as set-of-concept-definitions, but
more general. And it is certainly a different sense of the word than its use
in philosophy.” DOI:10.1006/knac.1993.1008 DOI:10.1006/ijhc.1995.1081
Angels on the head of a pin
Everything with a Blob and Line is
called an Ontology
• Wide acceptance criteria
• Narrow evaluation criteria
• Different sort of knowledge for different
situations
• Different styles of representation; some
scruffy and some formal
• Representing knowledge in biology is more
than ontologies
• We could stop calling them ontologies
RDF
graph
RDF
graph
Database
schema
Database
schema
ThesaurusThesaurus
OWL
Ontology
OWL
Ontology
Formal
ontology
Formal
ontology
SKOS
vocabulary
SKOS
vocabulary
Uses of Ontologies
Knowing What We’ve got is so
Useful
• We could computationally handle lots of data,
but we couldn’t do so with what we know
about those data
• Ontologies so far mainly used for a common
tongue so that we can compare
• … and it works!
• Still getting lots of mileage from ontology
annotation
• …, But there is so much more
GENERIC GENE ONTOLOGY (GO)
TERM FINDERS000003093
MXR1
YPL250C
S000004294
SAM3
YIR017C
S000003152
MMP1
MET1
Expressed
Genes
P-value score
http://go.princeton.edu/cgi-bin/GOTermFinder
Classifying a Mouse
Individual Description:
Stops wriggling after 3 sec
Has 3 cm tail
Mass 10g
10 days old (since birth)
Strain C57Bl/6
Class Description:
Class:DepressedMouse
EquivalentTo:Mouse that
(wriggles For <=30 OR swims for <=45)
DataTransformation
Short tailed mouse
Class:ShortTailedMouse
EquivalentTo:Mouse that
hasPart EXACTLY 1 (Tail that hasAssay SOME
(LengthAssay that hasValue SOME int[<= 20) and hasUnit
SOME Millimetre))
SubClassOf: Mouse that
hasPart some (Tail that hasQuality SOME Short)
• We can recognise an instance of short-
tailed mouse, but we also know that it has
the quality “short”
• Even when the fact isn’t asserted
•First bullet
Classifying Proteins
>uniprot|Q15262|PTPK_HUMAN Receptor-type protein-tyrosine phosphatase kappa precursor
(EC 3.1.3.48) (R-PTP-kappa).
MDTTAAAALPAFVALLLLSPWPLLGSAQGQFSAGGCTFDDGPGACDYHQDLYDDFEWVHV
SAQEPHYLPPEMPQGSYMIVDSSDHDPGEKARLQLPTMKENDTHCIDFSYLLYSQKGLNP
GTLNILVRVNKGPLANPIWNVTGFTGRDWLRAELAVSSFWPNEYQVIFEAEVSGGRSGYI
AIDDIQVLSYPCDKSPHFLRLGDVEVNAGQNATFQCIATGRDAVHNKLWLQRRNGEDIPV………..
InterPro
Instance Store
Reasoner
Translate
Codify
OWL’s Automated Reasoners
• Demonstrably useful in:
– Building ontologies
– Querying ontologies
– Can automatically annotate
– Have made “discoveries”
But there is more than OWL’s reasoning
Separation of Knowledge and
Software
• We realised a long time ago that we needed
to separate
• We only recently called this knowledge
component ontology
• We don’t really need to see the ontology
• We certainly shouldn’t show people OWL; it
“scares the horses”
• Ontology for software not humans (L. Hunter)
The Ontology cottage Industry
• We’ve industrialised data production
• We’ve (to some extent) industrialised data
analysis
• We’ve not really moved away from hand-
crafted, “whittled” ontologies
Can we have Mass Editing of
Ontologies?
• Probably not;
• Computer scientists in love with synchronous
editing
• …, but not really necessary (see CSCW)
• Mass gathering of Knowledge
Mass Gathering of Knowledge and the
Application of Patterns or a
metamodel
http://rightfield.org.uk http://www.e-lico.eu/populous
There’s so much more to Ontology
Building than editing Axioms
• Gathering knowledge
• Adding labels
• Adding other human orientated content
• Reviewing, checking suggesting
• Deploying, using, creating “views”
• Ontology comprehension
There’s More to KR than OWL
• OWL and its automated reasoners are useful
• But there is so much more to KR than
ontologies and OWL
• Higher order reasoning
• Rules
• Other sorts of reasoning
Generating natural language
Class: HeLa
SubClassOf: Cell,
bearer_of some 'cervical carcinoma’,
derives_from some 'Homo sapiens’,
derives_from some cervix,
derives_from some 'epithelial cell'
OWL
HeLa is a cell line. A hela is all of the
following: something that is bearer of
a cervical carcinoma, something that
derives from a homo sapiens,
something that derives from an
epithelial cell, and something that
derives from a cervix.
Generated natural language
Experimental Factor Ontology (EFO)
http://www.ebi.ac.uk/efo
Ontology as book
Title: Experimental Factor Ontology
Table of Contents
Chapter 1. Cell line
Chapter 2. Cell type
Chapter 3. Chemical Compound
Chapter 4. Organism
HeLa is a cell line. A hela is all of the
following: something that is bearer of
a cervical carcinoma, something
that derives from a homo sapiens,
something that derives from an
epithelial cell, and something that
derives from a cervix.
entry
DataData
Types of Knowledge
Biologist’s headBiologist’s head
PapersPapers
DatabasesDatabases
OntologiesOntologies
??????
It’s not Just “Things”
• Experiments produce data about things
• Proteins, genes, chemicals, reactions,
diseases, size, shape, speed, ….
• As well as this knowledge we have knowledge
of how it was done
• OBI is still the “things” to do with production
• We still need the methods of by which these
“things” were deployed
• The protocol
Knowledge about an
experiment
Workflow
Run
Workflow
Run
Workflow
ProvenanceProvenance
Organisationa
l
Organisationa
l
Results and
Interpretation
Results and
Interpretation
Workflows are knowledge about
methods
Get genes in region
Get pathways that
contain genes
Merge data into single files
Get gene descriptions
Get pathway descriptions
Cross-reference ids
Methods:
1. A QTL (region of chromosome) is entered into the
workflow, specified as base pairs. These base pairs
are subsequently used to identify, in the Ensembl
database, any genes that lie within this region.
2. Any genes found within this region are subsequently
annotated with Entrez and UniProt identifiers.
3. The Entrez and UniProt identifiers are then passed
to a KEGG id conversion Web Service, to cross-
reference the input ids to KEGG gene identifiers.
This enables gene descriptions and biological
pathway data to be returned from KEGG.
4. Each KEGG gene id is then used in a search for
KEGG pathways. Any pathways found to contain the
gene are returned as KEGG pathway ids.
5. Both KEGG gene and pathway ids are then sent to
individual services, provided by KEGG, which
provide a description of the gene and pathway.
6. The outputs of the workflow are then combined into
single flat files, which can be saved locally and used
to identify novel pathways and genes within the QTL
region.
myExperiment
http://www.myexperiment.org
Research Objects
MethodMethod
DataData
IntroductionIntroduction
ConclusionsConclusions
ResultsResults
Human Written
WorkflowWorkflow
Generated Text
Semantically
annotated
Model, View, Controller
Annotated
Data
Annotated
Data
ControllerController
ProjectionProjection
Text
Tables Graphs
Steve Pettifer
http://utopia.cs.man.ac.uk/
What Next?
• Ontologies are not the only fruit
• We could stop calling them ontologies
• We need to produce “ontologies” faster
• We need to do more interesting things with our knowledge
• We need to make them pervade our tools
• We need then to be “agile”
• Open to other forms of KR and other forms of reasoning
• Adding to data automatically
• Generating our descriptions of data
Acknowledgements
• Simon Jupp for the slides
• Alan rector and Carole goble
• sysMoDB for rightField (Katy Wolstencroft, Stuart Owen, Matt
Horridge)
• Populous – Simon Jupp
• SWAT – richard Power, Sandra Williams and Allan third at the
OU
• EFO – James Malone and Helen Parkinson
• Steve Pettifer for the Utopia and MVC
• Paul Fisher and the Taverna team
• The myExperiment team at Southampton and Manchester

Mais conteúdo relacionado

Mais procurados

Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Chris Mungall
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...geraintduck
 
Representation of kidney structures in Uberon
Representation of kidney structures in UberonRepresentation of kidney structures in Uberon
Representation of kidney structures in UberonChris Mungall
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biologyrobertstevens65
 
The Language of the Gene Ontology
The Language of the Gene OntologyThe Language of the Gene Ontology
The Language of the Gene Ontologyrobertstevens65
 
Gene Ontology WormBase Workshop International Worm Meeting 2015
Gene Ontology WormBase Workshop International Worm Meeting 2015Gene Ontology WormBase Workshop International Worm Meeting 2015
Gene Ontology WormBase Workshop International Worm Meeting 2015raymond91105
 
Biomedical literature mining
Biomedical literature miningBiomedical literature mining
Biomedical literature miningLars Juhl Jensen
 
The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...Neuroscience Information Framework
 
Cartic Ramakrishnan's dissertation defense
Cartic Ramakrishnan's dissertation defenseCartic Ramakrishnan's dissertation defense
Cartic Ramakrishnan's dissertation defenseCartic Ramakrishnan
 
Mungall keynote-biocurator-2017
Mungall keynote-biocurator-2017Mungall keynote-biocurator-2017
Mungall keynote-biocurator-2017Chris Mungall
 
ContentMine Presentation for WHO Health Data Seminar
ContentMine Presentation for WHO Health Data SeminarContentMine Presentation for WHO Health Data Seminar
ContentMine Presentation for WHO Health Data SeminarJenny Molloy
 
Representing and reasoning with biological knowledge
Representing and reasoning with biological knowledgeRepresenting and reasoning with biological knowledge
Representing and reasoning with biological knowledgeBenjamin Good
 
Experiences with logic programming in bioinformatics
Experiences with logic programming in bioinformaticsExperiences with logic programming in bioinformatics
Experiences with logic programming in bioinformaticsChris Mungall
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRONPrabin Shakya
 
2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynotec.titus.brown
 

Mais procurados (20)

Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
 
Representation of kidney structures in Uberon
Representation of kidney structures in UberonRepresentation of kidney structures in Uberon
Representation of kidney structures in Uberon
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
 
The Language of the Gene Ontology
The Language of the Gene OntologyThe Language of the Gene Ontology
The Language of the Gene Ontology
 
Genome scale-data as networks
Genome scale-data as networksGenome scale-data as networks
Genome scale-data as networks
 
Gene Ontology WormBase Workshop International Worm Meeting 2015
Gene Ontology WormBase Workshop International Worm Meeting 2015Gene Ontology WormBase Workshop International Worm Meeting 2015
Gene Ontology WormBase Workshop International Worm Meeting 2015
 
Biomedical literature mining
Biomedical literature miningBiomedical literature mining
Biomedical literature mining
 
The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...
 
AS application
AS applicationAS application
AS application
 
Biological networks
Biological networksBiological networks
Biological networks
 
Cartic Ramakrishnan's dissertation defense
Cartic Ramakrishnan's dissertation defenseCartic Ramakrishnan's dissertation defense
Cartic Ramakrishnan's dissertation defense
 
Mungall keynote-biocurator-2017
Mungall keynote-biocurator-2017Mungall keynote-biocurator-2017
Mungall keynote-biocurator-2017
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 
ContentMine Presentation for WHO Health Data Seminar
ContentMine Presentation for WHO Health Data SeminarContentMine Presentation for WHO Health Data Seminar
ContentMine Presentation for WHO Health Data Seminar
 
Representing and reasoning with biological knowledge
Representing and reasoning with biological knowledgeRepresenting and reasoning with biological knowledge
Representing and reasoning with biological knowledge
 
Experiences with logic programming in bioinformatics
Experiences with logic programming in bioinformaticsExperiences with logic programming in bioinformatics
Experiences with logic programming in bioinformatics
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRON
 
Protein-protein interaction networks
Protein-protein interaction networksProtein-protein interaction networks
Protein-protein interaction networks
 
2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynote
 

Destaque

Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003robertstevens65
 
Anti Ageing Treatment with stem cells
Anti Ageing Treatment with stem cellsAnti Ageing Treatment with stem cells
Anti Ageing Treatment with stem cellsPrabhu Mishra
 
plant stem cells
 plant stem cells plant stem cells
plant stem cellsGowthami R
 
Stem Cells and Tissue Engineering: past, present and future
Stem Cells and Tissue Engineering: past, present and futureStem Cells and Tissue Engineering: past, present and future
Stem Cells and Tissue Engineering: past, present and futureAna Rita Ramos
 
Basics of Tissue engineering
Basics of Tissue engineeringBasics of Tissue engineering
Basics of Tissue engineeringMahmoud Hamda
 
Biological Presentation On Stem Cells
Biological Presentation On Stem CellsBiological Presentation On Stem Cells
Biological Presentation On Stem Cellsguestce7e377
 

Destaque (11)

Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 
Anti Ageing Treatment with stem cells
Anti Ageing Treatment with stem cellsAnti Ageing Treatment with stem cells
Anti Ageing Treatment with stem cells
 
plant stem cells
 plant stem cells plant stem cells
plant stem cells
 
Tissue Engineering
Tissue EngineeringTissue Engineering
Tissue Engineering
 
TISSUE ENGINEERING
TISSUE ENGINEERINGTISSUE ENGINEERING
TISSUE ENGINEERING
 
Stem Cells and Tissue Engineering: past, present and future
Stem Cells and Tissue Engineering: past, present and futureStem Cells and Tissue Engineering: past, present and future
Stem Cells and Tissue Engineering: past, present and future
 
Basics of Tissue engineering
Basics of Tissue engineeringBasics of Tissue engineering
Basics of Tissue engineering
 
Biological Presentation On Stem Cells
Biological Presentation On Stem CellsBiological Presentation On Stem Cells
Biological Presentation On Stem Cells
 
Stem cell
Stem cellStem cell
Stem cell
 
Stem cells
Stem cellsStem cells
Stem cells
 
Stem cell therapy
Stem cell therapyStem cell therapy
Stem cell therapy
 

Semelhante a The Past, Present and Future of Knowledge in Biology

Ontology - and Reloaded and Revolutions
Ontology - and Reloaded and RevolutionsOntology - and Reloaded and Revolutions
Ontology - and Reloaded and RevolutionsJie Bao
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnected Data World
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesConnected Data World
 
Plant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesPlant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesLeighton Pritchard
 
Drug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersDrug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersSamuel Croset
 
Collaborative Ontology building: So much more than authoring an Ontology
Collaborative Ontology building: So much more than authoring an Ontology Collaborative Ontology building: So much more than authoring an Ontology
Collaborative Ontology building: So much more than authoring an Ontology robertstevens65
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Iddo
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960mare34
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsDuncan Hull
 
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...RussellHanson
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionMonica Munoz-Torres
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIPistoia Alliance
 
Building a repository of biomedical ontologies with Neo4j
Building a repository of biomedical ontologies with Neo4jBuilding a repository of biomedical ontologies with Neo4j
Building a repository of biomedical ontologies with Neo4jSimon Jupp
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsmikaelhuss
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsPaul Groth
 

Semelhante a The Past, Present and Future of Knowledge in Biology (20)

Ontology - and Reloaded and Revolutions
Ontology - and Reloaded and RevolutionsOntology - and Reloaded and Revolutions
Ontology - and Reloaded and Revolutions
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
Plant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesPlant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In Sequences
 
Drug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersDrug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasoners
 
Collaborative Ontology building: So much more than authoring an Ontology
Collaborative Ontology building: So much more than authoring an Ontology Collaborative Ontology building: So much more than authoring an Ontology
Collaborative Ontology building: So much more than authoring an Ontology
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013
 
Ontology at Manchester
Ontology at ManchesterOntology at Manchester
Ontology at Manchester
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of Bioinformatics
 
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
Building a repository of biomedical ontologies with Neo4j
Building a repository of biomedical ontologies with Neo4jBuilding a repository of biomedical ontologies with Neo4j
Building a repository of biomedical ontologies with Neo4j
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
 
Use of data
Use of dataUse of data
Use of data
 

Mais de robertstevens65

Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientrobertstevens65
 
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016robertstevens65
 
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...robertstevens65
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in robertstevens65
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysisrobertstevens65
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologiesrobertstevens65
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology developmentrobertstevens65
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family Historyrobertstevens65
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefactsrobertstevens65
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templatesrobertstevens65
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agilerobertstevens65
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesrobertstevens65
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)robertstevens65
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Roserobertstevens65
 
Working with big biomedical ontologies
Working with big biomedical ontologiesWorking with big biomedical ontologies
Working with big biomedical ontologiesrobertstevens65
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...robertstevens65
 
Ontology learning from text
Ontology learning from textOntology learning from text
Ontology learning from textrobertstevens65
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Disciplinerobertstevens65
 
A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2robertstevens65
 

Mais de robertstevens65 (20)

Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficient
 
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
 
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysis
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologies
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology development
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family History
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefacts
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templates
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agile
 
Spreadsheets to OWL
Spreadsheets to OWLSpreadsheets to OWL
Spreadsheets to OWL
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologies
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Rose
 
Working with big biomedical ontologies
Working with big biomedical ontologiesWorking with big biomedical ontologies
Working with big biomedical ontologies
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
 
Ontology learning from text
Ontology learning from textOntology learning from text
Ontology learning from text
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Discipline
 
A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2
 

Último

Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 

Último (20)

Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 

The Past, Present and Future of Knowledge in Biology

  • 1. The Past, Present and Future of Knowledge in Biology Robert Stevens BioHealth Informatics Group The University of Manchester Manchester United Kingdom Robert.Stevens@manchester.ac.uk
  • 2. Overview • A look at the state of play • For what are we using ontologies? • What do we count as knowledge? • Doing so much more with knowledge • Stopping text being a dead end
  • 3. Text and Ontologies: The Terrible Twins of Knowledge in Biology Robert Stevens BioHealth Informatics Group The University of Manchester Manchester United Kingdom Robert.Stevens@manchester.ac.uk
  • 4. Biology now has lots of facts
  • 6. Data are only as Good as their Metadata • There is a lot of biology out there… • How these entities are described in our data varies • We don’t even agree on what entities there are to describe in our data • This makes analysing data hard: You have to know what your data represent • …, but also how the entities described in your data relate to each other • We need to describe our data – their metadata
  • 7. Creating Woods, not Trees Genes Proteins Pathways Interactions Literature Complex Machines Virtual Organism …. from biological facts, we make a system that is some model of a real organism
  • 9. There’s a Lot of it About Searching for “ontology” in five year chunks on the ACM digital portal Searching for “ontology” in five year chunks on the ACM digital portal Searching for “ontology” in five year chunks on PubMed Searching for “ontology” in five year chunks on PubMed
  • 10. It’s all Gruber’s Fault • “In the context of knowledge sharing, the term ontology means a specification of a conceptualisation. That is, an ontology is a description (like a formal specification of a program) of the concepts and relationships that can exist for an agent or a community of agents. This definition is consistent with the usage of ontology as set-of-concept-definitions, but more general. And it is certainly a different sense of the word than its use in philosophy.” DOI:10.1006/knac.1993.1008 DOI:10.1006/ijhc.1995.1081
  • 11. Angels on the head of a pin
  • 12. Everything with a Blob and Line is called an Ontology • Wide acceptance criteria • Narrow evaluation criteria • Different sort of knowledge for different situations • Different styles of representation; some scruffy and some formal • Representing knowledge in biology is more than ontologies • We could stop calling them ontologies RDF graph RDF graph Database schema Database schema ThesaurusThesaurus OWL Ontology OWL Ontology Formal ontology Formal ontology SKOS vocabulary SKOS vocabulary
  • 14. Knowing What We’ve got is so Useful • We could computationally handle lots of data, but we couldn’t do so with what we know about those data • Ontologies so far mainly used for a common tongue so that we can compare • … and it works! • Still getting lots of mileage from ontology annotation • …, But there is so much more
  • 15. GENERIC GENE ONTOLOGY (GO) TERM FINDERS000003093 MXR1 YPL250C S000004294 SAM3 YIR017C S000003152 MMP1 MET1 Expressed Genes P-value score http://go.princeton.edu/cgi-bin/GOTermFinder
  • 16. Classifying a Mouse Individual Description: Stops wriggling after 3 sec Has 3 cm tail Mass 10g 10 days old (since birth) Strain C57Bl/6 Class Description: Class:DepressedMouse EquivalentTo:Mouse that (wriggles For <=30 OR swims for <=45) DataTransformation
  • 17. Short tailed mouse Class:ShortTailedMouse EquivalentTo:Mouse that hasPart EXACTLY 1 (Tail that hasAssay SOME (LengthAssay that hasValue SOME int[<= 20) and hasUnit SOME Millimetre)) SubClassOf: Mouse that hasPart some (Tail that hasQuality SOME Short) • We can recognise an instance of short- tailed mouse, but we also know that it has the quality “short” • Even when the fact isn’t asserted •First bullet
  • 18. Classifying Proteins >uniprot|Q15262|PTPK_HUMAN Receptor-type protein-tyrosine phosphatase kappa precursor (EC 3.1.3.48) (R-PTP-kappa). MDTTAAAALPAFVALLLLSPWPLLGSAQGQFSAGGCTFDDGPGACDYHQDLYDDFEWVHV SAQEPHYLPPEMPQGSYMIVDSSDHDPGEKARLQLPTMKENDTHCIDFSYLLYSQKGLNP GTLNILVRVNKGPLANPIWNVTGFTGRDWLRAELAVSSFWPNEYQVIFEAEVSGGRSGYI AIDDIQVLSYPCDKSPHFLRLGDVEVNAGQNATFQCIATGRDAVHNKLWLQRRNGEDIPV……….. InterPro Instance Store Reasoner Translate Codify
  • 19. OWL’s Automated Reasoners • Demonstrably useful in: – Building ontologies – Querying ontologies – Can automatically annotate – Have made “discoveries” But there is more than OWL’s reasoning
  • 20. Separation of Knowledge and Software • We realised a long time ago that we needed to separate • We only recently called this knowledge component ontology • We don’t really need to see the ontology • We certainly shouldn’t show people OWL; it “scares the horses” • Ontology for software not humans (L. Hunter)
  • 21. The Ontology cottage Industry • We’ve industrialised data production • We’ve (to some extent) industrialised data analysis • We’ve not really moved away from hand- crafted, “whittled” ontologies
  • 22. Can we have Mass Editing of Ontologies? • Probably not; • Computer scientists in love with synchronous editing • …, but not really necessary (see CSCW) • Mass gathering of Knowledge
  • 23. Mass Gathering of Knowledge and the Application of Patterns or a metamodel http://rightfield.org.uk http://www.e-lico.eu/populous
  • 24. There’s so much more to Ontology Building than editing Axioms • Gathering knowledge • Adding labels • Adding other human orientated content • Reviewing, checking suggesting • Deploying, using, creating “views” • Ontology comprehension
  • 25. There’s More to KR than OWL • OWL and its automated reasoners are useful • But there is so much more to KR than ontologies and OWL • Higher order reasoning • Rules • Other sorts of reasoning
  • 26. Generating natural language Class: HeLa SubClassOf: Cell, bearer_of some 'cervical carcinoma’, derives_from some 'Homo sapiens’, derives_from some cervix, derives_from some 'epithelial cell' OWL HeLa is a cell line. A hela is all of the following: something that is bearer of a cervical carcinoma, something that derives from a homo sapiens, something that derives from an epithelial cell, and something that derives from a cervix. Generated natural language Experimental Factor Ontology (EFO) http://www.ebi.ac.uk/efo
  • 27. Ontology as book Title: Experimental Factor Ontology Table of Contents Chapter 1. Cell line Chapter 2. Cell type Chapter 3. Chemical Compound Chapter 4. Organism HeLa is a cell line. A hela is all of the following: something that is bearer of a cervical carcinoma, something that derives from a homo sapiens, something that derives from an epithelial cell, and something that derives from a cervix. entry
  • 28. DataData Types of Knowledge Biologist’s headBiologist’s head PapersPapers DatabasesDatabases OntologiesOntologies ??????
  • 29. It’s not Just “Things” • Experiments produce data about things • Proteins, genes, chemicals, reactions, diseases, size, shape, speed, …. • As well as this knowledge we have knowledge of how it was done • OBI is still the “things” to do with production • We still need the methods of by which these “things” were deployed • The protocol
  • 31. Workflows are knowledge about methods Get genes in region Get pathways that contain genes Merge data into single files Get gene descriptions Get pathway descriptions Cross-reference ids Methods: 1. A QTL (region of chromosome) is entered into the workflow, specified as base pairs. These base pairs are subsequently used to identify, in the Ensembl database, any genes that lie within this region. 2. Any genes found within this region are subsequently annotated with Entrez and UniProt identifiers. 3. The Entrez and UniProt identifiers are then passed to a KEGG id conversion Web Service, to cross- reference the input ids to KEGG gene identifiers. This enables gene descriptions and biological pathway data to be returned from KEGG. 4. Each KEGG gene id is then used in a search for KEGG pathways. Any pathways found to contain the gene are returned as KEGG pathway ids. 5. Both KEGG gene and pathway ids are then sent to individual services, provided by KEGG, which provide a description of the gene and pathway. 6. The outputs of the workflow are then combined into single flat files, which can be saved locally and used to identify novel pathways and genes within the QTL region.
  • 35. What Next? • Ontologies are not the only fruit • We could stop calling them ontologies • We need to produce “ontologies” faster • We need to do more interesting things with our knowledge • We need to make them pervade our tools • We need then to be “agile” • Open to other forms of KR and other forms of reasoning • Adding to data automatically • Generating our descriptions of data
  • 36. Acknowledgements • Simon Jupp for the slides • Alan rector and Carole goble • sysMoDB for rightField (Katy Wolstencroft, Stuart Owen, Matt Horridge) • Populous – Simon Jupp • SWAT – richard Power, Sandra Williams and Allan third at the OU • EFO – James Malone and Helen Parkinson • Steve Pettifer for the Utopia and MVC • Paul Fisher and the Taverna team • The myExperiment team at Southampton and Manchester

Notas do Editor

  1. Slide Title: Literature Lots of books in a library
  2. Slide Title: Catalogues Stack of books listing: Genome Transcriptome Proteome Interactome Metabolome Phenome
  3. Slide Title Slide contains: Book on the left with a plus sign Black and white image, man sat at an old valve-style computer (i.e. manchester baby) Text saying: genes, proteins, interactions, pathways Mouse on the right Text below images says: (left) Literature (middle) complex machines (right) Organism (bottom) “…. from biological facts, we make a system that is some model of a real thing” - Robert Stevens – 2008
  4. All of which helps build better ontologies. But can we actually apply this computational amenability more Directly to biological knowledge. In this example, which is work by Katy Wolstencroft, we have codified Community knowledge about protein domains in phosphatases in OWL. We then take unknown protein sequences, Pass then through interpro and stick them into the instance store, which is basically a database and reasoner tied together Qualified Cardiniality!!!
  5. Slide Title: Literature Lots of books in a library