SlideShare uma empresa Scribd logo
1 de 34
Knowing what we’re
talking about
Robert Stevens
Bio-health Informatics Group
School of Computer Science
University of manchester
Oxford Road
Manchester
United Kingdom
M13 9PL
Robert.Stevens@manchester.ac.uk
We have an item of data
• 27
• 27 what?
• Units, with what is 27
associated?
• Even if I told you, would
we interpret what I said
in the same way?
27
• text
27mm
• text
tail of
27mm
Mouse tail of 27 mm
• … and we can carry on:
Mouse strain, where was
it raised, on what was it
fed, times, dates, etc.
etc.
• All this data is necessary
to interpret my original
number
• Even if that metadata
exists, we have to agree
on the things the
numbers describe
mouse
tail of
27mm
What is knowledge?
Heterogeneity is rife
• We agree on units (more or less)…
• We don’t agree on much else when it comes to
labels for the entities in our domain
• If we don’t know what we’re talking about….
• It’s difficult to interpret and exchange data and the
results from data
Categories and Category Labels
GO:0000368
U2-type nuclear mRNA 5' splice site recognition
spliceosomal E complex formation
spliceosomal E complex biosynthesis
spliceosomal CC complex formation
U2-type nuclear mRNA 5'-splice site recognition
The Ogden Triangle
“Roast Beef“
Concept
[Ogden, Richards, 1923]
• Humans require words (or at least symbols) to communicate efficiently. The
mapping of words to things is only indirectly possible. We do it by creating
concepts that refer to things.
• The relation between symbols and things has been described in the form of the
meaning triangle:
We need to know what we’re talking about…
• … if we don’t, our data are useless
• Ifg we are to interpret our data then we need to
know what entities it describes
• We need to share data and re-use it
• We need to find data; compare data; analyse data
• We need to know what we know….
Manchester Mercury
January 1st 1754 Executed 18
Found Dead 34
Frighted 2
Kill'd by falls and other accidents 55
Kill'd themselves 36
Murdered 3
Overlaid 40
Poisoned 1
Scalded 5
Smothered 1
Stabbed 1
Starved 7
Suffocated 5
Aged 1456
Consumption 3915
Convulsion 5977
Dropsy 794
Fevers 2292
Smallpox 774
Teeth 961
Bit by mad dogs 3
Broken Limbs 5
Bruised 5
Burnt 9
Drowned 86
Excessive Drinking 15
List of diseases &
casualties this year
19276 burials
15444 christenings
Deaths by centile
A World of Instances
• The world (of information) is made up of things and lots of them
• Instances, individuals, objects, tokens, particulars.
• The Earth is a kind of Planet
• Robert Stevens (NE 67 41 58 A) is a Person
• All the individual Alpha Haemoglobins in my many Instances of Red Blood Cell
• Each cell instance in my Body has copies of some 30,000 Genes
• A Word, language, idea, etc.
• This Table, those Chairs,
• Any Thing with “A”, “The”, “That”, etc. before it….
We Put things into
Categories
• All these instances hang about making our world
• Putting these things into categories is a fundamental part of
human cognition
• Psychologists study this as concept formation
• The same instances are put into a category
We have Labels for the
Categories and their
Instances
• We label categories with symbols: Words
• “Lion” is a category of big cat with big teeth
• Gene, Protein, Cell, Person, Hydrolase Activity, etc.
• …and, as we’ve already seen, each category can have many labels and any
particular label can refer to more than one category
• Semantic Heterogeneity
• “A lion” is an instance in that category
• Does the category “Lion” exist?
• Lions exist, but the category could just be a human way of talking about
lions
• … we like putting things into categories
A Controlled Vocabulary• A specified set of words and phrases for the categories
in which we place instances
• Natural language definitions for those words and
phrases
• A glossary defines, but doesn’t control
• The Uniprot keywords define and control
• Control is placed upon which labels are used to
represent the categories (concepts) we’ve used to
describe the instances in the world
• …, but there is nothing about how things in these
categories are related
Biopolymer
DNA
Enzyme
Nucleic acid
mRNA
Polypeptide
snRNA
tRNA
We also like to Relate Things
Together
• Categories have subcategories
• Instances in one category can be related
in some way to instances in another
• Can relate instances to each other in
many different ways
• Is-a, part-of, develops-from, etc.axes
• We can use these relationships to classify
categories
• Things in category A are part is
• If all instances in category A are also in
category B then As are kinds of Bs
Biopolymer
Nucleic Acid Polypeptide
Enzym
e
DNA RNA
tRNA mRNA smRNA
Categories and sub-
categories
biopolymer
polypeptide Nucleic acid
enzyme
DNA
RNA
Describing Category
Membership
• We can make conditions that any instance must fulfil in order to be a
member of a particular category
• A Phosphatase must have a phosphatase catalytic domain
• A Receptor must have a transmembrane domain
• A codon has three nucleotide residues
• A limb has part that is a joint
• A man has a Y chromosome and an X chromosome
• A woman has only an X chromosome
Relationships
• These conditions made from a property and a
successor relationship
• isPartOf, hasPart
• isDerivedFrom
• DevelopsFrom
• isHomologousTo
• …and many, many more
A Structured Controlled
Vocabulary
• Not only can we agree on the
labels we give categories
• Can also agree on how the
instances of categories are
related
• And agree on the labels we give
he relations
• Structure aids querying and
captures knowledge with greater
fidelity
Biopolymer
Nucleic Acid Polypeptide
Enzym
e
DNA RNA
tRNA mRNA smRNA
Gene
transcribedFrom
A Stronger Definition
• a set of logical axioms designed to account for the intended meaning
of a formal vocabulary used to describe a certain (conceptualisation of)
reality [described in an information system) [Guarino 1998]
• “conceptualisation of” inserted by me
• “Logical axioms” means a formal definition of meaning of terms in a
formal language
• Formal language—something a computer an reason with
• Use symbols to make inferences
• Symbols represent things and their relationships
• Making inferences about things computationally
So what is an ontology?
Catalog/
ID
Thesauri
Terms/
glossary
Informal
Is-a
Formal
Is-a
Formal
instance
Frames
(properties)
General
Logical
constraints
Value
restrictions
Disjointness,
Inverse, partof
Gene Ontology
Mouse Anatomy
EcoCyc
PharmGKB
TAMBIS
Arom
After Chris Welty et al
What does it all mean
anyway
• To interpret our data we need to know what it is we’re talking
about
• We need to decide the things that we’re talking about and
agree upon them
• We need to agree on how to recognise those entities
• We need to know how they are related to one another
• Ontologies are a mechanism for describing those entities
and their definitions
• There’s more to knowledge representation than ontologies…
All this knowledge needs
representing
• We want this knowledge in a computational form
• To make the knowledge available for software (and
humans)
• To help us develop and manage the (often) complex
artefacts
Building ontologies is hard (getting all those relationships in
the right place)
The Web Ontology Language (OWL) is a W3C
recommendation for ontologies on the Semantic Web and in
semantically enabled applications
A knowledge representation language with a strict semantics
that is amenable to autoamted reasoning
Web Ontology Language
(OWL)
• W3C recommendation for ontologies for the Semantic
Web
• OWL-DL mapped to a decidable fragment of first order
logic
• Classes, properties and instances
• Boolean operators, plus existential and universal
quantification
• Rich class expressions used in restriction on
properties – hasDomain some (ImnunoGlobinDomain
or FibronectinDomain)
What are we saying?
Person
WomanMan
is-ais-a
• Are all instances of Man instances of Person?
• Can an instance of Person be both a Man
and an instance of Woman?
• Can there be any more kinds of Person?
What are we saying?
• What kinds of class can fill “has chromosome”?
• How many “Y chromosome” are present?
• Does their have to be a “Y chromosome”?
• What properties are sufficient to be a Man and which are
simply necessary?
Y chromosomeMan has-chromosome
Y chromosomeMan
has-chromosome
X chromosomehas-chromosome
autosomehas-chromosome
1
1
44
OWL represents
classes of
instances
A
B
C
Necessity and Sufficiency
• An R2A phosphatase must have a fibronectin domain
• Having a fibronectin domain does not a phosphatase make
• Necessity -- what must a class instance have?
• Any protein that has a phosphatase catalytic domain is a
phosphatase enzyme
• All phosphatase enzymes have a catalytic domain
• Sufficiency – how is an instance recognised to be a member
of a class?
Uses of ontologies
Ontologies in software
Problems Ontologies in
Biology Try To Solve
• Provenance – where did it come from, who did it?
• Reproducibility – can I repeat and find results
reported?
• Sharing – can others understand your data?
• Integration – can I readily take multiple (thousands
of) data sets and use them without preparation?
• New knowledge – can we infer new knowledge as
a sum of current knowledge (computationally)?
The rise and rise of
ontologies
What are the prospects for
ontologies

Mais conteúdo relacionado

Semelhante a Knowing what we’re talking about

Working with big biomedical ontologies
Working with big biomedical ontologiesWorking with big biomedical ontologies
Working with big biomedical ontologiesrobertstevens65
 
CoE-WEBINAR-2_042117v3.pptx
CoE-WEBINAR-2_042117v3.pptxCoE-WEBINAR-2_042117v3.pptx
CoE-WEBINAR-2_042117v3.pptxVandana472475
 
How to Create a Golden Ontology
How to Create a Golden OntologyHow to Create a Golden Ontology
How to Create a Golden OntologyMike Bennett
 
Sp616 adult lexical processing for students
Sp616 adult lexical processing for studentsSp616 adult lexical processing for students
Sp616 adult lexical processing for studentsLynette Chan
 
Katrin Erk - 2017 - What do you know about an alligator when you know the com...
Katrin Erk - 2017 - What do you know about an alligator when you know the com...Katrin Erk - 2017 - What do you know about an alligator when you know the com...
Katrin Erk - 2017 - What do you know about an alligator when you know the com...Association for Computational Linguistics
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Iddo
 
chapter2 Know.representation.pptx
chapter2 Know.representation.pptxchapter2 Know.representation.pptx
chapter2 Know.representation.pptxwendifrawtadesse1
 
Classification/Science
Classification/ScienceClassification/Science
Classification/Sciencecbush4
 
Variation in speech tempo: Capt. Kirk, Mr. Spock, and all of us in between
Variation in speech tempo: Capt. Kirk, Mr. Spock, and all of us in betweenVariation in speech tempo: Capt. Kirk, Mr. Spock, and all of us in between
Variation in speech tempo: Capt. Kirk, Mr. Spock, and all of us in betweenTyler Schnoebelen
 
Drug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersDrug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersSamuel Croset
 
Ontology learning from text
Ontology learning from textOntology learning from text
Ontology learning from textrobertstevens65
 
ANSC3305 Fall 2012
ANSC3305 Fall 2012ANSC3305 Fall 2012
ANSC3305 Fall 2012Megan Kocher
 
Issues in Learning an Ontology from Text
Issues in Learning an Ontology from Text Issues in Learning an Ontology from Text
Issues in Learning an Ontology from Text robertstevens65
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biologyrobertstevens65
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translationMarcis Pinnis
 

Semelhante a Knowing what we’re talking about (20)

Working with big biomedical ontologies
Working with big biomedical ontologiesWorking with big biomedical ontologies
Working with big biomedical ontologies
 
1406.ch1.si.sp20
1406.ch1.si.sp201406.ch1.si.sp20
1406.ch1.si.sp20
 
CoE-WEBINAR-2_042117v3.pptx
CoE-WEBINAR-2_042117v3.pptxCoE-WEBINAR-2_042117v3.pptx
CoE-WEBINAR-2_042117v3.pptx
 
frames.pptx
frames.pptxframes.pptx
frames.pptx
 
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
 
How to Create a Golden Ontology
How to Create a Golden OntologyHow to Create a Golden Ontology
How to Create a Golden Ontology
 
Sp616 adult lexical processing for students
Sp616 adult lexical processing for studentsSp616 adult lexical processing for students
Sp616 adult lexical processing for students
 
Katrin Erk - 2017 - What do you know about an alligator when you know the com...
Katrin Erk - 2017 - What do you know about an alligator when you know the com...Katrin Erk - 2017 - What do you know about an alligator when you know the com...
Katrin Erk - 2017 - What do you know about an alligator when you know the com...
 
Ontology at Manchester
Ontology at ManchesterOntology at Manchester
Ontology at Manchester
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013
 
chapter2 Know.representation.pptx
chapter2 Know.representation.pptxchapter2 Know.representation.pptx
chapter2 Know.representation.pptx
 
Classification/Science
Classification/ScienceClassification/Science
Classification/Science
 
Variation in speech tempo: Capt. Kirk, Mr. Spock, and all of us in between
Variation in speech tempo: Capt. Kirk, Mr. Spock, and all of us in betweenVariation in speech tempo: Capt. Kirk, Mr. Spock, and all of us in between
Variation in speech tempo: Capt. Kirk, Mr. Spock, and all of us in between
 
Drug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasonersDrug-discovery knowledge integration and analysis using OWL and reasoners
Drug-discovery knowledge integration and analysis using OWL and reasoners
 
Lecture 2 2nd sem PHILOSOPHY
Lecture 2 2nd sem PHILOSOPHYLecture 2 2nd sem PHILOSOPHY
Lecture 2 2nd sem PHILOSOPHY
 
Ontology learning from text
Ontology learning from textOntology learning from text
Ontology learning from text
 
ANSC3305 Fall 2012
ANSC3305 Fall 2012ANSC3305 Fall 2012
ANSC3305 Fall 2012
 
Issues in Learning an Ontology from Text
Issues in Learning an Ontology from Text Issues in Learning an Ontology from Text
Issues in Learning an Ontology from Text
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translation
 

Mais de robertstevens65

Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientrobertstevens65
 
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016robertstevens65
 
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...robertstevens65
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in robertstevens65
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysisrobertstevens65
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologiesrobertstevens65
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology developmentrobertstevens65
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family Historyrobertstevens65
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefactsrobertstevens65
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templatesrobertstevens65
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agilerobertstevens65
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesrobertstevens65
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)robertstevens65
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Roserobertstevens65
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...robertstevens65
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Disciplinerobertstevens65
 
A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2robertstevens65
 
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 robertstevens65
 
Communities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and RealityCommunities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and Realityrobertstevens65
 

Mais de robertstevens65 (20)

Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficient
 
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
 
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysis
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologies
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology development
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family History
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefacts
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templates
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agile
 
Spreadsheets to OWL
Spreadsheets to OWLSpreadsheets to OWL
Spreadsheets to OWL
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologies
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Rose
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Discipline
 
A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2
 
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
 
Communities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and RealityCommunities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and Reality
 

Último

ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXDole Philippines School
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxkumarsanjai28051
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPirithiRaju
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Tamer Koksalan, PhD
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 

Último (20)

ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptx
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdf
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 

Knowing what we’re talking about

  • 1. Knowing what we’re talking about Robert Stevens Bio-health Informatics Group School of Computer Science University of manchester Oxford Road Manchester United Kingdom M13 9PL Robert.Stevens@manchester.ac.uk
  • 2. We have an item of data • 27 • 27 what? • Units, with what is 27 associated? • Even if I told you, would we interpret what I said in the same way? 27
  • 5. Mouse tail of 27 mm • … and we can carry on: Mouse strain, where was it raised, on what was it fed, times, dates, etc. etc. • All this data is necessary to interpret my original number • Even if that metadata exists, we have to agree on the things the numbers describe mouse tail of 27mm
  • 7. Heterogeneity is rife • We agree on units (more or less)… • We don’t agree on much else when it comes to labels for the entities in our domain • If we don’t know what we’re talking about…. • It’s difficult to interpret and exchange data and the results from data
  • 8. Categories and Category Labels GO:0000368 U2-type nuclear mRNA 5' splice site recognition spliceosomal E complex formation spliceosomal E complex biosynthesis spliceosomal CC complex formation U2-type nuclear mRNA 5'-splice site recognition
  • 9. The Ogden Triangle “Roast Beef“ Concept [Ogden, Richards, 1923] • Humans require words (or at least symbols) to communicate efficiently. The mapping of words to things is only indirectly possible. We do it by creating concepts that refer to things. • The relation between symbols and things has been described in the form of the meaning triangle:
  • 10. We need to know what we’re talking about… • … if we don’t, our data are useless • Ifg we are to interpret our data then we need to know what entities it describes • We need to share data and re-use it • We need to find data; compare data; analyse data • We need to know what we know….
  • 11. Manchester Mercury January 1st 1754 Executed 18 Found Dead 34 Frighted 2 Kill'd by falls and other accidents 55 Kill'd themselves 36 Murdered 3 Overlaid 40 Poisoned 1 Scalded 5 Smothered 1 Stabbed 1 Starved 7 Suffocated 5 Aged 1456 Consumption 3915 Convulsion 5977 Dropsy 794 Fevers 2292 Smallpox 774 Teeth 961 Bit by mad dogs 3 Broken Limbs 5 Bruised 5 Burnt 9 Drowned 86 Excessive Drinking 15 List of diseases & casualties this year 19276 burials 15444 christenings Deaths by centile
  • 12. A World of Instances • The world (of information) is made up of things and lots of them • Instances, individuals, objects, tokens, particulars. • The Earth is a kind of Planet • Robert Stevens (NE 67 41 58 A) is a Person • All the individual Alpha Haemoglobins in my many Instances of Red Blood Cell • Each cell instance in my Body has copies of some 30,000 Genes • A Word, language, idea, etc. • This Table, those Chairs, • Any Thing with “A”, “The”, “That”, etc. before it….
  • 13. We Put things into Categories • All these instances hang about making our world • Putting these things into categories is a fundamental part of human cognition • Psychologists study this as concept formation • The same instances are put into a category
  • 14. We have Labels for the Categories and their Instances • We label categories with symbols: Words • “Lion” is a category of big cat with big teeth • Gene, Protein, Cell, Person, Hydrolase Activity, etc. • …and, as we’ve already seen, each category can have many labels and any particular label can refer to more than one category • Semantic Heterogeneity • “A lion” is an instance in that category • Does the category “Lion” exist? • Lions exist, but the category could just be a human way of talking about lions • … we like putting things into categories
  • 15. A Controlled Vocabulary• A specified set of words and phrases for the categories in which we place instances • Natural language definitions for those words and phrases • A glossary defines, but doesn’t control • The Uniprot keywords define and control • Control is placed upon which labels are used to represent the categories (concepts) we’ve used to describe the instances in the world • …, but there is nothing about how things in these categories are related Biopolymer DNA Enzyme Nucleic acid mRNA Polypeptide snRNA tRNA
  • 16. We also like to Relate Things Together • Categories have subcategories • Instances in one category can be related in some way to instances in another • Can relate instances to each other in many different ways • Is-a, part-of, develops-from, etc.axes • We can use these relationships to classify categories • Things in category A are part is • If all instances in category A are also in category B then As are kinds of Bs Biopolymer Nucleic Acid Polypeptide Enzym e DNA RNA tRNA mRNA smRNA
  • 18. Describing Category Membership • We can make conditions that any instance must fulfil in order to be a member of a particular category • A Phosphatase must have a phosphatase catalytic domain • A Receptor must have a transmembrane domain • A codon has three nucleotide residues • A limb has part that is a joint • A man has a Y chromosome and an X chromosome • A woman has only an X chromosome
  • 19. Relationships • These conditions made from a property and a successor relationship • isPartOf, hasPart • isDerivedFrom • DevelopsFrom • isHomologousTo • …and many, many more
  • 20. A Structured Controlled Vocabulary • Not only can we agree on the labels we give categories • Can also agree on how the instances of categories are related • And agree on the labels we give he relations • Structure aids querying and captures knowledge with greater fidelity Biopolymer Nucleic Acid Polypeptide Enzym e DNA RNA tRNA mRNA smRNA Gene transcribedFrom
  • 21. A Stronger Definition • a set of logical axioms designed to account for the intended meaning of a formal vocabulary used to describe a certain (conceptualisation of) reality [described in an information system) [Guarino 1998] • “conceptualisation of” inserted by me • “Logical axioms” means a formal definition of meaning of terms in a formal language • Formal language—something a computer an reason with • Use symbols to make inferences • Symbols represent things and their relationships • Making inferences about things computationally
  • 22. So what is an ontology? Catalog/ ID Thesauri Terms/ glossary Informal Is-a Formal Is-a Formal instance Frames (properties) General Logical constraints Value restrictions Disjointness, Inverse, partof Gene Ontology Mouse Anatomy EcoCyc PharmGKB TAMBIS Arom After Chris Welty et al
  • 23. What does it all mean anyway • To interpret our data we need to know what it is we’re talking about • We need to decide the things that we’re talking about and agree upon them • We need to agree on how to recognise those entities • We need to know how they are related to one another • Ontologies are a mechanism for describing those entities and their definitions • There’s more to knowledge representation than ontologies…
  • 24. All this knowledge needs representing • We want this knowledge in a computational form • To make the knowledge available for software (and humans) • To help us develop and manage the (often) complex artefacts Building ontologies is hard (getting all those relationships in the right place) The Web Ontology Language (OWL) is a W3C recommendation for ontologies on the Semantic Web and in semantically enabled applications A knowledge representation language with a strict semantics that is amenable to autoamted reasoning
  • 25. Web Ontology Language (OWL) • W3C recommendation for ontologies for the Semantic Web • OWL-DL mapped to a decidable fragment of first order logic • Classes, properties and instances • Boolean operators, plus existential and universal quantification • Rich class expressions used in restriction on properties – hasDomain some (ImnunoGlobinDomain or FibronectinDomain)
  • 26. What are we saying? Person WomanMan is-ais-a • Are all instances of Man instances of Person? • Can an instance of Person be both a Man and an instance of Woman? • Can there be any more kinds of Person?
  • 27. What are we saying? • What kinds of class can fill “has chromosome”? • How many “Y chromosome” are present? • Does their have to be a “Y chromosome”? • What properties are sufficient to be a Man and which are simply necessary? Y chromosomeMan has-chromosome Y chromosomeMan has-chromosome X chromosomehas-chromosome autosomehas-chromosome 1 1 44
  • 29. Necessity and Sufficiency • An R2A phosphatase must have a fibronectin domain • Having a fibronectin domain does not a phosphatase make • Necessity -- what must a class instance have? • Any protein that has a phosphatase catalytic domain is a phosphatase enzyme • All phosphatase enzymes have a catalytic domain • Sufficiency – how is an instance recognised to be a member of a class?
  • 32. Problems Ontologies in Biology Try To Solve • Provenance – where did it come from, who did it? • Reproducibility – can I repeat and find results reported? • Sharing – can others understand your data? • Integration – can I readily take multiple (thousands of) data sets and use them without preparation? • New knowledge – can we infer new knowledge as a sum of current knowledge (computationally)?
  • 33. The rise and rise of ontologies
  • 34. What are the prospects for ontologies