SlideShare a Scribd company logo
1 of 52
Download to read offline
Functional Characterisation of
Metabolic Networks
Carlos Manuel Estévez-Bretón MSc
Doctorate in Systems Engineering and Computer Sciences
Advisors: Luis Fernando Niño PhD
Liliana Lopez Kleine PhD
Intelligent Systems Research Laboratory - LISI
Bioinformatics and Computational Biology research line “BioLisi”
Examining Committee:
Dr. Jason Papin, -U. ofVirginia, Bioengineering.
Dr.Andres Gonzalez, - U. de los Andes, Chemical Engineering.
Dr. Fabio Gonzalez, U. Nacional, Systems Engineering.
What...
Why...
Research Question
How...
Progress ...
Agenda
Goals
Evaluation
Deliverables
What?
http://www.impactcommunicationsinc.com/wp-content/uploads/2011/10/11-11_speak_up.jpg
Metabolism are the
complete set of
metabolic
networks and
physical processes
that determine the
physiological and
biochemical properties
of a cell.
With the sequencing of complete
genomes, it is now possible to
reconstruct the network of biochemical
reactions in many organisms, from
bacteria to humans...
PMC 2011 August 17.
Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459.
doi: 10.1002/wsbm.75
Ecological Scale
Lucas B. Edelman, James A. Eddy, and Nathan D. Price
Systems BiologyIntroduction
PMC 2011 August 17.
Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459.
doi: 10.1002/wsbm.75
Ecological Scale
Lucas B. Edelman, James A. Eddy, and Nathan D. Price
Systems BiologyIntroduction
PMC 2011 August 17.
Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459.
doi: 10.1002/wsbm.75
Ecological Scale
Lucas B. Edelman, James A. Eddy, and Nathan D. Price
Systems BiologyIntroduction
PMC 2011 August 17.
Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459.
doi: 10.1002/wsbm.75
Ecological Scale
Lucas B. Edelman, James A. Eddy, and Nathan D. Price
Systems BiologyIntroduction
PMC 2011 August 17.
Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459.
doi: 10.1002/wsbm.75
Ecological Scale
Lucas B. Edelman, James A. Eddy, and Nathan D. Price
Multilevelfield
Systems BiologyIntroduction
PMC 2011 August 17.
Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459.
doi: 10.1002/wsbm.75
Ecological Scale
Lucas B. Edelman, James A. Eddy, and Nathan D. Price
Multilevelfield
Studied
Interdisciplinary
Systems BiologyIntroduction
IntroductionBetter and cheaper processing power
Multilevel Information
IntroductionBetter and cheaper processing power
Introduction
Regulatory Networks
Protein Protein Interaction
Networks
Metabolic Networks
Ecological Networks
Introduction
Regulatory Networks
Protein Protein Interaction
Networks
Metabolic Networks
Ecological Networks
Main Data Sources
“Techniques such as high-trougput (HT)
sequencing and gene/protein profiling have
transformed biological Research” (Khatri et al,2012)
“In this way,the advent of HT profiling technologies
presents a new challenge,that of extracting meaning from
a long list of differentially expressed genes and proteins”.
(Khatri et al,2012)
“Techniques such as high-trougput (HT)
sequencing and gene/protein profiling have
transformed biological Research” (Khatri et al,2012)
“In this way,the advent of HT profiling technologies
presents a new challenge,that of extracting meaning from
a long list of differentially expressed genes and proteins”.
(Khatri et al,2012)
These biological techniques changes the way we study
biological science.
Interdisciplinary effort to extract meaning, analyze, and
obtain information with high levels of confidence and
quality.
[14:56 18/11/2011 Bioinformatics-btr585.tex] Page: 3331 3331–3332
commonly used in bioinformatics and their common synonyms,
plural forms and abbreviations. We then searched this list against
the PubMed titles and abstracts to identify the number of papers
published per year for each machine learning technique. To match as
many papers as possible, searches were case insensitive and allowed
for variation in hyphenation.
Fig. 1. The growth of supervised machine learning methods in PubMed.
∗To whom correspondence should be addressed
perhaps going out of fashion. The results show that none of the
major league methods has gone out of fashion, but we do see
moderate decreases in the use of both ANNs and Markov models in
the literature.
We were also curious to find out if certain machine learning
techniques were used in combination with each other. To investigate
this, we looked at what machine learning methods are co-mentioned
in articles (See Fig. 2). For all pairs of methods from the Supervised
Fig. 2. Heatmap showing the co-occurrence of machine learning techniques
within articles.
© The Author(s) 2011. Published by Oxford University Press.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/
by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
byguestonDecember7,2011ormatics.oxfordjournals.org/
“Hot techniques”: ANN,
Markov Models,and“new ones”
SVM and Random Forests.
(Jensen & Bateman in 2011)
IntelligentSystems
Latent Topic Analysisis not in the list of methods.
“In particular,supervised machine learning has been
used to great effect in numerous bioinformatics
prediction methods”.
(Jensen & Bateman,2011)
Machine learning is of immense importance in
bioinformatics and more generally for biomedical
sciences (Larrañaga et al.,2006;Tarca et al.,2007).
Because in metabolic systems analysis,is not common,
I think that is important to emphasise that:
There are no references in the literature for
analysis of metabolic pathways from a
functional approach,or using proposed
machine learning methods.
IntelligentSystems
Larrañaga et al. bib.oxfordjournals.org at The Reference Shelf on May 30, 2011
achineLearning
Larrañaga et al. bib.oxfordjournals.org at The Reference Shelf on May 30, 2011
Bayesian classifiers, Feature subset
selection
SVM,ANN, classification trees,
Evolutionary algorithms
tabu search
nearest neighbour, SVM, Bayesian
classifier, fuzzy k-NN
Bayesiangeneralizationofthe
SVM,ANN,lineardiscriminant
analysis,classificationtrees,ANN
SVMandHMM,
linear discriminant analysis,
quadratic discriminant
analysis, k-NN classifier,
bagging and boosting
classification trees, SVM and
random forest
achineLearning
Larrañaga et al. bib.oxfordjournals.org at The Reference Shelf on May 30, 2011
Bayesian classifiers, Feature subset
selection
SVM,ANN, classification trees,
Evolutionary algorithms
tabu search
nearest neighbour, SVM, Bayesian
classifier, fuzzy k-NN
Bayesiangeneralizationofthe
SVM,ANN,lineardiscriminant
analysis,classificationtrees,ANN
probabilistic graphical
models, classification
trees, boosting with
classification trees
SVMandHMM,
linear discriminant analysis,
quadratic discriminant
analysis, k-NN classifier,
bagging and boosting
classification trees, SVM and
random forest
achineLearning
Why?
http://www.perftrends.com/images/why.jpg
... or Methods are
not applied to
Metabolic Pathways...
...or are based onTopological (Graph Based)
network representations
• It should be possible to make some
advances in understanding the
underlying functional conformation
of metabolic pathways.
Statem
ent
http://www.scriptmag.com/wp-content/uploads/BrainStorm-NewColor-12-22_32-1280x980at86.jpg
http://www.scriptmag.com/wp-content/uploads/BrainStorm-NewColor-12-22_32-1280x980at86.jpg
• Supervised Clustering - useful to test the
given representation - by classifying the biochemical
reactions.
http://www.ee.ryerson.ca/~courses/ele888/ele_888_pat_class.gif
Statem
ent
http://diversity-mining-lab.wikispaces.com/
Statem
ent
• Information Retrieval algebraic models, like
vector space based ones, should “reveal” topics
that occurs in document collections.
• Is it possible to generate new - “really new” pathways?
• ...I’m talking about synthetic biology.
http://diversity-mining-lab.wikispaces.com/
Statem
ent
Research Question
Is it possible to
classify metabolic
networks only
using functional
features?
How?
http://www.wired.com/images_blogs/threatlevel/2012/10/harris002.jpg
Goals
• To Classify functionally, (without considering the
topological structure) metabolic pathways
based on machine learning methods.
Goals
• To Classify functionally, (without considering the
topological structure) metabolic pathways
based on machine learning methods.
• To Build or adapt a system of functional representation
for metabolic networks.
Goals
• To Classify functionally, (without considering the
topological structure) metabolic pathways
based on machine learning methods.
• To Build or adapt a system of functional representation
for metabolic networks.
• To Classify metabolic networks using machine learning
methods.
Goals
• To Classify functionally, (without considering the
topological structure) metabolic pathways
based on machine learning methods.
• To Build or adapt a system of functional representation
for metabolic networks.
• To Classify metabolic networks using machine learning
methods.
• To Apply (in new ways) machine learning methods in
the study of systems biology.
Methodology
S1 + S2 + … Sn P1 + P2 + … Pn
Enzime
CoFactor CoEnzime
General Metabolic Reaction Model - GMRM
Vectorization of GMRM
S1 S2 S3 Enzime CoF CoE P1 P2 P3
MetaCyc
KEGG1
2
RepresentationClassification
CarlosManuelEstévez-BretónR.2012
DataSourceEvaluation
Method 2Method 1
ROC
Confusion
matrix
Entropy
purity
adjusted
Rand Index
Accuracy
Pipeline
paper paper
paper
DataSources MetaCyc
KEGG1
2
DataRepresentation
S1 + S2 + … Sn P1 + P2 + … Pn
Enzime
CoFactor CoEnzime
General Metabolic Reaction Model - GMRM
Vectorization of GMRM
S1 S2 S3 Enzime CoF CoE P1 P2 P3
Classification
Supervised Classification
Method 1
•Let’s think about clustering without any
prior knowledge...
• Applying Information Retrieval methods to
Metabolic Pathways data.
Method 2
Evaluation ROC
Confusion
matrix
Entropy
purity
adjusted
Rand Index
Accuracy
http://www.intechopen.com/source/html/38584/media/image56.jpeg
Classified as:
Really is:
Positive Negative
Positive
Negative
False Negative
True NegativeFalse Positive
True Positive
Evaluation ROC
Confusion
matrix
Entropy
purity
adjusted
Rand Index
Accuracy
http://www.intechopen.com/source/html/38584/media/image56.jpeg
Classified as:
Really is:
Positive Negative
Positive
Negative
False Negative
True NegativeFalse Positive
True Positive
Error Rate
Recall/sensitivity
Specificity/True Negative Rate
Precision
1-Specificity/False Alarm Rate
Evaluation
ROC
Confusion
matrix
Entropy
purity
adjusted
Rand Index
Accuracy
http://www.intechopen.com/source/html/38584/media/image56.jpeg
http://wwww.cbgstat.com/v2/method_ROC_curve_MedCalc/images/ROC_curve_MedCalc_Snap17.gif
Deliverables
A computational metabolic
representation proposal
A computational metabolic
classification method
A generative metabolic
pathways model
A pipeline for metabolic
pathways analysis
Progress ...
http://desktop.freewallpaper4.me/view/original/3714/the-lonely-man.jpg
PreliminaryResults
S1 + S2 + … Sn P1 + P2 + … Pn
Enzime
CoFactor CoEnzime
General Metabolic Reaction Model - GMRM
Vectorization of GMRM
S1 S2 S3 Enzime CoF CoE P1 P2 P3
MetaCyc
KEGG1
2
RepresentationClassification
CarlosManuelEstévez-BretónR.2012
DataSourceEvaluation
Method 2Method 1
ROC
Confusion
matrix
Entropy
purity
adjusted
Rand Index
Accuracy
Pipeline
paper paper
paper
Complexity
Metabolic Pathway
Reaction
Metabolites/ome
Metabolic Switch
Glucose
Glucose 6P ATP
Hidrolase
Pyrophosphate
Vocabulary
Words Molecules
the
Murder for a jar of red rum
frog
soap
Document
Phrase
Paragraph
rum Murder for
jar
a
ofred
rum Murder for
jar
a
ofred
Glucose Glucose 6PATP
Hidrolase
ADP+ +
ADP
LinguisticAnalogy
S1 + S2 + … Sn P1 + P2 + … Pn
Enzime
CoFactor CoEnzime
General Metabolic Reaction Model - GMRM
Vectorization of GMRM
S1 S2 S3 Enzime CoF CoE P1 P2 P3
Representation
S1 + S2 + … Sn P1 + P2 + … Pn
Enzime
CoFactor CoEnzime
General Metabolic Reaction Model - GMRM
Vectorization of GMRM
S1 S2 S3 Enzime CoF CoE P1 P2 P3
Classification Supervised
4Pathways
2carbohydrate metabolism
1lipid metabolism
1from nucleotide metabolism
Support Vector Machines
Classification Tree
K Nearest Neighbour
CN2Naive Bayes
24
organisms
Method 1
Pipeline
Review
- Proposing a vector representation of biochemical
reactions, based in a linguistic analogy.
I´m going to classify metabolic networks
only using functional features...
To find patterns that suggests constitution
rules on metabolic pathways.
- Searching patterns by clustering.
Thanks
@karelman

More Related Content

What's hot

Computational Approaches to Systems Biology
Computational Approaches to Systems BiologyComputational Approaches to Systems Biology
Computational Approaches to Systems BiologyMike Hucka
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
 
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Fokhruz Zaman
 
Bioinformatics-General_Intro
Bioinformatics-General_IntroBioinformatics-General_Intro
Bioinformatics-General_IntroAbhiroop Ghatak
 
Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Sijo A
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpusijcsit
 
introduction of Bioinformatics
introduction of Bioinformaticsintroduction of Bioinformatics
introduction of BioinformaticsVinaKhan1
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsElena Sügis
 
Bibliography (Microsoft Word, 61k)
Bibliography (Microsoft Word, 61k)Bibliography (Microsoft Word, 61k)
Bibliography (Microsoft Word, 61k)butest
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...Alejandra Gonzalez-Beltran
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformaticscontactsoorya
 
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLS
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLSTWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLS
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLSIJDKP
 
Cheminformatics: An overview
Cheminformatics: An overviewCheminformatics: An overview
Cheminformatics: An overviewsubhasis banerjee
 

What's hot (19)

Computational Approaches to Systems Biology
Computational Approaches to Systems BiologyComputational Approaches to Systems Biology
Computational Approaches to Systems Biology
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
 
Bioinformatics-General_Intro
Bioinformatics-General_IntroBioinformatics-General_Intro
Bioinformatics-General_Intro
 
Data mining ppt
Data mining pptData mining ppt
Data mining ppt
 
Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)
 
Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012
 
Ibn Sina
Ibn SinaIbn Sina
Ibn Sina
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
 
introduction of Bioinformatics
introduction of Bioinformaticsintroduction of Bioinformatics
introduction of Bioinformatics
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 
Bibliography (Microsoft Word, 61k)
Bibliography (Microsoft Word, 61k)Bibliography (Microsoft Word, 61k)
Bibliography (Microsoft Word, 61k)
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformatics
 
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLS
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLSTWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLS
TWO LEVEL SELF-SUPERVISED RELATION EXTRACTION FROM MEDLINE USING UMLS
 
Bioinformatics ppt
Bioinformatics pptBioinformatics ppt
Bioinformatics ppt
 
Cheminformatics: An overview
Cheminformatics: An overviewCheminformatics: An overview
Cheminformatics: An overview
 
Resume
ResumeResume
Resume
 

Viewers also liked

Viewers also liked (8)

ILUSIONS
ILUSIONSILUSIONS
ILUSIONS
 
ILUSIONS
ILUSIONSILUSIONS
ILUSIONS
 
Malaga presentation
Malaga presentationMalaga presentation
Malaga presentation
 
No Leche De Vaca
No Leche De VacaNo Leche De Vaca
No Leche De Vaca
 
'Vlaamse' menukaart in Dinant
'Vlaamse' menukaart in Dinant'Vlaamse' menukaart in Dinant
'Vlaamse' menukaart in Dinant
 
Dani
DaniDani
Dani
 
Barcelona
BarcelonaBarcelona
Barcelona
 
Goya Wiyp Presentation
Goya Wiyp PresentationGoya Wiyp Presentation
Goya Wiyp Presentation
 

Similar to PhDc exam presentation

2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptxAshuAsh15
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAGopen_phacts
 
BIOINFO unit 1.pptx
BIOINFO unit 1.pptxBIOINFO unit 1.pptx
BIOINFO unit 1.pptxrnath286
 
Tales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesTales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesStefano Di Carlo
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsmikaelhuss
 
NatashaBME1450.doc
NatashaBME1450.docNatashaBME1450.doc
NatashaBME1450.docbutest
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyBarry Smith
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfkigaruantony
 
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Amit Sheth
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook OntologyStuart Chalk
 
Protein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyProtein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyZac Darcy
 
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYPROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYZac Darcy
 
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS AIRCC Publishing Corporation
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusAIRCC Publishing Corporation
 

Similar to PhDc exam presentation (20)

2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptx
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG
 
A biologist in e-Science
A biologist in e-ScienceA biologist in e-Science
A biologist in e-Science
 
BIOINFO unit 1.pptx
BIOINFO unit 1.pptxBIOINFO unit 1.pptx
BIOINFO unit 1.pptx
 
Tales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesTales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life Sciences
 
Sabina Leonelli
Sabina LeonelliSabina Leonelli
Sabina Leonelli
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Bio ontology drtc-seminar_anwesha
Bio ontology drtc-seminar_anweshaBio ontology drtc-seminar_anwesha
Bio ontology drtc-seminar_anwesha
 
NatashaBME1450.doc
NatashaBME1450.docNatashaBME1450.doc
NatashaBME1450.doc
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdf
 
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
BioPortal: ontologies and integrated data resources at the click of a mouse
BioPortal: ontologies and integrated data resourcesat the click of a mouseBioPortal: ontologies and integrated data resourcesat the click of a mouse
BioPortal: ontologies and integrated data resources at the click of a mouse
 
ISA - a short overview - Dec 2013
ISA - a short overview - Dec 2013ISA - a short overview - Dec 2013
ISA - a short overview - Dec 2013
 
Protein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyProtein sequence classification in data mining– a study
Protein sequence classification in data mining– a study
 
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYPROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
 
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
A SEMANTIC RETRIEVAL SYSTEM FOR EXTRACTING RELATIONSHIPS FROM BIOLOGICAL CORPUS
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
 

More from Carlos Manuel Estévez-Bretón Riveros

Aspectos Básicos de la Propiedad Intelectual y su Relación con las Ciencias M...
Aspectos Básicos de la Propiedad Intelectual y su Relación con las Ciencias M...Aspectos Básicos de la Propiedad Intelectual y su Relación con las Ciencias M...
Aspectos Básicos de la Propiedad Intelectual y su Relación con las Ciencias M...Carlos Manuel Estévez-Bretón Riveros
 

More from Carlos Manuel Estévez-Bretón Riveros (14)

Derechos de autor y de copia en el desarrollo de software
Derechos de autor y de copia en el desarrollo de softwareDerechos de autor y de copia en el desarrollo de software
Derechos de autor y de copia en el desarrollo de software
 
Biological technologies
Biological technologiesBiological technologies
Biological technologies
 
Datamining y Machine Learning para Ciencias Biológicas
Datamining y Machine Learning para Ciencias BiológicasDatamining y Machine Learning para Ciencias Biológicas
Datamining y Machine Learning para Ciencias Biológicas
 
Aspectos Básicos de la Propiedad Intelectual y su Relación con las Ciencias M...
Aspectos Básicos de la Propiedad Intelectual y su Relación con las Ciencias M...Aspectos Básicos de la Propiedad Intelectual y su Relación con las Ciencias M...
Aspectos Básicos de la Propiedad Intelectual y su Relación con las Ciencias M...
 
Ejemplo beneficios tributarios para Inversión Colciencias
Ejemplo beneficios tributarios para Inversión ColcienciasEjemplo beneficios tributarios para Inversión Colciencias
Ejemplo beneficios tributarios para Inversión Colciencias
 
Una mirada al futuro de la Biotecnología
Una mirada al futuro de la BiotecnologíaUna mirada al futuro de la Biotecnología
Una mirada al futuro de la Biotecnología
 
Contagion Mobile App
Contagion Mobile AppContagion Mobile App
Contagion Mobile App
 
Tv & inetrnet
Tv & inetrnetTv & inetrnet
Tv & inetrnet
 
TICs & Medio Ambiente
TICs & Medio AmbienteTICs & Medio Ambiente
TICs & Medio Ambiente
 
Boinc
BoincBoinc
Boinc
 
Campus Party MEN
Campus Party MENCampus Party MEN
Campus Party MEN
 
Biologia de Sistemas y Biologia Sintetica
Biologia de Sistemas y Biologia SinteticaBiologia de Sistemas y Biologia Sintetica
Biologia de Sistemas y Biologia Sintetica
 
Bioinformatica GeBiX
Bioinformatica GeBiXBioinformatica GeBiX
Bioinformatica GeBiX
 
GeBiX Muestras Codebar
GeBiX Muestras CodebarGeBiX Muestras Codebar
GeBiX Muestras Codebar
 

Recently uploaded

Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Presentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxPresentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxRosabel UA
 
TEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxTEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxruthvilladarez
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 

Recently uploaded (20)

FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Presentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptxPresentation Activity 2. Unit 3 transv.pptx
Presentation Activity 2. Unit 3 transv.pptx
 
TEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docxTEACHER REFLECTION FORM (NEW SET........).docx
TEACHER REFLECTION FORM (NEW SET........).docx
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 

PhDc exam presentation

  • 1. Functional Characterisation of Metabolic Networks Carlos Manuel Estévez-Bretón MSc Doctorate in Systems Engineering and Computer Sciences Advisors: Luis Fernando Niño PhD Liliana Lopez Kleine PhD Intelligent Systems Research Laboratory - LISI Bioinformatics and Computational Biology research line “BioLisi” Examining Committee: Dr. Jason Papin, -U. ofVirginia, Bioengineering. Dr.Andres Gonzalez, - U. de los Andes, Chemical Engineering. Dr. Fabio Gonzalez, U. Nacional, Systems Engineering.
  • 4. Metabolism are the complete set of metabolic networks and physical processes that determine the physiological and biochemical properties of a cell. With the sequencing of complete genomes, it is now possible to reconstruct the network of biochemical reactions in many organisms, from bacteria to humans...
  • 5. PMC 2011 August 17. Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459. doi: 10.1002/wsbm.75 Ecological Scale Lucas B. Edelman, James A. Eddy, and Nathan D. Price Systems BiologyIntroduction
  • 6. PMC 2011 August 17. Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459. doi: 10.1002/wsbm.75 Ecological Scale Lucas B. Edelman, James A. Eddy, and Nathan D. Price Systems BiologyIntroduction
  • 7. PMC 2011 August 17. Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459. doi: 10.1002/wsbm.75 Ecological Scale Lucas B. Edelman, James A. Eddy, and Nathan D. Price Systems BiologyIntroduction
  • 8. PMC 2011 August 17. Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459. doi: 10.1002/wsbm.75 Ecological Scale Lucas B. Edelman, James A. Eddy, and Nathan D. Price Systems BiologyIntroduction
  • 9. PMC 2011 August 17. Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459. doi: 10.1002/wsbm.75 Ecological Scale Lucas B. Edelman, James A. Eddy, and Nathan D. Price Multilevelfield Systems BiologyIntroduction
  • 10. PMC 2011 August 17. Wiley Interdiscip Rev Syst Biol Med. 2010 Jul-Aug; 2(4): 438–459. doi: 10.1002/wsbm.75 Ecological Scale Lucas B. Edelman, James A. Eddy, and Nathan D. Price Multilevelfield Studied Interdisciplinary Systems BiologyIntroduction
  • 11. IntroductionBetter and cheaper processing power
  • 13. Introduction Regulatory Networks Protein Protein Interaction Networks Metabolic Networks Ecological Networks
  • 14. Introduction Regulatory Networks Protein Protein Interaction Networks Metabolic Networks Ecological Networks Main Data Sources
  • 15. “Techniques such as high-trougput (HT) sequencing and gene/protein profiling have transformed biological Research” (Khatri et al,2012) “In this way,the advent of HT profiling technologies presents a new challenge,that of extracting meaning from a long list of differentially expressed genes and proteins”. (Khatri et al,2012)
  • 16. “Techniques such as high-trougput (HT) sequencing and gene/protein profiling have transformed biological Research” (Khatri et al,2012) “In this way,the advent of HT profiling technologies presents a new challenge,that of extracting meaning from a long list of differentially expressed genes and proteins”. (Khatri et al,2012) These biological techniques changes the way we study biological science. Interdisciplinary effort to extract meaning, analyze, and obtain information with high levels of confidence and quality.
  • 17. [14:56 18/11/2011 Bioinformatics-btr585.tex] Page: 3331 3331–3332 commonly used in bioinformatics and their common synonyms, plural forms and abbreviations. We then searched this list against the PubMed titles and abstracts to identify the number of papers published per year for each machine learning technique. To match as many papers as possible, searches were case insensitive and allowed for variation in hyphenation. Fig. 1. The growth of supervised machine learning methods in PubMed. ∗To whom correspondence should be addressed perhaps going out of fashion. The results show that none of the major league methods has gone out of fashion, but we do see moderate decreases in the use of both ANNs and Markov models in the literature. We were also curious to find out if certain machine learning techniques were used in combination with each other. To investigate this, we looked at what machine learning methods are co-mentioned in articles (See Fig. 2). For all pairs of methods from the Supervised Fig. 2. Heatmap showing the co-occurrence of machine learning techniques within articles. © The Author(s) 2011. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. byguestonDecember7,2011ormatics.oxfordjournals.org/ “Hot techniques”: ANN, Markov Models,and“new ones” SVM and Random Forests. (Jensen & Bateman in 2011) IntelligentSystems Latent Topic Analysisis not in the list of methods.
  • 18. “In particular,supervised machine learning has been used to great effect in numerous bioinformatics prediction methods”. (Jensen & Bateman,2011) Machine learning is of immense importance in bioinformatics and more generally for biomedical sciences (Larrañaga et al.,2006;Tarca et al.,2007). Because in metabolic systems analysis,is not common, I think that is important to emphasise that:
  • 19. There are no references in the literature for analysis of metabolic pathways from a functional approach,or using proposed machine learning methods. IntelligentSystems
  • 20. Larrañaga et al. bib.oxfordjournals.org at The Reference Shelf on May 30, 2011 achineLearning
  • 21. Larrañaga et al. bib.oxfordjournals.org at The Reference Shelf on May 30, 2011 Bayesian classifiers, Feature subset selection SVM,ANN, classification trees, Evolutionary algorithms tabu search nearest neighbour, SVM, Bayesian classifier, fuzzy k-NN Bayesiangeneralizationofthe SVM,ANN,lineardiscriminant analysis,classificationtrees,ANN SVMandHMM, linear discriminant analysis, quadratic discriminant analysis, k-NN classifier, bagging and boosting classification trees, SVM and random forest achineLearning
  • 22. Larrañaga et al. bib.oxfordjournals.org at The Reference Shelf on May 30, 2011 Bayesian classifiers, Feature subset selection SVM,ANN, classification trees, Evolutionary algorithms tabu search nearest neighbour, SVM, Bayesian classifier, fuzzy k-NN Bayesiangeneralizationofthe SVM,ANN,lineardiscriminant analysis,classificationtrees,ANN probabilistic graphical models, classification trees, boosting with classification trees SVMandHMM, linear discriminant analysis, quadratic discriminant analysis, k-NN classifier, bagging and boosting classification trees, SVM and random forest achineLearning
  • 24.
  • 25. ... or Methods are not applied to Metabolic Pathways... ...or are based onTopological (Graph Based) network representations
  • 26. • It should be possible to make some advances in understanding the underlying functional conformation of metabolic pathways. Statem ent http://www.scriptmag.com/wp-content/uploads/BrainStorm-NewColor-12-22_32-1280x980at86.jpg
  • 27. http://www.scriptmag.com/wp-content/uploads/BrainStorm-NewColor-12-22_32-1280x980at86.jpg • Supervised Clustering - useful to test the given representation - by classifying the biochemical reactions. http://www.ee.ryerson.ca/~courses/ele888/ele_888_pat_class.gif Statem ent
  • 29. • Information Retrieval algebraic models, like vector space based ones, should “reveal” topics that occurs in document collections. • Is it possible to generate new - “really new” pathways? • ...I’m talking about synthetic biology. http://diversity-mining-lab.wikispaces.com/ Statem ent
  • 30. Research Question Is it possible to classify metabolic networks only using functional features?
  • 32. Goals • To Classify functionally, (without considering the topological structure) metabolic pathways based on machine learning methods.
  • 33. Goals • To Classify functionally, (without considering the topological structure) metabolic pathways based on machine learning methods. • To Build or adapt a system of functional representation for metabolic networks.
  • 34. Goals • To Classify functionally, (without considering the topological structure) metabolic pathways based on machine learning methods. • To Build or adapt a system of functional representation for metabolic networks. • To Classify metabolic networks using machine learning methods.
  • 35. Goals • To Classify functionally, (without considering the topological structure) metabolic pathways based on machine learning methods. • To Build or adapt a system of functional representation for metabolic networks. • To Classify metabolic networks using machine learning methods. • To Apply (in new ways) machine learning methods in the study of systems biology.
  • 36. Methodology S1 + S2 + … Sn P1 + P2 + … Pn Enzime CoFactor CoEnzime General Metabolic Reaction Model - GMRM Vectorization of GMRM S1 S2 S3 Enzime CoF CoE P1 P2 P3 MetaCyc KEGG1 2 RepresentationClassification CarlosManuelEstévez-BretónR.2012 DataSourceEvaluation Method 2Method 1 ROC Confusion matrix Entropy purity adjusted Rand Index Accuracy Pipeline paper paper paper
  • 38. DataRepresentation S1 + S2 + … Sn P1 + P2 + … Pn Enzime CoFactor CoEnzime General Metabolic Reaction Model - GMRM Vectorization of GMRM S1 S2 S3 Enzime CoF CoE P1 P2 P3
  • 40. •Let’s think about clustering without any prior knowledge... • Applying Information Retrieval methods to Metabolic Pathways data. Method 2
  • 41. Evaluation ROC Confusion matrix Entropy purity adjusted Rand Index Accuracy http://www.intechopen.com/source/html/38584/media/image56.jpeg Classified as: Really is: Positive Negative Positive Negative False Negative True NegativeFalse Positive True Positive
  • 42. Evaluation ROC Confusion matrix Entropy purity adjusted Rand Index Accuracy http://www.intechopen.com/source/html/38584/media/image56.jpeg Classified as: Really is: Positive Negative Positive Negative False Negative True NegativeFalse Positive True Positive Error Rate Recall/sensitivity Specificity/True Negative Rate Precision 1-Specificity/False Alarm Rate
  • 44. Deliverables A computational metabolic representation proposal A computational metabolic classification method A generative metabolic pathways model A pipeline for metabolic pathways analysis
  • 46. PreliminaryResults S1 + S2 + … Sn P1 + P2 + … Pn Enzime CoFactor CoEnzime General Metabolic Reaction Model - GMRM Vectorization of GMRM S1 S2 S3 Enzime CoF CoE P1 P2 P3 MetaCyc KEGG1 2 RepresentationClassification CarlosManuelEstévez-BretónR.2012 DataSourceEvaluation Method 2Method 1 ROC Confusion matrix Entropy purity adjusted Rand Index Accuracy Pipeline paper paper paper
  • 47. Complexity Metabolic Pathway Reaction Metabolites/ome Metabolic Switch Glucose Glucose 6P ATP Hidrolase Pyrophosphate Vocabulary Words Molecules the Murder for a jar of red rum frog soap Document Phrase Paragraph rum Murder for jar a ofred rum Murder for jar a ofred Glucose Glucose 6PATP Hidrolase ADP+ + ADP LinguisticAnalogy S1 + S2 + … Sn P1 + P2 + … Pn Enzime CoFactor CoEnzime General Metabolic Reaction Model - GMRM Vectorization of GMRM S1 S2 S3 Enzime CoF CoE P1 P2 P3
  • 48. Representation S1 + S2 + … Sn P1 + P2 + … Pn Enzime CoFactor CoEnzime General Metabolic Reaction Model - GMRM Vectorization of GMRM S1 S2 S3 Enzime CoF CoE P1 P2 P3
  • 49. Classification Supervised 4Pathways 2carbohydrate metabolism 1lipid metabolism 1from nucleotide metabolism Support Vector Machines Classification Tree K Nearest Neighbour CN2Naive Bayes 24 organisms Method 1
  • 51. Review - Proposing a vector representation of biochemical reactions, based in a linguistic analogy. I´m going to classify metabolic networks only using functional features... To find patterns that suggests constitution rules on metabolic pathways. - Searching patterns by clustering.