SlideShare a Scribd company logo
1 of 71
Download to read offline
Applications of network theory to human
population genetics:
from pathways to genotype networks

Giovanni Marco Dall'Olio
Pompeu Fabra University, Barcelona
Advisors: Jaume Bertranpetit
and Hafid Laayouni
Acknowledgments
●

I would like to thank:
–

My PhD supervisors, Jaume Bertranpetit and Hafid
Laayouni

–

My committee: Dr. Mauro Santos, Dr. Ricard Solé,
Prof. Guido Barbujani, Dr. Ferran Casals, Dra.
Yolanda Espinosa

–

The Evolutionary Systems Biology group at UPF

–

The Institut of Biologia Evolutiva

2
Topics
●

Context and motivations

●

My research:
–
–

Pathway approach on the N-Glycosylation pathway

–

The Genotype Network Approach

–
●

Annotating the N-Glycosylation pathway

The Human Selection Browser and Biostar

Conclusions

3
Context of the thesis
●

●

The first anatomically modern humans
appeared about 200,000 years ago
How can we understand the signals of genetic
adaptation in our genome, since then?

4
Factors that influenced recent
human evolution

New climates

Diseases

Agriculture

5
The opportunity
●

●

We have access to large datasets of human
sequences
Better annotations on gene function and role

6
Contributions
●

Find applications of network theory to
understand genetic adaptation in the human
species

7
Applications of network theory

●

●

The Pathway approach

The Genotype Network
approach
8
Topics
●

Context and motivations

●

My research:
–
–

Pathway approach on the N-Glycosylation pathway

–

The Genotype Network Approach

–
●

Annotating the N-Glycosylation pathway

The Human Selection Browser and Biostar

Conclusions

9
The Pathway approach
●

●

Genes are organized
in pathways
Any eventual selection
constraint will be
distributed among all
the genes of a
pathway

10
Distribution of Selection forces
in a pathway
●

Some positions of the
pathway will be more
likely to have stronger
signals of selection

11
Pathway Approach - outline
●

●

●

Build a Network
representation of a
pathway
Execute a test for
positive selection on
each gene
Determine how the
signals of selection
are distributed on the
network
12
Pathway approach on the
N-Glycosylation pathway
●

●

Asparagine
N-Glycosylation is a
metabolic pathway for
a type of protein
modification
The structure of this
pathway is easy to
represent as a
network
13
N-glycosylation - upstream part
●

●

Produces a single sugar called “N-Glycan precursor”
This sugar is required for the proper folding of most
membrane proteins

14
Adapted from Stanley, P., Schachter, H., & Taniguchi, N. (2009).
N-Glycans. Essentials of Glycobiology.
N-Glycosylation and protein folding
●

The product of the upstream part of N-glycosylation
is used as a signal to distinguish folded and unfolded
proteins

Folded protein

Un-Folded protein
15
N-glycosylation - downstream part
●

●

Complex pathway
composed by
thousands of reactions
Produces multiple
glycans, important for
cell-to-cell interactions

16
Hossler, P., Mulukutla, B. C., & Hu, W.-S. (2007). Systems analysis of
N-glycan processing in mammalian cells.
PloS one, 2(1), e713. doi:10.1371/journal.pone.0000713
Glycans on the cell surface
●

●

The surface of a cell is similar to a forest of
glycosylated proteins
Each organism and cell has a specific repertoire
of glycans

17
A. Doeer, Glycoproteomics. Nature Methods, 2011. doi:10.1038/nmeth.1821
Annotating the
N-Glycosylation pathway
●

In order to build a correct network model for the
N-Glycosylation pathway, we annotated it first in
the Reactome database

18
The N-Glycosylation pathway
in Reactome

19
The KEGG entry for N-Glycosylation
is incomplete
Downstream
N-Glycosylation
in KEGG

Real representation
of downstream
N-Glycosylation
20
Another error for N-Glycosylation
in KEGG

21
Erroneous annotation in String
●

There are two genes
with the symbol ALG2:
–

–

●

ALG2 (Asparagine
Linked Glycosylation 2)
ALG-2 (Apoptosis
Linked Gene – 2)

In String, these two were
confused

22
Ambigous interpretation of the term
N-Glycosylation in GO

N-Glycosylated pathway

Merged

N-Glycosylated protein
23
Annotating the
N-Glycosylation pathway
●

Annotated ~100 reactions in Reactome

●

Fixed ~50 Gene Ontology terms

●

Fixed key errors in String and KEGG

24
Network structure of
N-Glycosylation pathway

25
Dataset used
●

The CEPH-HGDP 650,000 Illumina chip dataset

●

940 individuals, from 50 human populations

26
Methods used
●

●

The FST index → measure of population
differentiation
The iHS test → identification of signals of
recent positive selection

27
FST – Population differentiation
●

●

FST is a measure of
population
differentiation
If the FST between two
population is 1, it
means that the two
populations are fixed
for different alleles
28
Signatures of population differentiation
in the N-Glycosylation pathway

FST signals are concentrated
in the downstream part, and
in the substrates biosynthesis

29
Population Differentiation
and network position
●

●

Node degree correlates
with the distribution of
FST signals
Genes with high FST are
generally more
connected

30
IHS and Long range haplotypes
●

●

A selective sweep may
cause the appearance of
long homozygous
haplotypes at a high
frequency
Example: a long
homozygous haplotype
present in the LCT gene
in North-European
populations
Vitti et al, Trends in genetics, 2012

31
IHS and Long range haplotypes:

iHS: Compares
the Extended
Haplotype
Homozygosity
decay (EHH
decay) between
ancestral and
derived allele

Voight et al., PLoS Genetics 2006

32
Signatures of selection in the
N-Glycosylation pathway

No difference in the distribution of
iHS signals between upstream
and downstream
33
Signatures of selection in the
N-Glycosylation pathway

GCS1: redirects to
protein folding
quality control

MGAT3:
redirects to
Hybrid Glycans

MAN2A1: redirects
to Complex Glycans
34
Pathway approach on N-Glycosylation
●

There is a difference in the patterns of population differentiation between the
two parts of the N-Glycosylation pathway

●

Signals of positive selection are more likely on key genes

●

One of the few works applying the pathway approach on human genetics

35
Topics
●

Context and motivations

●

My research:
–
–

Pathway approach on the N-Glycosylation pathway

–

The Genotype Network Approach

–
●

Annotating the N-Glycosylation pathway

The Human Selection Browser and Biostar

Conclusions

36
The Genotype Network approach
●

Genotype Networks
have been used to
study the “innovability”
and evolvability of a
genetic system

37
The Genotype Network approach
●

●

Genotype Networks
have been used to
study the “innovability”
and evolvability of a
genetic system
Never applied to
population genetics
data, because they
require too much data!

38
Genotype Networks - theory
●

John Maynard-Smith:
the concept of a Protein
Space, which is explored
by populations

39
Genotype Networks - theory
●

John Maynard-Smith:
the concept of a Protein
Space, which is explored
by populations

“if evolution by natural selection is
to occur, functional proteins [or
DNA sequences] must form a
continuous network which can be
traversed by unit mutational steps
without passing through nonfunctional intermediates”
40
Neutralism and Selectionism
●

●

Neutralism: most mutations are
neutral or deleterious
Selectionism: positive
mutations drive evolution

41
Genotype Networks help recoincile
Neutralism and Selectionism
●

●

Cycles of Neutral
evolution, alterned by
cycles of Selection
Even neutral or
negative mutations
can beneficial on the
long run, because
they allow to explore
the genotype space
42
The Genotype Network - definitions
●

●

The Genotype
Space of a region of
5 SNPs can be
represented as a
network
Each node is a
possible genotype,
and edge connect
nodes with only one
difference
43
The Genotype Network - definitions
●

●

Green nodes are
sequences observed
in a population
This is the Genotype
Network of a
population

44
Average Path Length of a Genotype
Network
●

●

This figure represents
two populations
The yellow one has
an higher Average
Path Length than the
blue one

45
Average Degree
●

●

●

●

This population has an
high Average Degree
It is more robust to
mutations

This population has a
low Average Degree
Mutations are more likely
to fall outside the
Genotype Network
46
Dataset analyzed
●
●

1000genomes data, phase 1
850 individuals genotyped, grouped into three
continental groups (AFR, EUR and ASN)

47
The VCF2Space library
●

●

●

Suite of Python
scripts to calculate
Genotype Networks
from a VCF file
~400,000 lines of
code
~350 unit tests

48
Splitting the genome into windows
of 11 SNPs
●

●

Less than 11 SNPs -> networks are too small and
condensed
More than 11 SNPs -> networks are too large and
sparse

Small network

Large network

49
Why windows of 11 SNPs?

50
Genotype Network properties of the
human genome

http://genome.ucsc.edu/cgi-bin/hgTracks?
db=hg19&hubUrl=http://bioevo.upf.edu/~gdallolio/genotype_space/hub.txt

51
Coding & Non-Coding regions
●

Coding regions have higher average path
length and degree than non coding regions

52
Genotype Networks and Selection
(simulated data)

Selection
Neutral

53
●

●

●

Coding networks:
high average path
lenght and degree

Non coding networks:
low average path lenght
and degree

Recent selection: lower
average path lenght and
degree

54
Genotype Network:
currently under review..

55
Topics
●

Context and motivations

●

My research:
–
–

Pathway approach on the N-Glycosylation pathway

–

The Genotype Network Approach

–
●

Annotating the N-Glycosylation pathway

The Human Selection Browser and Biostar

Conclusions

56
Other works: The Human Selection
Browser
●

We applied 21 tests for
positive selection to the
1,000 Genomes dataset
–

●

FST, CLR, iHS, etc...

This dataset will be
published and made freely
available as a genome
browser

57
Other works: Biostar
●

An online forum for bioinformatics

●

About 150,000 visits per month

●

Helped thousands of bioinformaticians!

58
Topics
●

Context and motivations

●

My research:
–
–

Pathway approach on the N-Glycosylation pathway

–

The Genotype Network Approach

–
●

Annotating the N-Glycosylation pathway

The Human Selection Browser and Biostar

Conclusions

59
Conclusions (I)
●

●

●

●

We developed two applications of network theory to the study
of human population genetics.
We produced a network model of the N-Glycosylation
pathway, contributing it to the Reactome database and
improving the annotations in other databases.
We showed that the downstream part of the N-Glycosylation
pathway shows more signatures of genetic differentiation than
the upstream part. This is compatible with the role and
structure of this part of the pathway.
We showed that key genes of the N-Glycosylation pathway,
such as GCS1, MGAT3 and MAN2A1, show signatures of
recent positive selection in human populations.
60
Conclusions (II)
●

●

●

We produced a suite of Python scripts, called
VCF2Space, to apply the concept of Genotype
Networks to Single Nucleotide Polimorphism data
Our genome-wide application of Genotype Networks
showed that coding regions tend to have networks
with higher average degree and path length than
non-coding regions
We contributed positively to the bioinformatics
community, providing resources such as the 1000
Genomes Selection Browser and Biostar
61
63
Figures credits
●

●

●

Slide 5:
humans: http://blogs.ancestry.com/ancestry/
star trek: http://en.wikipedia.org/wiki/Star_Trek:_The_Original_Series
Slide 6:
Malaria: http://science.psu.edu/news-and-events/2012-news/Read7-2012
Climates: http://www.ancienteco.com/2012/03/climate-change-drives-human-evolution.html
Agriculture: http://en.wikipedia.org/wiki/History_of_agriculture
Slide 7:
–

●

Slide 14:
–

●

Cover of Science, 23 March 2001

Slide 15:
–

●

1000 Genomes, CEPH-HGDP panel, UK10K, Hapmap websites

Adapted from Stanley, P., Schachter, H., & Taniguchi, N. (2009).
N-Glycans. Essentials of Glycobiology.

Slide 17:
–

Glycosylation, downstream: Hossler, P., Mulukutla, B. C., & Hu, W.-S. (2007). Systems analysis of
N-glycan processing in mammalian cells. PloS one, 2(1), e713. doi:10.1371/journal.pone.0000713
64
Figures credits
●

●

●

●

Slide 27:
http://www.cephb.fr/en/hgdp/diversity.php/
Slide 29:
http://www.rationalskepticism.org
Slide 32
Adapted from Vitti et al, 2012
Slide 42:
–

wikipedia

65
The Pathway approach
Stronger Selection on
Genes with high
connectivity or
upstream of a
pathway

66
N-glycosylation – how does it work
●

All the N-glycans are generated from a single
sugar with a very conserved structure, called
N-glycan precursor

N-glycan
precursor

Signal for
folded
proteins

Millions of
different
67

glycans
The FST test

Almost all the highest
signals of FST are in
genes of the
downstream part

68
The iHS test

GCS1 in
EUR

MAN2A1 in
SSAFR and
EASIA

MGAT3 in
EASIA

69
Combining p-values
●

●

●

From Peng et al, Eur J Hum Genet. 2010

Fisher's combination test
ZF follows a χ2(2K)
distribution
SNPs from the same
gene may violate the
assumption of
independency, but still the
method is robust to errors

70
Comparing upstream and
downstream N-Glycosylation
●

χ2 test comparing the
number of events
observed in the each
part of the pathway,
against what is the
number expected if
there were no
pathway structure

71
How to convert genotypes to
networks
●

Two haplotypes per individual

●

Reference allele → 0; Alternative allele → 1
Individual 1

AC AC AA GG TT TG CA TG

Ancestral alleles:

A A A G T T C T

haplotype a

00000000

haplotype b

11000111

72

More Related Content

What's hot

ASCB2014Phoebe_small
ASCB2014Phoebe_smallASCB2014Phoebe_small
ASCB2014Phoebe_smallPhoebe He
 
2016_Scientific Reports_Article
2016_Scientific Reports_Article2016_Scientific Reports_Article
2016_Scientific Reports_ArticleMauricio Rosenfeld
 
Development and motility of Dicty actin mutants
Development and motility of Dicty actin mutantsDevelopment and motility of Dicty actin mutants
Development and motility of Dicty actin mutantsAndrey Dementyev
 
Endocytosis and cytoskeleton dynamic
Endocytosis and cytoskeleton dynamicEndocytosis and cytoskeleton dynamic
Endocytosis and cytoskeleton dynamicRaul D-v
 
Somatic cell genetics by kk sahu
Somatic cell genetics by kk sahuSomatic cell genetics by kk sahu
Somatic cell genetics by kk sahuKAUSHAL SAHU
 
20081217 05邵彥春 與紅麴菌菌絲發育相關基因的克隆及序列分析
20081217 05邵彥春 與紅麴菌菌絲發育相關基因的克隆及序列分析20081217 05邵彥春 與紅麴菌菌絲發育相關基因的克隆及序列分析
20081217 05邵彥春 與紅麴菌菌絲發育相關基因的克隆及序列分析Monascus2008
 
Avila et al 2010 wnt 3
Avila et al 2010 wnt 3Avila et al 2010 wnt 3
Avila et al 2010 wnt 3Jorge Parodi
 
Evolution theory and science eng.12.1.2021
Evolution theory and science   eng.12.1.2021Evolution theory and science   eng.12.1.2021
Evolution theory and science eng.12.1.2021Heinonen Matti
 

What's hot (20)

ASCB2014Phoebe_small
ASCB2014Phoebe_smallASCB2014Phoebe_small
ASCB2014Phoebe_small
 
Protocols for genomics and proteomics
Protocols for genomics and proteomics Protocols for genomics and proteomics
Protocols for genomics and proteomics
 
Tutorial mitosis
Tutorial mitosisTutorial mitosis
Tutorial mitosis
 
Biology Finals Study Guide
Biology Finals Study GuideBiology Finals Study Guide
Biology Finals Study Guide
 
FinalLabReport
FinalLabReportFinalLabReport
FinalLabReport
 
2016_Scientific Reports_Article
2016_Scientific Reports_Article2016_Scientific Reports_Article
2016_Scientific Reports_Article
 
JustinDEvans
JustinDEvansJustinDEvans
JustinDEvans
 
Development and motility of Dicty actin mutants
Development and motility of Dicty actin mutantsDevelopment and motility of Dicty actin mutants
Development and motility of Dicty actin mutants
 
Grindberg - PNAS
Grindberg - PNASGrindberg - PNAS
Grindberg - PNAS
 
Meiosis Reduction Division
Meiosis Reduction Division Meiosis Reduction Division
Meiosis Reduction Division
 
Endocytosis and cytoskeleton dynamic
Endocytosis and cytoskeleton dynamicEndocytosis and cytoskeleton dynamic
Endocytosis and cytoskeleton dynamic
 
Mitochondrial and chloroplast DNA
Mitochondrial and chloroplast DNAMitochondrial and chloroplast DNA
Mitochondrial and chloroplast DNA
 
Somatic cell genetics by kk sahu
Somatic cell genetics by kk sahuSomatic cell genetics by kk sahu
Somatic cell genetics by kk sahu
 
20081217 05邵彥春 與紅麴菌菌絲發育相關基因的克隆及序列分析
20081217 05邵彥春 與紅麴菌菌絲發育相關基因的克隆及序列分析20081217 05邵彥春 與紅麴菌菌絲發育相關基因的克隆及序列分析
20081217 05邵彥春 與紅麴菌菌絲發育相關基因的克隆及序列分析
 
Cell division best
Cell division bestCell division best
Cell division best
 
Avila et al 2010 wnt 3
Avila et al 2010 wnt 3Avila et al 2010 wnt 3
Avila et al 2010 wnt 3
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Khyber medical university bio informatics
Khyber medical university bio informaticsKhyber medical university bio informatics
Khyber medical university bio informatics
 
Biology english
Biology englishBiology english
Biology english
 
Evolution theory and science eng.12.1.2021
Evolution theory and science   eng.12.1.2021Evolution theory and science   eng.12.1.2021
Evolution theory and science eng.12.1.2021
 

Viewers also liked

Dissertation oral defense presentation
Dissertation   oral defense presentationDissertation   oral defense presentation
Dissertation oral defense presentationDr. Naomi Mangatu
 
Prepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationPrepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationChristian Glahn
 
Powerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefencePowerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefenceCatie Chase
 
Thesis Powerpoint
Thesis PowerpointThesis Powerpoint
Thesis Powerpointneha47
 
Thesis Power Point Presentation
Thesis Power Point PresentationThesis Power Point Presentation
Thesis Power Point Presentationriddhikapandya1985
 
Ph D Thesis Defense Presentation
Ph D Thesis Defense PresentationPh D Thesis Defense Presentation
Ph D Thesis Defense PresentationDiaa ElKott
 
PhD defence presentation
PhD defence presentationPhD defence presentation
PhD defence presentationcsteinmann
 
How to Defend your Thesis Proposal like a Professional
How to Defend your Thesis Proposal like a ProfessionalHow to Defend your Thesis Proposal like a Professional
How to Defend your Thesis Proposal like a ProfessionalMiriam College
 
Powerpoint Presentation of PhD Viva
Powerpoint Presentation of PhD VivaPowerpoint Presentation of PhD Viva
Powerpoint Presentation of PhD VivaDr Mohan Savade
 
Selection index population_genetics
Selection index population_geneticsSelection index population_genetics
Selection index population_geneticsJinseob Kim
 
04_ETH Zurich Pavilion booklet_sm
04_ETH Zurich Pavilion booklet_sm04_ETH Zurich Pavilion booklet_sm
04_ETH Zurich Pavilion booklet_smLukas Fitze
 
Social Sharing In a Web of Things
Social Sharing In a Web of ThingsSocial Sharing In a Web of Things
Social Sharing In a Web of ThingsDominique Guinard
 
Perspectives of identifying Korean genetic variations
Perspectives of identifying Korean genetic variationsPerspectives of identifying Korean genetic variations
Perspectives of identifying Korean genetic variationsHong ChangBum
 
Human-Computer Interaction in Complex Artefact Ecologies
Human-Computer Interaction in Complex Artefact EcologiesHuman-Computer Interaction in Complex Artefact Ecologies
Human-Computer Interaction in Complex Artefact Ecologiesclemensklokmose
 
Service Integration in the Web of Things
Service Integration in the Web of ThingsService Integration in the Web of Things
Service Integration in the Web of ThingsSimon Mayer
 
Vlad Trifa - Final PhD Thesis Defense at ETH Zurich
Vlad Trifa - Final PhD Thesis Defense at ETH ZurichVlad Trifa - Final PhD Thesis Defense at ETH Zurich
Vlad Trifa - Final PhD Thesis Defense at ETH ZurichVlad Trifa
 

Viewers also liked (20)

Dissertation oral defense presentation
Dissertation   oral defense presentationDissertation   oral defense presentation
Dissertation oral defense presentation
 
Prepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense PresentationPrepare your Ph.D. Defense Presentation
Prepare your Ph.D. Defense Presentation
 
Powerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefencePowerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis Defence
 
Linux intro 1 definitions
Linux intro 1  definitionsLinux intro 1  definitions
Linux intro 1 definitions
 
Agile bioinf
Agile bioinfAgile bioinf
Agile bioinf
 
Thesis Powerpoint
Thesis PowerpointThesis Powerpoint
Thesis Powerpoint
 
Version control
Version controlVersion control
Version control
 
Thesis Power Point Presentation
Thesis Power Point PresentationThesis Power Point Presentation
Thesis Power Point Presentation
 
Ph D Thesis Defense Presentation
Ph D Thesis Defense PresentationPh D Thesis Defense Presentation
Ph D Thesis Defense Presentation
 
PhD defence presentation
PhD defence presentationPhD defence presentation
PhD defence presentation
 
How to Defend your Thesis Proposal like a Professional
How to Defend your Thesis Proposal like a ProfessionalHow to Defend your Thesis Proposal like a Professional
How to Defend your Thesis Proposal like a Professional
 
Powerpoint Presentation of PhD Viva
Powerpoint Presentation of PhD VivaPowerpoint Presentation of PhD Viva
Powerpoint Presentation of PhD Viva
 
Selection index population_genetics
Selection index population_geneticsSelection index population_genetics
Selection index population_genetics
 
04_ETH Zurich Pavilion booklet_sm
04_ETH Zurich Pavilion booklet_sm04_ETH Zurich Pavilion booklet_sm
04_ETH Zurich Pavilion booklet_sm
 
Social Sharing In a Web of Things
Social Sharing In a Web of ThingsSocial Sharing In a Web of Things
Social Sharing In a Web of Things
 
Perspectives of identifying Korean genetic variations
Perspectives of identifying Korean genetic variationsPerspectives of identifying Korean genetic variations
Perspectives of identifying Korean genetic variations
 
UTSpeaks: Raising babies (1 - Professor Maralyn Foureur)
UTSpeaks: Raising babies (1 - Professor Maralyn Foureur)UTSpeaks: Raising babies (1 - Professor Maralyn Foureur)
UTSpeaks: Raising babies (1 - Professor Maralyn Foureur)
 
Human-Computer Interaction in Complex Artefact Ecologies
Human-Computer Interaction in Complex Artefact EcologiesHuman-Computer Interaction in Complex Artefact Ecologies
Human-Computer Interaction in Complex Artefact Ecologies
 
Service Integration in the Web of Things
Service Integration in the Web of ThingsService Integration in the Web of Things
Service Integration in the Web of Things
 
Vlad Trifa - Final PhD Thesis Defense at ETH Zurich
Vlad Trifa - Final PhD Thesis Defense at ETH ZurichVlad Trifa - Final PhD Thesis Defense at ETH Zurich
Vlad Trifa - Final PhD Thesis Defense at ETH Zurich
 

Similar to Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to human population genetics: from pathways to genotype networks

A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchHuda Nazeer
 
Whole Genome Sequencing .pptx
Whole Genome Sequencing .pptxWhole Genome Sequencing .pptx
Whole Genome Sequencing .pptxGyanchandSaini1
 
Human genome project
Human genome projectHuman genome project
Human genome projectJayaBellad
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfsabyabby
 
Amia tb-review-12
Amia tb-review-12Amia tb-review-12
Amia tb-review-12Russ Altman
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxDESMONDEZIEKE1
 
Sophie F. summer Poster Final
Sophie F. summer Poster FinalSophie F. summer Poster Final
Sophie F. summer Poster FinalSophie Friedheim
 
Genomics and proteomics by shreeman
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreemanshreeman cs
 
Evotec - How can Knowledge Graphs support Druh Discovery
Evotec - How can Knowledge Graphs support Druh DiscoveryEvotec - How can Knowledge Graphs support Druh Discovery
Evotec - How can Knowledge Graphs support Druh DiscoveryNeo4j
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomicsNikhil Aggarwal
 
Epigenetic Analysis Sequencing
Epigenetic Analysis SequencingEpigenetic Analysis Sequencing
Epigenetic Analysis SequencingLisa Martinez
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationJoaquin Dopazo
 
Genetics in psychobiology
Genetics in psychobiologyGenetics in psychobiology
Genetics in psychobiologyjasleenbrar03
 

Similar to Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to human population genetics: from pathways to genotype networks (20)

Systems biology
Systems biologySystems biology
Systems biology
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products Research
 
Human genome project by M.Sohail Riaz Hashmi
Human genome project by M.Sohail Riaz HashmiHuman genome project by M.Sohail Riaz Hashmi
Human genome project by M.Sohail Riaz Hashmi
 
Human genome project ()
Human genome project ()Human genome project ()
Human genome project ()
 
Whole Genome Sequencing .pptx
Whole Genome Sequencing .pptxWhole Genome Sequencing .pptx
Whole Genome Sequencing .pptx
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdf
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
 
Amia tb-review-12
Amia tb-review-12Amia tb-review-12
Amia tb-review-12
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptx
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
NGS and the molecular basis of disease: a practical view
NGS and the molecular basis of disease: a practical viewNGS and the molecular basis of disease: a practical view
NGS and the molecular basis of disease: a practical view
 
Sophie F. summer Poster Final
Sophie F. summer Poster FinalSophie F. summer Poster Final
Sophie F. summer Poster Final
 
Genomics and proteomics by shreeman
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreeman
 
Evotec - How can Knowledge Graphs support Druh Discovery
Evotec - How can Knowledge Graphs support Druh DiscoveryEvotec - How can Knowledge Graphs support Druh Discovery
Evotec - How can Knowledge Graphs support Druh Discovery
 
Sprig16 d leronni
Sprig16 d leronni Sprig16 d leronni
Sprig16 d leronni
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
Epigenetic Analysis Sequencing
Epigenetic Analysis SequencingEpigenetic Analysis Sequencing
Epigenetic Analysis Sequencing
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 
Genetics in psychobiology
Genetics in psychobiologyGenetics in psychobiology
Genetics in psychobiology
 

More from Giovanni Marco Dall'Olio (16)

Fehrman Nat Gen 2014 - Journal Club
Fehrman Nat Gen 2014 - Journal ClubFehrman Nat Gen 2014 - Journal Club
Fehrman Nat Gen 2014 - Journal Club
 
Linux intro 5 extra: awk
Linux intro 5 extra: awkLinux intro 5 extra: awk
Linux intro 5 extra: awk
 
Linux intro 5 extra: makefiles
Linux intro 5 extra: makefilesLinux intro 5 extra: makefiles
Linux intro 5 extra: makefiles
 
Linux intro 4 awk + makefile
Linux intro 4  awk + makefileLinux intro 4  awk + makefile
Linux intro 4 awk + makefile
 
Linux intro 3 grep + Unix piping
Linux intro 3 grep + Unix pipingLinux intro 3 grep + Unix piping
Linux intro 3 grep + Unix piping
 
Linux intro 2 basic terminal
Linux intro 2   basic terminalLinux intro 2   basic terminal
Linux intro 2 basic terminal
 
Hg for bioinformatics, second part
Hg for bioinformatics, second partHg for bioinformatics, second part
Hg for bioinformatics, second part
 
Hg version control bioinformaticians
Hg version control bioinformaticiansHg version control bioinformaticians
Hg version control bioinformaticians
 
The true story behind the annotation of a pathway
The true story behind the annotation of a pathwayThe true story behind the annotation of a pathway
The true story behind the annotation of a pathway
 
Plotting data with python and pylab
Plotting data with python and pylabPlotting data with python and pylab
Plotting data with python and pylab
 
Pycon
PyconPycon
Pycon
 
Makefiles Bioinfo
Makefiles BioinfoMakefiles Bioinfo
Makefiles Bioinfo
 
biopython, doctest and makefiles
biopython, doctest and makefilesbiopython, doctest and makefiles
biopython, doctest and makefiles
 
Web 2.0 e ricerca scientifica - Web 2.0 and scientific research
Web 2.0 e ricerca scientifica - Web 2.0 and scientific researchWeb 2.0 e ricerca scientifica - Web 2.0 and scientific research
Web 2.0 e ricerca scientifica - Web 2.0 and scientific research
 
Perl Bioinfo
Perl BioinfoPerl Bioinfo
Perl Bioinfo
 
(draft) perl e bioinformatica - presentazione per ipw2008
(draft) perl e bioinformatica - presentazione per ipw2008(draft) perl e bioinformatica - presentazione per ipw2008
(draft) perl e bioinformatica - presentazione per ipw2008
 

Recently uploaded

Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomdiscovermytutordmt
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Dipal Arora
 
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...chandars293
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Servicevidya singh
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...narwatsonia7
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
Chandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableChandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableDipal Arora
 
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...hotbabesbook
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...astropune
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...narwatsonia7
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 

Recently uploaded (20)

Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
Chandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableChandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD available
 
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
 

Thesis defence of Dall'Olio Giovanni Marco. Applications of network theory to human population genetics: from pathways to genotype networks

  • 1. Applications of network theory to human population genetics: from pathways to genotype networks Giovanni Marco Dall'Olio Pompeu Fabra University, Barcelona Advisors: Jaume Bertranpetit and Hafid Laayouni
  • 2. Acknowledgments ● I would like to thank: – My PhD supervisors, Jaume Bertranpetit and Hafid Laayouni – My committee: Dr. Mauro Santos, Dr. Ricard Solé, Prof. Guido Barbujani, Dr. Ferran Casals, Dra. Yolanda Espinosa – The Evolutionary Systems Biology group at UPF – The Institut of Biologia Evolutiva 2
  • 3. Topics ● Context and motivations ● My research: – – Pathway approach on the N-Glycosylation pathway – The Genotype Network Approach – ● Annotating the N-Glycosylation pathway The Human Selection Browser and Biostar Conclusions 3
  • 4. Context of the thesis ● ● The first anatomically modern humans appeared about 200,000 years ago How can we understand the signals of genetic adaptation in our genome, since then? 4
  • 5. Factors that influenced recent human evolution New climates Diseases Agriculture 5
  • 6. The opportunity ● ● We have access to large datasets of human sequences Better annotations on gene function and role 6
  • 7. Contributions ● Find applications of network theory to understand genetic adaptation in the human species 7
  • 8. Applications of network theory ● ● The Pathway approach The Genotype Network approach 8
  • 9. Topics ● Context and motivations ● My research: – – Pathway approach on the N-Glycosylation pathway – The Genotype Network Approach – ● Annotating the N-Glycosylation pathway The Human Selection Browser and Biostar Conclusions 9
  • 10. The Pathway approach ● ● Genes are organized in pathways Any eventual selection constraint will be distributed among all the genes of a pathway 10
  • 11. Distribution of Selection forces in a pathway ● Some positions of the pathway will be more likely to have stronger signals of selection 11
  • 12. Pathway Approach - outline ● ● ● Build a Network representation of a pathway Execute a test for positive selection on each gene Determine how the signals of selection are distributed on the network 12
  • 13. Pathway approach on the N-Glycosylation pathway ● ● Asparagine N-Glycosylation is a metabolic pathway for a type of protein modification The structure of this pathway is easy to represent as a network 13
  • 14. N-glycosylation - upstream part ● ● Produces a single sugar called “N-Glycan precursor” This sugar is required for the proper folding of most membrane proteins 14 Adapted from Stanley, P., Schachter, H., & Taniguchi, N. (2009). N-Glycans. Essentials of Glycobiology.
  • 15. N-Glycosylation and protein folding ● The product of the upstream part of N-glycosylation is used as a signal to distinguish folded and unfolded proteins Folded protein Un-Folded protein 15
  • 16. N-glycosylation - downstream part ● ● Complex pathway composed by thousands of reactions Produces multiple glycans, important for cell-to-cell interactions 16 Hossler, P., Mulukutla, B. C., & Hu, W.-S. (2007). Systems analysis of N-glycan processing in mammalian cells. PloS one, 2(1), e713. doi:10.1371/journal.pone.0000713
  • 17. Glycans on the cell surface ● ● The surface of a cell is similar to a forest of glycosylated proteins Each organism and cell has a specific repertoire of glycans 17 A. Doeer, Glycoproteomics. Nature Methods, 2011. doi:10.1038/nmeth.1821
  • 18. Annotating the N-Glycosylation pathway ● In order to build a correct network model for the N-Glycosylation pathway, we annotated it first in the Reactome database 18
  • 20. The KEGG entry for N-Glycosylation is incomplete Downstream N-Glycosylation in KEGG Real representation of downstream N-Glycosylation 20
  • 21. Another error for N-Glycosylation in KEGG 21
  • 22. Erroneous annotation in String ● There are two genes with the symbol ALG2: – – ● ALG2 (Asparagine Linked Glycosylation 2) ALG-2 (Apoptosis Linked Gene – 2) In String, these two were confused 22
  • 23. Ambigous interpretation of the term N-Glycosylation in GO N-Glycosylated pathway Merged N-Glycosylated protein 23
  • 24. Annotating the N-Glycosylation pathway ● Annotated ~100 reactions in Reactome ● Fixed ~50 Gene Ontology terms ● Fixed key errors in String and KEGG 24
  • 26. Dataset used ● The CEPH-HGDP 650,000 Illumina chip dataset ● 940 individuals, from 50 human populations 26
  • 27. Methods used ● ● The FST index → measure of population differentiation The iHS test → identification of signals of recent positive selection 27
  • 28. FST – Population differentiation ● ● FST is a measure of population differentiation If the FST between two population is 1, it means that the two populations are fixed for different alleles 28
  • 29. Signatures of population differentiation in the N-Glycosylation pathway FST signals are concentrated in the downstream part, and in the substrates biosynthesis 29
  • 30. Population Differentiation and network position ● ● Node degree correlates with the distribution of FST signals Genes with high FST are generally more connected 30
  • 31. IHS and Long range haplotypes ● ● A selective sweep may cause the appearance of long homozygous haplotypes at a high frequency Example: a long homozygous haplotype present in the LCT gene in North-European populations Vitti et al, Trends in genetics, 2012 31
  • 32. IHS and Long range haplotypes: iHS: Compares the Extended Haplotype Homozygosity decay (EHH decay) between ancestral and derived allele Voight et al., PLoS Genetics 2006 32
  • 33. Signatures of selection in the N-Glycosylation pathway No difference in the distribution of iHS signals between upstream and downstream 33
  • 34. Signatures of selection in the N-Glycosylation pathway GCS1: redirects to protein folding quality control MGAT3: redirects to Hybrid Glycans MAN2A1: redirects to Complex Glycans 34
  • 35. Pathway approach on N-Glycosylation ● There is a difference in the patterns of population differentiation between the two parts of the N-Glycosylation pathway ● Signals of positive selection are more likely on key genes ● One of the few works applying the pathway approach on human genetics 35
  • 36. Topics ● Context and motivations ● My research: – – Pathway approach on the N-Glycosylation pathway – The Genotype Network Approach – ● Annotating the N-Glycosylation pathway The Human Selection Browser and Biostar Conclusions 36
  • 37. The Genotype Network approach ● Genotype Networks have been used to study the “innovability” and evolvability of a genetic system 37
  • 38. The Genotype Network approach ● ● Genotype Networks have been used to study the “innovability” and evolvability of a genetic system Never applied to population genetics data, because they require too much data! 38
  • 39. Genotype Networks - theory ● John Maynard-Smith: the concept of a Protein Space, which is explored by populations 39
  • 40. Genotype Networks - theory ● John Maynard-Smith: the concept of a Protein Space, which is explored by populations “if evolution by natural selection is to occur, functional proteins [or DNA sequences] must form a continuous network which can be traversed by unit mutational steps without passing through nonfunctional intermediates” 40
  • 41. Neutralism and Selectionism ● ● Neutralism: most mutations are neutral or deleterious Selectionism: positive mutations drive evolution 41
  • 42. Genotype Networks help recoincile Neutralism and Selectionism ● ● Cycles of Neutral evolution, alterned by cycles of Selection Even neutral or negative mutations can beneficial on the long run, because they allow to explore the genotype space 42
  • 43. The Genotype Network - definitions ● ● The Genotype Space of a region of 5 SNPs can be represented as a network Each node is a possible genotype, and edge connect nodes with only one difference 43
  • 44. The Genotype Network - definitions ● ● Green nodes are sequences observed in a population This is the Genotype Network of a population 44
  • 45. Average Path Length of a Genotype Network ● ● This figure represents two populations The yellow one has an higher Average Path Length than the blue one 45
  • 46. Average Degree ● ● ● ● This population has an high Average Degree It is more robust to mutations This population has a low Average Degree Mutations are more likely to fall outside the Genotype Network 46
  • 47. Dataset analyzed ● ● 1000genomes data, phase 1 850 individuals genotyped, grouped into three continental groups (AFR, EUR and ASN) 47
  • 48. The VCF2Space library ● ● ● Suite of Python scripts to calculate Genotype Networks from a VCF file ~400,000 lines of code ~350 unit tests 48
  • 49. Splitting the genome into windows of 11 SNPs ● ● Less than 11 SNPs -> networks are too small and condensed More than 11 SNPs -> networks are too large and sparse Small network Large network 49
  • 50. Why windows of 11 SNPs? 50
  • 51. Genotype Network properties of the human genome http://genome.ucsc.edu/cgi-bin/hgTracks? db=hg19&hubUrl=http://bioevo.upf.edu/~gdallolio/genotype_space/hub.txt 51
  • 52. Coding & Non-Coding regions ● Coding regions have higher average path length and degree than non coding regions 52
  • 53. Genotype Networks and Selection (simulated data) Selection Neutral 53
  • 54. ● ● ● Coding networks: high average path lenght and degree Non coding networks: low average path lenght and degree Recent selection: lower average path lenght and degree 54
  • 56. Topics ● Context and motivations ● My research: – – Pathway approach on the N-Glycosylation pathway – The Genotype Network Approach – ● Annotating the N-Glycosylation pathway The Human Selection Browser and Biostar Conclusions 56
  • 57. Other works: The Human Selection Browser ● We applied 21 tests for positive selection to the 1,000 Genomes dataset – ● FST, CLR, iHS, etc... This dataset will be published and made freely available as a genome browser 57
  • 58. Other works: Biostar ● An online forum for bioinformatics ● About 150,000 visits per month ● Helped thousands of bioinformaticians! 58
  • 59. Topics ● Context and motivations ● My research: – – Pathway approach on the N-Glycosylation pathway – The Genotype Network Approach – ● Annotating the N-Glycosylation pathway The Human Selection Browser and Biostar Conclusions 59
  • 60. Conclusions (I) ● ● ● ● We developed two applications of network theory to the study of human population genetics. We produced a network model of the N-Glycosylation pathway, contributing it to the Reactome database and improving the annotations in other databases. We showed that the downstream part of the N-Glycosylation pathway shows more signatures of genetic differentiation than the upstream part. This is compatible with the role and structure of this part of the pathway. We showed that key genes of the N-Glycosylation pathway, such as GCS1, MGAT3 and MAN2A1, show signatures of recent positive selection in human populations. 60
  • 61. Conclusions (II) ● ● ● We produced a suite of Python scripts, called VCF2Space, to apply the concept of Genotype Networks to Single Nucleotide Polimorphism data Our genome-wide application of Genotype Networks showed that coding regions tend to have networks with higher average degree and path length than non-coding regions We contributed positively to the bioinformatics community, providing resources such as the 1000 Genomes Selection Browser and Biostar 61
  • 62. 63
  • 63. Figures credits ● ● ● Slide 5: humans: http://blogs.ancestry.com/ancestry/ star trek: http://en.wikipedia.org/wiki/Star_Trek:_The_Original_Series Slide 6: Malaria: http://science.psu.edu/news-and-events/2012-news/Read7-2012 Climates: http://www.ancienteco.com/2012/03/climate-change-drives-human-evolution.html Agriculture: http://en.wikipedia.org/wiki/History_of_agriculture Slide 7: – ● Slide 14: – ● Cover of Science, 23 March 2001 Slide 15: – ● 1000 Genomes, CEPH-HGDP panel, UK10K, Hapmap websites Adapted from Stanley, P., Schachter, H., & Taniguchi, N. (2009). N-Glycans. Essentials of Glycobiology. Slide 17: – Glycosylation, downstream: Hossler, P., Mulukutla, B. C., & Hu, W.-S. (2007). Systems analysis of N-glycan processing in mammalian cells. PloS one, 2(1), e713. doi:10.1371/journal.pone.0000713 64
  • 64. Figures credits ● ● ● ● Slide 27: http://www.cephb.fr/en/hgdp/diversity.php/ Slide 29: http://www.rationalskepticism.org Slide 32 Adapted from Vitti et al, 2012 Slide 42: – wikipedia 65
  • 65. The Pathway approach Stronger Selection on Genes with high connectivity or upstream of a pathway 66
  • 66. N-glycosylation – how does it work ● All the N-glycans are generated from a single sugar with a very conserved structure, called N-glycan precursor N-glycan precursor Signal for folded proteins Millions of different 67 glycans
  • 67. The FST test Almost all the highest signals of FST are in genes of the downstream part 68
  • 68. The iHS test GCS1 in EUR MAN2A1 in SSAFR and EASIA MGAT3 in EASIA 69
  • 69. Combining p-values ● ● ● From Peng et al, Eur J Hum Genet. 2010 Fisher's combination test ZF follows a χ2(2K) distribution SNPs from the same gene may violate the assumption of independency, but still the method is robust to errors 70
  • 70. Comparing upstream and downstream N-Glycosylation ● χ2 test comparing the number of events observed in the each part of the pathway, against what is the number expected if there were no pathway structure 71
  • 71. How to convert genotypes to networks ● Two haplotypes per individual ● Reference allele → 0; Alternative allele → 1 Individual 1 AC AC AA GG TT TG CA TG Ancestral alleles: A A A G T T C T haplotype a 00000000 haplotype b 11000111 72