SlideShare a Scribd company logo
1 of 36
A functional and evolutionary perspective on
transcription factor binding in Arabidopsis thaliana
Potsdam, October 2014
Comparative & Integrative Genomics group
Department of Plant Biotechnology and Bioinformatics, Ghent University
Department of Plant Systems Biology, VIB - Belgium
plaza_genomics
OVERVIEW
1. Transcriptional gene regulation in plants
2. Inference of transcriptonal networks using an ensemble
framework for phylogenetic footprinting in plants
3. An integrated gene regulatory network using experimental ChIP
data of 27 transcription factors in Arabidopsis
4. Conclusions
Jan Van de Velde
Ken Heyndrickx
1. TRANSCRIPTIONAL GENE REGULATION
Mejia-Guerra et al., 2012
Arabidopsis
-1,700-2,500 Transcription Factors
- 180-791 miRNA
- 2,708 expressed lncRNA
49MB non-coding DNA
11,000 regulatory
interactions (AtRegNet)
EXPERIMENTAL CHARACTERIZATION OF REGULATORY
INTERACTIONS
Mejia-Guerra et al., 2012
EMSA
Y1H
SELEX
PBM
ChIP-Seq
COMPUTATIONAL ANALYSIS OF CIS-REGULATORY
ELEMENTS
 Mapping of known TF binding sites on promoter sequences
 False positives
 Low quality motifs (PWMs) + many motifs lack information about
binding factor
 Motif redundancy & multi-gene transcription factor families
Database # CRE Species
PLACE 469 Vascular plants
AGRIS 99 Arabidopsis thaliana
AtProbe 172 Arabidopsis thaliana
PlantCARE 435 Monocots and dicots
2. PHYLOGENETIC FOOTPRINTING: DETECTION OF
CONSERVED NON-CODING SEQUENCES (CNS)
 Comparative analysis of noncoding DNA sequences to identify
candidate regulatory elements (in orthologous genes)
 Regulatory elements are conserved during evolution due to
functional constraint (vs. neutral carry-over)
 The power of phylogenetic footprinting is enhanced significantly
when data from a number of related species, which diverged
sufficiently, is available
DEVELOPING AN ENSEMBLE FRAMEWORK FOR
PHYLOGENETIC FOOTPRINTING IN PLANTS
 Application of motif mapping and
different pairwise alignment tools
 Aggregate alignments in multi-
species footprint using 11
comparator dicot genomes
 Evaluate statistical signifcance incl.
FDR analysis
AtProbe
Feature map
@ RSAT
144 regulatory elements (63 genes)
774 DNA motifs
FROM PAIRWISE ALIGNMENTS TO MULTI-SPECIES FOOTPRINTS
 Generate all pairwise alignments
between Arabidopsis query gene
and its orthologs
 Map all pairwise alignments back
to reference promoter
 Count per position the #species
that support a footprint
 Significance estimation
Van Bel, M., Proost, S., Wischnitzki, E., Movahedi, S., Scheerlinck, C., Van de Peer, Y., Vandepoele, K. (2012) Dissecting plant
genomes with the PLAZA comparative genomics platform. Plant Physiology 158:590-600
EVALUATION ATPROBE EXPERIMENTAL CIS-
REGULATORY ELEMENTS
Significance
Experimental
motifs
Scmm ACGTGGC = 0.54
P value < 0.001
G-box
Scmm ATAGATAA = 0.09
P value 0.48
GA motif
Scmm GATAAGATT = 0.36
P value < 0.001
I-box
RBCS1A
Scmm TATATATA = 0.7
P value < 0.001
GAPA
ACA motif
C-motif
PROPERTIES CNS
 69,361 CNSs associated with 17,895 genes
 Protein-coding genes (99%), miRNA genes (1%)
 Median length: 11nt (min-max: 5-514nt)
 CNS cover 1,070kb of the non-coding Arabidopsis genome
DETECTION OF EXPERIMENTAL REGULATORY
ELEMENTS
• Black boxes: percentage of
recovered elements
• White boxes: percentage
of uniquely recovered
elements in this study
RECOVERY OF IN VIVO FUNCTIONAL TARGETS
USING CNS INFORMATION
• White boxes: fold enrichment for CNSs
• Black boxes: fold enrichment naïve motif mapping
High-quality dataset 15 TFs
ChIP-Seq binding + TF binding site + differentially expressed TF perturbation (n=2708)
GENOME-WIDE REGULATORY ANNOTATION
Collapsed TF-target module network
 40,758 TF-target interactions (157 TFs)
 9/13 TFs significant overlap with experimentally
confirmed targets (AtRegNet/Hussey et al., 2013)
 Various functional genomics metrics confirm quality
predicted GRN
Van de Velde, J.*, Heyndrickx, K.S.*, and Vandepoele, K. (2014). Inference of Transcriptional Networks in Arabidopsis through
Conserved Noncoding Sequence Analysis. Plant Cell.
A CONDITION-SPECIFIC SECONDARY CELL WALL
GENE REGULATORY NETWORK.
2. AN INTEGRATED GENE REGULATORY NETWORK USING CHIP
DATA OF 27 TRANSCRIPTION FACTORS
 How is TF binding organised across different target genes?
 Have Highly Occupied Target (HOT) genes in plants the same
distinct regulatory features as in organisms?
 To what extent is binding linked to differential expression TF
binding site presence?
* Heyndrickx KS, Vandepoele K (2012) Systematic Identification of Functional Plant Modules through the Integration of Complementary Data
Sources. Plant physiology 159: 884-901
TF PROTEIN-PROTEIN INTERACTION NETWORK
De Bodt S, Hollunder J, Nelissen H, Meulemeester N, Inze D (2012) CORNET 2.0: integrating plant coexpression, protein-protein
interactions, regulatory interactions, gene associations and functional annotations. The New phytologist 195: 707-720
Flowering
Light Response
GENOMIC REGIONS BOUND BY TFS
GENOMIC REGIONS BOUND BY TFS
GENE LOCATION ANALYSIS TF BINDING EVENTS
 89% of upstream sites lie < 2kb
 23,891 / 26,717
 91% of downstream sites lie < 2kb
 11,687 / 12,828
COORDINATED REGULATION BY DIFFERENT TFS
BINDING SITE ORGANISATION
A
B
DH I sites: Zhang W, Zhang T, Wu Y, Jiang J (2012) Genome-Wide Identification of Regulatory DNA Elements and Protein-Binding
Footprints Using Signatures of Open Chromatin in Arabidopsis. The Plant cell
A
B
C
D
R
COOPERATIVE TF BINDING: HUB & HOT
#TargetGenes
# Bound TFs
Gene Complexity
A-D
 Hub genes
 1,170 genes bound by ≥ 8 TFs
 Significantly Enriched for TFs and miRNAs
 Highly Occupied Target (HOT) regions
 1,179 regions bound by ≥ 7 TFs
COOPERATIVE TF BINDING: HUB & HOT
A-D
Enrichment for regulatory genes
(TFs, kinases), response to
stimuli & developmental genes
CHROMATIN STATES AND NUCLEOTIDE DIVERSITY OF TF-
BOUND REGIONS
Sequeira-Mendes et al., … Gutierrez, C. (2014). The Functional Topography of the Arabidopsis Genome Is Organized in a Reduced
Number of Linear Motifs of Chromatin States. Plant Cell.
Population sequence diversity based on
369 Arabidopsis strains (Weigel lab)
HOT-ASSOCIATED GENES ARE FACTOR-RESPONSIVE
HOT-ASSOCIATED GENES ARE FACTOR-RESPONSIVE
EXPRESSION LEVELS ARE CORRELATED WITH
THE TOTAL NUMBER OF BOUND TFS
Low: < 3 TFs; Intermediate: >= 3 TFs and < 8TFs; hub: >= 8 TFs
(n=406 flowering-related genes)
A SINGLE MOTIF DOES NOT EXPLAIN BINDING
A SINGLE MOTIF DOES NOT EXPLAIN BINDING (II)
A SINGLE MOTIF DOES NOT EXPLAIN BINDING (II)
SEC. MOTIFS ARE ENRICHED FOR DE AS WELL
PIF5: ChIP-Seq
SEC. MOTIFS ARE ENRICHED FOR DE AS WELL
NEW HYPOTHESES ON CO-BINDING AND
TETHERING
AP1
PIF3
PRR7
FHY3AP1 SEP3
Co-binding Tethering
PRR5 PRR7
CONCLUSIONS & PERSPECTIVES
 Integration of CNS with complementary experimental data sources
offers new possibilities for regulatory gene annotation in plants
 High specificity to predict TF-target interactions
 Complementary to exerperimental TF-target detection methods
 Study GRN conservation and rewiring across species
 Integrated 27 TF ChIP-Seq gene regulatory network reveals
 Complexly regulated are enriched for regulatory genes
 HOT-associated regions represent functional binding events
 Open chromatin
 Sequence constraint
 TF binding sites
 Enrichment for regulated target genes
 Co-binding and tethering patterns could explain the apparent
discrepancy between binding and regulation in ChIP-chip/Seq
studies
FURTHER READING
Van de Velde, J.*, Heyndrickx, K.S.*, and Vandepoele, K. (2014). Inference of
Transcriptional Networks in Arabidopsis through Conserved Noncoding Sequence
Analysis. The Plant Cell 26(7):2729-2745
Proost, S., Van Bel, M. … and Vandepoele, K. (2015). PLAZA 3.0: an access point
for plant comparative genomics. Nucleic Acids Research (accepted)
Heyndrickx, K.S.*, Van de Velde, J.*, and Vandepoele, K. (2014). A functional
and evolutionary perspective on transcription factor binding in Arabidopsis
thaliana. The Plant Cell (accepted)

More Related Content

What's hot

Engineering plant immunity using crispr cas9 to generate virus resistance
Engineering plant immunity using crispr cas9 to generate virus resistanceEngineering plant immunity using crispr cas9 to generate virus resistance
Engineering plant immunity using crispr cas9 to generate virus resistance
Sheikh Mansoor
 
The new frontier of genome engineering
The new frontier of genome engineeringThe new frontier of genome engineering
The new frontier of genome engineering
PricyBark0
 
OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
Sean Davis
 

What's hot (20)

Engineering plant immunity using crispr cas9 to generate virus resistance
Engineering plant immunity using crispr cas9 to generate virus resistanceEngineering plant immunity using crispr cas9 to generate virus resistance
Engineering plant immunity using crispr cas9 to generate virus resistance
 
Genome editing in Plants with crispr/cas9
Genome editing in Plants with crispr/cas9Genome editing in Plants with crispr/cas9
Genome editing in Plants with crispr/cas9
 
The new frontier of genome engineering
The new frontier of genome engineeringThe new frontier of genome engineering
The new frontier of genome engineering
 
GBS: Genotyping by sequencing
GBS: Genotyping by sequencingGBS: Genotyping by sequencing
GBS: Genotyping by sequencing
 
OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
 
Plant genome sequencing and crop improvement
Plant genome sequencing and crop improvementPlant genome sequencing and crop improvement
Plant genome sequencing and crop improvement
 
CRISPR /Cas9
CRISPR /Cas9CRISPR /Cas9
CRISPR /Cas9
 
CRISPR, cas9 in plant disease resistance
CRISPR, cas9 in plant disease resistance CRISPR, cas9 in plant disease resistance
CRISPR, cas9 in plant disease resistance
 
MSU Transgenic and Genome Editing Facility
MSU Transgenic and Genome Editing FacilityMSU Transgenic and Genome Editing Facility
MSU Transgenic and Genome Editing Facility
 
Impact of advanced technology in biology
Impact of advanced technology in biologyImpact of advanced technology in biology
Impact of advanced technology in biology
 
Gene editing 1
Gene editing 1Gene editing 1
Gene editing 1
 
CRISPR – a novel tool for genome editing
CRISPR – a novel tool for genome editingCRISPR – a novel tool for genome editing
CRISPR – a novel tool for genome editing
 
Genome Editing CRISPR-Cas9
Genome Editing CRISPR-Cas9 Genome Editing CRISPR-Cas9
Genome Editing CRISPR-Cas9
 
181214 Bioinformática vegetal
181214 Bioinformática vegetal181214 Bioinformática vegetal
181214 Bioinformática vegetal
 
Dr amar Sharma
Dr amar SharmaDr amar Sharma
Dr amar Sharma
 
Who owns CRISPR? - An update on the Interference.
Who owns CRISPR? - An update on the Interference.Who owns CRISPR? - An update on the Interference.
Who owns CRISPR? - An update on the Interference.
 
Genotyping in Breeding programs
Genotyping in Breeding programsGenotyping in Breeding programs
Genotyping in Breeding programs
 
Crispr cas9
Crispr cas9Crispr cas9
Crispr cas9
 
Genome Sequencing in Finger Millet
Genome Sequencing in Finger MilletGenome Sequencing in Finger Millet
Genome Sequencing in Finger Millet
 
Est database
Est databaseEst database
Est database
 

Viewers also liked

Evolutionary perspective
Evolutionary perspectiveEvolutionary perspective
Evolutionary perspective
mpape
 
An Evolutionary Take on Human Nature
An Evolutionary Take on Human NatureAn Evolutionary Take on Human Nature
An Evolutionary Take on Human Nature
TEDtoChina
 
Major Theoretical Perspectives in Sociology
Major Theoretical Perspectives in SociologyMajor Theoretical Perspectives in Sociology
Major Theoretical Perspectives in Sociology
Kostyk Elf
 
The sociological perspectives
The sociological perspectivesThe sociological perspectives
The sociological perspectives
Qurat Ul Ain Ali
 
Major theoretical perspectives in sociology
Major theoretical perspectives in sociologyMajor theoretical perspectives in sociology
Major theoretical perspectives in sociology
Seth Allen
 

Viewers also liked (10)

Prediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methodsPrediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methods
 
A knowledge-based strategy for structural recognition of transcription factor...
A knowledge-based strategy for structural recognition of transcription factor...A knowledge-based strategy for structural recognition of transcription factor...
A knowledge-based strategy for structural recognition of transcription factor...
 
Transcription regulatory elements
Transcription regulatory elementsTranscription regulatory elements
Transcription regulatory elements
 
Evolutionary perspective
Evolutionary perspectiveEvolutionary perspective
Evolutionary perspective
 
Protein dna interaction
Protein dna interactionProtein dna interaction
Protein dna interaction
 
An Evolutionary Take on Human Nature
An Evolutionary Take on Human NatureAn Evolutionary Take on Human Nature
An Evolutionary Take on Human Nature
 
Major Theoretical Perspectives in Sociology
Major Theoretical Perspectives in SociologyMajor Theoretical Perspectives in Sociology
Major Theoretical Perspectives in Sociology
 
The sociological perspectives
The sociological perspectivesThe sociological perspectives
The sociological perspectives
 
Subtypes of Associated Protein-DNA (Transcription Factor-Transcription Factor...
Subtypes of Associated Protein-DNA (Transcription Factor-Transcription Factor...Subtypes of Associated Protein-DNA (Transcription Factor-Transcription Factor...
Subtypes of Associated Protein-DNA (Transcription Factor-Transcription Factor...
 
Major theoretical perspectives in sociology
Major theoretical perspectives in sociologyMajor theoretical perspectives in sociology
Major theoretical perspectives in sociology
 

Similar to A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana

Architecture of the human regulatory network derived from encode data
Architecture of the human regulatory network derived from encode dataArchitecture of the human regulatory network derived from encode data
Architecture of the human regulatory network derived from encode data
Anax Fotopoulos
 
Akhtar et al. Stem Cell Reports (2015)
Akhtar et al. Stem Cell Reports (2015)Akhtar et al. Stem Cell Reports (2015)
Akhtar et al. Stem Cell Reports (2015)
Aslam Akhtar, MS
 

Similar to A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana (20)

Inferring gene functions and regulatory interactions in plants using differen...
Inferring gene functions and regulatory interactions in plants using differen...Inferring gene functions and regulatory interactions in plants using differen...
Inferring gene functions and regulatory interactions in plants using differen...
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
Integrative inference of transcriptional networks in Arabidopsis yields novel...
Integrative inference of transcriptional networks in Arabidopsis yields novel...Integrative inference of transcriptional networks in Arabidopsis yields novel...
Integrative inference of transcriptional networks in Arabidopsis yields novel...
 
Austin Neurology & Neurosciences
Austin Neurology & NeurosciencesAustin Neurology & Neurosciences
Austin Neurology & Neurosciences
 
TF2Network: unravelling gene regulatory networks and transcription factor fun...
TF2Network: unravelling gene regulatory networks and transcription factor fun...TF2Network: unravelling gene regulatory networks and transcription factor fun...
TF2Network: unravelling gene regulatory networks and transcription factor fun...
 
Gene expression group presentation at GAW 19
Gene expression group presentation at GAW 19Gene expression group presentation at GAW 19
Gene expression group presentation at GAW 19
 
S. prasanth kumar young scientist awarded presentation
S. prasanth kumar young scientist awarded presentationS. prasanth kumar young scientist awarded presentation
S. prasanth kumar young scientist awarded presentation
 
Tair workshop stanford2017
Tair workshop stanford2017Tair workshop stanford2017
Tair workshop stanford2017
 
Leveraging functional genomics analytics for target discovery
Leveraging functional genomics analytics for target discoveryLeveraging functional genomics analytics for target discovery
Leveraging functional genomics analytics for target discovery
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattle
 
ACSESS200808
ACSESS200808ACSESS200808
ACSESS200808
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferation
 
Genomics Technologies
Genomics TechnologiesGenomics Technologies
Genomics Technologies
 
Application of molecular probes
Application of molecular probesApplication of molecular probes
Application of molecular probes
 
Architecture of the human regulatory network derived from encode data
Architecture of the human regulatory network derived from encode dataArchitecture of the human regulatory network derived from encode data
Architecture of the human regulatory network derived from encode data
 
Prediction Of Regulatory Elements
Prediction Of Regulatory ElementsPrediction Of Regulatory Elements
Prediction Of Regulatory Elements
 
Akhtar et al. Stem Cell Reports (2015)
Akhtar et al. Stem Cell Reports (2015)Akhtar et al. Stem Cell Reports (2015)
Akhtar et al. Stem Cell Reports (2015)
 
Roleoffunctionalgenomicsincropimprovement ashishgautam
Roleoffunctionalgenomicsincropimprovement ashishgautamRoleoffunctionalgenomicsincropimprovement ashishgautam
Roleoffunctionalgenomicsincropimprovement ashishgautam
 
Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...
 

Recently uploaded

THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 

Recently uploaded (20)

Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 

A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana

  • 1. A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana Potsdam, October 2014 Comparative & Integrative Genomics group Department of Plant Biotechnology and Bioinformatics, Ghent University Department of Plant Systems Biology, VIB - Belgium plaza_genomics
  • 2. OVERVIEW 1. Transcriptional gene regulation in plants 2. Inference of transcriptonal networks using an ensemble framework for phylogenetic footprinting in plants 3. An integrated gene regulatory network using experimental ChIP data of 27 transcription factors in Arabidopsis 4. Conclusions Jan Van de Velde Ken Heyndrickx
  • 3. 1. TRANSCRIPTIONAL GENE REGULATION Mejia-Guerra et al., 2012 Arabidopsis -1,700-2,500 Transcription Factors - 180-791 miRNA - 2,708 expressed lncRNA 49MB non-coding DNA 11,000 regulatory interactions (AtRegNet)
  • 4. EXPERIMENTAL CHARACTERIZATION OF REGULATORY INTERACTIONS Mejia-Guerra et al., 2012 EMSA Y1H SELEX PBM ChIP-Seq
  • 5. COMPUTATIONAL ANALYSIS OF CIS-REGULATORY ELEMENTS  Mapping of known TF binding sites on promoter sequences  False positives  Low quality motifs (PWMs) + many motifs lack information about binding factor  Motif redundancy & multi-gene transcription factor families Database # CRE Species PLACE 469 Vascular plants AGRIS 99 Arabidopsis thaliana AtProbe 172 Arabidopsis thaliana PlantCARE 435 Monocots and dicots
  • 6. 2. PHYLOGENETIC FOOTPRINTING: DETECTION OF CONSERVED NON-CODING SEQUENCES (CNS)  Comparative analysis of noncoding DNA sequences to identify candidate regulatory elements (in orthologous genes)  Regulatory elements are conserved during evolution due to functional constraint (vs. neutral carry-over)  The power of phylogenetic footprinting is enhanced significantly when data from a number of related species, which diverged sufficiently, is available
  • 7. DEVELOPING AN ENSEMBLE FRAMEWORK FOR PHYLOGENETIC FOOTPRINTING IN PLANTS  Application of motif mapping and different pairwise alignment tools  Aggregate alignments in multi- species footprint using 11 comparator dicot genomes  Evaluate statistical signifcance incl. FDR analysis AtProbe Feature map @ RSAT 144 regulatory elements (63 genes) 774 DNA motifs
  • 8. FROM PAIRWISE ALIGNMENTS TO MULTI-SPECIES FOOTPRINTS  Generate all pairwise alignments between Arabidopsis query gene and its orthologs  Map all pairwise alignments back to reference promoter  Count per position the #species that support a footprint  Significance estimation Van Bel, M., Proost, S., Wischnitzki, E., Movahedi, S., Scheerlinck, C., Van de Peer, Y., Vandepoele, K. (2012) Dissecting plant genomes with the PLAZA comparative genomics platform. Plant Physiology 158:590-600
  • 9. EVALUATION ATPROBE EXPERIMENTAL CIS- REGULATORY ELEMENTS Significance Experimental motifs Scmm ACGTGGC = 0.54 P value < 0.001 G-box Scmm ATAGATAA = 0.09 P value 0.48 GA motif Scmm GATAAGATT = 0.36 P value < 0.001 I-box RBCS1A Scmm TATATATA = 0.7 P value < 0.001 GAPA ACA motif C-motif
  • 10. PROPERTIES CNS  69,361 CNSs associated with 17,895 genes  Protein-coding genes (99%), miRNA genes (1%)  Median length: 11nt (min-max: 5-514nt)  CNS cover 1,070kb of the non-coding Arabidopsis genome
  • 11. DETECTION OF EXPERIMENTAL REGULATORY ELEMENTS • Black boxes: percentage of recovered elements • White boxes: percentage of uniquely recovered elements in this study
  • 12. RECOVERY OF IN VIVO FUNCTIONAL TARGETS USING CNS INFORMATION • White boxes: fold enrichment for CNSs • Black boxes: fold enrichment naïve motif mapping High-quality dataset 15 TFs ChIP-Seq binding + TF binding site + differentially expressed TF perturbation (n=2708)
  • 13. GENOME-WIDE REGULATORY ANNOTATION Collapsed TF-target module network  40,758 TF-target interactions (157 TFs)  9/13 TFs significant overlap with experimentally confirmed targets (AtRegNet/Hussey et al., 2013)  Various functional genomics metrics confirm quality predicted GRN Van de Velde, J.*, Heyndrickx, K.S.*, and Vandepoele, K. (2014). Inference of Transcriptional Networks in Arabidopsis through Conserved Noncoding Sequence Analysis. Plant Cell.
  • 14. A CONDITION-SPECIFIC SECONDARY CELL WALL GENE REGULATORY NETWORK.
  • 15. 2. AN INTEGRATED GENE REGULATORY NETWORK USING CHIP DATA OF 27 TRANSCRIPTION FACTORS  How is TF binding organised across different target genes?  Have Highly Occupied Target (HOT) genes in plants the same distinct regulatory features as in organisms?  To what extent is binding linked to differential expression TF binding site presence?
  • 16. * Heyndrickx KS, Vandepoele K (2012) Systematic Identification of Functional Plant Modules through the Integration of Complementary Data Sources. Plant physiology 159: 884-901
  • 17. TF PROTEIN-PROTEIN INTERACTION NETWORK De Bodt S, Hollunder J, Nelissen H, Meulemeester N, Inze D (2012) CORNET 2.0: integrating plant coexpression, protein-protein interactions, regulatory interactions, gene associations and functional annotations. The New phytologist 195: 707-720 Flowering Light Response
  • 20. GENE LOCATION ANALYSIS TF BINDING EVENTS  89% of upstream sites lie < 2kb  23,891 / 26,717  91% of downstream sites lie < 2kb  11,687 / 12,828
  • 21. COORDINATED REGULATION BY DIFFERENT TFS
  • 22. BINDING SITE ORGANISATION A B DH I sites: Zhang W, Zhang T, Wu Y, Jiang J (2012) Genome-Wide Identification of Regulatory DNA Elements and Protein-Binding Footprints Using Signatures of Open Chromatin in Arabidopsis. The Plant cell A B C D R
  • 23. COOPERATIVE TF BINDING: HUB & HOT #TargetGenes # Bound TFs Gene Complexity A-D
  • 24.  Hub genes  1,170 genes bound by ≥ 8 TFs  Significantly Enriched for TFs and miRNAs  Highly Occupied Target (HOT) regions  1,179 regions bound by ≥ 7 TFs COOPERATIVE TF BINDING: HUB & HOT A-D Enrichment for regulatory genes (TFs, kinases), response to stimuli & developmental genes
  • 25. CHROMATIN STATES AND NUCLEOTIDE DIVERSITY OF TF- BOUND REGIONS Sequeira-Mendes et al., … Gutierrez, C. (2014). The Functional Topography of the Arabidopsis Genome Is Organized in a Reduced Number of Linear Motifs of Chromatin States. Plant Cell. Population sequence diversity based on 369 Arabidopsis strains (Weigel lab)
  • 26. HOT-ASSOCIATED GENES ARE FACTOR-RESPONSIVE
  • 27. HOT-ASSOCIATED GENES ARE FACTOR-RESPONSIVE
  • 28. EXPRESSION LEVELS ARE CORRELATED WITH THE TOTAL NUMBER OF BOUND TFS Low: < 3 TFs; Intermediate: >= 3 TFs and < 8TFs; hub: >= 8 TFs (n=406 flowering-related genes)
  • 29. A SINGLE MOTIF DOES NOT EXPLAIN BINDING
  • 30. A SINGLE MOTIF DOES NOT EXPLAIN BINDING (II)
  • 31. A SINGLE MOTIF DOES NOT EXPLAIN BINDING (II)
  • 32. SEC. MOTIFS ARE ENRICHED FOR DE AS WELL PIF5: ChIP-Seq
  • 33. SEC. MOTIFS ARE ENRICHED FOR DE AS WELL
  • 34. NEW HYPOTHESES ON CO-BINDING AND TETHERING AP1 PIF3 PRR7 FHY3AP1 SEP3 Co-binding Tethering PRR5 PRR7
  • 35. CONCLUSIONS & PERSPECTIVES  Integration of CNS with complementary experimental data sources offers new possibilities for regulatory gene annotation in plants  High specificity to predict TF-target interactions  Complementary to exerperimental TF-target detection methods  Study GRN conservation and rewiring across species  Integrated 27 TF ChIP-Seq gene regulatory network reveals  Complexly regulated are enriched for regulatory genes  HOT-associated regions represent functional binding events  Open chromatin  Sequence constraint  TF binding sites  Enrichment for regulated target genes  Co-binding and tethering patterns could explain the apparent discrepancy between binding and regulation in ChIP-chip/Seq studies
  • 36. FURTHER READING Van de Velde, J.*, Heyndrickx, K.S.*, and Vandepoele, K. (2014). Inference of Transcriptional Networks in Arabidopsis through Conserved Noncoding Sequence Analysis. The Plant Cell 26(7):2729-2745 Proost, S., Van Bel, M. … and Vandepoele, K. (2015). PLAZA 3.0: an access point for plant comparative genomics. Nucleic Acids Research (accepted) Heyndrickx, K.S.*, Van de Velde, J.*, and Vandepoele, K. (2014). A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana. The Plant Cell (accepted)