SlideShare uma empresa Scribd logo
1 de 59
Manjappa
Ph. D. Scholar
Dept. of Genetics & Plant
Breeding
UAS, GKVK, Bengaluru, India
Catalyzing plant science research with RNA-seq
1
Central dogma of molecular biology
Transcriptome
(mRNA, rRNA, tRNA, and
other non-coding RNA)
2
Why to study transcriptome ?
It reflects the genes that are being actively expressed at any
given time (expression profiling)
Expression level of mRNAs in a given cell population varies
How an organism adapt to the developmental cues and
environmental fluctuations.
3
Quantify the changing expression levels of each transcript
during development and under different conditions
Catalogue all species of transcript (mRNAs, non-coding
RNAs & small RNAs)
Determine the transcriptional structure of genes, in terms
of their start sites, 5′ and 3′ ends, splicing patterns and
other post-transcriptional modifications
Aim of transcriptomics
4
Microarray Technology
5
Limitation in microarray technique
Reliance upon knowledge of genome sequence
High background levels owing to cross hybridization
Limited dynamic range of detection owing to background &
saturation of signals
Comparing expression levels across different experiments is often
difficult & can require complicated normalization methods
Sanger sequencing of cDNA or EST libraries:
- Relatively low throughput, expensive & generally not quantitative
Tag-based methods (SAGE, CAGE & MPSS):
 high throughput & precise, ‘digital’ gene expression levels
Most are based on expensive Sanger sequencing technology, & a
significant portion of the short tags cannot be uniquely mapped
to the reference genome
only a portion of the transcript is analyzed and isoforms are
generally indistinguishable from each other
6
Wang et. al, RNA-Seq: a revolutionary tool for transcriptomics, Nat. Rev. Genetics 10, 57-63, 2009).
Next generation sequencing (NGS)
Sample preparation
Data analysis:
Mapping reads
Visualization (Gbrowser)
De novo assembly
Quantification
RNA-sequencing
7
RNA-seq vs. microarray
• RNA-seq can be used to characterize novel transcripts and splicing
variants as well as to profile the expression levels of known
transcripts (but hybridization-based techniques are limited to detect
transcripts corresponding to known genomic sequences)
• Detect large dynamic range of expression levels (9,000 fold)
compared to microarray (100-few-hundred fold
• RNA-seq has higher resolution than whole genome tiling array
analysis
• In principle, mRNA can achieve single-base resolution, where
the resolution of tiling array depends on the density of probes
• High levels of reproducibility, for both technical and biological
replicates
• RNA-seq can apply the same experimental protocol to various
purposes, whereas specialized arrays need to be designed in these
cases
• Detecting SNPs (needs SNP array otherwise)
• Mapping exon junctions (needs junction array otherwise)
• Detecting gene fusions (needs gene fusion array otherwise)
8
RNA-seq and microarray agree fairly well only for
genes with medium levels of expression
Saccharomyces cerevisiae cells grown in nutrient-rich media. Correlation is very low
for genes with either low or high expression levels.
9
Advantages of RNA-Seq compared with other
transcriptomics methods
10
RNA Seq helps to look at
Alternative gene spliced transcripts
Post-transcriptional modifications
Gene fusion
Mutations/SNPs
Changes in gene expression
• Used to determine exon/intron boundaries
• Verify or amend previously annotated 5’ and 3’ gene
boundaries.
• Also includes miRNA, tRNA, and rRNA profiling
11
Alternative splicing
12
Library construction
RNA fragmentation (RNA
hydrolysis or nebulization) &
cDNA fragmentation (DNase I
treatment or sonication)
Bioinformatic challenges
Devt. of efficient methods to
store, retrieve and process
large amounts of data, which
must reduce errors in image
analysis and base-calling and
remove low-quality reads.
13
Challenges for RNA-Seq
bias at depleted 5′ and 3′ ends
bias at 3′ ends
14
Applications of RNA Seq
• First draft of Arabidopsis thaliana genome sequence (2000); its
annotation continues to be improved
• Large amounts of Sanger sequencing-generated EST data provided the
initial basis for gene identification and expression profiling
Expensive, time consuming, inherently biased against low-
abundance transcripts & are typically enriched in transcript
termini
• RNA-seq circumvents these limitations and provides accurate
resolution of splice junctions and alternative splicing events
• Arabidopsis transcriptome survey using Illumina shows
- At least 42% of intron-containing genes are alternatively
spliced (Filichkin et al., 2010)
- 61% when only multi-exonic genes are sampled
- ~48% of rice genes (Lu et al.,2010)
1. IMPROVING GENOME ANNOTATION WITH TRANSCRIPTOMIC DATA
15
Contd…
• Mining RNA seq data in search of TSS variation is improving gene
structure annotation and alternative TSSs have been detected in
∼10,000 loci in Arabidopsis and rice
(Tanaka et al., 2009).
• An ideal genome annotation would identify
 Genes that show invariant transcript sequences
 Those that exhibit alternative splicing and
 Link these events to specific spatial, temporal, developmental,
and/or environmental cues.
• Abiotic stress in Arabidopsis can increase or decrease the proportions of
apparently unproductive isoforms for some key regulatory genes,
supports alternative splicing is an important mechanism in the
regulation of gene function
(Filichkin et al., 2010)
16
Contd…
• Polymorphisms between different A. thaliana accessions is
one SNP per ∼200 bp.
• Complete re-sequencing of the transcriptomes and
annotation of different accessions helps to interpret the
functional consequences of polymorphism
• Utilizing genomic and transcriptomic data for in silico gene
prediction results in a more reliable annotated genome,
with Information on SNPs, indels, splice variants and
expression variation
17
Generating genomic and enabling proteomic
resources for “non-model” species
• Published plant genome sequences represents very small fraction of plant
taxonomic diversity
• Study of “non-model” species challenging
• de novo sequencing of the transcriptome to generate genetic resources
1. Eucalyptus (mizrachi et al., 2010)
2. Garlic (sun et al., 2012)
3. Pea (franssen et al., 2011),
4. Chestnut (barakat et al., 2009)
5. Chickpea (garg et al., 2011)
6. Olive (alagna et al., 2009)
7. Safflower (lulin et al., 2012
8. Japanese knotweed (Hao et al., 2011).
Gene annotation relies on identifying homologs, & ideally orthologs, in
species with an annotated genome (if no appropriate EST databases are
available)
If not, A. thaliana genome sequence (Gold std.)
Further confirmation; interrogating additional plant databases
Annotation with pre-existing EST database Eg: melon (Dai et al., 2011)
Same
function
different
function
18
• De novo RNA-seq to identify genetic polymorphisms
(molecular breeding), wherein multiple cultivars or close-
related species with variations in traits of interest are
sequenced and genetic variation is identified.
Allows generation of molecular markers to facilitate
progeny selection and molecular genetics research
Ex: 12,000 SSRs in a single RNA-seq analysis of sesame
(earlier only 80 SSRs), on average 1 genic-SSR per ∼8 kb
(Zhang et al., 2012)
5,234 SNPs in transcriptomes of five winter rye inbred
lines. Used in a high-throughput SNP genotyping array
(Haseneyer et al., 2011)
• Comparative sequence analysis of radish RNA- seq data and
Brassica rapa genome sequence lead to the discovery of
14,641 SSRs
Contd…
(Wang et al., 2012)
19
RNA Seq application to advance the field of
proteomics.
• Effective proteome profiling is generally considered to
depend heavily on the availability of a high-quality DNA
reference database
• High-throughput mass spectrometry-based protein
identification relies on the availability of an extensive DNA
sequence database in order to match experimentally
determined peptide masses with the theoretical proteome
generated by computationally translating transcripts
• RNA- seq based transcriptome profiling can provide an
effective data set for proteomic analysis of non-model
organisms
20
“RNA-Seq, facilitates the
matching of peptide mass
spectra with cognate gene
sequence”
• To test this, quantitative
analysis of the proteomes of
pollen from domesticated
tomato (Solanum
lycopersicum) and two wild
relatives
• RNA-Seq (454
pyrosequencing); >1200
proteins were identified
No major qualitative or quantitative differences were observed in the characterized proteomes
either with a highly curated community database of tomato sequences or the RNA-Seq database21
Characterizing temporal, spatial, regulatory,
and evolutionary transcriptome landscapes
Temporal transcriptome
• RNA-seq is increasingly being adopted to examine
transcriptional dynamics
• Ananalysis of transcriptome of grape berries during three
stages of devt. identified >6,500 genes that were expressed
in a stage-specific manner (Zenoni et al.,2010)
• Radish >21,000 genes differentially expressed at two
developmental stages of roots, includes genes strongly
linking root development with starch and sucrose
metabolism and with phenylpropanoid biosynthesis.
(Wang et al., 2012)
22
Objective: To understand the molecular mechanisms underlying tuberous root
formation and development.
• Radish (R. sativus) cultivar ‘Weixianqing’.
• Samples; cultivar ‘Weixianqing’.
• hypocotyl (1 cm, 7DAS) & true root (1 cm, 20 DAS (RLSS, the stage of cortex splitting),
10 seedlings of each were pooled together
• Illumina paired-end sequencing technology GAII platform (BGI; Shenzhen, China)
• Gene annotation: Comparative genome analysis between radish and Brassica rapa.
Unigenes were aligned with sequences in NCBI non-redundant protein (Nr) database,
Swiss-Prot protein database, Kyoto Encyclopedia of Genes and Genomes (KEGG)
pathway database & Cluster of Orthologous Groups (COG) database using BLASTx
Annotation by using Blast2GO program
23
• Sequence similarity search was conducted against the NCBI
Nr (85.51 %), Nt (90.18%) and Swiss-Prot protein databases
(54%) using the BLASTx algorithm
• 21,109 unigenes were assigned GO terms.
Functional annotation of all non-redundant unigenes
Gene Ontology classification of assembled unigenes
(9,271; 43.92 %)
24
Transcript differences between RESS and RLSS
13,453
8,389
To understand the functions of
DEGs, mapped all the DEGs to
terms in the KEGG database &
found 29 pathways were
significantly enriched.
carbohydrate, energy, lipid,
amino acid, other amino acids,
terpenoids and polyketides,
metabolism and biosynthesis
of other secondary metabolites
20 (starch and sucrose metabolism) and 25 (phenylpropanoid
biosynthesis) unigenes significantly up-regulated, play a critical roles in
regulating radish tuberous root formation. Also confirm finding of radish
root is rich with carbohydrates and phenolic compounds. 25
starch and sucrose metabolism
(303) and phenylpropanoid
biosynthesis (177) two
predominant groups
EST-derived SSR detection
26
• Previous gene expression studies using EST sequencing, spotted
microarrays & Affymetrix Gene Chip tech. (based on prior sequence)
• Provides only a fragmented picture of transcript accumulation patterns.
• RNA Seq to 7 tissues (leaf, flower, pod, two stages of pod-shell, root,
nodule) & 7 seed devt. Stages of BC5F5 plant G. max
• Compare transcript reads with recent genome sequence (assembly
Glyma1.01)
• Potential model for future RNA-Seq atlases
27
Mapping of short-read sequences:
• Illumina Genome analyser-II: produced 5.8-8.9 mill. 36-bp reads
for 7 non-seed tissues & 2.7-9.6 mill. 36-bp reads for seed tissues
• Alignment program GSNAP was used to map the reads to two
reference genomes: G. max and Bradyrhizobium japonicum.
• Digital gene expression analysis: 46,430 genes identified as “high
confidence” (correlation to full length cDNAs, ESTs, homology, &
ab initio methods)
• Of which 41,975 (90.4%) genes were transcriptionally active
Expression and gene structure
Coding regions of transcriptionally inactive genes were smaller and had a lower GC content
28
Hierarchical clustering of transcriptional
profiles in 14 tissues.
Tissue-specific analysis of the soybean transcriptome
Relative expression levels based on Z-score
analysis (3.4-3.6 more tissue specific)
early seed devt. stages late seed devt. stages
aerial tissues underground tissues
Z = (X-μ)/sd
Tissue
specific
Tissue
specific
29
Heatmap of the Legume Specific GenesHeatmap of top 500 highest expressed genes
Some legume specific genes
have tissue specific transcription
Glyma06g08290
Glyma04g08220 (Oleosin)
Glyma02g01590 (lectin precursor 1)
30
General trends in expression profiles for all genes tissue by tissue comparison (Fishers Exact test)
Higher transcriptional level
Importance: Understand Gene functions and molecular process occur during two stages.
GOslim analysis; between Seed 25 DAF & seed 28 DAF, seed 35 & 42 DAF, stably expressed
nutrient reservoir activity & urease activity. Which are imp activity in seed devt. 31
Genes structure and tissue specific gene expression
• Underground tissue have larger first exon, aerial has higher # of
exons.
• Significant difference in total transcription length among tissue
due to varying intron length.
• No significant difference between GC content and tissue
specificity
32
Boxplot Dendrogram of preferential expressed genes in seed development
RPKM normalized log2-
transformed expression
gene profiles
33
Summary
• RNA Seq-Atlas provides
• A record of high-resolution gene expression in a set of
14 diverse tissues
• Hierarchical clustering of transcriptional profiles for 14
tissues
• Relationship between gene structure and gene
expression
• Tissue-specific gene expression of both the most highly-
expressed genes and the genes specific to legumes in
seed development and nodule tissues
• A means of evaluating existing gene model annotations
for the Glycine max genome
34
Spatial transcriptome
• Most RNA-seq analyses target whole organs, or sets of
organs, which inherently prevents the identification of cell
or tissue type transcripts, and thus spatially coordinated
structural and regulatory gene networks.
• RNA-seq analysis of discrete tissues or cell types: Spatial
information and increase the depth of sequence coverage
Ex: >1000 genes have specifically or preferentially
expressed in Arabidopsis male meiocytes
• Acquiring tissue or cell-specific samples with any degree of
precision and minimal contamination is often technically
difficult
35
Methods of isolation of single cells
36
Contd…
• Matas et al. (2011): LCM + RNA-seq (454 pyrosequencing)
transcriptomes of 5 principal tissues of the developing
tomato fruit pericarp.
~ 21,000 unigenes identified & more than half showed
ubiquitous (57%) expression, while other showed cell
type-specific expression
• Takacs et al. (2012): LCM + RNA-seq (Illumina-based NGS)
study of the ontogeny of maize shoot apical meristem
59% of genes expressed ubiquitously
• A number of mammalian tissues also shown a high
proportion of ubiquitously expressed transcripts
“These studies may indicate that this is a common feature of
eukaryotes” (Ramsköld et al., 2009).
37
To study plant responses and adaptations to abiotic
and biotic stresses
Aim: Elucidate genes and gene networks that contribute to
sorghum’s tolerance to water-limiting environments with a long-
term aim of developing strategies to improve plant productivity
under drought
 Discovered >50 previously unknown drought- responsive genes.
38
Up Down
ABA ~2,300 ~2,600
PEG ~1,650 ~700
20 μM
8th day,
57.1 μM
20% PEG-8000
Transcript Analysis
in Response to
ABA and Osmotic
Stress
Method
ABA in response to plant
stress, and its central role
in other pathways,
(dormancy in leaf & seed)
LEA protein
WSI18 protein
dehydrin
sugar
substrate
transporter
peroxidase 6
39
A, brassinosteroid biosynthesis
B, cytokinins degradation
C, cytokinins glucoside biosynthesis
D, ent-kaurene biosynthesis
E, ethylene biosynthesis from methionine
F, gibberellin biosynthesis
G, gibberellin inactivation
H, IAA conjugate biosynthesis
I, jasmonic acid
Networks of hormone pathways in ABA-treated plants
Shoots Roots
Box=Hormone-related
Circle=non-hormone-related
Down Up DE genes
Dark blue solid lines= ≥10 blue
long-dashed lines=6-9 light
blue short-dashed= ≤5
 Only the brassinosteroid and JA biosynthesis pathways, and cytokinin glucoside and IAA
conjugate biosynthesis pathways are directly connected via DE genes.
 Indirect ‘cross-talk’ between the various hormones in response to osmotic stress and ABA
40
Determining the genes of unknown
function that respond to drought or
ABA treatment across species
Decision tree used to determine which
genes and their orthologs were regulated by
drought/ABA across different species
Overlap of drought-responsive sorghum
genes of unknown function that had drought-
responsive orthologs of unknown function in
other species
41
(51) (82)
(183)
• RNA-seq used reveal massive changes in
metabolism and cellular physiology of the green
alga Chlamydomonas reinhardtii when the cells
become deprived of sulfur
• studies of plant responses to pathogens
Ex: sorghum Bipolaris sorghicola
(Mizunoetal.,2012)
• Complexities of the metabolic pathways associated
with plant defense mechanisms
42
Study plant evolution and polyploidy.
• A comparison of the leaf transcriptome of an allopolyploid relative of
soybean with two species that contributed to its homoelogous genome,
allowed the determination of the contribution of the different genomes
to the transcriptome (Ilut et al., 2012)
• Maize endosperm trascriptome analysis; discovered 179 imprinted
genes and 38 imprinted long ncRNAs (Zhang et al., 2011)
• Transcriptome of 9 distinct tissues of three species of the Poaceae
family (Brachypodium, sorghum & rice) to determine whether
orthologous genes from these three species exhibit the same expression
patterns (Davidson et al., 2012)
 Only a fraction of orthologous genes exhibit conserved expression
patterns
 Orthologs in syntenic genomic blocks are more likely to share
correlated expression patterns compared with non-syntenic
orthologs.
 These findings are important for crop improvement (seq transfer)
43
Hierarchical clustering of 27 tissues (9 tissues x 3
species) based on correlations of log2 FPKM mapped
expression values of 3-taxa single-copy (3x3) genes
Classification of Brachypodium, rice, and
sorghum genes into orthologous groups
clustering of
corresponding
tissue
extensive expression divergence within 3 · 3 genes
Red: single copy (2 X 2 & 3 X 3)
Black: multicopy (2 X N & 3 x N)
OrthoMCL 44
Genes within each k-means co-expression
cluster were categorized based on OrthoMCL
category assignments or as lineage-specific
single-copy (1 x 1) genes.
Co-expression analyses identify conservation
of expression among orthologs and paralogs
Proportions of genes with at least one corresponding paralog or ortholog in the same cluster
Portion of Poaceae orthologs and paralogs share same
expression patterns across reproductive tissues
Some genes exhibited different expression phenotypes
45
Similar expression pattern in Poacea
which biological processes were over-represented in
orthologs/paralogs category ?
Ortholog/paral
ogs
Gene ontology (GO) annotation
2 x N genes Stress-related functions (‘response to biotic stimulus’, ‘defense response’,
‘apoptosis’), lipid transport, secretion (‘exocytosis’), and general
oxidation–reduction reactions.
3 x N (higher
substitution
rates)
Core metabolic functions; ‘translation, ATP biosynthesis, nucleosome
assembly, and biosynthetic process & oxidation–reduction, response to
wounding, sexual reproduction.
3 x 3 genes Essential functions: regulation of transcription’ (>1000 genes),
‘protein folding’ (253 genes), ‘intracellular protein transport’
(123 genes), and ‘glycolysis (91 genes)
2 x 2 genes protein amino acid phosphorylation, ‘regulation of transcription &
response to oxidative stress
46
Relationship between synteny and expression patterns of orthologs
Syntenic gene pairs within collinear blocks of at
least five genes were identified for all pairwise
combinations of three Poaceae species
Distributions of Pearson’s correlation coefficients (PCC)
synteny plays a significant role in
evolution of gene expression, especially
in the case of duplicate and multicopy
genes
47
Identifying and characterizing novel non-coding RNAs
• Insilico analysis provides a rapid way to identify putative sRNA
genes
• RNA-seq technology represents an excellent means for sRNA
discovery and validation
• Characterization of miRNAs regulatory functions to be facilitated
by determining tissue-specific expression pattern
• RNA-seq was used to identify sRNAs from five Arabidopsis root
tissues.
Some sRNAs expressed in all 5 tissues while others were
tissue and developmental zone specific
• The frequency of alternative slicing at miRNA binding sites is
significantly higher than that at other regions, suggesting that
alternative splicing is a significant regulatory mechanism.
• sRNAs have been recently characterized in the context of
association with epigenome modifications, including cytosine
methylation of genomic DNA
48
From co-expression networks to integrative data
analysis
• Sequencing whole transcriptomes provides a high degree of detail,
but deriving useful biological information from a long list of
expressed genes is typically not trivial
• Construct networks of co-expressed genes and to use gene ontology
(GO) information to help highlight important gene candidates as
critical components of functional networks
• Gene ontology enrichment analysis of RNA-seq data often illustrates
the complexity of interacting pathways
Robust Functiona
networks
Transcriptome: RNA-seq
proteomics
metabolomics
No correlation
ex: Soybean
protein X
Correlation
Ex:Oil plam
mesocarp Fatty
acid
49
Bulked Segregant RNA-Seq
SNP 2 being closely
related to the mutation to
map
linkage disequilibrium
between markers and
causal gene is determined
by quantifying the allelic
frequencies between two
samples
advantages:
(i) Having a reference genome is not
a prerequisite
(ii) Markers can be generated from
the experimental data
(iii) Differential expression profiles
(iv) Info on effects of mutant on
global patterns of gene
expression
(v) Provide map position of a gene
Liu et al. (2012)
BSA requires polymorphic markers
50
>64,000 SNPs
 Two alleles of a given SNP site should be
detected in approximately equal numbers of
RNA-Seq reads when considering both pools of
RNASeq data.
 Only one allele of a SNP that is completely
linked to the causal gene should be present
among the RNA-Seq reads from the mutant
pool
 In practice, as a consequence of Allele Specific
Expression and sampling bias, genes expressed
at low levels, single allele of many SNPs are
detected in the mutant pool.
 Empirical Bayesian approach used to estimate
linkage probability, i.e. probability of a SNP
exhibiting complete linkage disequilibrium
with the causal gene. 51
>64,000 SNPs
 Two alleles of a given SNP site should be
detected in approximately equal numbers of
RNA-Seq reads when considering both pools of
RNASeq data.
 Only one allele of a SNP that is completely
linked to the causal gene should be present
among the RNA-Seq reads from the mutant
pool
 In practice, as a consequence of Allele Specific
Expression and sampling bias, genes expressed
at low levels, single allele of many SNPs are
detected in the mutant pool.
 Empirical Bayesian approach used to estimate
linkage probability, i.e. probability of a SNP
exhibiting complete linkage disequilibrium
with the causal gene. 52
>64,000 SNPs
 Two alleles of a given SNP site should be
detected in approximately equal numbers of
RNA-Seq reads when considering both pools of
RNASeq data.
 Only one allele of a SNP that is completely
linked to the causal gene should be present
among the RNA-Seq reads from the mutant
pool
 In practice, as a consequence of Allele Specific
Expression and sampling bias, genes expressed
at low levels, single allele of many SNPs are
detected in the mutant pool.
 Empirical Bayesian approach used to estimate
linkage probability, i.e. probability of a SNP
exhibiting complete linkage disequilibrium
with the causal gene.
gl3-ref allele in a non-B73/B73
53
Reference genome
The top 10 windows with the highest median
linkage probability were located at physical
position ,183.5–185.2 Mb.
Fine mapping of gene.
1.Mutant gene expression will often be
down-regulated compared to the WT pool.
2. Collections of SNPs tightly linked to
mutant gene
3. SNPs linked to mutated gene can be used
for gene cloning via chromosome walking.
54
• Not necessary to use tissue with mutant gene
expression for BSR-Seq.
• However, if we collect tissue with expression we
can also get additional expression data.
Resolution of mapping depends on
1. # of individuals included in the bulks
2. Sequencing depth
3. Density of polymorphisms in mapping population
55
• International multi-disciplinary consortium; 1,000 plant sps. transcriptome data
• It is PPP project; funding of 75% from Govt. of Alberta, 25% by Musea
Ventures. BGI-Shenzhen- sequencing at reduced costs & iPlant collaborative -
computational informatics.
• Objectives:
1. Resolve many of the lingering uncertainties in species relationships,
especially in the early lineages of streptophyte green algae and land plants
2. To identify gene changes associated with the major innovations in
Viridiplantae evolution, such as multi-cellularity, transitions from marine to
freshwater or terrestrial environments, maternal retention of zygotes and
embryos, complex life history involving haploid and diploid phases, vascular
systems, seeds and flowers
• Species selection; representations of all major lineages across the Viridiplantae
(green plants), representing ~1 billion years of evolution, including flowering
plants, conifers, ferns, mosses and streptophyte green algae.
56
Resources available
1. Access to raw and processed data:
Content; transcriptome assemblies, putative coding
sequences, orthogroups and gene and species trees with
related sequence alignments.
2. High performance computing and cloud-based services:
iPlant discovery environment (DE) web interface (tutorials and
teaching materials available)
57
Phenylpropanoid synthesis pathway for Colchicum autumnale. Labelled rectangles are
proteins. Small circles are metabolites. Black lines show the KEGG pathway. Red lines show the
BioGRID interactions emanating from protein (K12355), which was interactively selected. A right-
click on the protein will display the inferred function and a link to the sequence(s)
Interactions & pathways
58
Conclusion
• RNA-sequencing is now well-established as a versatile
platform with applications in an ever growing number of
fields of plant biology research
• Ongoing developments in sequencing technologies, such as
increased read lengths, greater numbers of reads per run
• Advanced computational tools to facilitate sequence
assembly, analysis, and integration with orthogonal data
sets will further accelerate the breadth and frequency of its
adoption by plant scientists
59

Mais conteúdo relacionado

Mais procurados

Omics in plant breeding
Omics in plant breedingOmics in plant breeding
Omics in plant breedingpoornimakn04
 
Tilling and eco tilling
Tilling and eco tillingTilling and eco tilling
Tilling and eco tillingSuresh Antre
 
Genotyping by sequencing
Genotyping by sequencingGenotyping by sequencing
Genotyping by sequencingBhavya Sree
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionJatinder Singh
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisUniversity of California, Davis
 
Allele mining, tilling and eco tilling
Allele mining, tilling and eco tillingAllele mining, tilling and eco tilling
Allele mining, tilling and eco tillingkundan Jadhao
 
Transcriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementTranscriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementSajid Sheikh
 
COMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.pptCOMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.pptSilpa87
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAmol Kunde
 
Association mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingAssociation mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingMahesh Biradar
 
Cereals genomics and protiomics
Cereals genomics and protiomicsCereals genomics and protiomics
Cereals genomics and protiomicsUsman Arshad
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomicsSukhjinder Singh
 

Mais procurados (20)

Transcriptome analysis
Transcriptome analysisTranscriptome analysis
Transcriptome analysis
 
SNp mining in crops
SNp mining in cropsSNp mining in crops
SNp mining in crops
 
TILLING & ECO-TILLING
TILLING & ECO-TILLINGTILLING & ECO-TILLING
TILLING & ECO-TILLING
 
Omics in plant breeding
Omics in plant breedingOmics in plant breeding
Omics in plant breeding
 
Tilling and eco tilling
Tilling and eco tillingTilling and eco tilling
Tilling and eco tilling
 
Genotyping by sequencing
Genotyping by sequencingGenotyping by sequencing
Genotyping by sequencing
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential Expression
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
 
Allele mining, tilling and eco tilling
Allele mining, tilling and eco tillingAllele mining, tilling and eco tilling
Allele mining, tilling and eco tilling
 
Transcriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementTranscriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvement
 
Genome wide association mapping
Genome wide association mappingGenome wide association mapping
Genome wide association mapping
 
RNAseq Analysis
RNAseq AnalysisRNAseq Analysis
RNAseq Analysis
 
COMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.pptCOMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.ppt
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Association mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingAssociation mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mapping
 
markers and their role
markers and their rolemarkers and their role
markers and their role
 
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Cereals genomics and protiomics
Cereals genomics and protiomicsCereals genomics and protiomics
Cereals genomics and protiomics
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomics
 

Destaque

Comparison between RNASeq and Microarray for Gene Expression Analysis
Comparison between RNASeq and Microarray for Gene Expression AnalysisComparison between RNASeq and Microarray for Gene Expression Analysis
Comparison between RNASeq and Microarray for Gene Expression AnalysisYaoyu Wang
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...VHIR Vall d’Hebron Institut de Recerca
 
2012 august 16 systems biology rna seq v2
2012 august 16 systems biology rna seq v22012 august 16 systems biology rna seq v2
2012 august 16 systems biology rna seq v2Anne Deslattes Mays
 
Part 1 of RNA-seq for DE analysis: Defining the goal
Part 1 of RNA-seq for DE analysis: Defining the goalPart 1 of RNA-seq for DE analysis: Defining the goal
Part 1 of RNA-seq for DE analysis: Defining the goalJoachim Jacob
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment DesignYaoyu Wang
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Yaoyu Wang
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysismikaelhuss
 
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...Ann Loraine
 
Introduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqIntroduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqTimothy Tickle
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqEnis Afgan
 
Next-Generation Sequencing and its Applications in RNA-Seq
Next-Generation Sequencing and its Applications in RNA-SeqNext-Generation Sequencing and its Applications in RNA-Seq
Next-Generation Sequencing and its Applications in RNA-Seqb0rAAs
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAGRF_Ltd
 
Kogo 2013 RNA-seq analysis
Kogo 2013 RNA-seq analysisKogo 2013 RNA-seq analysis
Kogo 2013 RNA-seq analysisJunsu Ko
 
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...Bioo Scientific
 
Galaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo ProtocolGalaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo ProtocolHong ChangBum
 
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...Christos Argyropoulos
 
RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1BITS
 

Destaque (20)

Comparison between RNASeq and Microarray for Gene Expression Analysis
Comparison between RNASeq and Microarray for Gene Expression AnalysisComparison between RNASeq and Microarray for Gene Expression Analysis
Comparison between RNASeq and Microarray for Gene Expression Analysis
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
2012 august 16 systems biology rna seq v2
2012 august 16 systems biology rna seq v22012 august 16 systems biology rna seq v2
2012 august 16 systems biology rna seq v2
 
Part 1 of RNA-seq for DE analysis: Defining the goal
Part 1 of RNA-seq for DE analysis: Defining the goalPart 1 of RNA-seq for DE analysis: Defining the goal
Part 1 of RNA-seq for DE analysis: Defining the goal
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
 
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
RNA-Seq analysis of blueberry fruit identifies candidate genes involved in ri...
 
Introduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqIntroduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seq
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-Seq
 
Next-Generation Sequencing and its Applications in RNA-Seq
Next-Generation Sequencing and its Applications in RNA-SeqNext-Generation Sequencing and its Applications in RNA-Seq
Next-Generation Sequencing and its Applications in RNA-Seq
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Kogo 2013 RNA-seq analysis
Kogo 2013 RNA-seq analysisKogo 2013 RNA-seq analysis
Kogo 2013 RNA-seq analysis
 
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
Bioo Scientific - Reduced Bias Small RNA Library Prep with Gel-Free or Low-In...
 
Galaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo ProtocolGalaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo Protocol
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 
RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1
 

Semelhante a Catalyzing Plant Science Research with RNA-seq

Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)IndrajaDoradla
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data AnalysisRavi Gandham
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
The research and application progress of transcriptome sequencing technology (i)
The research and application progress of transcriptome sequencing technology (i)The research and application progress of transcriptome sequencing technology (i)
The research and application progress of transcriptome sequencing technology (i)creativebiolabs11
 
Molecular markers by tahura mariyam ansari
Molecular markers by tahura mariyam ansariMolecular markers by tahura mariyam ansari
Molecular markers by tahura mariyam ansariTahura Mariyam Ansari
 
Please describe in detaila. How to create and to use a transpo.pdf
Please describe in detaila. How to create and to use a transpo.pdfPlease describe in detaila. How to create and to use a transpo.pdf
Please describe in detaila. How to create and to use a transpo.pdf4babies2010
 
16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence Analysis16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence AnalysisAbdulrahman Muhammad
 
Presentation1..gymno..non specific markers n microsatellites..by Nikita Patha...
Presentation1..gymno..non specific markers n microsatellites..by Nikita Patha...Presentation1..gymno..non specific markers n microsatellites..by Nikita Patha...
Presentation1..gymno..non specific markers n microsatellites..by Nikita Patha...NIKITAPATHANIA
 
Molecular markers: Outlook
Molecular markers: OutlookMolecular markers: Outlook
Molecular markers: OutlookAdhiyamaan Raj
 
Present status and recent developments on available molecular marker.pptx
Present status and recent developments on available molecular marker.pptxPresent status and recent developments on available molecular marker.pptx
Present status and recent developments on available molecular marker.pptxPrabhatSingh628463
 
DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification Senthil Natesan
 
Transcriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdfTranscriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdfshinycthomas
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...VHIR Vall d’Hebron Institut de Recerca
 
Molecular systematics.pdf
Molecular systematics.pdfMolecular systematics.pdf
Molecular systematics.pdfAartisoni17
 
Genomics Technologies
Genomics TechnologiesGenomics Technologies
Genomics TechnologiesSean Davis
 
Application of molecular probes
Application of molecular probesApplication of molecular probes
Application of molecular probesAyush Jain
 
15 molecular markers techniques
15 molecular markers techniques15 molecular markers techniques
15 molecular markers techniquesAVINASH KUSHWAHA
 

Semelhante a Catalyzing Plant Science Research with RNA-seq (20)

Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data Analysis
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
The research and application progress of transcriptome sequencing technology (i)
The research and application progress of transcriptome sequencing technology (i)The research and application progress of transcriptome sequencing technology (i)
The research and application progress of transcriptome sequencing technology (i)
 
Gene expression profiling
Gene expression profilingGene expression profiling
Gene expression profiling
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Molecular markers by tahura mariyam ansari
Molecular markers by tahura mariyam ansariMolecular markers by tahura mariyam ansari
Molecular markers by tahura mariyam ansari
 
Please describe in detaila. How to create and to use a transpo.pdf
Please describe in detaila. How to create and to use a transpo.pdfPlease describe in detaila. How to create and to use a transpo.pdf
Please describe in detaila. How to create and to use a transpo.pdf
 
16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence Analysis16S Ribosomal DNA Sequence Analysis
16S Ribosomal DNA Sequence Analysis
 
Presentation1..gymno..non specific markers n microsatellites..by Nikita Patha...
Presentation1..gymno..non specific markers n microsatellites..by Nikita Patha...Presentation1..gymno..non specific markers n microsatellites..by Nikita Patha...
Presentation1..gymno..non specific markers n microsatellites..by Nikita Patha...
 
Molecular markers: Outlook
Molecular markers: OutlookMolecular markers: Outlook
Molecular markers: Outlook
 
Present status and recent developments on available molecular marker.pptx
Present status and recent developments on available molecular marker.pptxPresent status and recent developments on available molecular marker.pptx
Present status and recent developments on available molecular marker.pptx
 
DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification
 
Transcriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdfTranscriptomics,techniqes, applications.pdf
Transcriptomics,techniqes, applications.pdf
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
 
Molecular systematics.pdf
Molecular systematics.pdfMolecular systematics.pdf
Molecular systematics.pdf
 
Genomics Technologies
Genomics TechnologiesGenomics Technologies
Genomics Technologies
 
Application of molecular probes
Application of molecular probesApplication of molecular probes
Application of molecular probes
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 
15 molecular markers techniques
15 molecular markers techniques15 molecular markers techniques
15 molecular markers techniques
 

Mais de Manjappa Ganiger

Molecular basis of heterosis in crop plants
Molecular basis of heterosis in crop plantsMolecular basis of heterosis in crop plants
Molecular basis of heterosis in crop plantsManjappa Ganiger
 
The final split: Regulation of anther dehiscence
The final split: Regulation of anther dehiscenceThe final split: Regulation of anther dehiscence
The final split: Regulation of anther dehiscenceManjappa Ganiger
 
Gene introgression from wild relatives to cultivated plants
Gene introgression from wild relatives to cultivated plantsGene introgression from wild relatives to cultivated plants
Gene introgression from wild relatives to cultivated plantsManjappa Ganiger
 
Floral biology and crossing techniques in groundnut
Floral biology and crossing techniques in groundnutFloral biology and crossing techniques in groundnut
Floral biology and crossing techniques in groundnutManjappa Ganiger
 
Traditional knowledge changing scenario in india
Traditional knowledge changing scenario in indiaTraditional knowledge changing scenario in india
Traditional knowledge changing scenario in indiaManjappa Ganiger
 
Quality parameters in cucurbits & gourds
Quality parameters in cucurbits & gourdsQuality parameters in cucurbits & gourds
Quality parameters in cucurbits & gourdsManjappa Ganiger
 
Floral biology and crossing techniques in greengram
Floral biology and crossing techniques in greengramFloral biology and crossing techniques in greengram
Floral biology and crossing techniques in greengramManjappa Ganiger
 
Age related resisitance in plants
Age related resisitance in plantsAge related resisitance in plants
Age related resisitance in plantsManjappa Ganiger
 

Mais de Manjappa Ganiger (9)

Molecular basis of heterosis in crop plants
Molecular basis of heterosis in crop plantsMolecular basis of heterosis in crop plants
Molecular basis of heterosis in crop plants
 
The final split: Regulation of anther dehiscence
The final split: Regulation of anther dehiscenceThe final split: Regulation of anther dehiscence
The final split: Regulation of anther dehiscence
 
Gene introgression from wild relatives to cultivated plants
Gene introgression from wild relatives to cultivated plantsGene introgression from wild relatives to cultivated plants
Gene introgression from wild relatives to cultivated plants
 
Floral biology and crossing techniques in groundnut
Floral biology and crossing techniques in groundnutFloral biology and crossing techniques in groundnut
Floral biology and crossing techniques in groundnut
 
Traditional knowledge changing scenario in india
Traditional knowledge changing scenario in indiaTraditional knowledge changing scenario in india
Traditional knowledge changing scenario in india
 
Quality parameters in cucurbits & gourds
Quality parameters in cucurbits & gourdsQuality parameters in cucurbits & gourds
Quality parameters in cucurbits & gourds
 
Monosomics
MonosomicsMonosomics
Monosomics
 
Floral biology and crossing techniques in greengram
Floral biology and crossing techniques in greengramFloral biology and crossing techniques in greengram
Floral biology and crossing techniques in greengram
 
Age related resisitance in plants
Age related resisitance in plantsAge related resisitance in plants
Age related resisitance in plants
 

Último

Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxMedical College
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxRitchAndruAgustin
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
Ai in communication electronicss[1].pptx
Ai in communication electronicss[1].pptxAi in communication electronicss[1].pptx
Ai in communication electronicss[1].pptxsubscribeus100
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫qfactory1
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 

Último (20)

Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptx
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
Ai in communication electronicss[1].pptx
Ai in communication electronicss[1].pptxAi in communication electronicss[1].pptx
Ai in communication electronicss[1].pptx
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 

Catalyzing Plant Science Research with RNA-seq

  • 1. Manjappa Ph. D. Scholar Dept. of Genetics & Plant Breeding UAS, GKVK, Bengaluru, India Catalyzing plant science research with RNA-seq 1
  • 2. Central dogma of molecular biology Transcriptome (mRNA, rRNA, tRNA, and other non-coding RNA) 2
  • 3. Why to study transcriptome ? It reflects the genes that are being actively expressed at any given time (expression profiling) Expression level of mRNAs in a given cell population varies How an organism adapt to the developmental cues and environmental fluctuations. 3
  • 4. Quantify the changing expression levels of each transcript during development and under different conditions Catalogue all species of transcript (mRNAs, non-coding RNAs & small RNAs) Determine the transcriptional structure of genes, in terms of their start sites, 5′ and 3′ ends, splicing patterns and other post-transcriptional modifications Aim of transcriptomics 4
  • 6. Limitation in microarray technique Reliance upon knowledge of genome sequence High background levels owing to cross hybridization Limited dynamic range of detection owing to background & saturation of signals Comparing expression levels across different experiments is often difficult & can require complicated normalization methods Sanger sequencing of cDNA or EST libraries: - Relatively low throughput, expensive & generally not quantitative Tag-based methods (SAGE, CAGE & MPSS):  high throughput & precise, ‘digital’ gene expression levels Most are based on expensive Sanger sequencing technology, & a significant portion of the short tags cannot be uniquely mapped to the reference genome only a portion of the transcript is analyzed and isoforms are generally indistinguishable from each other 6
  • 7. Wang et. al, RNA-Seq: a revolutionary tool for transcriptomics, Nat. Rev. Genetics 10, 57-63, 2009). Next generation sequencing (NGS) Sample preparation Data analysis: Mapping reads Visualization (Gbrowser) De novo assembly Quantification RNA-sequencing 7
  • 8. RNA-seq vs. microarray • RNA-seq can be used to characterize novel transcripts and splicing variants as well as to profile the expression levels of known transcripts (but hybridization-based techniques are limited to detect transcripts corresponding to known genomic sequences) • Detect large dynamic range of expression levels (9,000 fold) compared to microarray (100-few-hundred fold • RNA-seq has higher resolution than whole genome tiling array analysis • In principle, mRNA can achieve single-base resolution, where the resolution of tiling array depends on the density of probes • High levels of reproducibility, for both technical and biological replicates • RNA-seq can apply the same experimental protocol to various purposes, whereas specialized arrays need to be designed in these cases • Detecting SNPs (needs SNP array otherwise) • Mapping exon junctions (needs junction array otherwise) • Detecting gene fusions (needs gene fusion array otherwise) 8
  • 9. RNA-seq and microarray agree fairly well only for genes with medium levels of expression Saccharomyces cerevisiae cells grown in nutrient-rich media. Correlation is very low for genes with either low or high expression levels. 9
  • 10. Advantages of RNA-Seq compared with other transcriptomics methods 10
  • 11. RNA Seq helps to look at Alternative gene spliced transcripts Post-transcriptional modifications Gene fusion Mutations/SNPs Changes in gene expression • Used to determine exon/intron boundaries • Verify or amend previously annotated 5’ and 3’ gene boundaries. • Also includes miRNA, tRNA, and rRNA profiling 11
  • 13. Library construction RNA fragmentation (RNA hydrolysis or nebulization) & cDNA fragmentation (DNase I treatment or sonication) Bioinformatic challenges Devt. of efficient methods to store, retrieve and process large amounts of data, which must reduce errors in image analysis and base-calling and remove low-quality reads. 13 Challenges for RNA-Seq bias at depleted 5′ and 3′ ends bias at 3′ ends
  • 15. • First draft of Arabidopsis thaliana genome sequence (2000); its annotation continues to be improved • Large amounts of Sanger sequencing-generated EST data provided the initial basis for gene identification and expression profiling Expensive, time consuming, inherently biased against low- abundance transcripts & are typically enriched in transcript termini • RNA-seq circumvents these limitations and provides accurate resolution of splice junctions and alternative splicing events • Arabidopsis transcriptome survey using Illumina shows - At least 42% of intron-containing genes are alternatively spliced (Filichkin et al., 2010) - 61% when only multi-exonic genes are sampled - ~48% of rice genes (Lu et al.,2010) 1. IMPROVING GENOME ANNOTATION WITH TRANSCRIPTOMIC DATA 15
  • 16. Contd… • Mining RNA seq data in search of TSS variation is improving gene structure annotation and alternative TSSs have been detected in ∼10,000 loci in Arabidopsis and rice (Tanaka et al., 2009). • An ideal genome annotation would identify  Genes that show invariant transcript sequences  Those that exhibit alternative splicing and  Link these events to specific spatial, temporal, developmental, and/or environmental cues. • Abiotic stress in Arabidopsis can increase or decrease the proportions of apparently unproductive isoforms for some key regulatory genes, supports alternative splicing is an important mechanism in the regulation of gene function (Filichkin et al., 2010) 16
  • 17. Contd… • Polymorphisms between different A. thaliana accessions is one SNP per ∼200 bp. • Complete re-sequencing of the transcriptomes and annotation of different accessions helps to interpret the functional consequences of polymorphism • Utilizing genomic and transcriptomic data for in silico gene prediction results in a more reliable annotated genome, with Information on SNPs, indels, splice variants and expression variation 17
  • 18. Generating genomic and enabling proteomic resources for “non-model” species • Published plant genome sequences represents very small fraction of plant taxonomic diversity • Study of “non-model” species challenging • de novo sequencing of the transcriptome to generate genetic resources 1. Eucalyptus (mizrachi et al., 2010) 2. Garlic (sun et al., 2012) 3. Pea (franssen et al., 2011), 4. Chestnut (barakat et al., 2009) 5. Chickpea (garg et al., 2011) 6. Olive (alagna et al., 2009) 7. Safflower (lulin et al., 2012 8. Japanese knotweed (Hao et al., 2011). Gene annotation relies on identifying homologs, & ideally orthologs, in species with an annotated genome (if no appropriate EST databases are available) If not, A. thaliana genome sequence (Gold std.) Further confirmation; interrogating additional plant databases Annotation with pre-existing EST database Eg: melon (Dai et al., 2011) Same function different function 18
  • 19. • De novo RNA-seq to identify genetic polymorphisms (molecular breeding), wherein multiple cultivars or close- related species with variations in traits of interest are sequenced and genetic variation is identified. Allows generation of molecular markers to facilitate progeny selection and molecular genetics research Ex: 12,000 SSRs in a single RNA-seq analysis of sesame (earlier only 80 SSRs), on average 1 genic-SSR per ∼8 kb (Zhang et al., 2012) 5,234 SNPs in transcriptomes of five winter rye inbred lines. Used in a high-throughput SNP genotyping array (Haseneyer et al., 2011) • Comparative sequence analysis of radish RNA- seq data and Brassica rapa genome sequence lead to the discovery of 14,641 SSRs Contd… (Wang et al., 2012) 19
  • 20. RNA Seq application to advance the field of proteomics. • Effective proteome profiling is generally considered to depend heavily on the availability of a high-quality DNA reference database • High-throughput mass spectrometry-based protein identification relies on the availability of an extensive DNA sequence database in order to match experimentally determined peptide masses with the theoretical proteome generated by computationally translating transcripts • RNA- seq based transcriptome profiling can provide an effective data set for proteomic analysis of non-model organisms 20
  • 21. “RNA-Seq, facilitates the matching of peptide mass spectra with cognate gene sequence” • To test this, quantitative analysis of the proteomes of pollen from domesticated tomato (Solanum lycopersicum) and two wild relatives • RNA-Seq (454 pyrosequencing); >1200 proteins were identified No major qualitative or quantitative differences were observed in the characterized proteomes either with a highly curated community database of tomato sequences or the RNA-Seq database21
  • 22. Characterizing temporal, spatial, regulatory, and evolutionary transcriptome landscapes Temporal transcriptome • RNA-seq is increasingly being adopted to examine transcriptional dynamics • Ananalysis of transcriptome of grape berries during three stages of devt. identified >6,500 genes that were expressed in a stage-specific manner (Zenoni et al.,2010) • Radish >21,000 genes differentially expressed at two developmental stages of roots, includes genes strongly linking root development with starch and sucrose metabolism and with phenylpropanoid biosynthesis. (Wang et al., 2012) 22
  • 23. Objective: To understand the molecular mechanisms underlying tuberous root formation and development. • Radish (R. sativus) cultivar ‘Weixianqing’. • Samples; cultivar ‘Weixianqing’. • hypocotyl (1 cm, 7DAS) & true root (1 cm, 20 DAS (RLSS, the stage of cortex splitting), 10 seedlings of each were pooled together • Illumina paired-end sequencing technology GAII platform (BGI; Shenzhen, China) • Gene annotation: Comparative genome analysis between radish and Brassica rapa. Unigenes were aligned with sequences in NCBI non-redundant protein (Nr) database, Swiss-Prot protein database, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database & Cluster of Orthologous Groups (COG) database using BLASTx Annotation by using Blast2GO program 23
  • 24. • Sequence similarity search was conducted against the NCBI Nr (85.51 %), Nt (90.18%) and Swiss-Prot protein databases (54%) using the BLASTx algorithm • 21,109 unigenes were assigned GO terms. Functional annotation of all non-redundant unigenes Gene Ontology classification of assembled unigenes (9,271; 43.92 %) 24
  • 25. Transcript differences between RESS and RLSS 13,453 8,389 To understand the functions of DEGs, mapped all the DEGs to terms in the KEGG database & found 29 pathways were significantly enriched. carbohydrate, energy, lipid, amino acid, other amino acids, terpenoids and polyketides, metabolism and biosynthesis of other secondary metabolites 20 (starch and sucrose metabolism) and 25 (phenylpropanoid biosynthesis) unigenes significantly up-regulated, play a critical roles in regulating radish tuberous root formation. Also confirm finding of radish root is rich with carbohydrates and phenolic compounds. 25 starch and sucrose metabolism (303) and phenylpropanoid biosynthesis (177) two predominant groups
  • 27. • Previous gene expression studies using EST sequencing, spotted microarrays & Affymetrix Gene Chip tech. (based on prior sequence) • Provides only a fragmented picture of transcript accumulation patterns. • RNA Seq to 7 tissues (leaf, flower, pod, two stages of pod-shell, root, nodule) & 7 seed devt. Stages of BC5F5 plant G. max • Compare transcript reads with recent genome sequence (assembly Glyma1.01) • Potential model for future RNA-Seq atlases 27
  • 28. Mapping of short-read sequences: • Illumina Genome analyser-II: produced 5.8-8.9 mill. 36-bp reads for 7 non-seed tissues & 2.7-9.6 mill. 36-bp reads for seed tissues • Alignment program GSNAP was used to map the reads to two reference genomes: G. max and Bradyrhizobium japonicum. • Digital gene expression analysis: 46,430 genes identified as “high confidence” (correlation to full length cDNAs, ESTs, homology, & ab initio methods) • Of which 41,975 (90.4%) genes were transcriptionally active Expression and gene structure Coding regions of transcriptionally inactive genes were smaller and had a lower GC content 28
  • 29. Hierarchical clustering of transcriptional profiles in 14 tissues. Tissue-specific analysis of the soybean transcriptome Relative expression levels based on Z-score analysis (3.4-3.6 more tissue specific) early seed devt. stages late seed devt. stages aerial tissues underground tissues Z = (X-μ)/sd Tissue specific Tissue specific 29
  • 30. Heatmap of the Legume Specific GenesHeatmap of top 500 highest expressed genes Some legume specific genes have tissue specific transcription Glyma06g08290 Glyma04g08220 (Oleosin) Glyma02g01590 (lectin precursor 1) 30
  • 31. General trends in expression profiles for all genes tissue by tissue comparison (Fishers Exact test) Higher transcriptional level Importance: Understand Gene functions and molecular process occur during two stages. GOslim analysis; between Seed 25 DAF & seed 28 DAF, seed 35 & 42 DAF, stably expressed nutrient reservoir activity & urease activity. Which are imp activity in seed devt. 31
  • 32. Genes structure and tissue specific gene expression • Underground tissue have larger first exon, aerial has higher # of exons. • Significant difference in total transcription length among tissue due to varying intron length. • No significant difference between GC content and tissue specificity 32
  • 33. Boxplot Dendrogram of preferential expressed genes in seed development RPKM normalized log2- transformed expression gene profiles 33
  • 34. Summary • RNA Seq-Atlas provides • A record of high-resolution gene expression in a set of 14 diverse tissues • Hierarchical clustering of transcriptional profiles for 14 tissues • Relationship between gene structure and gene expression • Tissue-specific gene expression of both the most highly- expressed genes and the genes specific to legumes in seed development and nodule tissues • A means of evaluating existing gene model annotations for the Glycine max genome 34
  • 35. Spatial transcriptome • Most RNA-seq analyses target whole organs, or sets of organs, which inherently prevents the identification of cell or tissue type transcripts, and thus spatially coordinated structural and regulatory gene networks. • RNA-seq analysis of discrete tissues or cell types: Spatial information and increase the depth of sequence coverage Ex: >1000 genes have specifically or preferentially expressed in Arabidopsis male meiocytes • Acquiring tissue or cell-specific samples with any degree of precision and minimal contamination is often technically difficult 35
  • 36. Methods of isolation of single cells 36
  • 37. Contd… • Matas et al. (2011): LCM + RNA-seq (454 pyrosequencing) transcriptomes of 5 principal tissues of the developing tomato fruit pericarp. ~ 21,000 unigenes identified & more than half showed ubiquitous (57%) expression, while other showed cell type-specific expression • Takacs et al. (2012): LCM + RNA-seq (Illumina-based NGS) study of the ontogeny of maize shoot apical meristem 59% of genes expressed ubiquitously • A number of mammalian tissues also shown a high proportion of ubiquitously expressed transcripts “These studies may indicate that this is a common feature of eukaryotes” (Ramsköld et al., 2009). 37
  • 38. To study plant responses and adaptations to abiotic and biotic stresses Aim: Elucidate genes and gene networks that contribute to sorghum’s tolerance to water-limiting environments with a long- term aim of developing strategies to improve plant productivity under drought  Discovered >50 previously unknown drought- responsive genes. 38
  • 39. Up Down ABA ~2,300 ~2,600 PEG ~1,650 ~700 20 μM 8th day, 57.1 μM 20% PEG-8000 Transcript Analysis in Response to ABA and Osmotic Stress Method ABA in response to plant stress, and its central role in other pathways, (dormancy in leaf & seed) LEA protein WSI18 protein dehydrin sugar substrate transporter peroxidase 6 39
  • 40. A, brassinosteroid biosynthesis B, cytokinins degradation C, cytokinins glucoside biosynthesis D, ent-kaurene biosynthesis E, ethylene biosynthesis from methionine F, gibberellin biosynthesis G, gibberellin inactivation H, IAA conjugate biosynthesis I, jasmonic acid Networks of hormone pathways in ABA-treated plants Shoots Roots Box=Hormone-related Circle=non-hormone-related Down Up DE genes Dark blue solid lines= ≥10 blue long-dashed lines=6-9 light blue short-dashed= ≤5  Only the brassinosteroid and JA biosynthesis pathways, and cytokinin glucoside and IAA conjugate biosynthesis pathways are directly connected via DE genes.  Indirect ‘cross-talk’ between the various hormones in response to osmotic stress and ABA 40
  • 41. Determining the genes of unknown function that respond to drought or ABA treatment across species Decision tree used to determine which genes and their orthologs were regulated by drought/ABA across different species Overlap of drought-responsive sorghum genes of unknown function that had drought- responsive orthologs of unknown function in other species 41 (51) (82) (183)
  • 42. • RNA-seq used reveal massive changes in metabolism and cellular physiology of the green alga Chlamydomonas reinhardtii when the cells become deprived of sulfur • studies of plant responses to pathogens Ex: sorghum Bipolaris sorghicola (Mizunoetal.,2012) • Complexities of the metabolic pathways associated with plant defense mechanisms 42
  • 43. Study plant evolution and polyploidy. • A comparison of the leaf transcriptome of an allopolyploid relative of soybean with two species that contributed to its homoelogous genome, allowed the determination of the contribution of the different genomes to the transcriptome (Ilut et al., 2012) • Maize endosperm trascriptome analysis; discovered 179 imprinted genes and 38 imprinted long ncRNAs (Zhang et al., 2011) • Transcriptome of 9 distinct tissues of three species of the Poaceae family (Brachypodium, sorghum & rice) to determine whether orthologous genes from these three species exhibit the same expression patterns (Davidson et al., 2012)  Only a fraction of orthologous genes exhibit conserved expression patterns  Orthologs in syntenic genomic blocks are more likely to share correlated expression patterns compared with non-syntenic orthologs.  These findings are important for crop improvement (seq transfer) 43
  • 44. Hierarchical clustering of 27 tissues (9 tissues x 3 species) based on correlations of log2 FPKM mapped expression values of 3-taxa single-copy (3x3) genes Classification of Brachypodium, rice, and sorghum genes into orthologous groups clustering of corresponding tissue extensive expression divergence within 3 · 3 genes Red: single copy (2 X 2 & 3 X 3) Black: multicopy (2 X N & 3 x N) OrthoMCL 44
  • 45. Genes within each k-means co-expression cluster were categorized based on OrthoMCL category assignments or as lineage-specific single-copy (1 x 1) genes. Co-expression analyses identify conservation of expression among orthologs and paralogs Proportions of genes with at least one corresponding paralog or ortholog in the same cluster Portion of Poaceae orthologs and paralogs share same expression patterns across reproductive tissues Some genes exhibited different expression phenotypes 45 Similar expression pattern in Poacea
  • 46. which biological processes were over-represented in orthologs/paralogs category ? Ortholog/paral ogs Gene ontology (GO) annotation 2 x N genes Stress-related functions (‘response to biotic stimulus’, ‘defense response’, ‘apoptosis’), lipid transport, secretion (‘exocytosis’), and general oxidation–reduction reactions. 3 x N (higher substitution rates) Core metabolic functions; ‘translation, ATP biosynthesis, nucleosome assembly, and biosynthetic process & oxidation–reduction, response to wounding, sexual reproduction. 3 x 3 genes Essential functions: regulation of transcription’ (>1000 genes), ‘protein folding’ (253 genes), ‘intracellular protein transport’ (123 genes), and ‘glycolysis (91 genes) 2 x 2 genes protein amino acid phosphorylation, ‘regulation of transcription & response to oxidative stress 46
  • 47. Relationship between synteny and expression patterns of orthologs Syntenic gene pairs within collinear blocks of at least five genes were identified for all pairwise combinations of three Poaceae species Distributions of Pearson’s correlation coefficients (PCC) synteny plays a significant role in evolution of gene expression, especially in the case of duplicate and multicopy genes 47
  • 48. Identifying and characterizing novel non-coding RNAs • Insilico analysis provides a rapid way to identify putative sRNA genes • RNA-seq technology represents an excellent means for sRNA discovery and validation • Characterization of miRNAs regulatory functions to be facilitated by determining tissue-specific expression pattern • RNA-seq was used to identify sRNAs from five Arabidopsis root tissues. Some sRNAs expressed in all 5 tissues while others were tissue and developmental zone specific • The frequency of alternative slicing at miRNA binding sites is significantly higher than that at other regions, suggesting that alternative splicing is a significant regulatory mechanism. • sRNAs have been recently characterized in the context of association with epigenome modifications, including cytosine methylation of genomic DNA 48
  • 49. From co-expression networks to integrative data analysis • Sequencing whole transcriptomes provides a high degree of detail, but deriving useful biological information from a long list of expressed genes is typically not trivial • Construct networks of co-expressed genes and to use gene ontology (GO) information to help highlight important gene candidates as critical components of functional networks • Gene ontology enrichment analysis of RNA-seq data often illustrates the complexity of interacting pathways Robust Functiona networks Transcriptome: RNA-seq proteomics metabolomics No correlation ex: Soybean protein X Correlation Ex:Oil plam mesocarp Fatty acid 49
  • 50. Bulked Segregant RNA-Seq SNP 2 being closely related to the mutation to map linkage disequilibrium between markers and causal gene is determined by quantifying the allelic frequencies between two samples advantages: (i) Having a reference genome is not a prerequisite (ii) Markers can be generated from the experimental data (iii) Differential expression profiles (iv) Info on effects of mutant on global patterns of gene expression (v) Provide map position of a gene Liu et al. (2012) BSA requires polymorphic markers 50
  • 51. >64,000 SNPs  Two alleles of a given SNP site should be detected in approximately equal numbers of RNA-Seq reads when considering both pools of RNASeq data.  Only one allele of a SNP that is completely linked to the causal gene should be present among the RNA-Seq reads from the mutant pool  In practice, as a consequence of Allele Specific Expression and sampling bias, genes expressed at low levels, single allele of many SNPs are detected in the mutant pool.  Empirical Bayesian approach used to estimate linkage probability, i.e. probability of a SNP exhibiting complete linkage disequilibrium with the causal gene. 51
  • 52. >64,000 SNPs  Two alleles of a given SNP site should be detected in approximately equal numbers of RNA-Seq reads when considering both pools of RNASeq data.  Only one allele of a SNP that is completely linked to the causal gene should be present among the RNA-Seq reads from the mutant pool  In practice, as a consequence of Allele Specific Expression and sampling bias, genes expressed at low levels, single allele of many SNPs are detected in the mutant pool.  Empirical Bayesian approach used to estimate linkage probability, i.e. probability of a SNP exhibiting complete linkage disequilibrium with the causal gene. 52
  • 53. >64,000 SNPs  Two alleles of a given SNP site should be detected in approximately equal numbers of RNA-Seq reads when considering both pools of RNASeq data.  Only one allele of a SNP that is completely linked to the causal gene should be present among the RNA-Seq reads from the mutant pool  In practice, as a consequence of Allele Specific Expression and sampling bias, genes expressed at low levels, single allele of many SNPs are detected in the mutant pool.  Empirical Bayesian approach used to estimate linkage probability, i.e. probability of a SNP exhibiting complete linkage disequilibrium with the causal gene. gl3-ref allele in a non-B73/B73 53
  • 54. Reference genome The top 10 windows with the highest median linkage probability were located at physical position ,183.5–185.2 Mb. Fine mapping of gene. 1.Mutant gene expression will often be down-regulated compared to the WT pool. 2. Collections of SNPs tightly linked to mutant gene 3. SNPs linked to mutated gene can be used for gene cloning via chromosome walking. 54
  • 55. • Not necessary to use tissue with mutant gene expression for BSR-Seq. • However, if we collect tissue with expression we can also get additional expression data. Resolution of mapping depends on 1. # of individuals included in the bulks 2. Sequencing depth 3. Density of polymorphisms in mapping population 55
  • 56. • International multi-disciplinary consortium; 1,000 plant sps. transcriptome data • It is PPP project; funding of 75% from Govt. of Alberta, 25% by Musea Ventures. BGI-Shenzhen- sequencing at reduced costs & iPlant collaborative - computational informatics. • Objectives: 1. Resolve many of the lingering uncertainties in species relationships, especially in the early lineages of streptophyte green algae and land plants 2. To identify gene changes associated with the major innovations in Viridiplantae evolution, such as multi-cellularity, transitions from marine to freshwater or terrestrial environments, maternal retention of zygotes and embryos, complex life history involving haploid and diploid phases, vascular systems, seeds and flowers • Species selection; representations of all major lineages across the Viridiplantae (green plants), representing ~1 billion years of evolution, including flowering plants, conifers, ferns, mosses and streptophyte green algae. 56
  • 57. Resources available 1. Access to raw and processed data: Content; transcriptome assemblies, putative coding sequences, orthogroups and gene and species trees with related sequence alignments. 2. High performance computing and cloud-based services: iPlant discovery environment (DE) web interface (tutorials and teaching materials available) 57
  • 58. Phenylpropanoid synthesis pathway for Colchicum autumnale. Labelled rectangles are proteins. Small circles are metabolites. Black lines show the KEGG pathway. Red lines show the BioGRID interactions emanating from protein (K12355), which was interactively selected. A right- click on the protein will display the inferred function and a link to the sequence(s) Interactions & pathways 58
  • 59. Conclusion • RNA-sequencing is now well-established as a versatile platform with applications in an ever growing number of fields of plant biology research • Ongoing developments in sequencing technologies, such as increased read lengths, greater numbers of reads per run • Advanced computational tools to facilitate sequence assembly, analysis, and integration with orthogonal data sets will further accelerate the breadth and frequency of its adoption by plant scientists 59

Notas do Editor

  1. Human genes 20,000-25,000 & rice 56,081 genes. All will not express all time and tissues.
  2. SAGE: a tag of only 9 or 10 bases, from a specific position in the mRNA, is enough to identify the transcript. MPSS= Massively Parallel Signature Sequencing, 16-20 bp signature, CAGE=27nt 5' ends of capped transcripts
  3. Use of close-related species is preferable, but if none is available, use A. thaliana
  4. Gene Ontology (GO) database includes information on biological processes, molecular functions and cellular components and is an international standardized gene functional classification system which offers a dynamically updated controlled vocabulary and a strictly defined concept that describes the properties of genes and their products in any organism.
  5. aerial and underground tissues are distinguished from seed tissue by a bimodal expression pattern with more genes from aerial and underground tissues shifted toward higher expression
  6. Z-score help in further annotation process & investigation especially for those genes with no known annotation. Oleosin and Lectin precursor 1 gene; no expression in seed 10 DAF & high in seed 42 DAF, this warrants investigation of determine how their similar expression profiles in seed development is affected by the negative correlation between protein and oil seed content
  7. Web interface is provided to this table, which links each table cell to a downloadable list of genes.
  8. Underground tissue have larger first exon, aerial has higher # of exons. Significant difference in total transcription length among tissue due to varying intron length.
  9. transcripts that are expressed at extremely low levels, or that are specific to an uncommon cell type in a complex organ or tissue, may be diluted below the limit of detection.
  10. Overlap between differentially responsive genes following treatment with ABA and PEG. LEA proteins are water-binding molecules, membrane-stabilizers, and ion modulators induced by drought stress and ABA, a subclass of which includes dehydrins.
  11. because sequence information is transferred from model species to crop plants without sequenced genomes
  12. Single copy= orthologs, multicopy= paralogs The highest number of two-taxon groups (2 x 2 and 2 x N) was observed between rice and sorghum compared to Brachypodium with either rice or sorghum, consistent with genome reduction and gene loss during Brachypodium evolution.
  13. The integration of transcriptomics, proteomics, and metabolomics can expose complex biological and biochemical interactions, paving the way to elucidate relation- ships between genotype and phenotype.
  14. The expression patterns of genes within the mapping interval can be used to prioritize candidate genes .
  15. In Bayesian analysis approach, a SNP is classified as having a high probability of being in complete linkage disequilibrium with the causal gene only if it is ‘‘fixed’’ in the mutant pool. If mis-inclusion of mutant in nonmutant or vice versa also it takes care.