SlideShare uma empresa Scribd logo
1 de 36
Extracting biological meaning from large gene list with DAVID Huang et al., CurrProtoc Bioinformatics (2009) http://david.abcc.ncifcrf.gov/home.jsp Francesco Mattia Mancuso (francesco.mancuso@crg.es) Bioinfarmatics Core Facility Short Tutorial
Introduction ,[object Object]
 Proteomics
 Expression microarray
 Promoter microarray
ChIP-on-CHIPs
 …significant capabilities to study a large variety of biological mechanisms, including associations with diseases large ‘interesting’ gene list (ranging in size from hundreds to thousands of genes) involved in studied biological conditions.  Data Analysis of genes/proteins list ,[object Object]
Challenging task,[object Object]
Released in 2003 (Dennis et al., Genome Biol.; Hosack et al., Genome Biol.)
able to extract biological features/meaning associated with large gene lists
able to handle any type of gene listCommon strategy with other tool: ,[object Object]
gene ontology terms
to statistically highlight the most overrepresented biological annotation
enrichment,[object Object]
Main objectives of GO project Compile and provide GO terms; Use of structured vocabularies in the annotation of gene products; Provide open access to the GO database and Web resource. Independent sets of vocabularies Molecular Function (MF) – elemental activity or task performed, or potentially performed, by individual gene products (e.g. “DNA binding” and “catalytic activity”); Cellular Component (CC) – location of action for a gene product (e.g. “organelle membrane” and “cytoskeleton”); Biological Process (BP) – broad biological objective or goal in which a gene product participates. (e.g. “DNA replication” and “response to stimulus”).
[object Object]
The accession ID belongs with the definition.
if a term changes (e.g., from “chromatin” to “structural component of chromatin”), but not the definition of the term, the accession ID will remain the same.Directed acyclic graphs (DAGs) Semantic relationships between parent and child terms: ,[object Object]
part_of: the child is a component of the parent, such as a subprocess or physical part (e.g. nucleolus is part of nuclear lumen),[object Object]
Enrichment  and p-valuescalculatedwith a hypergeometricdistribution N = all genes (universe) M = all genes belonging to a pathway n = your gene list m = genes of your gene list that belongs to the pathway Other well-known statistical methods: χ2, Fisher’s exact test, Binomial probability
A 'good' gene list Contains many important genes (marker genes) as expected; Reasonable number of genes ranging from hundreds to thousands (e.g., 100–2,000 genes), not extremely low or high; Most of the genes significantly pass the statistical threshold; Portion of up- or down-regulated genes are involved in certain interesting biological processes, rather than being randomly spread throughout all possible biological processes; Consistently contain more enriched biology than that of a random list in the same size range; High reproducibility to generate a similar gene list under the same conditions; Data high quality can be confirmed by other independent experiments.
DAVIDhomepage:   http://david.abcc.ncifcrf.gov/home.jsp
The wide-range collection of heterogeneous functional annotations in the DAVID Knowledgebase
Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
GENE LIST MANAGEMENT PANEL: SUBMIT AND MANAGE USER’S GENE LISTS
Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
GENE NAME BATCH VIEWER: EXPLORE GENE NAMES BASED ON USER’S GENE IDs
ID CONVERSION TOOL: CONVERT USERS’ GENE IDs TO DIFFERENT TYPES
Exercise 1 Submit data and convert the IDs Cicala, C. et al. HIV envelope induces a cascade of cell signals in non-proliferating target cells that favor virus replication. Proc. Natl. Acad. Sci. USA 99, 9380–9385 (2002). “Freshly isolated peripheral blood mononuclear cells were treated with an HIV envelope protein (gp120) and genome-wide gene expression changes were observed using Affymetrix U95A microarray chips. The aim of the experiment was to investigate cellular responses to viral envelope protein infection, which may help in understanding the mechanisms for HIV replication in resting or sub-optimally activated peripheral blood mononuclear cells.” DOWNLOAD THE DATASET FROM : http://www.nature.com/nprot/journal/v4/n1/suppinfo/nprot.2008.211_S1.html Supplementary Data 2
Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
GENE FUNCTIONAL CLASSIFICATION TOOL: CLASSIFY USERS’ GENES INTO CO-FUNCTIONAL GENE GROUPS
Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
FUNCTIONAL ANNOTATION TOOL: IDENTIFY ENRICHEDBIOLOGY WITHIN USERS’ GENE LISTS

Mais conteúdo relacionado

Mais procurados

Lecture 6 candidate gene association full
Lecture 6 candidate gene association fullLecture 6 candidate gene association full
Lecture 6 candidate gene association fullLekki Frazier-Wood
 
Forward and reverse genetics
Forward and reverse geneticsForward and reverse genetics
Forward and reverse geneticsSrishti Singh
 
Lecture 9 slides: Machine learning for Protein Structure ...
Lecture 9 slides: Machine learning for Protein Structure ...Lecture 9 slides: Machine learning for Protein Structure ...
Lecture 9 slides: Machine learning for Protein Structure ...butest
 
Transgenesis, Intragenesis and Cisgenesis: A Brief Review
Transgenesis, Intragenesis and Cisgenesis: A Brief ReviewTransgenesis, Intragenesis and Cisgenesis: A Brief Review
Transgenesis, Intragenesis and Cisgenesis: A Brief ReviewHuda Nazeer
 
Assembly and gene_prediction
Assembly and gene_predictionAssembly and gene_prediction
Assembly and gene_predictionBas van Breukelen
 
Comparitive genome mapping and model systems
Comparitive genome mapping and model systemsComparitive genome mapping and model systems
Comparitive genome mapping and model systemsHimanshi Chauhan
 
Sequencing, Alignment and Assembly
Sequencing, Alignment and AssemblySequencing, Alignment and Assembly
Sequencing, Alignment and AssemblyShaun Jackman
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methodsratanvishwas
 
Application of bioinformatics in agriculture sector
Application of bioinformatics in agriculture sectorApplication of bioinformatics in agriculture sector
Application of bioinformatics in agriculture sectorSuraj Singh
 
Softwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisSoftwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisPrasanthperceptron
 
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...mohd younus wani
 
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...Torsten Seemann
 
Activation tagging in plants
Activation tagging in plantsActivation tagging in plants
Activation tagging in plantsAmandeep Kaur
 
Genomic selection, prediction models, GEBV values, genomic selection in plant...
Genomic selection, prediction models, GEBV values, genomic selection in plant...Genomic selection, prediction models, GEBV values, genomic selection in plant...
Genomic selection, prediction models, GEBV values, genomic selection in plant...Mahesh Biradar
 

Mais procurados (20)

SNp mining in crops
SNp mining in cropsSNp mining in crops
SNp mining in crops
 
Lecture 6 candidate gene association full
Lecture 6 candidate gene association fullLecture 6 candidate gene association full
Lecture 6 candidate gene association full
 
Forward and reverse genetics
Forward and reverse geneticsForward and reverse genetics
Forward and reverse genetics
 
Lecture 9 slides: Machine learning for Protein Structure ...
Lecture 9 slides: Machine learning for Protein Structure ...Lecture 9 slides: Machine learning for Protein Structure ...
Lecture 9 slides: Machine learning for Protein Structure ...
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Transgenesis, Intragenesis and Cisgenesis: A Brief Review
Transgenesis, Intragenesis and Cisgenesis: A Brief ReviewTransgenesis, Intragenesis and Cisgenesis: A Brief Review
Transgenesis, Intragenesis and Cisgenesis: A Brief Review
 
Assembly and gene_prediction
Assembly and gene_predictionAssembly and gene_prediction
Assembly and gene_prediction
 
Comparitive genome mapping and model systems
Comparitive genome mapping and model systemsComparitive genome mapping and model systems
Comparitive genome mapping and model systems
 
NGS: Mapping and de novo assembly
NGS: Mapping and de novo assemblyNGS: Mapping and de novo assembly
NGS: Mapping and de novo assembly
 
Sequencing, Alignment and Assembly
Sequencing, Alignment and AssemblySequencing, Alignment and Assembly
Sequencing, Alignment and Assembly
 
Genome Mapping
Genome MappingGenome Mapping
Genome Mapping
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methods
 
Application of bioinformatics in agriculture sector
Application of bioinformatics in agriculture sectorApplication of bioinformatics in agriculture sector
Application of bioinformatics in agriculture sector
 
Softwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisSoftwares For Phylogentic Analysis
Softwares For Phylogentic Analysis
 
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
 
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Activation tagging in plants
Activation tagging in plantsActivation tagging in plants
Activation tagging in plants
 
Neb cutter
Neb cutterNeb cutter
Neb cutter
 
Genomic selection, prediction models, GEBV values, genomic selection in plant...
Genomic selection, prediction models, GEBV values, genomic selection in plant...Genomic selection, prediction models, GEBV values, genomic selection in plant...
Genomic selection, prediction models, GEBV values, genomic selection in plant...
 

Semelhante a David

Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolJesminBinti
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomicsAisha Kalsoom
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformaticaMartín Arrieta
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesAmos Watentena
 
Functional annotation
Functional annotationFunctional annotation
Functional annotationRavi Gandham
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein functionLars Juhl Jensen
 
X-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage toolX-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage toolGenomika Diagnósticos
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONcsandit
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data setsimprovemed
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSMSCW Mysore
 
IJSRED-V2I1P5
IJSRED-V2I1P5IJSRED-V2I1P5
IJSRED-V2I1P5IJSRED
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirKAUSHAL SAHU
 
Utilizing literature for biological discovery
Utilizing literature for biological discoveryUtilizing literature for biological discovery
Utilizing literature for biological discoveryLars Juhl Jensen
 

Semelhante a David (20)

Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
 
Functional annotation
Functional annotationFunctional annotation
Functional annotation
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
X-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage toolX-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage tool
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
IJSRED-V2I1P5
IJSRED-V2I1P5IJSRED-V2I1P5
IJSRED-V2I1P5
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
 
Utilizing literature for biological discovery
Utilizing literature for biological discoveryUtilizing literature for biological discovery
Utilizing literature for biological discovery
 

Último

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 

Último (20)

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 

David

  • 1. Extracting biological meaning from large gene list with DAVID Huang et al., CurrProtoc Bioinformatics (2009) http://david.abcc.ncifcrf.gov/home.jsp Francesco Mattia Mancuso (francesco.mancuso@crg.es) Bioinfarmatics Core Facility Short Tutorial
  • 2.
  • 7.
  • 8.
  • 9. Released in 2003 (Dennis et al., Genome Biol.; Hosack et al., Genome Biol.)
  • 10. able to extract biological features/meaning associated with large gene lists
  • 11.
  • 13. to statistically highlight the most overrepresented biological annotation
  • 14.
  • 15. Main objectives of GO project Compile and provide GO terms; Use of structured vocabularies in the annotation of gene products; Provide open access to the GO database and Web resource. Independent sets of vocabularies Molecular Function (MF) – elemental activity or task performed, or potentially performed, by individual gene products (e.g. “DNA binding” and “catalytic activity”); Cellular Component (CC) – location of action for a gene product (e.g. “organelle membrane” and “cytoskeleton”); Biological Process (BP) – broad biological objective or goal in which a gene product participates. (e.g. “DNA replication” and “response to stimulus”).
  • 16.
  • 17. The accession ID belongs with the definition.
  • 18.
  • 19.
  • 20. Enrichment and p-valuescalculatedwith a hypergeometricdistribution N = all genes (universe) M = all genes belonging to a pathway n = your gene list m = genes of your gene list that belongs to the pathway Other well-known statistical methods: χ2, Fisher’s exact test, Binomial probability
  • 21. A 'good' gene list Contains many important genes (marker genes) as expected; Reasonable number of genes ranging from hundreds to thousands (e.g., 100–2,000 genes), not extremely low or high; Most of the genes significantly pass the statistical threshold; Portion of up- or down-regulated genes are involved in certain interesting biological processes, rather than being randomly spread throughout all possible biological processes; Consistently contain more enriched biology than that of a random list in the same size range; High reproducibility to generate a similar gene list under the same conditions; Data high quality can be confirmed by other independent experiments.
  • 22. DAVIDhomepage: http://david.abcc.ncifcrf.gov/home.jsp
  • 23. The wide-range collection of heterogeneous functional annotations in the DAVID Knowledgebase
  • 24. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 25. GENE LIST MANAGEMENT PANEL: SUBMIT AND MANAGE USER’S GENE LISTS
  • 26. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 27. GENE NAME BATCH VIEWER: EXPLORE GENE NAMES BASED ON USER’S GENE IDs
  • 28. ID CONVERSION TOOL: CONVERT USERS’ GENE IDs TO DIFFERENT TYPES
  • 29.
  • 30. Exercise 1 Submit data and convert the IDs Cicala, C. et al. HIV envelope induces a cascade of cell signals in non-proliferating target cells that favor virus replication. Proc. Natl. Acad. Sci. USA 99, 9380–9385 (2002). “Freshly isolated peripheral blood mononuclear cells were treated with an HIV envelope protein (gp120) and genome-wide gene expression changes were observed using Affymetrix U95A microarray chips. The aim of the experiment was to investigate cellular responses to viral envelope protein infection, which may help in understanding the mechanisms for HIV replication in resting or sub-optimally activated peripheral blood mononuclear cells.” DOWNLOAD THE DATASET FROM : http://www.nature.com/nprot/journal/v4/n1/suppinfo/nprot.2008.211_S1.html Supplementary Data 2
  • 31. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 32. GENE FUNCTIONAL CLASSIFICATION TOOL: CLASSIFY USERS’ GENES INTO CO-FUNCTIONAL GENE GROUPS
  • 33.
  • 34.
  • 35. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 36. FUNCTIONAL ANNOTATION TOOL: IDENTIFY ENRICHEDBIOLOGY WITHIN USERS’ GENE LISTS
  • 37.
  • 38. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 39.
  • 41. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 43. Analytic tools/modules in DAVID Huang et al., Nature Protocols, 2009
  • 44. Functional Annotation Table There is no statistics applied in this report.
  • 45. Attention!!!!! DAVID enrichment analysis is more of an exploratory procedure than a pure statistical solution. “The final interpretation and analytic result decisions (in terms of accepting the results that make sense biologically in the context of the study, or rejecting ones that do not) should be made by the biologists/analysts themselves, rather than by any of the tools.” (Huang et al., 2009)
  • 46.
  • 47.
  • 48. EASE Score Threshold (Maximum Probability): the threshold of EASE Score, a modified Fisher Exact P-value, for gene-enrichment analysis. It ranges from 0 to 1. Fisher Exact P-Value = 0 represents perfect enrichment.
  • 49. The Fold Enrichment is defined as the ratio of the two proportions. For example, if 40/400 (i.e. 10%) of your input genes involved in "kinase activity" and the background information is 300/30000 genes (i.e.  1%) associating with "kinase activity", roughly 10% / 1% = 10 fold enrichment.
  • 50. In DAVID annotation system, Fisher Exact is adopted to measure the gene-enrichment in annotation terms. When members of two independent groups can fall into one of two mutually exclusive categories, Fisher Exact test is used to determine whether the proportions of those falling into each category differs by group.
  • 51. Benjamini-Hochberg, Bonferroni, FDR (False Discovery Rate) are different 'standard' statistics for multiple comparison corrections. They correct P-values to be more conservative in order to lower family-wise false discovery rate.
  • 52. LT (list total): number of genes in your gene list mapped to any term in this ontology ("system”)
  • 53. PH (population hits): number of genes with this GO term on the background list (the whole chip)
  • 54. PT (population total): number of genes on the background list (the whole chip) mapped to any term in this ontology ("system”)

Notas do Editor

  1. GoMiner, GOstat, Onto-express, GoToolBox, FatiGO, GFINDer and GSEA
  2. 3 - (e.g., selecting genes by comparing gene expression between control and experimental cells with t-test statistics: fold changes greater than or equal to 2 and P-values less than or equal to 0.05)6 - e.g., by independent experiments under the same conditions or by leave-one-out statistical test
  3. Functional classification: ability for investigators to explore and view functionally related genes together, as a unit, to concentrate on the larger biological network rather than at the level of an individual gene.Functional Annotation chart: provides typical gene–term enrichment (overrepresented) analysis to identify the most relevant (overrepresented) biological terms associated with a given gene listFunctional Annotation Clustering: uses a similar fuzzy clustering concept as functional classification by measuring relationships among the annotation terms on the basis of the degree of their coassociation with genes within the user’s list to cluster somewhat heterogeneous, yet highly similar annotation into functional annotation groupsFunctional annotation table: is a query engine for the DAVID knowledgebase, without statistical calculations. For a given gene list, the tool can quickly query corresponding annotation for each gene and present them in a table format.