SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
NGS at
geneXplain GmbH
   Introducing GenomeTraveler
geneXplain GmbH




Products
   Bioinformatics and systems biology: geneXplain platform
   Bioinformatics and NGS: IMC and GenomeTraveler
   Cheminformatics: PASS, PharmaExpert, GUSAR




                          info@genexplain.com | www.genexplain.com   2
Systems Biology and Cheminformatics

   geneXplain platform
    ◦ Collection, storage and analysis of experimental data
    ◦ Network clustering and search for master regulators
    ◦ Graphical programming of workflows
    ◦ Possibility to write new scripts and add-ons


   PASS and PharmaExpert
    ◦ Qualitative biological activity prediction for libraries of chemical compounds
    ◦ Drug-drug interactions and identification of most promising candidates


   GUSAR
    ◦ Quantitative biological activity prediction for libraries of chemical compounds
    ◦ (Q)SAR model building




                                      info@genexplain.com | www.genexplain.com          3
geneXplain GmbH




Products
   Bioinformatics and systems biology: geneXplain platform
   Bioinformatics and NGS: IMC and GenomeTraveler
   Cheminformatics: PASS, PharmaExpert, GUSAR




                          info@genexplain.com | www.genexplain.com   4
Contents
   Primary analysis                       Comparative
    ◦ de novo assembly                      genomics
    ◦ Scaffolding                           ◦ in silico cloning
    ◦ Mapping                               ◦ Genome analysis
    ◦ Import of mapping                     ◦ Array analysis
      results


   Secondary analysis
                                           Why you should try
    ◦ Profile analysis                      GenomeTraveler
    ◦ Mutation analysis
    ◦ Expression analysis


                                             info@genexplain.com |
                            info@genexplain.com | www.genexplain.com
                                               www.genexplain.com      5
Primary Analysis
   Getting started with NGS
   data




          info@genexplain.com |
            www.genexplain.com    6
General Information

   GT supports NGS data from the following
    sources:
    ◦ ABI SOLiD (CSFastA)
    ◦ Illumina GA (FastQ)
    ◦ Roche/454 GS FLX/Junior

   It is possible to map NGS sequences against
    sequence data in any of these formats:
    ◦ ABI/SCF
    ◦ GenBank
    ◦ FastA


                        info@genexplain.com | www.genexplain.com   7
de novo Assembly
   de novo assembly can be
    done with 200 bp or larger
    reads,    saved    in  the
    following formats:
    ◦   ABI/SCF/SFF/PHD
    ◦   FastA/FastQ/CSFastA
    ◦   Solexa Raw
    ◦   GenBank/EMBL
    ◦   TXT

   GT also supports the
    import of assembly files
    from:
    ◦ Velvet (AFG)
    ◦ Phrap (ACE)                         Example: assembly results imported
                                          from Phrap.


                              info@genexplain.com | www.genexplain.com         8
Scaffolding

The de novo assembly of large genomes results
in a large number of contigs. These can be
combined using paired-end short reads.
Contig positions in a scaffold can be viewed in
GT's "Scaffold Viewer".




                   info@genexplain.com | www.genexplain.com   9
Mapping Functions
The mapping algorithm is based on NCBI
Blast results for reference sequences
compared with NGS short reads.

         fragment                 paired-end
        short reads               short reads



                                  multiple
       multiple sites          chromosomes
                                 or contigs


                   info@genexplain.com | www.genexplain.com   10
Import of Mapping Results
   GenomeTraveler    can
    also import mapping
    results   from  other
    software, namely:
    ◦ SAMtool (SAM/BAM)
    ◦ SOLiD (GFF)


   Mapping and import
    results can be viewed
    in the "Profile and
    Alignment Viewer" of
    GT.                                     Example: imported GFF file.



                          info@genexplain.com | www.genexplain.com        11
Primary Analysis Summary

       Long read de novo
    assembly and import of      Scaffolding using paired-
     assembly results from          end short reads.
       Phrap and Velvet.

                  GenomeTraveler has a
                  number of options for
                 the primary analysis of
                        NGS data.

     Mapping: single-end,
                                Import of mapping result
      paired-end, multiple
                                 files from SAMtool and
    chromosome/contig/site
                                          SOLiD.
           mapping.




                      info@genexplain.com | www.genexplain.com   12
Secondary Analysis
      Analysis of mapped data




             info@genexplain.com |
               www.genexplain.com    13
General Information
   Once NGS data is
    mapped, it can be
    analyzed     in    the
    "Profile and Alignment
    Viewer".

   The viewer consists of
    four panels:
    ◦ Whole genome
      navigation panel
    ◦ Profile panel
    ◦ Genome (feature) map
    ◦ Sequence alignment


                        info@genexplain.com | www.genexplain.com   14
Profile Analysis

   Peak detection can be done for one, a few
    or all contigs in one alignment.

   Peaks can be
    saved as CSV
    file or inserted
    as          new
    features    into
    the     genome
    map.


                       info@genexplain.com | www.genexplain.com   15
Mutation Analysis

   The "Disagreement List" is a list of bases
    which differ between the reference and the
    consensus of mapped reads (example).

   SNP detection
    ◦ Based on
      disagreement list
    ◦ Feature mapping
    ◦ Link generation
      with JSNP

   Indel detection

                          info@genexplain.com | www.genexplain.com   16
Expression Analysis

   GT provides profiles and a comparison with
    expression profiles of tiling arrays for:
    ◦ RNA-Seq
    ◦ ChIP-Seq




                    info@genexplain.com | www.genexplain.com   17
Secondary Analysis Summary

                                    Mutation analysis: find
    Profile analysis: detect        divergent bases, SNPs
      and register peaks.           and insertion/deletion
                                            sites.

                     GenomeTraveler
                  provides three fields for
                   secondary analysis of
                        NGS data.

                                    These are GT specific
    Expression analysis: GT
                                    functions that can be
   provides profiles for RNA-
                                   accessed in the "Profile
      and ChIP-Seq data.
                                   and Alignment Viewer".




                        info@genexplain.com | www.genexplain.com   18
Comparative
  Genomics
Functions inherited from IMC




       info@genexplain.com |
         www.genexplain.com    19
General Information
GenomeTraveler is based
on      IMC   (in    silico                              Cloning
Molecular        Cloning),
which offers three major
in silico experimentation                               Genome
areas.                                                  analysis
These are also featured
in GT and can be
accessed in the "Profile                                 Array
and Alignment Viewer".                                  analysis



                      info@genexplain.com | www.genexplain.com     20
in silico Cloning
The "Cloning" menu lists a number of possible
actions for in silico molecular cloning experiments.
Included are all necessary functions for:
  ◦ PCR (including gel electrophoresis of results)
  ◦ Ligation and sequence editing
  ◦ Restriction enzymes and digestion (example below)




                                    info@genexplain.com | www.genexplain.com   21
Genome Analysis
The "Genome Analysis" menu is filled with functions from
statistics to visualization to working with sequences:
  ◦   Translation
  ◦   Annotation
  ◦   EST and trace mapping
  ◦   Homology searches
  ◦   Dot plot, Venn diagram (examples below), KEGG pathways




                                     info@genexplain.com | www.genexplain.com   22
Genome Analysis
The genome rearrangement map, as part
of the genome analysis functionality,
compares the distribution of features on
your mapped sequence compared to any
reference   sequence     you   set    (in
GenBank/EMBL/GPFF formats), as seen in
the example below.




                       info@genexplain.com | www.genexplain.com   23
Array Analysis
The "Array Analysis" menu contains all functions
necessary for the creation and analysis of array
data:
  ◦ Probe design (example below)
  ◦ Import of probe and array data
  ◦ Statistics and peak detection
  ◦ Two expression graphs and clustering




                                     info@genexplain.com | www.genexplain.com   24
Additional Functions
The     "Profile     and
Alignment Viewer" has
more to offer, though:
   Search for pattern matches,
    keywords, and more.
   View your sequence         in
    different ways.
   Customize   the  genome
    feature map and hide the
    program's  quick  access
    buttons.




                          info@genexplain.com | www.genexplain.com   25
Comparative Genomics Summary

    in silico cloning options            Genome analysis
    include PCR, restriction           features translation,
          and ligation of             annotation, homology
            sequences.                 searches and more.


                    With GenomeTraveler
                    you can work in silico
                     with your NGS data.


     Array analysis offers          The additional functions
      probe design, data               of the "Profile and
    import, peak detection         Alignment Viewer" render
          and more.                    GT more flexible.




                         info@genexplain.com | www.genexplain.com   26
Why You Should
Try GenomeTraveler
    Advantages of the Software




           info@genexplain.com |
             www.genexplain.com    27
Key Notes

   GenomeTraveler has a number of options for
    the primary analysis of NGS data.
   GenomeTraveler provides three                                fields   for
    secondary analysis of NGS data.
   With GenomeTraveler you can work in silico
    with your NGS data.


   You can download a free two-week trial version
    of GenomeTraveler at
    http://www.genexplain.com/genome-traveler

                      info@genexplain.com | www.genexplain.com                  28

Mais conteúdo relacionado

Semelhante a GenomeTraveler

Artificial Intelligence Database Performance Tuning
Artificial Intelligence Database Performance TuningArtificial Intelligence Database Performance Tuning
Artificial Intelligence Database Performance TuningRoel Van de Paar
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowHorizonDiscovery
 
TGAC Browser bosc 2014
TGAC Browser bosc 2014TGAC Browser bosc 2014
TGAC Browser bosc 2014Anil Thanki
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...GenomeInABottle
 
BG7, a new system for bacterial genome annotation designed for NGS data
BG7, a new system for bacterial genome annotation designed for NGS data BG7, a new system for bacterial genome annotation designed for NGS data
BG7, a new system for bacterial genome annotation designed for NGS data Marina Manrique
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in BioinformaticsArindam Ghosh
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchDavid Ruau
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRONPrabin Shakya
 
Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing Monica Munoz-Torres
 
Visualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotationsVisualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotationsAnn Loraine
 
Neo4j_Cypher.pdf
Neo4j_Cypher.pdfNeo4j_Cypher.pdf
Neo4j_Cypher.pdfJaberRad1
 
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...Daniel Su
 

Semelhante a GenomeTraveler (20)

Artificial Intelligence Database Performance Tuning
Artificial Intelligence Database Performance TuningArtificial Intelligence Database Performance Tuning
Artificial Intelligence Database Performance Tuning
 
ChIP-seq Theory
ChIP-seq TheoryChIP-seq Theory
ChIP-seq Theory
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and How
 
TGAC Browser bosc 2014
TGAC Browser bosc 2014TGAC Browser bosc 2014
TGAC Browser bosc 2014
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 
BG7, a new system for bacterial genome annotation designed for NGS data
BG7, a new system for bacterial genome annotation designed for NGS data BG7, a new system for bacterial genome annotation designed for NGS data
BG7, a new system for bacterial genome annotation designed for NGS data
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in Bioinformatics
 
ChipSeq Data Analysis
ChipSeq Data AnalysisChipSeq Data Analysis
ChipSeq Data Analysis
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRON
 
Arraygen_Brochure
Arraygen_BrochureArraygen_Brochure
Arraygen_Brochure
 
Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing
 
P0126557 slides
P0126557 slidesP0126557 slides
P0126557 slides
 
Visualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotationsVisualizing the genome: Techniques for presenting genome data and annotations
Visualizing the genome: Techniques for presenting genome data and annotations
 
Neo4j_Cypher.pdf
Neo4j_Cypher.pdfNeo4j_Cypher.pdf
Neo4j_Cypher.pdf
 
Snapgene
SnapgeneSnapgene
Snapgene
 
May 15 workshop
May 15  workshopMay 15  workshop
May 15 workshop
 
May workshop
May workshopMay workshop
May workshop
 
Molecular Biology Software Links
Molecular Biology Software LinksMolecular Biology Software Links
Molecular Biology Software Links
 
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
 

Último

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Último (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

GenomeTraveler

  • 1. NGS at geneXplain GmbH Introducing GenomeTraveler
  • 2. geneXplain GmbH Products  Bioinformatics and systems biology: geneXplain platform  Bioinformatics and NGS: IMC and GenomeTraveler  Cheminformatics: PASS, PharmaExpert, GUSAR info@genexplain.com | www.genexplain.com 2
  • 3. Systems Biology and Cheminformatics  geneXplain platform ◦ Collection, storage and analysis of experimental data ◦ Network clustering and search for master regulators ◦ Graphical programming of workflows ◦ Possibility to write new scripts and add-ons  PASS and PharmaExpert ◦ Qualitative biological activity prediction for libraries of chemical compounds ◦ Drug-drug interactions and identification of most promising candidates  GUSAR ◦ Quantitative biological activity prediction for libraries of chemical compounds ◦ (Q)SAR model building info@genexplain.com | www.genexplain.com 3
  • 4. geneXplain GmbH Products  Bioinformatics and systems biology: geneXplain platform  Bioinformatics and NGS: IMC and GenomeTraveler  Cheminformatics: PASS, PharmaExpert, GUSAR info@genexplain.com | www.genexplain.com 4
  • 5. Contents  Primary analysis  Comparative ◦ de novo assembly genomics ◦ Scaffolding ◦ in silico cloning ◦ Mapping ◦ Genome analysis ◦ Import of mapping ◦ Array analysis results  Secondary analysis  Why you should try ◦ Profile analysis GenomeTraveler ◦ Mutation analysis ◦ Expression analysis info@genexplain.com | info@genexplain.com | www.genexplain.com www.genexplain.com 5
  • 6. Primary Analysis Getting started with NGS data info@genexplain.com | www.genexplain.com 6
  • 7. General Information  GT supports NGS data from the following sources: ◦ ABI SOLiD (CSFastA) ◦ Illumina GA (FastQ) ◦ Roche/454 GS FLX/Junior  It is possible to map NGS sequences against sequence data in any of these formats: ◦ ABI/SCF ◦ GenBank ◦ FastA info@genexplain.com | www.genexplain.com 7
  • 8. de novo Assembly  de novo assembly can be done with 200 bp or larger reads, saved in the following formats: ◦ ABI/SCF/SFF/PHD ◦ FastA/FastQ/CSFastA ◦ Solexa Raw ◦ GenBank/EMBL ◦ TXT  GT also supports the import of assembly files from: ◦ Velvet (AFG) ◦ Phrap (ACE) Example: assembly results imported from Phrap. info@genexplain.com | www.genexplain.com 8
  • 9. Scaffolding The de novo assembly of large genomes results in a large number of contigs. These can be combined using paired-end short reads. Contig positions in a scaffold can be viewed in GT's "Scaffold Viewer". info@genexplain.com | www.genexplain.com 9
  • 10. Mapping Functions The mapping algorithm is based on NCBI Blast results for reference sequences compared with NGS short reads. fragment paired-end short reads short reads multiple multiple sites chromosomes or contigs info@genexplain.com | www.genexplain.com 10
  • 11. Import of Mapping Results  GenomeTraveler can also import mapping results from other software, namely: ◦ SAMtool (SAM/BAM) ◦ SOLiD (GFF)  Mapping and import results can be viewed in the "Profile and Alignment Viewer" of GT. Example: imported GFF file. info@genexplain.com | www.genexplain.com 11
  • 12. Primary Analysis Summary Long read de novo assembly and import of Scaffolding using paired- assembly results from end short reads. Phrap and Velvet. GenomeTraveler has a number of options for the primary analysis of NGS data. Mapping: single-end, Import of mapping result paired-end, multiple files from SAMtool and chromosome/contig/site SOLiD. mapping. info@genexplain.com | www.genexplain.com 12
  • 13. Secondary Analysis Analysis of mapped data info@genexplain.com | www.genexplain.com 13
  • 14. General Information  Once NGS data is mapped, it can be analyzed in the "Profile and Alignment Viewer".  The viewer consists of four panels: ◦ Whole genome navigation panel ◦ Profile panel ◦ Genome (feature) map ◦ Sequence alignment info@genexplain.com | www.genexplain.com 14
  • 15. Profile Analysis  Peak detection can be done for one, a few or all contigs in one alignment.  Peaks can be saved as CSV file or inserted as new features into the genome map. info@genexplain.com | www.genexplain.com 15
  • 16. Mutation Analysis  The "Disagreement List" is a list of bases which differ between the reference and the consensus of mapped reads (example).  SNP detection ◦ Based on disagreement list ◦ Feature mapping ◦ Link generation with JSNP  Indel detection info@genexplain.com | www.genexplain.com 16
  • 17. Expression Analysis  GT provides profiles and a comparison with expression profiles of tiling arrays for: ◦ RNA-Seq ◦ ChIP-Seq info@genexplain.com | www.genexplain.com 17
  • 18. Secondary Analysis Summary Mutation analysis: find Profile analysis: detect divergent bases, SNPs and register peaks. and insertion/deletion sites. GenomeTraveler provides three fields for secondary analysis of NGS data. These are GT specific Expression analysis: GT functions that can be provides profiles for RNA- accessed in the "Profile and ChIP-Seq data. and Alignment Viewer". info@genexplain.com | www.genexplain.com 18
  • 19. Comparative Genomics Functions inherited from IMC info@genexplain.com | www.genexplain.com 19
  • 20. General Information GenomeTraveler is based on IMC (in silico Cloning Molecular Cloning), which offers three major in silico experimentation Genome areas. analysis These are also featured in GT and can be accessed in the "Profile Array and Alignment Viewer". analysis info@genexplain.com | www.genexplain.com 20
  • 21. in silico Cloning The "Cloning" menu lists a number of possible actions for in silico molecular cloning experiments. Included are all necessary functions for: ◦ PCR (including gel electrophoresis of results) ◦ Ligation and sequence editing ◦ Restriction enzymes and digestion (example below) info@genexplain.com | www.genexplain.com 21
  • 22. Genome Analysis The "Genome Analysis" menu is filled with functions from statistics to visualization to working with sequences: ◦ Translation ◦ Annotation ◦ EST and trace mapping ◦ Homology searches ◦ Dot plot, Venn diagram (examples below), KEGG pathways info@genexplain.com | www.genexplain.com 22
  • 23. Genome Analysis The genome rearrangement map, as part of the genome analysis functionality, compares the distribution of features on your mapped sequence compared to any reference sequence you set (in GenBank/EMBL/GPFF formats), as seen in the example below. info@genexplain.com | www.genexplain.com 23
  • 24. Array Analysis The "Array Analysis" menu contains all functions necessary for the creation and analysis of array data: ◦ Probe design (example below) ◦ Import of probe and array data ◦ Statistics and peak detection ◦ Two expression graphs and clustering info@genexplain.com | www.genexplain.com 24
  • 25. Additional Functions The "Profile and Alignment Viewer" has more to offer, though:  Search for pattern matches, keywords, and more.  View your sequence in different ways.  Customize the genome feature map and hide the program's quick access buttons. info@genexplain.com | www.genexplain.com 25
  • 26. Comparative Genomics Summary in silico cloning options Genome analysis include PCR, restriction features translation, and ligation of annotation, homology sequences. searches and more. With GenomeTraveler you can work in silico with your NGS data. Array analysis offers The additional functions probe design, data of the "Profile and import, peak detection Alignment Viewer" render and more. GT more flexible. info@genexplain.com | www.genexplain.com 26
  • 27. Why You Should Try GenomeTraveler Advantages of the Software info@genexplain.com | www.genexplain.com 27
  • 28. Key Notes  GenomeTraveler has a number of options for the primary analysis of NGS data.  GenomeTraveler provides three fields for secondary analysis of NGS data.  With GenomeTraveler you can work in silico with your NGS data.  You can download a free two-week trial version of GenomeTraveler at http://www.genexplain.com/genome-traveler info@genexplain.com | www.genexplain.com 28