SlideShare uma empresa Scribd logo
1 de 21
Baixar para ler offline
Assembly tools and Visualisation
Matthias Haimel




                          EBI is an Outstation of the European Molecular Biology Laboratory.
Overview
    • Assemblers
               • ABySS
               • SOAPdenovo
    • Visualisation
               • Tablet
               • AbySS-Explorer
    • Read mapping
               • Sam / Bam
    • Visualisation
               • Artemis
               • IGV - Integrative Genomics Viewer



2   25.04.11             Assemblers
ABySS                                 Assembly By Short Sequences

    • Genome Sciences Centre, Vancouver
               • http://www.bcgsc.ca/platform/bioinfo/software/abyss
               • Open source, BCCA Licence
    • de Bruijn graph
               • Trimming (tip clipping), bubble popping
               • Use paired-end information: resolve ambiguities between contigs
               • parallel (use cluster)
    • Files
               • Fasta / Fastq
               • Sam/Bam
               • colour-space



3   25.04.11             Assemblers
ABySS
    • ABYSS (singe end)
               • e.g. ABYSS -k27 single.fastq -o contigs.fa
    • abyss-pe (paired end)
               • e.g. abyss-pe k=27 n=10 in='read_1.fastq read_2.fastq' name=ecli
    • Multiple libraries
               •   ... lib=’read1 read2’ read1=’read1_1.fa read1_2.fa’ read2=’read2_1.fa read1_2.fa’




4   25.04.11                 Assemblers
SOAPdenovo
    • Beijing Genomics Institute (BGI), China
               • http://soap.genomics.org.cn/soapdenovo.html
               • Panda genome
               • Source available
    • de Bruijn graph
               • pre-set Kmer frequency threshold
               • Bubble removing
    • Build scaffold
               • mapping reads to contigs
               • gap filling




5   25.04.11             Assemblers
SOAPdenovo
    • Full run
               • e.g. SOAPdenovo all -s read.config -K 27 -o contigs.fa
    • Run sub steps
               •   pregraph    = velveth
               •   contig      = velvetg
               •   map         map reads to contigs
               •   scaff      scaffolding
    • Configuration
               • Config file input instead of read files
               • Specify rank, usage (assembly/scaffolding), insert size




6   25.04.11              Assemblers
Visualisation                     http://bioinf.scri.ac.uk/tablet/

    • Tablet
               • Lightweight
               • Easy to use
    • Formats
               •   ACE
               •   AFG
               •   BAM
               •   BANK (AMOS)




7   25.04.11              Assemblers
Visualisation - Velvet
    • Tablet
               • Velvetg ... -amos_file yes
    • GraphViz
               •   Transform velvet graph into GraphViz format
               •   Contributed by Paul Harrison
               •   <velvet>/contrib/layout/
               •   Velvet -> .dot file (Python script)
               •   .dot -> png (graphviz)




8   25.04.11               Assemblers
Visualisation                   http://www.bcgsc.ca/platform/bioinfo/software/abyss-explorer


    • ABySS-Explorer
               • Visualizes ABySS assemblies
               • Interactive graph structure
               • Filter contigs




9   25.04.11            Assemblers
Assembler - Practical
     • Assemblers
                • ABySS
                • SOAPdenovo
     • Visualisation
                • Tablet
                • ABySS-Explorer




10   25.04.11            Assemblers
Read mapping                             http://samtools.sourceforge.net/SAM1.pdf

     • SAM / BAM
                •   Sequence Alignment / Map format (SAM)
                •   Binary form of SAM (BAM)
                •   generic format
                •   Flexible and simple
                •   Compact (BAM)
                •   Allow indexing
                •   Load regions
                •   Support streaming




11   25.04.11              Assemblers
SAM
     • Header
                •   File format version information
                •   Sequence dictionary (name/length/..)
                •   Read group (platform/library/...)
                •   Program info
     • Body
                • Alignment information




12   25.04.11               Assemblers
SAM Header
     • '@' followed by record type (two characters)
                @HD   VN:1.0
                @SQ   SN:chr20 LN:62435964
                @RG   ID:L1 PU:SC_1_10 LB:SC_1 SM:NA12891
                @RG   ID:L2 PU:SC_2_12 LB:SC_2 SM:NA12891




13   25.04.11         Assemblers
SAM Alignment
     • Tab delimited lines




14   25.04.11    Assemblers
SAM Alignment
     • Tab delimited lines

      Read_28833_29006_6945 99 chr20 28833 20 10M1D25M = 28993 195 
      AGCT... <<<<... NM:i:1 RG:Z:L1
      read_28701_28881_323b 147 chr20 28834 30 35M = 28701 -168 
      ACCT... <<7;:... MF:i:18 RG:Z:L2




15   25.04.11     Assemblers
Tools
     • Mapping Reads
                • BWA
                • Bowtie
                • SSAHA2
     • Manipulate SAM/BAM
                • SAM Tools package
                • Picard




16   25.04.11            Assemblers
BWA
     • Burrows-Wheeler Alignment Tool
                • Map (singe/paired-end/long) reads to a sequence
     • Index database
                • bwa index -a bwtsw database.fasta
     • Align reads
                • bwa aln database.fasta short_read.fastq > aln_sa.sai
     • Generate alignments
                •   bwa sampe database.fasta aln_sa1.sai aln_sa2.sai read1.fq read2.fq > aln.sam
     • Long reads
                • bwa bwasw database.fasta long_read.fastq > aln.sam




17   25.04.11                 Assemblers
SAM tools
     • Utilities for SAM format
                • samtools <command> ...
     • Commands:
                •   view: SAM <-> BAM
                •   sort: sort BAM file
                •   index: build BAM file index
                •   merge: merges x BAM files
                •   pileup: alignment in the pileup format
                •   tview: integrated Text alignment viewer




18   25.04.11               Assemblers
Visualisation                    Integrative Genomics Viewer
                                       http://www.broadinstitute.org/igv/
     • IGV
                • Good integration
     • Formats
                •   DAS
                •   BAM
                •   GFF
                •   ...
     • Tools
                • Run scripts
                • Export region
                • ...



19   25.04.11             Assemblers
Visualisation
                                   http://www.sanger.ac.uk/resources/software/artemis/

     • Artemis
          • Sequence Viewer
          • Annotation tool
     • Formats
          •     EMBL
          •     GENBANK
          •     GFF
          •     FASTA
          •     BAM




20   25.04.11         Assemblers
Mapping - Practical
     • Mapping reads + prepare for visalization
                • BWA
                • samtools
     • Visualisation
                • IGV




21   25.04.11            Assemblers

Mais conteúdo relacionado

Semelhante a 2011-04-26_various-assemblers-presentation

CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...StampedeCon
 
Zero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSSZero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSSMike McGarr
 
Australian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStackAustralian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStackMatt Ray
 
PLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in GrailsPLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in GrailsAlfresco Software
 
Chef for OpenStack - OpenStack Fall 2012 Summit
Chef for OpenStack  - OpenStack Fall 2012 SummitChef for OpenStack  - OpenStack Fall 2012 Summit
Chef for OpenStack - OpenStack Fall 2012 SummitMatt Ray
 
Sutol How To Be A Lion Tamer
Sutol How To Be A Lion TamerSutol How To Be A Lion Tamer
Sutol How To Be A Lion TamerSharon James
 
Rails - getting started
Rails - getting startedRails - getting started
Rails - getting startedTrue North
 
JBoss AS 7 from a user perspective
JBoss AS 7 from a user perspectiveJBoss AS 7 from a user perspective
JBoss AS 7 from a user perspectiveMax Andersen
 
Java Night 2010 SteamCannon
Java Night 2010 SteamCannonJava Night 2010 SteamCannon
Java Night 2010 SteamCannonmarekgoldmann
 
Achieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with ChefAchieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with ChefMatt Ray
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamerWannes Rams
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamerSharon James
 
Implementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using KubelessImplementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using KubelessAhmed Misbah
 
OpenStack Deployments with Chef
OpenStack Deployments with ChefOpenStack Deployments with Chef
OpenStack Deployments with ChefMatt Ray
 
Play Framework and Activator
Play Framework and ActivatorPlay Framework and Activator
Play Framework and ActivatorKevin Webber
 
Optaros Surf Code Camp Introduction
Optaros Surf Code Camp IntroductionOptaros Surf Code Camp Introduction
Optaros Surf Code Camp IntroductionJeff Potts
 

Semelhante a 2011-04-26_various-assemblers-presentation (20)

CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
 
Zero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSSZero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSS
 
Australian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStackAustralian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStack
 
PLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in GrailsPLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in Grails
 
Chef for OpenStack - OpenStack Fall 2012 Summit
Chef for OpenStack  - OpenStack Fall 2012 SummitChef for OpenStack  - OpenStack Fall 2012 Summit
Chef for OpenStack - OpenStack Fall 2012 Summit
 
Chef for OpenStack- Fall 2012.pdf
Chef for OpenStack- Fall 2012.pdfChef for OpenStack- Fall 2012.pdf
Chef for OpenStack- Fall 2012.pdf
 
Sutol How To Be A Lion Tamer
Sutol How To Be A Lion TamerSutol How To Be A Lion Tamer
Sutol How To Be A Lion Tamer
 
Rails - getting started
Rails - getting startedRails - getting started
Rails - getting started
 
JBoss AS 7 from a user perspective
JBoss AS 7 from a user perspectiveJBoss AS 7 from a user perspective
JBoss AS 7 from a user perspective
 
spring-cloud.pptx
spring-cloud.pptxspring-cloud.pptx
spring-cloud.pptx
 
Java Night 2010 SteamCannon
Java Night 2010 SteamCannonJava Night 2010 SteamCannon
Java Night 2010 SteamCannon
 
Achieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with ChefAchieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with Chef
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamer
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamer
 
Implementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using KubelessImplementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using Kubeless
 
OpenStack Deployments with Chef
OpenStack Deployments with ChefOpenStack Deployments with Chef
OpenStack Deployments with Chef
 
Inexpensive storage
Inexpensive storageInexpensive storage
Inexpensive storage
 
Play Framework and Activator
Play Framework and ActivatorPlay Framework and Activator
Play Framework and Activator
 
Django In The Real World
Django In The Real WorldDjango In The Real World
Django In The Real World
 
Optaros Surf Code Camp Introduction
Optaros Surf Code Camp IntroductionOptaros Surf Code Camp Introduction
Optaros Surf Code Camp Introduction
 

Último

ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 

Último (20)

FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 

2011-04-26_various-assemblers-presentation

  • 1. Assembly tools and Visualisation Matthias Haimel EBI is an Outstation of the European Molecular Biology Laboratory.
  • 2. Overview • Assemblers • ABySS • SOAPdenovo • Visualisation • Tablet • AbySS-Explorer • Read mapping • Sam / Bam • Visualisation • Artemis • IGV - Integrative Genomics Viewer 2 25.04.11 Assemblers
  • 3. ABySS Assembly By Short Sequences • Genome Sciences Centre, Vancouver • http://www.bcgsc.ca/platform/bioinfo/software/abyss • Open source, BCCA Licence • de Bruijn graph • Trimming (tip clipping), bubble popping • Use paired-end information: resolve ambiguities between contigs • parallel (use cluster) • Files • Fasta / Fastq • Sam/Bam • colour-space 3 25.04.11 Assemblers
  • 4. ABySS • ABYSS (singe end) • e.g. ABYSS -k27 single.fastq -o contigs.fa • abyss-pe (paired end) • e.g. abyss-pe k=27 n=10 in='read_1.fastq read_2.fastq' name=ecli • Multiple libraries • ... lib=’read1 read2’ read1=’read1_1.fa read1_2.fa’ read2=’read2_1.fa read1_2.fa’ 4 25.04.11 Assemblers
  • 5. SOAPdenovo • Beijing Genomics Institute (BGI), China • http://soap.genomics.org.cn/soapdenovo.html • Panda genome • Source available • de Bruijn graph • pre-set Kmer frequency threshold • Bubble removing • Build scaffold • mapping reads to contigs • gap filling 5 25.04.11 Assemblers
  • 6. SOAPdenovo • Full run • e.g. SOAPdenovo all -s read.config -K 27 -o contigs.fa • Run sub steps • pregraph = velveth • contig = velvetg • map map reads to contigs • scaff scaffolding • Configuration • Config file input instead of read files • Specify rank, usage (assembly/scaffolding), insert size 6 25.04.11 Assemblers
  • 7. Visualisation http://bioinf.scri.ac.uk/tablet/ • Tablet • Lightweight • Easy to use • Formats • ACE • AFG • BAM • BANK (AMOS) 7 25.04.11 Assemblers
  • 8. Visualisation - Velvet • Tablet • Velvetg ... -amos_file yes • GraphViz • Transform velvet graph into GraphViz format • Contributed by Paul Harrison • <velvet>/contrib/layout/ • Velvet -> .dot file (Python script) • .dot -> png (graphviz) 8 25.04.11 Assemblers
  • 9. Visualisation http://www.bcgsc.ca/platform/bioinfo/software/abyss-explorer • ABySS-Explorer • Visualizes ABySS assemblies • Interactive graph structure • Filter contigs 9 25.04.11 Assemblers
  • 10. Assembler - Practical • Assemblers • ABySS • SOAPdenovo • Visualisation • Tablet • ABySS-Explorer 10 25.04.11 Assemblers
  • 11. Read mapping http://samtools.sourceforge.net/SAM1.pdf • SAM / BAM • Sequence Alignment / Map format (SAM) • Binary form of SAM (BAM) • generic format • Flexible and simple • Compact (BAM) • Allow indexing • Load regions • Support streaming 11 25.04.11 Assemblers
  • 12. SAM • Header • File format version information • Sequence dictionary (name/length/..) • Read group (platform/library/...) • Program info • Body • Alignment information 12 25.04.11 Assemblers
  • 13. SAM Header • '@' followed by record type (two characters) @HD VN:1.0 @SQ SN:chr20 LN:62435964 @RG ID:L1 PU:SC_1_10 LB:SC_1 SM:NA12891 @RG ID:L2 PU:SC_2_12 LB:SC_2 SM:NA12891 13 25.04.11 Assemblers
  • 14. SAM Alignment • Tab delimited lines 14 25.04.11 Assemblers
  • 15. SAM Alignment • Tab delimited lines Read_28833_29006_6945 99 chr20 28833 20 10M1D25M = 28993 195 AGCT... <<<<... NM:i:1 RG:Z:L1 read_28701_28881_323b 147 chr20 28834 30 35M = 28701 -168 ACCT... <<7;:... MF:i:18 RG:Z:L2 15 25.04.11 Assemblers
  • 16. Tools • Mapping Reads • BWA • Bowtie • SSAHA2 • Manipulate SAM/BAM • SAM Tools package • Picard 16 25.04.11 Assemblers
  • 17. BWA • Burrows-Wheeler Alignment Tool • Map (singe/paired-end/long) reads to a sequence • Index database • bwa index -a bwtsw database.fasta • Align reads • bwa aln database.fasta short_read.fastq > aln_sa.sai • Generate alignments • bwa sampe database.fasta aln_sa1.sai aln_sa2.sai read1.fq read2.fq > aln.sam • Long reads • bwa bwasw database.fasta long_read.fastq > aln.sam 17 25.04.11 Assemblers
  • 18. SAM tools • Utilities for SAM format • samtools <command> ... • Commands: • view: SAM <-> BAM • sort: sort BAM file • index: build BAM file index • merge: merges x BAM files • pileup: alignment in the pileup format • tview: integrated Text alignment viewer 18 25.04.11 Assemblers
  • 19. Visualisation Integrative Genomics Viewer http://www.broadinstitute.org/igv/ • IGV • Good integration • Formats • DAS • BAM • GFF • ... • Tools • Run scripts • Export region • ... 19 25.04.11 Assemblers
  • 20. Visualisation http://www.sanger.ac.uk/resources/software/artemis/ • Artemis • Sequence Viewer • Annotation tool • Formats • EMBL • GENBANK • GFF • FASTA • BAM 20 25.04.11 Assemblers
  • 21. Mapping - Practical • Mapping reads + prepare for visalization • BWA • samtools • Visualisation • IGV 21 25.04.11 Assemblers