SlideShare uma empresa Scribd logo
1 de 19
Miloš Bošković Cell Processor Based Sequence Alignment
Prof. Vassil Alexandrov Janko Straßburg University of Reading, Aristotle University, University Carlos III European Commission Acknowledgments
Sequence Alignment on the Playstation 3 Six SPEs Smith-Waterman Algorithm only Accelerating Multiple SequenceAlignment with the Cell BE Processor Designed to accelerate a particular sequence alignment application Modeling and SchedulingWavefront  Computations on the Cell Broadband Engine Smith-Waterman Algorithm State of the art
Cell BroadBand Engine Cell Processor
SIMD approach Working with vectors Parallelisation Using multiple SPEs Cell Processor features
Sequence Alignment ,[object Object]
Used for aligning sequences of DNA nucleotides or amino acids (proteins)
Great amount of data, requiring lots of computational power,[object Object]
Needleman-Wunsch Scoring matrix Each cell’s value is based on its upper, left, and upper-left neighbour Main issue – data dependencies Sequence Alignment
Traceback      HEAGAWGHEE      -PA--WHEAE Sequence Alignment
Main issue – data dependencies Parallelisation
Two possible approaches Use the existing code and modify it Develop the code from scratch Chose the latter one Code development
One SPE – one row Each SPE one cell behind the previous one Not efficient DMA overhead Potential solution
Grouping cells into tiles Tile size 8 X 8 up to 64 x 64  Solution
Tiles grouped into blocks Each block is 16 tiles high or more Algorithm first covers one block, then moves to the next one Solution
Wavefront algorithm also applied on the tile level One antidiagonal – one or more vectors Vectorisation
Always try to transfer as much as possible Maximum transfer allowed – 16 KB Integer size – 4 B If tile size is 64, the transfer size is    64 X 64 X 2 = 8192 X 4 = 32768 B Solution – short integers New transfer size - 64 X 64 X 2 = 8192 X 2 = 	= 16 384 B = 16 KB Technical issues
Double buffering
Each SPE – two tiles Double buffering

Mais conteúdo relacionado

Semelhante a Cell Processor Based Sequence Alignment

Effects of change propagation resulting from adaptive preprocessing in multic...
Effects of change propagation resulting from adaptive preprocessing in multic...Effects of change propagation resulting from adaptive preprocessing in multic...
Effects of change propagation resulting from adaptive preprocessing in multic...
Manuel Martín
 

Semelhante a Cell Processor Based Sequence Alignment (20)

VLSI ARCHITECTURE FOR NANO WIRE BASED ADVANCED ENCRYPTION STANDARD (AES) WITH...
VLSI ARCHITECTURE FOR NANO WIRE BASED ADVANCED ENCRYPTION STANDARD (AES) WITH...VLSI ARCHITECTURE FOR NANO WIRE BASED ADVANCED ENCRYPTION STANDARD (AES) WITH...
VLSI ARCHITECTURE FOR NANO WIRE BASED ADVANCED ENCRYPTION STANDARD (AES) WITH...
 
IEEE 2014 JAVA NETWORKING PROJECTS Snapshot and continuous data collection in...
IEEE 2014 JAVA NETWORKING PROJECTS Snapshot and continuous data collection in...IEEE 2014 JAVA NETWORKING PROJECTS Snapshot and continuous data collection in...
IEEE 2014 JAVA NETWORKING PROJECTS Snapshot and continuous data collection in...
 
Ngs de novo assembly progresses and challenges
Ngs de novo assembly progresses and challengesNgs de novo assembly progresses and challenges
Ngs de novo assembly progresses and challenges
 
DefenseTalk_Trimmed
DefenseTalk_TrimmedDefenseTalk_Trimmed
DefenseTalk_Trimmed
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsImproving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
 
HACC: Fitting the Universe Inside a Supercomputer
HACC: Fitting the Universe Inside a SupercomputerHACC: Fitting the Universe Inside a Supercomputer
HACC: Fitting the Universe Inside a Supercomputer
 
Associative memory implementation with artificial neural networks
Associative memory implementation with artificial neural networksAssociative memory implementation with artificial neural networks
Associative memory implementation with artificial neural networks
 
Concurrent Replication of Parallel and Distributed Simulations
Concurrent Replication of Parallel and Distributed SimulationsConcurrent Replication of Parallel and Distributed Simulations
Concurrent Replication of Parallel and Distributed Simulations
 
Cliff sugerman
Cliff sugermanCliff sugerman
Cliff sugerman
 
A Parallel Algorithm Template for Updating Single-Source Shortest Paths in La...
A Parallel Algorithm Template for Updating Single-Source Shortest Paths in La...A Parallel Algorithm Template for Updating Single-Source Shortest Paths in La...
A Parallel Algorithm Template for Updating Single-Source Shortest Paths in La...
 
(Slides) A Method for Distributed Computaion of Semi-Optimal Multicast Tree i...
(Slides) A Method for Distributed Computaion of Semi-Optimal Multicast Tree i...(Slides) A Method for Distributed Computaion of Semi-Optimal Multicast Tree i...
(Slides) A Method for Distributed Computaion of Semi-Optimal Multicast Tree i...
 
Multiscale Dataflow Computing: Competitive Advantage at the Exascale Frontier
Multiscale Dataflow Computing: Competitive Advantage at the Exascale FrontierMultiscale Dataflow Computing: Competitive Advantage at the Exascale Frontier
Multiscale Dataflow Computing: Competitive Advantage at the Exascale Frontier
 
APPLICATION OF GENETIC ALGORITHM IN DESIGNING A SECURITY MODEL FOR MOBILE ADH...
APPLICATION OF GENETIC ALGORITHM IN DESIGNING A SECURITY MODEL FOR MOBILE ADH...APPLICATION OF GENETIC ALGORITHM IN DESIGNING A SECURITY MODEL FOR MOBILE ADH...
APPLICATION OF GENETIC ALGORITHM IN DESIGNING A SECURITY MODEL FOR MOBILE ADH...
 
AWS Customer Presentation- University of Maryland
AWS Customer Presentation- University of MarylandAWS Customer Presentation- University of Maryland
AWS Customer Presentation- University of Maryland
 
2005: A Matlab Tour on Artificial Immune Systems
2005: A Matlab Tour on Artificial Immune Systems2005: A Matlab Tour on Artificial Immune Systems
2005: A Matlab Tour on Artificial Immune Systems
 
Effects of change propagation resulting from adaptive preprocessing in multic...
Effects of change propagation resulting from adaptive preprocessing in multic...Effects of change propagation resulting from adaptive preprocessing in multic...
Effects of change propagation resulting from adaptive preprocessing in multic...
 
JPN1406 Snapshot and Continuous Data Collection in Probabilistic Wireless S...
JPN1406   Snapshot and Continuous Data Collection in Probabilistic Wireless S...JPN1406   Snapshot and Continuous Data Collection in Probabilistic Wireless S...
JPN1406 Snapshot and Continuous Data Collection in Probabilistic Wireless S...
 
Fault Detection AES
Fault Detection AESFault Detection AES
Fault Detection AES
 
PPT
PPTPPT
PPT
 

Mais de Miloš Bošković

Mais de Miloš Bošković (7)

Kutija za alate jednog (frontend) web developera
Kutija za alate jednog (frontend) web developeraKutija za alate jednog (frontend) web developera
Kutija za alate jednog (frontend) web developera
 
Should you move Up to TypeScript and why is the answer probably Yes???
Should you move Up to TypeScript and why is the answer probably Yes???Should you move Up to TypeScript and why is the answer probably Yes???
Should you move Up to TypeScript and why is the answer probably Yes???
 
Dinosaurusi su umreli od Angular2
Dinosaurusi su umreli od Angular2Dinosaurusi su umreli od Angular2
Dinosaurusi su umreli od Angular2
 
Freelancing
FreelancingFreelancing
Freelancing
 
AngularJs - From Heedless Meddler to Superheroic Assistant
AngularJs - From Heedless Meddler to Superheroic AssistantAngularJs - From Heedless Meddler to Superheroic Assistant
AngularJs - From Heedless Meddler to Superheroic Assistant
 
Angularizam kao takav u novijoj JS istoriji
Angularizam kao takav u novijoj JS istorijiAngularizam kao takav u novijoj JS istoriji
Angularizam kao takav u novijoj JS istoriji
 
AngularJs - Taming the Superhero
AngularJs - Taming the SuperheroAngularJs - Taming the Superhero
AngularJs - Taming the Superhero
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Último (20)

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

Cell Processor Based Sequence Alignment

  • 1. Miloš Bošković Cell Processor Based Sequence Alignment
  • 2. Prof. Vassil Alexandrov Janko Straßburg University of Reading, Aristotle University, University Carlos III European Commission Acknowledgments
  • 3. Sequence Alignment on the Playstation 3 Six SPEs Smith-Waterman Algorithm only Accelerating Multiple SequenceAlignment with the Cell BE Processor Designed to accelerate a particular sequence alignment application Modeling and SchedulingWavefront Computations on the Cell Broadband Engine Smith-Waterman Algorithm State of the art
  • 4. Cell BroadBand Engine Cell Processor
  • 5. SIMD approach Working with vectors Parallelisation Using multiple SPEs Cell Processor features
  • 6.
  • 7. Used for aligning sequences of DNA nucleotides or amino acids (proteins)
  • 8.
  • 9. Needleman-Wunsch Scoring matrix Each cell’s value is based on its upper, left, and upper-left neighbour Main issue – data dependencies Sequence Alignment
  • 10. Traceback HEAGAWGHEE -PA--WHEAE Sequence Alignment
  • 11. Main issue – data dependencies Parallelisation
  • 12. Two possible approaches Use the existing code and modify it Develop the code from scratch Chose the latter one Code development
  • 13. One SPE – one row Each SPE one cell behind the previous one Not efficient DMA overhead Potential solution
  • 14. Grouping cells into tiles Tile size 8 X 8 up to 64 x 64 Solution
  • 15. Tiles grouped into blocks Each block is 16 tiles high or more Algorithm first covers one block, then moves to the next one Solution
  • 16. Wavefront algorithm also applied on the tile level One antidiagonal – one or more vectors Vectorisation
  • 17. Always try to transfer as much as possible Maximum transfer allowed – 16 KB Integer size – 4 B If tile size is 64, the transfer size is 64 X 64 X 2 = 8192 X 4 = 32768 B Solution – short integers New transfer size - 64 X 64 X 2 = 8192 X 2 = = 16 384 B = 16 KB Technical issues
  • 19. Each SPE – two tiles Double buffering
  • 20. Results (8 KB sequence size)
  • 21. It is possible to efficiently employ Cell Broadband Engine for Sequence alignment Further optimisation needed Reduction of context creations Inter-SPE communication Implementing sequence alignment across multiple pairs of sequences Using ALF – Accelerated Library Framework Conclusion
  • 22. Thank you for your time!
  • 23. Results (4 KB sequence size)
  • 24. Results (2 KB sequence size)