SlideShare uma empresa Scribd logo
1 de 25
Baixar para ler offline
Parallel Computing Scenarios
and the new challenges for the Software Architect




Fabrizio Giudici        Emmanuele Sordini
Tidalwave s.a.s.        BloomingStars.com
880
2




Goals of this presentation
> Parallel/distributed computing has been around for decades
  – Mainly in some niches (science, military, finance)
  – Seti@Home, BOINC, ... paved the way for the rest of the world
> It's going to become important for “general” software architects
  – Because of recent technology evolution
  – From (maybe) scared customers to new business opportunities
> Learn how to design WORA parallel applications
  – Exploiting parallelism in different scenarios
  – Good design, patterns, technologies
3




AGENDA
>   MOTIVATIONS
>   CASE STUDY
>   ARCHITECTURE
>   CONCLUSION
4




Motivations
> User's need grow
  – Demand for increased
     computing power
  – Multimedia processing
  – Also on the desktop
> Technological challenges
  – Multi-core computers (10s of cores        from Computer Architecture,
     in a few years, clock stable or              a Quantitative approach:
                                              Patterson & Hennessy, 2006
     decreasing)
> New opportunities
  – Easy “local mini grids“ (e.g Jini, Rio)
  – Massive grid computing as a
     service (e.g. Sun Grid)
5




Jini, Rio, Sun Grid
> Jini
  – A Java technology for creating “federations of services“
  – Spontaneous networking, discovery, mobile code
  – http://www.jini.org
> Rio
  – Based on Jini
  – Adds container / component paradigm, Quality of Service, etc...
  – http://rio.dev.java.net
> Sun Grid
  – A massive (1.000s nodes) grid platform accessible as a service
  – $1 / CPU / hour
  – Since May 2007 available in 24 countries outside USA
  – http://www.network.com
6




Parallel computing: easy or hard?
> Harder than single-thread computing
> Largely depends on the context
  – Course grain
  – Fine grain
> Future language enhancements?
  – Java syntax extensions
  – New languages
  – Virtual Machine optimizations (transparent to code?)
> Let's focus on what we have now
  – Design
  – Architecture
  – Patterns
7




An important point: ROI
> In the past distributed computing required ad-hoc hardware
  – Highly expensive
  – High start-up costs
  – Strive for high parallelism exploitation to justify costs
       50% efficiency with 100.000€ expense means 50.000€ wasted
> Today is different
  – Local “mini grids” can be easily set up with standard hardware
  – Large facilities available as a service on a “pay-per-hour” basis
> A different ROI policy
  – No or negligible start-up costs
  – Probably you can live with medium efficiency in parallelism exploitation
       50% efficiency on the Sun Grid means $1 per hour wasted
8




AGENDA
>   MOTIVATIONS
>   CASE STUDY
>   ARCHITECTURE
>   CONCLUSION
9




Case study
> Concrete problem and potential community
  – To be addressed with state-of-the art design
  – Focused on image processing, but many things are general purpose
> Different scenarios considered
  – Single core, multi-core
  – “Local mini grids” made with Jini and Rio
  – Sun Grid
> Mistral – the imaging framework
  – http://mistral.tidalwave.it
> Pleiades – the application
  – http://pleiades.bloomingstars.com
10




Hires imaging of solar system bodies
> For decades only with pro equipment
> Since 90s within the reach of amateurs
  – availability of decent quality optics
  – good cameras at reasonable prices
> Key concept
  – take multiple exposures
  – stack and align them
  – improve S/N ratio by averaging
> Bottom line: consumer's perspective fit with the scenario
  – Recall previous slide about users and computing power
11




Algorithm
12




Parallel decomposition
13




AGENDA
>   MOTIVATIONS
>   CASE STUDY
>   ARCHITECTURE
>   CONCLUSION
14




Mistral
> Abstract Imaging layer
  – Wraps Java2D, JAI, ImageJ, others
> Flexible
  – Operations and/or engines can be plugged in
> Versatile Imaging Processor
  – Based on the Master/Worker pattern
> Built-in support for profiling
  – Benchmarking is a must
15




Mistral ImageProcessor
> Master / Worker pattern
> Polymorphic implementation
  – Local (multi core)
  – Rio
  – Sun Grid
> Images wrapped in an opaque
  container (EditableImage)
16




Phase controller
reference = createReference();

for each image
  {
    schedule new RegisterTask(image, reference);
  }

when (at least 2 RegisterTasks completed)
  {
    image1 = registerTask1.getResult();
    image2 = registerTask2.getResult();
    schedule new AddTask(image1, image2);
  }

when (at least 2 AddTasks completed)
  {
    image1 = addTask1.getResult();
    image2 = addTask2.getResult();
    schedule new AddTask(image1, image2);
  }

// Detects last phase
when (no more pending tasks && only 1 completed AddTask has not been processed)
  {
    sumImage = addTask.getResult();
    normalize(sumImage);
    return;
  }
17




Best practices
> Fallacies #2, #3 of distributed computing
  – “Latency is zero”
  – “Bandwidth is infinite”
> In other words
  – Don't pretend the network is not there
  – Serialization overhead is an issue
> Memory is also a problem
  – Memory is not infinite
  – Typical of imaging or large-datasets problems
  – Was not an issue with sequential processing
       Queuing intermediate results vs process one by one
18




Adapt to scenarios
> Consume results as soon as possible
  – Avoid accumulation of idle results in queues
  – Post tasks with priority
> Adapt the most to the environment capabilities
  – Synthetic images are re-computed at each node
  – Filesystem-based exchange is used when NFS is available (e.g. Sun Grid)
  – Distributed image cache otherwise
  – Optimized routing
19




EditableImage serialization
> Opaque holder of the real image
> Each instance has its own unique UUID
  – Serializable – but only moves UUID around
> NFS implementation
  – Each serialized image gets stored on the disk
  – Upon use on a remote node, it is loaded from the disk
> Otherwise
  – Images are pulled from a distributed cache
  – Pull vs push approach for cache
20




Optimized routing
> Optimize scheduling to minimize data motion
  – Multi-phase: the operands come from a previous phase
  – For instance, while adding two images
> The scheduler
  – Queries the pending Task about the needed images
  – Looks up the distributed image cache
  – Finds the node where most of needed images are
  – Schedule the task to that node
21




AGENDA
>   MOTIVATIONS
>   CASE STUDY
>   ARCHITECTURE
>   CONCLUSION
22




Conclusion
> Parallel computing is coming among us
  – Hard, but in reach
  – It's an opportunity, not a scary thing
> In different fashions
  – Multi core
  – Local mini grids (Jini, Rio)
  – Massive grids as a facility (Sun Grid)
> Can be dealt with more patterns in our catalog
> You don't need always the “optimal” solution
  – Just pull most out of the CPUs with trade-off design
23




Desktop too – and demo
> Mistral and Pleiades
  integrated in blueMarine, a
  desktop photo application
> Demos at Jazoon:
  – “blueMarine - a desktop
      app for the open source
      photographic workflow“
  – Tuesday, 2007-06-26,
      12:00 - 12:50, Arena 2
  – Thursday, 2007-06-28,
      14:00 - 14:50, Arena 1
24




Q&A
> Question time
Fabrizio Giudici    www.tidalwave.it
Tidalwave s.a.s     fabrizio.giudici@tidalwave.it


Emmanuele Sordini   www.bloomingstars.com
                    emmanuele@sordini.com

Mais conteúdo relacionado

Mais procurados

Graphic Processing Unit
Graphic Processing UnitGraphic Processing Unit
Graphic Processing Unit
Kamran Ashraf
 

Mais procurados (12)

Gpu and The Brick Wall
Gpu and The Brick WallGpu and The Brick Wall
Gpu and The Brick Wall
 
graphics processing unit ppt
graphics processing unit pptgraphics processing unit ppt
graphics processing unit ppt
 
Cloud computing technology
Cloud computing technologyCloud computing technology
Cloud computing technology
 
Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)
 
GRAPHICS PROCESSING UNIT (GPU)
GRAPHICS PROCESSING UNIT (GPU)GRAPHICS PROCESSING UNIT (GPU)
GRAPHICS PROCESSING UNIT (GPU)
 
Virtualization in cloud computing
Virtualization in cloud computingVirtualization in cloud computing
Virtualization in cloud computing
 
Gpu Systems
Gpu SystemsGpu Systems
Gpu Systems
 
Graphics processing unit ppt
Graphics processing unit pptGraphics processing unit ppt
Graphics processing unit ppt
 
GPU Algorithms and trends 2018
GPU Algorithms and trends 2018GPU Algorithms and trends 2018
GPU Algorithms and trends 2018
 
High performance computing with accelarators
High performance computing with accelaratorsHigh performance computing with accelarators
High performance computing with accelarators
 
Graphic Processing Unit
Graphic Processing UnitGraphic Processing Unit
Graphic Processing Unit
 
Lec04 gpu architecture
Lec04 gpu architectureLec04 gpu architecture
Lec04 gpu architecture
 

Destaque (10)

AMS 250 - High-Performance, Massively Parallel Computing with FLASH
AMS 250 - High-Performance, Massively Parallel Computing with FLASH AMS 250 - High-Performance, Massively Parallel Computing with FLASH
AMS 250 - High-Performance, Massively Parallel Computing with FLASH
 
GPU Computing
GPU ComputingGPU Computing
GPU Computing
 
Parallel architecture-programming
Parallel architecture-programmingParallel architecture-programming
Parallel architecture-programming
 
Full introduction to_parallel_computing
Full introduction to_parallel_computingFull introduction to_parallel_computing
Full introduction to_parallel_computing
 
Parallel Wireless: Rural HetNet Case Study
Parallel Wireless: Rural HetNet Case StudyParallel Wireless: Rural HetNet Case Study
Parallel Wireless: Rural HetNet Case Study
 
Router architectures in no c
Router architectures in no cRouter architectures in no c
Router architectures in no c
 
Chat application
Chat applicationChat application
Chat application
 
Parallel Computing
Parallel ComputingParallel Computing
Parallel Computing
 
SRS FOR CHAT APPLICATION
SRS FOR CHAT APPLICATIONSRS FOR CHAT APPLICATION
SRS FOR CHAT APPLICATION
 
A project report on chat application
A project report on chat applicationA project report on chat application
A project report on chat application
 

Semelhante a Parallel Computing Scenarios and the new challenges for the Software Architect

blueMarine a desktop app for the open source photographic workflow
blueMarine  a desktop app for the open source photographic workflowblueMarine  a desktop app for the open source photographic workflow
blueMarine a desktop app for the open source photographic workflow
Fabrizio Giudici
 
EEDC Intelligent Placement of Datacenters
EEDC Intelligent Placement of DatacentersEEDC Intelligent Placement of Datacenters
EEDC Intelligent Placement of Datacenters
Roger Rafanell Mas
 
The world is the computer and the programmer is you
The world is the computer and the programmer is youThe world is the computer and the programmer is you
The world is the computer and the programmer is you
Davide Carboni
 
Introduction to NetBSD kernel
Introduction to NetBSD kernelIntroduction to NetBSD kernel
Introduction to NetBSD kernel
Mahendra M
 

Semelhante a Parallel Computing Scenarios and the new challenges for the Software Architect (20)

blueMarine a desktop app for the open source photographic workflow
blueMarine  a desktop app for the open source photographic workflowblueMarine  a desktop app for the open source photographic workflow
blueMarine a desktop app for the open source photographic workflow
 
OpenNebula out in the Open, Ander Astudillo, SURFsara
OpenNebula out in the Open, Ander Astudillo, SURFsaraOpenNebula out in the Open, Ander Astudillo, SURFsara
OpenNebula out in the Open, Ander Astudillo, SURFsara
 
A Container-based Sizing Framework for Apache Hadoop/Spark Clusters
A Container-based Sizing Framework for Apache Hadoop/Spark ClustersA Container-based Sizing Framework for Apache Hadoop/Spark Clusters
A Container-based Sizing Framework for Apache Hadoop/Spark Clusters
 
Introduction to parallel_computing
Introduction to parallel_computingIntroduction to parallel_computing
Introduction to parallel_computing
 
GPU Renderfarm with Integrated Asset Management & Production System (AMPS)
GPU Renderfarm with Integrated Asset Management & Production System (AMPS)GPU Renderfarm with Integrated Asset Management & Production System (AMPS)
GPU Renderfarm with Integrated Asset Management & Production System (AMPS)
 
Expectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software researchExpectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software research
 
EEDC Intelligent Placement of Datacenters
EEDC Intelligent Placement of DatacentersEEDC Intelligent Placement of Datacenters
EEDC Intelligent Placement of Datacenters
 
Journey to Containerized Application / Google Container Engine
Journey to Containerized Application / Google Container EngineJourney to Containerized Application / Google Container Engine
Journey to Containerized Application / Google Container Engine
 
Exascale Capabl
Exascale CapablExascale Capabl
Exascale Capabl
 
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
 
Unit-1 ESD.pptx
Unit-1 ESD.pptxUnit-1 ESD.pptx
Unit-1 ESD.pptx
 
PACT19, MOSAIC : Heterogeneity-, Communication-, and Constraint-Aware Model ...
PACT19, MOSAIC : Heterogeneity-, Communication-, and Constraint-Aware Model ...PACT19, MOSAIC : Heterogeneity-, Communication-, and Constraint-Aware Model ...
PACT19, MOSAIC : Heterogeneity-, Communication-, and Constraint-Aware Model ...
 
数据中心网络研究:机遇与挑战
数据中心网络研究:机遇与挑战数据中心网络研究:机遇与挑战
数据中心网络研究:机遇与挑战
 
The world is the computer and the programmer is you
The world is the computer and the programmer is youThe world is the computer and the programmer is you
The world is the computer and the programmer is you
 
Introduction to NetBSD kernel
Introduction to NetBSD kernelIntroduction to NetBSD kernel
Introduction to NetBSD kernel
 
Cloud Computing Berkeley.pdf
Cloud Computing Berkeley.pdfCloud Computing Berkeley.pdf
Cloud Computing Berkeley.pdf
 
Building a Front End for a Sensor Data Cloud
Building a Front End for a Sensor Data CloudBuilding a Front End for a Sensor Data Cloud
Building a Front End for a Sensor Data Cloud
 
Virtualisation For Network Testing & Staff Training
Virtualisation For Network Testing & Staff TrainingVirtualisation For Network Testing & Staff Training
Virtualisation For Network Testing & Staff Training
 
OpenDS_Jazoon2010
OpenDS_Jazoon2010OpenDS_Jazoon2010
OpenDS_Jazoon2010
 
Performance and Scale: Billions of request per day (DDD2019)
Performance and Scale: Billions of request per day (DDD2019)Performance and Scale: Billions of request per day (DDD2019)
Performance and Scale: Billions of request per day (DDD2019)
 

Mais de Fabrizio Giudici

NOSQL also means RDF stores: an Android case study
NOSQL also means RDF stores: an Android case studyNOSQL also means RDF stores: an Android case study
NOSQL also means RDF stores: an Android case study
Fabrizio Giudici
 
Tools for an effective software factory
Tools for an effective software factoryTools for an effective software factory
Tools for an effective software factory
Fabrizio Giudici
 
blueMarine photographic workflow with Java
blueMarine photographic workflow with JavablueMarine photographic workflow with Java
blueMarine photographic workflow with Java
Fabrizio Giudici
 
blueMarine Sailing with NetBeans Platform
blueMarine Sailing with NetBeans PlatformblueMarine Sailing with NetBeans Platform
blueMarine Sailing with NetBeans Platform
Fabrizio Giudici
 
NASA World Wind for Java API Overview
NASA World Wind for Java  API OverviewNASA World Wind for Java  API Overview
NASA World Wind for Java API Overview
Fabrizio Giudici
 
Rich Internet Applications con JavaFX e NetBeans
Rich Internet Applications  con JavaFX e NetBeans Rich Internet Applications  con JavaFX e NetBeans
Rich Internet Applications con JavaFX e NetBeans
Fabrizio Giudici
 
Web Development with Apache Struts 2
Web Development with  Apache Struts 2Web Development with  Apache Struts 2
Web Development with Apache Struts 2
Fabrizio Giudici
 
blueMarine Or Why You Should Really Ship Swing Applications
blueMarine  Or Why You Should Really Ship Swing  Applications blueMarine  Or Why You Should Really Ship Swing  Applications
blueMarine Or Why You Should Really Ship Swing Applications
Fabrizio Giudici
 
Designing a JavaFX Mobile application
Designing a JavaFX Mobile applicationDesigning a JavaFX Mobile application
Designing a JavaFX Mobile application
Fabrizio Giudici
 

Mais de Fabrizio Giudici (16)

Building Android apps with Maven
Building Android apps with MavenBuilding Android apps with Maven
Building Android apps with Maven
 
DCI - Data, Context and Interaction @ Jug Lugano May 2011
DCI - Data, Context and Interaction @ Jug Lugano May 2011 DCI - Data, Context and Interaction @ Jug Lugano May 2011
DCI - Data, Context and Interaction @ Jug Lugano May 2011
 
DCI - Data, Context and Interaction @ Jug Genova April 2011
DCI - Data, Context and Interaction @ Jug Genova April 2011DCI - Data, Context and Interaction @ Jug Genova April 2011
DCI - Data, Context and Interaction @ Jug Genova April 2011
 
NOSQL also means RDF stores: an Android case study
NOSQL also means RDF stores: an Android case studyNOSQL also means RDF stores: an Android case study
NOSQL also means RDF stores: an Android case study
 
Netbeans+platform+maven
Netbeans+platform+mavenNetbeans+platform+maven
Netbeans+platform+maven
 
Tools for an effective software factory
Tools for an effective software factoryTools for an effective software factory
Tools for an effective software factory
 
blueMarine photographic workflow with Java
blueMarine photographic workflow with JavablueMarine photographic workflow with Java
blueMarine photographic workflow with Java
 
blueMarine Sailing with NetBeans Platform
blueMarine Sailing with NetBeans PlatformblueMarine Sailing with NetBeans Platform
blueMarine Sailing with NetBeans Platform
 
NASA World Wind for Java API Overview
NASA World Wind for Java  API OverviewNASA World Wind for Java  API Overview
NASA World Wind for Java API Overview
 
Rich Internet Applications con JavaFX e NetBeans
Rich Internet Applications  con JavaFX e NetBeans Rich Internet Applications  con JavaFX e NetBeans
Rich Internet Applications con JavaFX e NetBeans
 
The VRC Project
The VRC ProjectThe VRC Project
The VRC Project
 
Web Development with Apache Struts 2
Web Development with  Apache Struts 2Web Development with  Apache Struts 2
Web Development with Apache Struts 2
 
blueMarine Or Why You Should Really Ship Swing Applications
blueMarine  Or Why You Should Really Ship Swing  Applications blueMarine  Or Why You Should Really Ship Swing  Applications
blueMarine Or Why You Should Really Ship Swing Applications
 
Designing a JavaFX Mobile application
Designing a JavaFX Mobile applicationDesigning a JavaFX Mobile application
Designing a JavaFX Mobile application
 
Android java fx-jme@jug-lugano
Android java fx-jme@jug-luganoAndroid java fx-jme@jug-lugano
Android java fx-jme@jug-lugano
 
Mercurial
MercurialMercurial
Mercurial
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Parallel Computing Scenarios and the new challenges for the Software Architect

  • 1. Parallel Computing Scenarios and the new challenges for the Software Architect Fabrizio Giudici Emmanuele Sordini Tidalwave s.a.s. BloomingStars.com 880
  • 2. 2 Goals of this presentation > Parallel/distributed computing has been around for decades – Mainly in some niches (science, military, finance) – Seti@Home, BOINC, ... paved the way for the rest of the world > It's going to become important for “general” software architects – Because of recent technology evolution – From (maybe) scared customers to new business opportunities > Learn how to design WORA parallel applications – Exploiting parallelism in different scenarios – Good design, patterns, technologies
  • 3. 3 AGENDA > MOTIVATIONS > CASE STUDY > ARCHITECTURE > CONCLUSION
  • 4. 4 Motivations > User's need grow – Demand for increased computing power – Multimedia processing – Also on the desktop > Technological challenges – Multi-core computers (10s of cores from Computer Architecture, in a few years, clock stable or a Quantitative approach: Patterson & Hennessy, 2006 decreasing) > New opportunities – Easy “local mini grids“ (e.g Jini, Rio) – Massive grid computing as a service (e.g. Sun Grid)
  • 5. 5 Jini, Rio, Sun Grid > Jini – A Java technology for creating “federations of services“ – Spontaneous networking, discovery, mobile code – http://www.jini.org > Rio – Based on Jini – Adds container / component paradigm, Quality of Service, etc... – http://rio.dev.java.net > Sun Grid – A massive (1.000s nodes) grid platform accessible as a service – $1 / CPU / hour – Since May 2007 available in 24 countries outside USA – http://www.network.com
  • 6. 6 Parallel computing: easy or hard? > Harder than single-thread computing > Largely depends on the context – Course grain – Fine grain > Future language enhancements? – Java syntax extensions – New languages – Virtual Machine optimizations (transparent to code?) > Let's focus on what we have now – Design – Architecture – Patterns
  • 7. 7 An important point: ROI > In the past distributed computing required ad-hoc hardware – Highly expensive – High start-up costs – Strive for high parallelism exploitation to justify costs  50% efficiency with 100.000€ expense means 50.000€ wasted > Today is different – Local “mini grids” can be easily set up with standard hardware – Large facilities available as a service on a “pay-per-hour” basis > A different ROI policy – No or negligible start-up costs – Probably you can live with medium efficiency in parallelism exploitation  50% efficiency on the Sun Grid means $1 per hour wasted
  • 8. 8 AGENDA > MOTIVATIONS > CASE STUDY > ARCHITECTURE > CONCLUSION
  • 9. 9 Case study > Concrete problem and potential community – To be addressed with state-of-the art design – Focused on image processing, but many things are general purpose > Different scenarios considered – Single core, multi-core – “Local mini grids” made with Jini and Rio – Sun Grid > Mistral – the imaging framework – http://mistral.tidalwave.it > Pleiades – the application – http://pleiades.bloomingstars.com
  • 10. 10 Hires imaging of solar system bodies > For decades only with pro equipment > Since 90s within the reach of amateurs – availability of decent quality optics – good cameras at reasonable prices > Key concept – take multiple exposures – stack and align them – improve S/N ratio by averaging > Bottom line: consumer's perspective fit with the scenario – Recall previous slide about users and computing power
  • 13. 13 AGENDA > MOTIVATIONS > CASE STUDY > ARCHITECTURE > CONCLUSION
  • 14. 14 Mistral > Abstract Imaging layer – Wraps Java2D, JAI, ImageJ, others > Flexible – Operations and/or engines can be plugged in > Versatile Imaging Processor – Based on the Master/Worker pattern > Built-in support for profiling – Benchmarking is a must
  • 15. 15 Mistral ImageProcessor > Master / Worker pattern > Polymorphic implementation – Local (multi core) – Rio – Sun Grid > Images wrapped in an opaque container (EditableImage)
  • 16. 16 Phase controller reference = createReference(); for each image { schedule new RegisterTask(image, reference); } when (at least 2 RegisterTasks completed) { image1 = registerTask1.getResult(); image2 = registerTask2.getResult(); schedule new AddTask(image1, image2); } when (at least 2 AddTasks completed) { image1 = addTask1.getResult(); image2 = addTask2.getResult(); schedule new AddTask(image1, image2); } // Detects last phase when (no more pending tasks && only 1 completed AddTask has not been processed) { sumImage = addTask.getResult(); normalize(sumImage); return; }
  • 17. 17 Best practices > Fallacies #2, #3 of distributed computing – “Latency is zero” – “Bandwidth is infinite” > In other words – Don't pretend the network is not there – Serialization overhead is an issue > Memory is also a problem – Memory is not infinite – Typical of imaging or large-datasets problems – Was not an issue with sequential processing  Queuing intermediate results vs process one by one
  • 18. 18 Adapt to scenarios > Consume results as soon as possible – Avoid accumulation of idle results in queues – Post tasks with priority > Adapt the most to the environment capabilities – Synthetic images are re-computed at each node – Filesystem-based exchange is used when NFS is available (e.g. Sun Grid) – Distributed image cache otherwise – Optimized routing
  • 19. 19 EditableImage serialization > Opaque holder of the real image > Each instance has its own unique UUID – Serializable – but only moves UUID around > NFS implementation – Each serialized image gets stored on the disk – Upon use on a remote node, it is loaded from the disk > Otherwise – Images are pulled from a distributed cache – Pull vs push approach for cache
  • 20. 20 Optimized routing > Optimize scheduling to minimize data motion – Multi-phase: the operands come from a previous phase – For instance, while adding two images > The scheduler – Queries the pending Task about the needed images – Looks up the distributed image cache – Finds the node where most of needed images are – Schedule the task to that node
  • 21. 21 AGENDA > MOTIVATIONS > CASE STUDY > ARCHITECTURE > CONCLUSION
  • 22. 22 Conclusion > Parallel computing is coming among us – Hard, but in reach – It's an opportunity, not a scary thing > In different fashions – Multi core – Local mini grids (Jini, Rio) – Massive grids as a facility (Sun Grid) > Can be dealt with more patterns in our catalog > You don't need always the “optimal” solution – Just pull most out of the CPUs with trade-off design
  • 23. 23 Desktop too – and demo > Mistral and Pleiades integrated in blueMarine, a desktop photo application > Demos at Jazoon: – “blueMarine - a desktop app for the open source photographic workflow“ – Tuesday, 2007-06-26, 12:00 - 12:50, Arena 2 – Thursday, 2007-06-28, 14:00 - 14:50, Arena 1
  • 25. Fabrizio Giudici www.tidalwave.it Tidalwave s.a.s fabrizio.giudici@tidalwave.it Emmanuele Sordini www.bloomingstars.com emmanuele@sordini.com