SlideShare uma empresa Scribd logo
1 de 50
Baixar para ler offline
High-Throughput
Scientific Computing
        Hanspeter Pfister
    pfister@seas.harvard.edu
Themes
• How is the brain wired?
• How did the Universe start?
How is the brain wired?
      The Connectome Project
Connectome Team
• Harvard Center for Brain Science
  – Jeff Lichtman & Clay Reid
• Microsoft Research / UW
  – Michael Cohen
• Kitware Inc.
  – Will Schroeder, Charles Law, Rusty Blue
• VRVis Vienna
  – Markus Hadwiger, Johanna Beyer
• IIC
  – Amelio Vazquez, Eric Miller (Tufts)
  – Won-Ki Seung, Hanspeter Pfister
The Scientific Challenge




                      composite from Roe et al. 1989,
                    Sutton and Brunso-Bechtold 1991
Confocal Microscopy:
             Brainbow




Adapted from OlympusConfocal.com
Electron Microscopy:
      ATLUM
Serial Sectioning
                  y
    x


z



    .
    .
    .                                          Section i, i   (1, …,N)
        Adapted from http://parasol.tamu.edu
              Texas A&M University
40,000x40,000 pixels
          1.6 GB
  120x120 µm (3 nm/pixel)


Here shown 40x undersampled




                              6 15mu EM big view
5 8mu rlp
4 3mu rlp
3 1mu rlp
2 300 nm rlp
The Data Challenge
• 1 mm3           ~= mouse thalamus ~= 1 petabyte
• 1 cm            ~= mouse brain               ~= 1 exabyte
        3


• 1000 cm         ~= human brain               ~= 1 zettabyte
              3



  All of Google’s world-wide storage today ~= 1 exabyte
Addressing the Data
     Challenge
• Multi-Scale Imaging
• Hierarchical Data Representation
• Distributed Heterogeneous Computing
 • Visualization
 • Segmentation
 • Analysis
Addressing the Data
     Challenge
• Multi-Scale Imaging
• Hierarchical Data Representation
• Distributed Heterogeneous Computing
 • Visualization
 • Segmentation
 • Analysis
Direct Volume Rendering




              MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Ray Casting
• Image-order ray shooting
 • Interpolate
 • Assign color & opacity
 • Composite
• Simple to implement
• Very flexible
  (adaptive sampling, …)
• Correct perspective




                            MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Transfer Functions
• Mapping of density to optical properties
• Simplest: color table with opacity over density




                           MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Connectome: EM Data




             MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Single-Pass Ray Casting
• Enabled by conditional loops
• Substitute multiple passes with single loop and early
  loop exit


• Volume rendering example
  in NVIDIA CUDA SDK
  (procedural ray setup)




                           MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Basic Ray Setup / Termination
• Two main approaches:

 • Procedural ray/box intersection
  [Röttger et al., 2003], [Green, 2004]

 • Rasterize bounding box
  [Krüger and Westermann, 2003]




                               MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Procedural Ray Setup / Term.
• Procedural ray / box intersection
  • Everything handled in
    fragment shader

• Ray given by camera position
  and volume entry position
• Exit criterion needed


• Pro: simple and self-contained
• Con: full load on fragment shader



                            MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
quot;Image-Basedquot; Ray Setup / Term.
• Rasterize bounding box
  front faces and back faces

• Ray start positions:
  front faces
• Direction vectors:
  back faces − front faces


         -           =

• Independent of projection (orthogonal/perspective)

                             MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Kernel
•   Image-based
    ray setup
    •   Ray start image
    •   Direction image




•   Ray-cast loop
    •   Sample volume
    •   Accumulate
        color and opacity




•   Terminate

•   Store output




                            MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Standard Ray Casting Optim. (1)
Early ray termination
  • Isosurfaces:
    stop when surface hit
  • Direct volume rendering:
    stop when opacity >= threshold

• Several possibilities
  • Current GPUs: early loop exit works well




                           MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Standard Ray Casting Optim. (2)
Empty space skipping
  • Skip transparent samples
  • Depends on transfer function
  • Start casting close to first hit

• Several possibilities
  • Per-sample check of opacity (expensive)
  • Hierarchical data store (e.g., octree with stack-less
    traversal [Gobbetti et al., 2008] )

  • These are image-order:
    what about object-order?


                            MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Object-Order Empty Space Skip. (1)
• Modify initial rasterization step




rasterize bounding box   rasterize “tightquot; bounding geometry
                            MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Object-Order Empty Space Skip. (2)
• Store min-max values of volume blocks
• Cull blocks against transfer function or isovalue
• Rasterize front and back faces of active blocks




                           MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Connectome: Fluorescence Data




              MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Connectome: Implicit Surfaces




              MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
Addressing the Data
     Challenge
• Multi-Scale Imaging
• Hierarchical Data Representation
• Distributed Heterogeneous Computing
 • Visualization
 • Segmentation
 • Analysis
Active Ribbons
Active Ribbon:
A set of two non-intersecting and
coupled Active Contours

Active Contour: 
Deformable closed curve that can be
used to segment objects in an image
                                      Outer Active Inner Active
                                                     Contour
                                       Contour

                                          Active Ribbon
Results (Matlab)
Axon Segmentation
Interactive Analysis
How did the Universe
       start?
                  The MWA Project

Kevin Dale, Richard Edgar, Daniel Mitchell, Randall Wayth,
       Lincoln Greenhill, and Hanspeter Pfister
MWA CfA / IIC Team
• Harvard Center for Astrophysics /
  Smithsonian Astrophysical
  Observatory
  –   Lincoln Greenhill
  –   Daniel Mitchell
  –   Randall Wayth
  –   Stephen Ord
• IIC / SEAS
  – Richard Edgar
  – Kevin Dale, Hanspeter Pfister
The Scientific Goals
• Epoch of Re-                       ionized

  Inonisation (EOR)                  neutral


• Heliospheric and
                                      (H)




                         The “Gap”
  Ionospheric
• Transient detection
• Pulsars, Surveys,
  Interstellar Medium,               ionized


  Galactic Magnetic
  Field, …
The Murchison Widefield Array (MWA)



•   Located in the remote Australian
    outback
•   Extremely wide fields of view for radio
    astronomy in the 80-300 MHz band
•   512 tiles, each a 4x4 array of dipoles,
    scattered over ~ 1.5 km
•   Data center for real-time processing
    co-located with the array


                                           http://www.haystack.mit.edu/ast/arrays/mwa/index.html
© Murchison Wide-field Array Project
(MIT/Harvard/Smithsonian/ANU/Curtin U./U.Melb./UWA/CSIRO)
© Murchison Wide-field Array Project
(MIT/Harvard/Smithsonian/ANU/Curtin U./U.Melb./UWA/CSIRO)
Calibration


                                  Ionospheric offsets




  Ungridded
visibilities with
bright sources
    peeled
                                                        Imaging
The Data Rate
                     Challenge
             ent
v. p             ang
     arall             led               Calibration Loop
          el c
               om
                   put
                         atio
                              n

              FFT
              Averaging ( !)
              Gridding
              Vector Rotation              Mapping
                                    (1) GB/s
16 GB/s
                                                            Science
                                  8s cadence
0.5s cadence
Implementation
•   Hardware

    •   2.4 GHz dual-core AMD Opteron, 4GB RAM

    •   NVIDIA Quadro FX 5600

•   Software

    •   AMD Core Math Library (ACML)

    •   NVIDIA CUDA (CUBLAS, CUFFT)

    •   OpenGL
Single-GPU Speedup
                                           CPUGPU speedup
                                                               Image Formation
                           Imaging


Mostly OpenGL            Gridding *


              UnpeelTileResponse

                                                               Calibration Loop
                 PeelTileResponse


                ReRotateVisibilities


             MeasureTileResponse


         MeasureIonosphericOffset


    RotateAndAccumulateVisibilities


                                       0   10   20   30   40    50   60   70
Example Results
      •   Noisy images from test data




GPU                               Reference
Scaling to a Cluster
• 1000 frequency channels, 65 sources every
  8 seconds, and 16002 output image
• 20-40 frequencies / GPU
• 32-64 GPUs, i.e., 16 Tesla S1070s
• Need MPI for internal data transfer
Conclusions
• GPUs enable high-throughput scientific
  computing
• Performance gains of 10-100x
• CUDA makes life easier (but not perfect)
• Rasterization / OpenGL still useful
• Need CUDA MPI for clusters

Mais conteúdo relacionado

Mais procurados

極紫外線散射儀於先進製程檢測應用
極紫外線散射儀於先進製程檢測應用極紫外線散射儀於先進製程檢測應用
極紫外線散射儀於先進製程檢測應用CHENHuiMei
 
Real Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsReal Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsWassim Filali
 
Chen Liu_Laser scanning damage detection - Copy
Chen Liu_Laser scanning damage detection - CopyChen Liu_Laser scanning damage detection - Copy
Chen Liu_Laser scanning damage detection - CopyChen Liu
 
CERN: Machine Protection Systems
CERN: Machine Protection SystemsCERN: Machine Protection Systems
CERN: Machine Protection Systemsdrbtodd
 
Panoramic Video in Environmental Monitoring Software Development and Applica...
Panoramic Video in Environmental Monitoring Software Development and Applica...Panoramic Video in Environmental Monitoring Software Development and Applica...
Panoramic Video in Environmental Monitoring Software Development and Applica...pycontw
 
Using Photonics to Prototype the Research Campus Infrastructure of the Future...
Using Photonics to Prototype the Research Campus Infrastructure of the Future...Using Photonics to Prototype the Research Campus Infrastructure of the Future...
Using Photonics to Prototype the Research Campus Infrastructure of the Future...Larry Smarr
 
Francisco Guinea-Recent advances in graphene research
Francisco Guinea-Recent advances in graphene researchFrancisco Guinea-Recent advances in graphene research
Francisco Guinea-Recent advances in graphene researchFundación Ramón Areces
 
RFP_2016_Zhenjie_CEN
RFP_2016_Zhenjie_CENRFP_2016_Zhenjie_CEN
RFP_2016_Zhenjie_CENZhenjie Cen
 
Multispectral Purkinje Imaging
 Multispectral Purkinje Imaging Multispectral Purkinje Imaging
Multispectral Purkinje ImagingPetteriTeikariPhD
 
Multimodal RGB-D+RF-based sensing for human movement analysis
Multimodal RGB-D+RF-based sensing for human movement analysisMultimodal RGB-D+RF-based sensing for human movement analysis
Multimodal RGB-D+RF-based sensing for human movement analysisPetteriTeikariPhD
 
Project - Sound Model Similarity Search
Project - Sound Model Similarity SearchProject - Sound Model Similarity Search
Project - Sound Model Similarity SearchSudarshan Bala
 

Mais procurados (13)

fNIRS data analysis
fNIRS data analysisfNIRS data analysis
fNIRS data analysis
 
極紫外線散射儀於先進製程檢測應用
極紫外線散射儀於先進製程檢測應用極紫外線散射儀於先進製程檢測應用
極紫外線散射儀於先進製程檢測應用
 
Real Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsReal Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth Sensors
 
Chen Liu_Laser scanning damage detection - Copy
Chen Liu_Laser scanning damage detection - CopyChen Liu_Laser scanning damage detection - Copy
Chen Liu_Laser scanning damage detection - Copy
 
CERN: Machine Protection Systems
CERN: Machine Protection SystemsCERN: Machine Protection Systems
CERN: Machine Protection Systems
 
Panoramic Video in Environmental Monitoring Software Development and Applica...
Panoramic Video in Environmental Monitoring Software Development and Applica...Panoramic Video in Environmental Monitoring Software Development and Applica...
Panoramic Video in Environmental Monitoring Software Development and Applica...
 
Using Photonics to Prototype the Research Campus Infrastructure of the Future...
Using Photonics to Prototype the Research Campus Infrastructure of the Future...Using Photonics to Prototype the Research Campus Infrastructure of the Future...
Using Photonics to Prototype the Research Campus Infrastructure of the Future...
 
Francisco Guinea-Recent advances in graphene research
Francisco Guinea-Recent advances in graphene researchFrancisco Guinea-Recent advances in graphene research
Francisco Guinea-Recent advances in graphene research
 
RFP_2016_Zhenjie_CEN
RFP_2016_Zhenjie_CENRFP_2016_Zhenjie_CEN
RFP_2016_Zhenjie_CEN
 
Twnss 1-jsg-yano
Twnss 1-jsg-yanoTwnss 1-jsg-yano
Twnss 1-jsg-yano
 
Multispectral Purkinje Imaging
 Multispectral Purkinje Imaging Multispectral Purkinje Imaging
Multispectral Purkinje Imaging
 
Multimodal RGB-D+RF-based sensing for human movement analysis
Multimodal RGB-D+RF-based sensing for human movement analysisMultimodal RGB-D+RF-based sensing for human movement analysis
Multimodal RGB-D+RF-based sensing for human movement analysis
 
Project - Sound Model Similarity Search
Project - Sound Model Similarity SearchProject - Sound Model Similarity Search
Project - Sound Model Similarity Search
 

Destaque

IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NV...
IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NV...IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NV...
IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NV...npinto
 
IAP09 CUDA@MIT 6.963 - Lecture 04: CUDA Advanced #1 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 04: CUDA Advanced #1 (Nicolas Pinto, MIT)IAP09 CUDA@MIT 6.963 - Lecture 04: CUDA Advanced #1 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 04: CUDA Advanced #1 (Nicolas Pinto, MIT)npinto
 
IAP09 CUDA@MIT 6.963 - Lecture 03: CUDA Basics #2 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 03: CUDA Basics #2 (Nicolas Pinto, MIT)IAP09 CUDA@MIT 6.963 - Lecture 03: CUDA Basics #2 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 03: CUDA Basics #2 (Nicolas Pinto, MIT)npinto
 
IAP09 CUDA@MIT 6.963 - Lecture 07: CUDA Advanced #2 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 07: CUDA Advanced #2 (Nicolas Pinto, MIT)IAP09 CUDA@MIT 6.963 - Lecture 07: CUDA Advanced #2 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 07: CUDA Advanced #2 (Nicolas Pinto, MIT)npinto
 
IAP09 CUDA@MIT 6.963 - Lecture 01: Logistics (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 01: Logistics (Nicolas Pinto, MIT)IAP09 CUDA@MIT 6.963 - Lecture 01: Logistics (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 01: Logistics (Nicolas Pinto, MIT)npinto
 

Destaque (6)

IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NV...
IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NV...IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NV...
IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NV...
 
A Survey of Compressed GPU-based Direct Volume Rendering
A Survey of Compressed GPU-based Direct Volume RenderingA Survey of Compressed GPU-based Direct Volume Rendering
A Survey of Compressed GPU-based Direct Volume Rendering
 
IAP09 CUDA@MIT 6.963 - Lecture 04: CUDA Advanced #1 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 04: CUDA Advanced #1 (Nicolas Pinto, MIT)IAP09 CUDA@MIT 6.963 - Lecture 04: CUDA Advanced #1 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 04: CUDA Advanced #1 (Nicolas Pinto, MIT)
 
IAP09 CUDA@MIT 6.963 - Lecture 03: CUDA Basics #2 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 03: CUDA Basics #2 (Nicolas Pinto, MIT)IAP09 CUDA@MIT 6.963 - Lecture 03: CUDA Basics #2 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 03: CUDA Basics #2 (Nicolas Pinto, MIT)
 
IAP09 CUDA@MIT 6.963 - Lecture 07: CUDA Advanced #2 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 07: CUDA Advanced #2 (Nicolas Pinto, MIT)IAP09 CUDA@MIT 6.963 - Lecture 07: CUDA Advanced #2 (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 07: CUDA Advanced #2 (Nicolas Pinto, MIT)
 
IAP09 CUDA@MIT 6.963 - Lecture 01: Logistics (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 01: Logistics (Nicolas Pinto, MIT)IAP09 CUDA@MIT 6.963 - Lecture 01: Logistics (Nicolas Pinto, MIT)
IAP09 CUDA@MIT 6.963 - Lecture 01: Logistics (Nicolas Pinto, MIT)
 

Semelhante a High-Throughput Scientific Computing for Brain Mapping and Radio Astronomy

ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21Dae Woon Kim
 
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Yan Xu
 
J Fernandes Hst 2009
J Fernandes Hst 2009J Fernandes Hst 2009
J Fernandes Hst 2009jlfernandes85
 
ApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRLucaCinquini
 
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)Matthew O'Toole
 
SIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time Raytracing
SIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time RaytracingSIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time Raytracing
SIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time RaytracingElectronic Arts / DICE
 
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ..."Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...Edge AI and Vision Alliance
 
Ieee 2016 nss mic poster N30-21
Ieee 2016 nss mic poster N30-21Ieee 2016 nss mic poster N30-21
Ieee 2016 nss mic poster N30-21Dae Woon Kim
 
Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Sara Granados Cabeza
 
Integrated Detector Electronics (IDEAS) ASIC product update
Integrated Detector Electronics (IDEAS) ASIC product updateIntegrated Detector Electronics (IDEAS) ASIC product update
Integrated Detector Electronics (IDEAS) ASIC product updateGunnar Maehlum
 
Object Detection Beyond Mask R-CNN and RetinaNet I
Object Detection Beyond Mask R-CNN and RetinaNet IObject Detection Beyond Mask R-CNN and RetinaNet I
Object Detection Beyond Mask R-CNN and RetinaNet IWanjin Yu
 
Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Integrated Carbon Observation System (ICOS)
 
Nuc chemsummerschool 072106-v2
Nuc chemsummerschool 072106-v2Nuc chemsummerschool 072106-v2
Nuc chemsummerschool 072106-v2Dean Quezon
 
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...Sergey Karayev
 
Sif Project
Sif ProjectSif Project
Sif Projectpathapks
 

Semelhante a High-Throughput Scientific Computing for Brain Mapping and Radio Astronomy (20)

Defying Nyquist in Analog to Digital Conversion
Defying Nyquist in Analog to Digital ConversionDefying Nyquist in Analog to Digital Conversion
Defying Nyquist in Analog to Digital Conversion
 
ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21
 
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
 
J Fernandes Hst 2009
J Fernandes Hst 2009J Fernandes Hst 2009
J Fernandes Hst 2009
 
ApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTR
 
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
 
SIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time Raytracing
SIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time RaytracingSIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time Raytracing
SIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time Raytracing
 
9553477.ppt
9553477.ppt9553477.ppt
9553477.ppt
 
DefenseTalk_Trimmed
DefenseTalk_TrimmedDefenseTalk_Trimmed
DefenseTalk_Trimmed
 
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ..."Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
 
Ieee 2016 nss mic poster N30-21
Ieee 2016 nss mic poster N30-21Ieee 2016 nss mic poster N30-21
Ieee 2016 nss mic poster N30-21
 
Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...
 
Integrated Detector Electronics (IDEAS) ASIC product update
Integrated Detector Electronics (IDEAS) ASIC product updateIntegrated Detector Electronics (IDEAS) ASIC product update
Integrated Detector Electronics (IDEAS) ASIC product update
 
Object Detection Beyond Mask R-CNN and RetinaNet I
Object Detection Beyond Mask R-CNN and RetinaNet IObject Detection Beyond Mask R-CNN and RetinaNet I
Object Detection Beyond Mask R-CNN and RetinaNet I
 
Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...
 
23AFMC_Beamer.pdf
23AFMC_Beamer.pdf23AFMC_Beamer.pdf
23AFMC_Beamer.pdf
 
Nuc chemsummerschool 072106-v2
Nuc chemsummerschool 072106-v2Nuc chemsummerschool 072106-v2
Nuc chemsummerschool 072106-v2
 
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
 
Loca2005
Loca2005Loca2005
Loca2005
 
Sif Project
Sif ProjectSif Project
Sif Project
 

Mais de npinto

"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)npinto
 
High-Performance Computing Needs Machine Learning... And Vice Versa (NIPS 201...
High-Performance Computing Needs Machine Learning... And Vice Versa (NIPS 201...High-Performance Computing Needs Machine Learning... And Vice Versa (NIPS 201...
High-Performance Computing Needs Machine Learning... And Vice Versa (NIPS 201...npinto
 
[Harvard CS264] 16 - Managing Dynamic Parallelism on GPUs: A Case Study of Hi...
[Harvard CS264] 16 - Managing Dynamic Parallelism on GPUs: A Case Study of Hi...[Harvard CS264] 16 - Managing Dynamic Parallelism on GPUs: A Case Study of Hi...
[Harvard CS264] 16 - Managing Dynamic Parallelism on GPUs: A Case Study of Hi...npinto
 
[Harvard CS264] 15a - The Onset of Parallelism, Changes in Computer Architect...
[Harvard CS264] 15a - The Onset of Parallelism, Changes in Computer Architect...[Harvard CS264] 15a - The Onset of Parallelism, Changes in Computer Architect...
[Harvard CS264] 15a - The Onset of Parallelism, Changes in Computer Architect...npinto
 
[Harvard CS264] 15a - Jacket: Visual Computing (James Malcolm, Accelereyes)
[Harvard CS264] 15a - Jacket: Visual Computing (James Malcolm, Accelereyes)[Harvard CS264] 15a - Jacket: Visual Computing (James Malcolm, Accelereyes)
[Harvard CS264] 15a - Jacket: Visual Computing (James Malcolm, Accelereyes)npinto
 
[Harvard CS264] 14 - Dynamic Compilation for Massively Parallel Processors (G...
[Harvard CS264] 14 - Dynamic Compilation for Massively Parallel Processors (G...[Harvard CS264] 14 - Dynamic Compilation for Massively Parallel Processors (G...
[Harvard CS264] 14 - Dynamic Compilation for Massively Parallel Processors (G...npinto
 
[Harvard CS264] 13 - The R-Stream High-Level Program Transformation Tool / Pr...
[Harvard CS264] 13 - The R-Stream High-Level Program Transformation Tool / Pr...[Harvard CS264] 13 - The R-Stream High-Level Program Transformation Tool / Pr...
[Harvard CS264] 13 - The R-Stream High-Level Program Transformation Tool / Pr...npinto
 
[Harvard CS264] 12 - Irregular Parallelism on the GPU: Algorithms and Data St...
[Harvard CS264] 12 - Irregular Parallelism on the GPU: Algorithms and Data St...[Harvard CS264] 12 - Irregular Parallelism on the GPU: Algorithms and Data St...
[Harvard CS264] 12 - Irregular Parallelism on the GPU: Algorithms and Data St...npinto
 
[Harvard CS264] 11b - Analysis-Driven Performance Optimization with CUDA (Cli...
[Harvard CS264] 11b - Analysis-Driven Performance Optimization with CUDA (Cli...[Harvard CS264] 11b - Analysis-Driven Performance Optimization with CUDA (Cli...
[Harvard CS264] 11b - Analysis-Driven Performance Optimization with CUDA (Cli...npinto
 
[Harvard CS264] 11a - Programming the Memory Hierarchy with Sequoia (Mike Bau...
[Harvard CS264] 11a - Programming the Memory Hierarchy with Sequoia (Mike Bau...[Harvard CS264] 11a - Programming the Memory Hierarchy with Sequoia (Mike Bau...
[Harvard CS264] 11a - Programming the Memory Hierarchy with Sequoia (Mike Bau...npinto
 
[Harvard CS264] 10b - cl.oquence: High-Level Language Abstractions for Low-Le...
[Harvard CS264] 10b - cl.oquence: High-Level Language Abstractions for Low-Le...[Harvard CS264] 10b - cl.oquence: High-Level Language Abstractions for Low-Le...
[Harvard CS264] 10b - cl.oquence: High-Level Language Abstractions for Low-Le...npinto
 
[Harvard CS264] 10a - Easy, Effective, Efficient: GPU Programming in Python w...
[Harvard CS264] 10a - Easy, Effective, Efficient: GPU Programming in Python w...[Harvard CS264] 10a - Easy, Effective, Efficient: GPU Programming in Python w...
[Harvard CS264] 10a - Easy, Effective, Efficient: GPU Programming in Python w...npinto
 
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...npinto
 
[Harvard CS264] 08a - Cloud Computing, Amazon EC2, MIT StarCluster (Justin Ri...
[Harvard CS264] 08a - Cloud Computing, Amazon EC2, MIT StarCluster (Justin Ri...[Harvard CS264] 08a - Cloud Computing, Amazon EC2, MIT StarCluster (Justin Ri...
[Harvard CS264] 08a - Cloud Computing, Amazon EC2, MIT StarCluster (Justin Ri...npinto
 
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)npinto
 
[Harvard CS264] 07 - GPU Cluster Programming (MPI & ZeroMQ)
[Harvard CS264] 07 - GPU Cluster Programming (MPI & ZeroMQ)[Harvard CS264] 07 - GPU Cluster Programming (MPI & ZeroMQ)
[Harvard CS264] 07 - GPU Cluster Programming (MPI & ZeroMQ)npinto
 
[Harvard CS264] 06 - CUDA Ninja Tricks: GPU Scripting, Meta-programming & Aut...
[Harvard CS264] 06 - CUDA Ninja Tricks: GPU Scripting, Meta-programming & Aut...[Harvard CS264] 06 - CUDA Ninja Tricks: GPU Scripting, Meta-programming & Aut...
[Harvard CS264] 06 - CUDA Ninja Tricks: GPU Scripting, Meta-programming & Aut...npinto
 
[Harvard CS264] 05 - Advanced-level CUDA Programming
[Harvard CS264] 05 - Advanced-level CUDA Programming[Harvard CS264] 05 - Advanced-level CUDA Programming
[Harvard CS264] 05 - Advanced-level CUDA Programmingnpinto
 
[Harvard CS264] 04 - Intermediate-level CUDA Programming
[Harvard CS264] 04 - Intermediate-level CUDA Programming[Harvard CS264] 04 - Intermediate-level CUDA Programming
[Harvard CS264] 04 - Intermediate-level CUDA Programmingnpinto
 
[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics
[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics
[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basicsnpinto
 

Mais de npinto (20)

"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)
 
High-Performance Computing Needs Machine Learning... And Vice Versa (NIPS 201...
High-Performance Computing Needs Machine Learning... And Vice Versa (NIPS 201...High-Performance Computing Needs Machine Learning... And Vice Versa (NIPS 201...
High-Performance Computing Needs Machine Learning... And Vice Versa (NIPS 201...
 
[Harvard CS264] 16 - Managing Dynamic Parallelism on GPUs: A Case Study of Hi...
[Harvard CS264] 16 - Managing Dynamic Parallelism on GPUs: A Case Study of Hi...[Harvard CS264] 16 - Managing Dynamic Parallelism on GPUs: A Case Study of Hi...
[Harvard CS264] 16 - Managing Dynamic Parallelism on GPUs: A Case Study of Hi...
 
[Harvard CS264] 15a - The Onset of Parallelism, Changes in Computer Architect...
[Harvard CS264] 15a - The Onset of Parallelism, Changes in Computer Architect...[Harvard CS264] 15a - The Onset of Parallelism, Changes in Computer Architect...
[Harvard CS264] 15a - The Onset of Parallelism, Changes in Computer Architect...
 
[Harvard CS264] 15a - Jacket: Visual Computing (James Malcolm, Accelereyes)
[Harvard CS264] 15a - Jacket: Visual Computing (James Malcolm, Accelereyes)[Harvard CS264] 15a - Jacket: Visual Computing (James Malcolm, Accelereyes)
[Harvard CS264] 15a - Jacket: Visual Computing (James Malcolm, Accelereyes)
 
[Harvard CS264] 14 - Dynamic Compilation for Massively Parallel Processors (G...
[Harvard CS264] 14 - Dynamic Compilation for Massively Parallel Processors (G...[Harvard CS264] 14 - Dynamic Compilation for Massively Parallel Processors (G...
[Harvard CS264] 14 - Dynamic Compilation for Massively Parallel Processors (G...
 
[Harvard CS264] 13 - The R-Stream High-Level Program Transformation Tool / Pr...
[Harvard CS264] 13 - The R-Stream High-Level Program Transformation Tool / Pr...[Harvard CS264] 13 - The R-Stream High-Level Program Transformation Tool / Pr...
[Harvard CS264] 13 - The R-Stream High-Level Program Transformation Tool / Pr...
 
[Harvard CS264] 12 - Irregular Parallelism on the GPU: Algorithms and Data St...
[Harvard CS264] 12 - Irregular Parallelism on the GPU: Algorithms and Data St...[Harvard CS264] 12 - Irregular Parallelism on the GPU: Algorithms and Data St...
[Harvard CS264] 12 - Irregular Parallelism on the GPU: Algorithms and Data St...
 
[Harvard CS264] 11b - Analysis-Driven Performance Optimization with CUDA (Cli...
[Harvard CS264] 11b - Analysis-Driven Performance Optimization with CUDA (Cli...[Harvard CS264] 11b - Analysis-Driven Performance Optimization with CUDA (Cli...
[Harvard CS264] 11b - Analysis-Driven Performance Optimization with CUDA (Cli...
 
[Harvard CS264] 11a - Programming the Memory Hierarchy with Sequoia (Mike Bau...
[Harvard CS264] 11a - Programming the Memory Hierarchy with Sequoia (Mike Bau...[Harvard CS264] 11a - Programming the Memory Hierarchy with Sequoia (Mike Bau...
[Harvard CS264] 11a - Programming the Memory Hierarchy with Sequoia (Mike Bau...
 
[Harvard CS264] 10b - cl.oquence: High-Level Language Abstractions for Low-Le...
[Harvard CS264] 10b - cl.oquence: High-Level Language Abstractions for Low-Le...[Harvard CS264] 10b - cl.oquence: High-Level Language Abstractions for Low-Le...
[Harvard CS264] 10b - cl.oquence: High-Level Language Abstractions for Low-Le...
 
[Harvard CS264] 10a - Easy, Effective, Efficient: GPU Programming in Python w...
[Harvard CS264] 10a - Easy, Effective, Efficient: GPU Programming in Python w...[Harvard CS264] 10a - Easy, Effective, Efficient: GPU Programming in Python w...
[Harvard CS264] 10a - Easy, Effective, Efficient: GPU Programming in Python w...
 
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
[Harvard CS264] 09 - Machine Learning on Big Data: Lessons Learned from Googl...
 
[Harvard CS264] 08a - Cloud Computing, Amazon EC2, MIT StarCluster (Justin Ri...
[Harvard CS264] 08a - Cloud Computing, Amazon EC2, MIT StarCluster (Justin Ri...[Harvard CS264] 08a - Cloud Computing, Amazon EC2, MIT StarCluster (Justin Ri...
[Harvard CS264] 08a - Cloud Computing, Amazon EC2, MIT StarCluster (Justin Ri...
 
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
 
[Harvard CS264] 07 - GPU Cluster Programming (MPI & ZeroMQ)
[Harvard CS264] 07 - GPU Cluster Programming (MPI & ZeroMQ)[Harvard CS264] 07 - GPU Cluster Programming (MPI & ZeroMQ)
[Harvard CS264] 07 - GPU Cluster Programming (MPI & ZeroMQ)
 
[Harvard CS264] 06 - CUDA Ninja Tricks: GPU Scripting, Meta-programming & Aut...
[Harvard CS264] 06 - CUDA Ninja Tricks: GPU Scripting, Meta-programming & Aut...[Harvard CS264] 06 - CUDA Ninja Tricks: GPU Scripting, Meta-programming & Aut...
[Harvard CS264] 06 - CUDA Ninja Tricks: GPU Scripting, Meta-programming & Aut...
 
[Harvard CS264] 05 - Advanced-level CUDA Programming
[Harvard CS264] 05 - Advanced-level CUDA Programming[Harvard CS264] 05 - Advanced-level CUDA Programming
[Harvard CS264] 05 - Advanced-level CUDA Programming
 
[Harvard CS264] 04 - Intermediate-level CUDA Programming
[Harvard CS264] 04 - Intermediate-level CUDA Programming[Harvard CS264] 04 - Intermediate-level CUDA Programming
[Harvard CS264] 04 - Intermediate-level CUDA Programming
 
[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics
[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics
[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics
 

Último

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 

Último (20)

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 

High-Throughput Scientific Computing for Brain Mapping and Radio Astronomy

  • 1. High-Throughput Scientific Computing Hanspeter Pfister pfister@seas.harvard.edu
  • 2. Themes • How is the brain wired? • How did the Universe start?
  • 3. How is the brain wired? The Connectome Project
  • 4. Connectome Team • Harvard Center for Brain Science – Jeff Lichtman & Clay Reid • Microsoft Research / UW – Michael Cohen • Kitware Inc. – Will Schroeder, Charles Law, Rusty Blue • VRVis Vienna – Markus Hadwiger, Johanna Beyer • IIC – Amelio Vazquez, Eric Miller (Tufts) – Won-Ki Seung, Hanspeter Pfister
  • 5. The Scientific Challenge composite from Roe et al. 1989, Sutton and Brunso-Bechtold 1991
  • 6. Confocal Microscopy: Brainbow Adapted from OlympusConfocal.com
  • 8. Serial Sectioning y x z . . . Section i, i (1, …,N) Adapted from http://parasol.tamu.edu Texas A&M University
  • 9. 40,000x40,000 pixels 1.6 GB 120x120 µm (3 nm/pixel) Here shown 40x undersampled 6 15mu EM big view
  • 13. 2 300 nm rlp
  • 14. The Data Challenge • 1 mm3 ~= mouse thalamus ~= 1 petabyte • 1 cm ~= mouse brain ~= 1 exabyte 3 • 1000 cm ~= human brain ~= 1 zettabyte 3 All of Google’s world-wide storage today ~= 1 exabyte
  • 15. Addressing the Data Challenge • Multi-Scale Imaging • Hierarchical Data Representation • Distributed Heterogeneous Computing • Visualization • Segmentation • Analysis
  • 16. Addressing the Data Challenge • Multi-Scale Imaging • Hierarchical Data Representation • Distributed Heterogeneous Computing • Visualization • Segmentation • Analysis
  • 17. Direct Volume Rendering MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 18. Ray Casting • Image-order ray shooting • Interpolate • Assign color & opacity • Composite • Simple to implement • Very flexible (adaptive sampling, …) • Correct perspective MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 19. Transfer Functions • Mapping of density to optical properties • Simplest: color table with opacity over density MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 20. Connectome: EM Data MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 21. Single-Pass Ray Casting • Enabled by conditional loops • Substitute multiple passes with single loop and early loop exit • Volume rendering example in NVIDIA CUDA SDK (procedural ray setup) MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 22. Basic Ray Setup / Termination • Two main approaches: • Procedural ray/box intersection [Röttger et al., 2003], [Green, 2004] • Rasterize bounding box [Krüger and Westermann, 2003] MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 23. Procedural Ray Setup / Term. • Procedural ray / box intersection • Everything handled in fragment shader • Ray given by camera position and volume entry position • Exit criterion needed • Pro: simple and self-contained • Con: full load on fragment shader MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 24. quot;Image-Basedquot; Ray Setup / Term. • Rasterize bounding box front faces and back faces • Ray start positions: front faces • Direction vectors: back faces − front faces - = • Independent of projection (orthogonal/perspective) MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 25. Kernel • Image-based ray setup • Ray start image • Direction image • Ray-cast loop • Sample volume • Accumulate color and opacity • Terminate • Store output MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 26. Standard Ray Casting Optim. (1) Early ray termination • Isosurfaces: stop when surface hit • Direct volume rendering: stop when opacity >= threshold • Several possibilities • Current GPUs: early loop exit works well MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 27. Standard Ray Casting Optim. (2) Empty space skipping • Skip transparent samples • Depends on transfer function • Start casting close to first hit • Several possibilities • Per-sample check of opacity (expensive) • Hierarchical data store (e.g., octree with stack-less traversal [Gobbetti et al., 2008] ) • These are image-order: what about object-order? MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 28. Object-Order Empty Space Skip. (1) • Modify initial rasterization step rasterize bounding box rasterize “tightquot; bounding geometry MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 29. Object-Order Empty Space Skip. (2) • Store min-max values of volume blocks • Cull blocks against transfer function or isovalue • Rasterize front and back faces of active blocks MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 30. Connectome: Fluorescence Data MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 31. Connectome: Implicit Surfaces MARKUS HADWIGER, VRVIS RESEARCH CENTER, VIENNA, AUSTRIA
  • 32. Addressing the Data Challenge • Multi-Scale Imaging • Hierarchical Data Representation • Distributed Heterogeneous Computing • Visualization • Segmentation • Analysis
  • 33. Active Ribbons Active Ribbon: A set of two non-intersecting and coupled Active Contours Active Contour: Deformable closed curve that can be used to segment objects in an image Outer Active Inner Active Contour Contour Active Ribbon
  • 37. How did the Universe start? The MWA Project Kevin Dale, Richard Edgar, Daniel Mitchell, Randall Wayth, Lincoln Greenhill, and Hanspeter Pfister
  • 38. MWA CfA / IIC Team • Harvard Center for Astrophysics / Smithsonian Astrophysical Observatory – Lincoln Greenhill – Daniel Mitchell – Randall Wayth – Stephen Ord • IIC / SEAS – Richard Edgar – Kevin Dale, Hanspeter Pfister
  • 39. The Scientific Goals • Epoch of Re- ionized Inonisation (EOR) neutral • Heliospheric and (H) The “Gap” Ionospheric • Transient detection • Pulsars, Surveys, Interstellar Medium, ionized Galactic Magnetic Field, …
  • 40. The Murchison Widefield Array (MWA) • Located in the remote Australian outback • Extremely wide fields of view for radio astronomy in the 80-300 MHz band • 512 tiles, each a 4x4 array of dipoles, scattered over ~ 1.5 km • Data center for real-time processing co-located with the array http://www.haystack.mit.edu/ast/arrays/mwa/index.html
  • 41. © Murchison Wide-field Array Project (MIT/Harvard/Smithsonian/ANU/Curtin U./U.Melb./UWA/CSIRO)
  • 42. © Murchison Wide-field Array Project (MIT/Harvard/Smithsonian/ANU/Curtin U./U.Melb./UWA/CSIRO)
  • 43.
  • 44. Calibration Ionospheric offsets Ungridded visibilities with bright sources peeled Imaging
  • 45. The Data Rate Challenge ent v. p ang arall led Calibration Loop el c om put atio n FFT Averaging ( !) Gridding Vector Rotation Mapping (1) GB/s 16 GB/s Science 8s cadence 0.5s cadence
  • 46. Implementation • Hardware • 2.4 GHz dual-core AMD Opteron, 4GB RAM • NVIDIA Quadro FX 5600 • Software • AMD Core Math Library (ACML) • NVIDIA CUDA (CUBLAS, CUFFT) • OpenGL
  • 47. Single-GPU Speedup CPUGPU speedup Image Formation Imaging Mostly OpenGL Gridding * UnpeelTileResponse Calibration Loop PeelTileResponse ReRotateVisibilities MeasureTileResponse MeasureIonosphericOffset RotateAndAccumulateVisibilities 0 10 20 30 40 50 60 70
  • 48. Example Results • Noisy images from test data GPU Reference
  • 49. Scaling to a Cluster • 1000 frequency channels, 65 sources every 8 seconds, and 16002 output image • 20-40 frequencies / GPU • 32-64 GPUs, i.e., 16 Tesla S1070s • Need MPI for internal data transfer
  • 50. Conclusions • GPUs enable high-throughput scientific computing • Performance gains of 10-100x • CUDA makes life easier (but not perfect) • Rasterization / OpenGL still useful • Need CUDA MPI for clusters