SlideShare a Scribd company logo
1 of 20
Download to read offline
HPC + D + A = HPDA?
Barry Bolding, PhD
SVP & Chief Strategy Officer
Cray Inc.
bbolding@cray.com
1
Era of Mobile
Age of Context
https://medium.com/crossing-the-pond/into-the-
age-of-context-f0aed15171d7
Age of Context - Robert Scoble and Shel Israel
2
Context can be powerful, but…
Age of Context
Contextual
interpolation
Age of
Insight
Insightful
extrapolation
3
Cray’s Vision:
The Fusion of Supercomputing and Big & Fast Data
Modeling The World
Cray Supercomputers solving “grand challenges” in science, engineering and analytics
Compute Store Analyze
Data-
Intensive
Processing
High throughput event
processing & data
capture from sensors,
data feeds and
instruments
Math Models
Modeling and
simulation augmented
with data to provide
the highest fidelity
virtual reality results
Data Models
Integration of datasets
and math models for
search, analysis,
predictive modeling
and knowledge
discovery
4
Cray’s Unique Strengths in Supercomputing +
Analytics
5
Productivity
based ROI
Customer
Community
Systems by
Design
The Cray community of
customers and partners are
the most demanding and
enabling. Allows Cray to
push the boundaries of
scalable systems.
We design our systems
for production
computing and
analytics at scale.
Cray takes the productive
applications view of ROI:
• Match system designs to the
applications
• Enable highly productive
programming environments.
• Build solutions that are
flexible and upgradeable.
• Lower TCO than commodity
clusters and cloud for data-
intensive applications
The Alan Turing Institute
● UK national institute for data sciences
● to break new boundaries in how we use big
data in a fast moving, competitive world
● Joint venture: founder Universities and
EPSRC
● Cambridge, Edinburgh, Oxford, UCL, Warwick
● Launch partners
● Lloyds Register Foundation, GCHQ and Cray
● UK Government investment in “Eight
Great Technologies”
● https://turing.ac.uk/
6
7
“We don’t just want insights, we
want actionable insights.”
Peter Grindrod, University of Oxford
Human Genome Assembly – Under 9 Minutes on
“Edison”
● Cray XC30 “Edison” reduces the assembly
time of a complete human genome to 8.4
minutes
● A UPC modified version of the Meraculous
code (Hip-Mer) was 170x faster.
● Eliminating the analytics back-log and
fueling the drive towards precision
medicine
● Cray’s CX30 with Aries interconnect
coupled with high memory bandwidth and
memory on each node provided the I/O
bandwidth required for genome assembly
work
The end-to-end scaling of the team’s HipMer genome assembler approach
showing the human genome scalability on Edison on the left and the more
complex wheat genome on the right (both axes are in log scale).
References: http://www.theplatform.net/2015/08/20/supercomputer-force-knocks-human-genome-assembly-under-9-minutes/
http://gauss.cs.ucsb.edu/~aydin/sc15_genome.pdf
8
9
“Using our HipMer technology
enables for the first time assembly
throughput to exceed the capability
of all the world's sequencers”
HipMer: an Etreme-Scale DeNovo Genome Assembler
Georganas, Buluc, Chapman, Hofmeyr, Aluru, Egan, Oliker,
Rokshar and Yelick
Magnus (CX30) Whitefly study at IVEC
10
● Dr. Laura Boykin
● funded by the Gates
Foundation and a TED
Fellowship.
● Awarded time on “Magnus,”
Her team is marrying
genomics, supercomputing
and evolutionary history to
help African farmers
develop management
strategies and breed pest-
resistant cropsThey are battling a species complex
of at least 34 morphologically
indistinguishable species.
11
“It’s a massive problem. I’m one of
15 principal investigators working
on a new project whose mission is
to give farmers a cassava plant
that’s resistant to the viruses and
the whiteflies.”
Dr. Laura Boykin
Copyright 2015 Cray Inc.
Protein Folding – Mixed Simulation and Analytics
● Focuses on how protein folding happens
● Model possible paths to folded end-state
● Temporal resolution matters, but drives data size
Markov State
model of a fold
Model
Data
geometry
biology
physics
variables
of interest
to create
the model
to drive
the simulation
to compare with
simulation results
input
conditions
solution
process
solved variables
of interest
Simulation
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673555/pdf/nihms428070.pdf
Curr Opin Struct Biol. 2013 February ; 23(1): 58–65. doi:10.1016/j.sbi.2012.11.002.
Copyright 2015 Cray Inc.
Protein Folding – Mixed Simulation and Analytics
Model
Data
geometry
biology
physics
variables
of interest
to create
the model
to drive
the simulation
to compare with
simulation results
input
conditions
solution
process
solved variables
of interest
Simulation
Automated classification in protein databases
(a) (b) (c)
Figure 1: One conformation of the villin HP-35 protein (a); part of its distance matrix using only its backbone atoms in
the conformation (b); and three eigenvectors and the associated eigenvalues capturing and synthesizing the conformation
geometry (c).
III. METHODOLOGY
Our method involves three steps. Given a trajectory
composed of frames generated in a distributed fashion
and containing up to 400 consecutively folding protein
conformations, our method first extracts relevant features
from each conformation into alocal metadata representation.
In our case, the relevant feature is the local knowledge
on the geometric conformation of the protein structure.
Second, the method maps each conformation to either a
meta-stable or a transition stage using the local knowledge
on the conformational features of the structures in the frame
only. Finally, the method rebuilds the global knowledge
(i.e., stages and conformation space explored) of the given
trajectories through a reduction operation.
A. Extraction of conformational features
Our method extracts the shape of each single folding
conformation into a metadata representation that preserves
therelevant information contained in thedatabut reduces the
overall data size. Rather than working with the atom coordi-
nates of the complete protein molecule, our method first rep-
resents each protein conformation using an N ⇤N distance
matrix (DM), where N is the number of backbone carbon
atoms in the protein (i.e., the alpha carbon of each amino
acid). More specifically, the matrix contains the distance
from each backbone atom to the other backbone atoms in the
same protein conformation. Figure 1(a) shows an example
of a folding conformation for the protein HP-35 NleNle
(i.e., a variant of the villin headpiece subdomain) taken
from folding trajectories generated by Folding@Home [9].
Figure 1(b) shows part of the corresponding distance matrix
representing the same conformation in a different format.
The representation of each protein conformation is trans-
formed from a set of 3 ⇤M floating point numbers (where
3 identifies the coordinates in the Cartesian space of each
atom, and M is the number of atoms in the entire protein,
with M > N ) to a N ⇤N matrix of floating point numbers.
Once the method has mapped each conformation to a
distance matrix, it applies classical multidimensional scaling
(MDS) to each distance matrix separately. MDS reduces the
N ⇤N symmetric distance matrix into a lower dimensional
matrix while maintaining the original information on the
protein conformation (i.e., the distance of each backbone
atom to all the other backbone atoms). Because of simplicity
and ease of human interpretation, we applied MDS to reduce
the data dimensionality to a N ⇤3 matrix. MDS also
generates a set of three N ⇤1 column vectors (q1, q2, q3),
and each eigenvector comes with its eigenvalue (λ1, λ2, λ3).
Each eigenvalue represents the amount of variations in the
data associated to the corresponding eigenvector [10]. In
this case we use the information contained in the leading
eigenvalues to summarize the conformational features of the
protein at a given time in the folding process. Thus, we
are able to reduce the protein dimensionality into a single
point in the 3-D Euclidean space. In other words, the three
eigenvalues represent the variance or curves of the backbone
atoms with respect to each other in the protein’s alpha-
helices or beta-sheets. Figure 1(c) shows the three eigenval-
ues and eigenvectors obtained for the matrix in Figure 1(b).
The figure also shows the reduced representation of the
protein conformation into the 3-D point. In summary, the
method maps each protein conformation from 3⇤M atom’s
coordinates to 3 ⇤1 floating point numbers. It performs the
mapping for each conformation in a frame separately from
the other mapped conformations in a concurrent manner.
B. Classification of meta-stable and transition stages
As the folding evolves, the protein changes between
meta-stable and transition stages. Each frame composing a
trajectory may contain up to 400 protein conformations that
can be clustered into one or more of these two stage cate-
In-situ analysis of folds –
Dimensionality reduction using PCA & MDS
14
Thomas J. Lane, Diwakar Shukla, Kyle A. Beauchamp, and Vijay S.
Pande
Astronomy
Physics Light Sources
Genomics
Climate
DOE Facilities are Facing a Data Deluge
15
Cray/AMPLab/LBL collaboration
Top Data Analytics Problems for NERSC users
1. Generative Model for the Visible Universe
● Joint Inference across telescope images
2. Pattern Detection for Climate
● Scalable Deep Learning on Cori
3. Stephen Hawking Device
● Machine Learning for speech prosthetic
4. Google Maps for BioImaging
● Semantic databases
5. Genome Assembly for Wheat
● Graph Analytics
17
Data Characteristics
Science
Domain
Data Source Data Characteristics Data Volume Analysis
Challenge
Cosmology Multiple
Telescopes
Noisy, multi-band,
artifacts
O(100) TB Data Fusion,
Inference
HEP Anti-Neutrino
detectors
Noisy, artifacts, spatio-
temporal
O(10) TB Pattern/Anomaly
Detection
BioImaging Mass-spec
instruments
Noisy, artifacts, multi-
modal
O(10) TB Dim. Reduction
Clustering, Pattern
Detection
Genomics Sequencers Missing data, errors O(1-10) TB Clustering, Pattern
Detection
Neuroscience Neural Recorders Spatio-temporal, high
dimensional, noisy
O(1) TB Dim. Reduction,
Pattern Detection
Climate Satellites,
Simulation o/p
Multi-variate, spatio-
temporal
O(10) TB Pattern/Anomaly
Detection
18
19
“Current HPC machines are optimized for
scientific simulations, arithmetic-intensive
workloads, and regular computation, whereas
data-analytics applications requires hardware
optimized for bandwidth, throughput,
concurrency and all-to-all communication.”
Tumeo & Feo (IEEE Computer, Aug)
20
“The age of insight will require hardware and
software infrastructures and ecosystems
optimized for data analytic modeling & data-
intensive simulations, including regular and
irregular data access, high bandwidth,
throughput, concurrency, all-to-all
communication and scalability.”

More Related Content

What's hot

Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...Pourya Jafarzadeh
 
Image Segmentation Using Two Weighted Variable Fuzzy K Means
Image Segmentation Using Two Weighted Variable Fuzzy K MeansImage Segmentation Using Two Weighted Variable Fuzzy K Means
Image Segmentation Using Two Weighted Variable Fuzzy K MeansEditor IJCATR
 
Report-de Bruijn Graph
Report-de Bruijn GraphReport-de Bruijn Graph
Report-de Bruijn GraphAshwani kumar
 
A PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering AlgorithmA PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering AlgorithmIJORCS
 
A Dependent Set Based Approach for Large Graph Analysis
A Dependent Set Based Approach for Large Graph AnalysisA Dependent Set Based Approach for Large Graph Analysis
A Dependent Set Based Approach for Large Graph AnalysisEditor IJCATR
 
A Kernel Approach for Semi-Supervised Clustering Framework for High Dimension...
A Kernel Approach for Semi-Supervised Clustering Framework for High Dimension...A Kernel Approach for Semi-Supervised Clustering Framework for High Dimension...
A Kernel Approach for Semi-Supervised Clustering Framework for High Dimension...IJCSIS Research Publications
 
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming dataA fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming dataAlexander Decker
 
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...wl820609
 
A general weighted_fuzzy_clustering_algorithm
A general weighted_fuzzy_clustering_algorithmA general weighted_fuzzy_clustering_algorithm
A general weighted_fuzzy_clustering_algorithmTA Minh Thuy
 
The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)James McMurray
 
AN IMPROVED MULTI-SOM ALGORITHM
AN IMPROVED MULTI-SOM ALGORITHMAN IMPROVED MULTI-SOM ALGORITHM
AN IMPROVED MULTI-SOM ALGORITHMIJNSA Journal
 
Macromodel of High Speed Interconnect using Vector Fitting Algorithm
Macromodel of High Speed Interconnect using Vector Fitting AlgorithmMacromodel of High Speed Interconnect using Vector Fitting Algorithm
Macromodel of High Speed Interconnect using Vector Fitting Algorithmijsrd.com
 
Bayesian Co clustering
Bayesian Co clusteringBayesian Co clustering
Bayesian Co clusteringlau
 
COMPARISON OF VOLUME AND DISTANCE CONSTRAINT ON HYPERSPECTRAL UNMIXING
COMPARISON OF VOLUME AND DISTANCE CONSTRAINT ON HYPERSPECTRAL UNMIXINGCOMPARISON OF VOLUME AND DISTANCE CONSTRAINT ON HYPERSPECTRAL UNMIXING
COMPARISON OF VOLUME AND DISTANCE CONSTRAINT ON HYPERSPECTRAL UNMIXINGcsandit
 
Architecture neural network deep optimizing based on self organizing feature ...
Architecture neural network deep optimizing based on self organizing feature ...Architecture neural network deep optimizing based on self organizing feature ...
Architecture neural network deep optimizing based on self organizing feature ...journalBEEI
 
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...lauratoni4
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLlauratoni4
 

What's hot (20)

Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
Online Multi-Person Tracking Using Variance Magnitude of Image colors and Sol...
 
Image Segmentation Using Two Weighted Variable Fuzzy K Means
Image Segmentation Using Two Weighted Variable Fuzzy K MeansImage Segmentation Using Two Weighted Variable Fuzzy K Means
Image Segmentation Using Two Weighted Variable Fuzzy K Means
 
Report-de Bruijn Graph
Report-de Bruijn GraphReport-de Bruijn Graph
Report-de Bruijn Graph
 
A PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering AlgorithmA PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering Algorithm
 
Self-organizing map
Self-organizing mapSelf-organizing map
Self-organizing map
 
A Dependent Set Based Approach for Large Graph Analysis
A Dependent Set Based Approach for Large Graph AnalysisA Dependent Set Based Approach for Large Graph Analysis
A Dependent Set Based Approach for Large Graph Analysis
 
A Kernel Approach for Semi-Supervised Clustering Framework for High Dimension...
A Kernel Approach for Semi-Supervised Clustering Framework for High Dimension...A Kernel Approach for Semi-Supervised Clustering Framework for High Dimension...
A Kernel Approach for Semi-Supervised Clustering Framework for High Dimension...
 
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming dataA fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming data
 
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
 
A general weighted_fuzzy_clustering_algorithm
A general weighted_fuzzy_clustering_algorithmA general weighted_fuzzy_clustering_algorithm
A general weighted_fuzzy_clustering_algorithm
 
The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)
 
AN IMPROVED MULTI-SOM ALGORITHM
AN IMPROVED MULTI-SOM ALGORITHMAN IMPROVED MULTI-SOM ALGORITHM
AN IMPROVED MULTI-SOM ALGORITHM
 
Macromodel of High Speed Interconnect using Vector Fitting Algorithm
Macromodel of High Speed Interconnect using Vector Fitting AlgorithmMacromodel of High Speed Interconnect using Vector Fitting Algorithm
Macromodel of High Speed Interconnect using Vector Fitting Algorithm
 
Bayesian Co clustering
Bayesian Co clusteringBayesian Co clustering
Bayesian Co clustering
 
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image RetrievalBeyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
Beyond Bag of Features: Adaptive Hilbert Scan Based Tree for Image Retrieval
 
COMPARISON OF VOLUME AND DISTANCE CONSTRAINT ON HYPERSPECTRAL UNMIXING
COMPARISON OF VOLUME AND DISTANCE CONSTRAINT ON HYPERSPECTRAL UNMIXINGCOMPARISON OF VOLUME AND DISTANCE CONSTRAINT ON HYPERSPECTRAL UNMIXING
COMPARISON OF VOLUME AND DISTANCE CONSTRAINT ON HYPERSPECTRAL UNMIXING
 
Architecture neural network deep optimizing based on self organizing feature ...
Architecture neural network deep optimizing based on self organizing feature ...Architecture neural network deep optimizing based on self organizing feature ...
Architecture neural network deep optimizing based on self organizing feature ...
 
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RL
 
A280108
A280108A280108
A280108
 

Similar to Cray HPC + D + A = HPDA

Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...Melissa Moody
 
Making effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computationsMaking effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computationsOregon State University
 
Unsupervised Clustering Classify theCancer Data withtheHelp of FCM Algorithm
Unsupervised Clustering Classify theCancer Data withtheHelp of FCM AlgorithmUnsupervised Clustering Classify theCancer Data withtheHelp of FCM Algorithm
Unsupervised Clustering Classify theCancer Data withtheHelp of FCM AlgorithmIOSR Journals
 
Executive SummaryIntroductionProtein engineering
Executive SummaryIntroductionProtein engineeringExecutive SummaryIntroductionProtein engineering
Executive SummaryIntroductionProtein engineeringBetseyCalderon89
 
ENHANCING SEGMENTATION APPROACHES FROM FUZZY K-C-MEANS TO FUZZY-MPSO BASED LI...
ENHANCING SEGMENTATION APPROACHES FROM FUZZY K-C-MEANS TO FUZZY-MPSO BASED LI...ENHANCING SEGMENTATION APPROACHES FROM FUZZY K-C-MEANS TO FUZZY-MPSO BASED LI...
ENHANCING SEGMENTATION APPROACHES FROM FUZZY K-C-MEANS TO FUZZY-MPSO BASED LI...Christo Ananth
 
An Automatic Clustering Technique for Optimal Clusters
An Automatic Clustering Technique for Optimal ClustersAn Automatic Clustering Technique for Optimal Clusters
An Automatic Clustering Technique for Optimal ClustersIJCSEA Journal
 
Data Mining Protein Structures' Topological Properties to Enhance Contact Ma...
Data Mining Protein Structures' Topological Properties  to Enhance Contact Ma...Data Mining Protein Structures' Topological Properties  to Enhance Contact Ma...
Data Mining Protein Structures' Topological Properties to Enhance Contact Ma...jaumebp
 
Analysis of Cholesterol Quantity Detection and ANN Classification
Analysis of Cholesterol Quantity Detection and ANN ClassificationAnalysis of Cholesterol Quantity Detection and ANN Classification
Analysis of Cholesterol Quantity Detection and ANN ClassificationIJCSIS Research Publications
 
Project Presentation
Project PresentationProject Presentation
Project Presentationbutest
 
Multiscale and integrative single-cell Hi-C analysis with Higashi.pdf
Multiscale and integrative single-cell Hi-C analysis with Higashi.pdfMultiscale and integrative single-cell Hi-C analysis with Higashi.pdf
Multiscale and integrative single-cell Hi-C analysis with Higashi.pdfAtiaGohar1
 
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...Dominic Suciu
 
Study on Reconstruction Accuracy using shapiness index of morphological trans...
Study on Reconstruction Accuracy using shapiness index of morphological trans...Study on Reconstruction Accuracy using shapiness index of morphological trans...
Study on Reconstruction Accuracy using shapiness index of morphological trans...ijcseit
 
Comparison Between Clustering Algorithms for Microarray Data Analysis
Comparison Between Clustering Algorithms for Microarray Data AnalysisComparison Between Clustering Algorithms for Microarray Data Analysis
Comparison Between Clustering Algorithms for Microarray Data AnalysisIOSR Journals
 
Multisite UTE 31P Rosette MRSI(PETALUTE)
Multisite UTE 31P Rosette MRSI(PETALUTE)Multisite UTE 31P Rosette MRSI(PETALUTE)
Multisite UTE 31P Rosette MRSI(PETALUTE)Uzay Emir
 
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...ijasuc
 
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...ijasuc
 
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...IRJET Journal
 
ADAPTIVE SEGMENTATION OF CELLS AND PARTICLES IN FLUORESCENT MICROSCOPE IMAGE
ADAPTIVE SEGMENTATION OF CELLS AND PARTICLES IN FLUORESCENT MICROSCOPE IMAGEADAPTIVE SEGMENTATION OF CELLS AND PARTICLES IN FLUORESCENT MICROSCOPE IMAGE
ADAPTIVE SEGMENTATION OF CELLS AND PARTICLES IN FLUORESCENT MICROSCOPE IMAGEJournal For Research
 

Similar to Cray HPC + D + A = HPDA (20)

Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
Deep Learning Meets Biology: How Does a Protein Helix Know Where to Start and...
 
Making effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computationsMaking effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computations
 
Unsupervised Clustering Classify theCancer Data withtheHelp of FCM Algorithm
Unsupervised Clustering Classify theCancer Data withtheHelp of FCM AlgorithmUnsupervised Clustering Classify theCancer Data withtheHelp of FCM Algorithm
Unsupervised Clustering Classify theCancer Data withtheHelp of FCM Algorithm
 
Executive SummaryIntroductionProtein engineering
Executive SummaryIntroductionProtein engineeringExecutive SummaryIntroductionProtein engineering
Executive SummaryIntroductionProtein engineering
 
ENHANCING SEGMENTATION APPROACHES FROM FUZZY K-C-MEANS TO FUZZY-MPSO BASED LI...
ENHANCING SEGMENTATION APPROACHES FROM FUZZY K-C-MEANS TO FUZZY-MPSO BASED LI...ENHANCING SEGMENTATION APPROACHES FROM FUZZY K-C-MEANS TO FUZZY-MPSO BASED LI...
ENHANCING SEGMENTATION APPROACHES FROM FUZZY K-C-MEANS TO FUZZY-MPSO BASED LI...
 
Recursive
RecursiveRecursive
Recursive
 
An Automatic Clustering Technique for Optimal Clusters
An Automatic Clustering Technique for Optimal ClustersAn Automatic Clustering Technique for Optimal Clusters
An Automatic Clustering Technique for Optimal Clusters
 
Data Mining Protein Structures' Topological Properties to Enhance Contact Ma...
Data Mining Protein Structures' Topological Properties  to Enhance Contact Ma...Data Mining Protein Structures' Topological Properties  to Enhance Contact Ma...
Data Mining Protein Structures' Topological Properties to Enhance Contact Ma...
 
Analysis of Cholesterol Quantity Detection and ANN Classification
Analysis of Cholesterol Quantity Detection and ANN ClassificationAnalysis of Cholesterol Quantity Detection and ANN Classification
Analysis of Cholesterol Quantity Detection and ANN Classification
 
Project Presentation
Project PresentationProject Presentation
Project Presentation
 
Multiscale and integrative single-cell Hi-C analysis with Higashi.pdf
Multiscale and integrative single-cell Hi-C analysis with Higashi.pdfMultiscale and integrative single-cell Hi-C analysis with Higashi.pdf
Multiscale and integrative single-cell Hi-C analysis with Higashi.pdf
 
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
 
Study on Reconstruction Accuracy using shapiness index of morphological trans...
Study on Reconstruction Accuracy using shapiness index of morphological trans...Study on Reconstruction Accuracy using shapiness index of morphological trans...
Study on Reconstruction Accuracy using shapiness index of morphological trans...
 
Comparison Between Clustering Algorithms for Microarray Data Analysis
Comparison Between Clustering Algorithms for Microarray Data AnalysisComparison Between Clustering Algorithms for Microarray Data Analysis
Comparison Between Clustering Algorithms for Microarray Data Analysis
 
Multisite UTE 31P Rosette MRSI(PETALUTE)
Multisite UTE 31P Rosette MRSI(PETALUTE)Multisite UTE 31P Rosette MRSI(PETALUTE)
Multisite UTE 31P Rosette MRSI(PETALUTE)
 
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
 
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
EBCD: A ROUTING ALGORITHM BASED ON BEE COLONY FOR ENERGY CONSUMPTION REDUCTIO...
 
T180203125133
T180203125133T180203125133
T180203125133
 
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
 
ADAPTIVE SEGMENTATION OF CELLS AND PARTICLES IN FLUORESCENT MICROSCOPE IMAGE
ADAPTIVE SEGMENTATION OF CELLS AND PARTICLES IN FLUORESCENT MICROSCOPE IMAGEADAPTIVE SEGMENTATION OF CELLS AND PARTICLES IN FLUORESCENT MICROSCOPE IMAGE
ADAPTIVE SEGMENTATION OF CELLS AND PARTICLES IN FLUORESCENT MICROSCOPE IMAGE
 

More from inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networksinside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networksinside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoringinside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Updateinside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Accelerationinside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficientlyinside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Erainside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computinginside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Cray HPC + D + A = HPDA

  • 1. HPC + D + A = HPDA? Barry Bolding, PhD SVP & Chief Strategy Officer Cray Inc. bbolding@cray.com 1
  • 2. Era of Mobile Age of Context https://medium.com/crossing-the-pond/into-the- age-of-context-f0aed15171d7 Age of Context - Robert Scoble and Shel Israel 2
  • 3. Context can be powerful, but… Age of Context Contextual interpolation Age of Insight Insightful extrapolation 3
  • 4. Cray’s Vision: The Fusion of Supercomputing and Big & Fast Data Modeling The World Cray Supercomputers solving “grand challenges” in science, engineering and analytics Compute Store Analyze Data- Intensive Processing High throughput event processing & data capture from sensors, data feeds and instruments Math Models Modeling and simulation augmented with data to provide the highest fidelity virtual reality results Data Models Integration of datasets and math models for search, analysis, predictive modeling and knowledge discovery 4
  • 5. Cray’s Unique Strengths in Supercomputing + Analytics 5 Productivity based ROI Customer Community Systems by Design The Cray community of customers and partners are the most demanding and enabling. Allows Cray to push the boundaries of scalable systems. We design our systems for production computing and analytics at scale. Cray takes the productive applications view of ROI: • Match system designs to the applications • Enable highly productive programming environments. • Build solutions that are flexible and upgradeable. • Lower TCO than commodity clusters and cloud for data- intensive applications
  • 6. The Alan Turing Institute ● UK national institute for data sciences ● to break new boundaries in how we use big data in a fast moving, competitive world ● Joint venture: founder Universities and EPSRC ● Cambridge, Edinburgh, Oxford, UCL, Warwick ● Launch partners ● Lloyds Register Foundation, GCHQ and Cray ● UK Government investment in “Eight Great Technologies” ● https://turing.ac.uk/ 6
  • 7. 7 “We don’t just want insights, we want actionable insights.” Peter Grindrod, University of Oxford
  • 8. Human Genome Assembly – Under 9 Minutes on “Edison” ● Cray XC30 “Edison” reduces the assembly time of a complete human genome to 8.4 minutes ● A UPC modified version of the Meraculous code (Hip-Mer) was 170x faster. ● Eliminating the analytics back-log and fueling the drive towards precision medicine ● Cray’s CX30 with Aries interconnect coupled with high memory bandwidth and memory on each node provided the I/O bandwidth required for genome assembly work The end-to-end scaling of the team’s HipMer genome assembler approach showing the human genome scalability on Edison on the left and the more complex wheat genome on the right (both axes are in log scale). References: http://www.theplatform.net/2015/08/20/supercomputer-force-knocks-human-genome-assembly-under-9-minutes/ http://gauss.cs.ucsb.edu/~aydin/sc15_genome.pdf 8
  • 9. 9 “Using our HipMer technology enables for the first time assembly throughput to exceed the capability of all the world's sequencers” HipMer: an Etreme-Scale DeNovo Genome Assembler Georganas, Buluc, Chapman, Hofmeyr, Aluru, Egan, Oliker, Rokshar and Yelick
  • 10. Magnus (CX30) Whitefly study at IVEC 10 ● Dr. Laura Boykin ● funded by the Gates Foundation and a TED Fellowship. ● Awarded time on “Magnus,” Her team is marrying genomics, supercomputing and evolutionary history to help African farmers develop management strategies and breed pest- resistant cropsThey are battling a species complex of at least 34 morphologically indistinguishable species.
  • 11. 11 “It’s a massive problem. I’m one of 15 principal investigators working on a new project whose mission is to give farmers a cassava plant that’s resistant to the viruses and the whiteflies.” Dr. Laura Boykin
  • 12. Copyright 2015 Cray Inc. Protein Folding – Mixed Simulation and Analytics ● Focuses on how protein folding happens ● Model possible paths to folded end-state ● Temporal resolution matters, but drives data size Markov State model of a fold Model Data geometry biology physics variables of interest to create the model to drive the simulation to compare with simulation results input conditions solution process solved variables of interest Simulation http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673555/pdf/nihms428070.pdf Curr Opin Struct Biol. 2013 February ; 23(1): 58–65. doi:10.1016/j.sbi.2012.11.002.
  • 13. Copyright 2015 Cray Inc. Protein Folding – Mixed Simulation and Analytics Model Data geometry biology physics variables of interest to create the model to drive the simulation to compare with simulation results input conditions solution process solved variables of interest Simulation Automated classification in protein databases (a) (b) (c) Figure 1: One conformation of the villin HP-35 protein (a); part of its distance matrix using only its backbone atoms in the conformation (b); and three eigenvectors and the associated eigenvalues capturing and synthesizing the conformation geometry (c). III. METHODOLOGY Our method involves three steps. Given a trajectory composed of frames generated in a distributed fashion and containing up to 400 consecutively folding protein conformations, our method first extracts relevant features from each conformation into alocal metadata representation. In our case, the relevant feature is the local knowledge on the geometric conformation of the protein structure. Second, the method maps each conformation to either a meta-stable or a transition stage using the local knowledge on the conformational features of the structures in the frame only. Finally, the method rebuilds the global knowledge (i.e., stages and conformation space explored) of the given trajectories through a reduction operation. A. Extraction of conformational features Our method extracts the shape of each single folding conformation into a metadata representation that preserves therelevant information contained in thedatabut reduces the overall data size. Rather than working with the atom coordi- nates of the complete protein molecule, our method first rep- resents each protein conformation using an N ⇤N distance matrix (DM), where N is the number of backbone carbon atoms in the protein (i.e., the alpha carbon of each amino acid). More specifically, the matrix contains the distance from each backbone atom to the other backbone atoms in the same protein conformation. Figure 1(a) shows an example of a folding conformation for the protein HP-35 NleNle (i.e., a variant of the villin headpiece subdomain) taken from folding trajectories generated by Folding@Home [9]. Figure 1(b) shows part of the corresponding distance matrix representing the same conformation in a different format. The representation of each protein conformation is trans- formed from a set of 3 ⇤M floating point numbers (where 3 identifies the coordinates in the Cartesian space of each atom, and M is the number of atoms in the entire protein, with M > N ) to a N ⇤N matrix of floating point numbers. Once the method has mapped each conformation to a distance matrix, it applies classical multidimensional scaling (MDS) to each distance matrix separately. MDS reduces the N ⇤N symmetric distance matrix into a lower dimensional matrix while maintaining the original information on the protein conformation (i.e., the distance of each backbone atom to all the other backbone atoms). Because of simplicity and ease of human interpretation, we applied MDS to reduce the data dimensionality to a N ⇤3 matrix. MDS also generates a set of three N ⇤1 column vectors (q1, q2, q3), and each eigenvector comes with its eigenvalue (λ1, λ2, λ3). Each eigenvalue represents the amount of variations in the data associated to the corresponding eigenvector [10]. In this case we use the information contained in the leading eigenvalues to summarize the conformational features of the protein at a given time in the folding process. Thus, we are able to reduce the protein dimensionality into a single point in the 3-D Euclidean space. In other words, the three eigenvalues represent the variance or curves of the backbone atoms with respect to each other in the protein’s alpha- helices or beta-sheets. Figure 1(c) shows the three eigenval- ues and eigenvectors obtained for the matrix in Figure 1(b). The figure also shows the reduced representation of the protein conformation into the 3-D point. In summary, the method maps each protein conformation from 3⇤M atom’s coordinates to 3 ⇤1 floating point numbers. It performs the mapping for each conformation in a frame separately from the other mapped conformations in a concurrent manner. B. Classification of meta-stable and transition stages As the folding evolves, the protein changes between meta-stable and transition stages. Each frame composing a trajectory may contain up to 400 protein conformations that can be clustered into one or more of these two stage cate- In-situ analysis of folds – Dimensionality reduction using PCA & MDS
  • 14. 14 Thomas J. Lane, Diwakar Shukla, Kyle A. Beauchamp, and Vijay S. Pande
  • 15. Astronomy Physics Light Sources Genomics Climate DOE Facilities are Facing a Data Deluge 15
  • 17. Top Data Analytics Problems for NERSC users 1. Generative Model for the Visible Universe ● Joint Inference across telescope images 2. Pattern Detection for Climate ● Scalable Deep Learning on Cori 3. Stephen Hawking Device ● Machine Learning for speech prosthetic 4. Google Maps for BioImaging ● Semantic databases 5. Genome Assembly for Wheat ● Graph Analytics 17
  • 18. Data Characteristics Science Domain Data Source Data Characteristics Data Volume Analysis Challenge Cosmology Multiple Telescopes Noisy, multi-band, artifacts O(100) TB Data Fusion, Inference HEP Anti-Neutrino detectors Noisy, artifacts, spatio- temporal O(10) TB Pattern/Anomaly Detection BioImaging Mass-spec instruments Noisy, artifacts, multi- modal O(10) TB Dim. Reduction Clustering, Pattern Detection Genomics Sequencers Missing data, errors O(1-10) TB Clustering, Pattern Detection Neuroscience Neural Recorders Spatio-temporal, high dimensional, noisy O(1) TB Dim. Reduction, Pattern Detection Climate Satellites, Simulation o/p Multi-variate, spatio- temporal O(10) TB Pattern/Anomaly Detection 18
  • 19. 19 “Current HPC machines are optimized for scientific simulations, arithmetic-intensive workloads, and regular computation, whereas data-analytics applications requires hardware optimized for bandwidth, throughput, concurrency and all-to-all communication.” Tumeo & Feo (IEEE Computer, Aug)
  • 20. 20 “The age of insight will require hardware and software infrastructures and ecosystems optimized for data analytic modeling & data- intensive simulations, including regular and irregular data access, high bandwidth, throughput, concurrency, all-to-all communication and scalability.”