Alluxio + Eckerson Webinar | Simplifying and Accelerating Data Access for AI/ML Model Training

Alluxio, Inc.
Alluxio, Inc.Alluxio, Inc.
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
Simplifying and Accelerating Data Access
for AI/ML Model Training
Kevin Petrie
Vice President of Research
Sridhar Venkatesh
SVP of Product
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
The Rise of Generative AI
After lots of training, language models generate strings of words
that become logical sentences and paragraphs
• A neural network whose nodes
share inputs and outputs
• Learns, summarizes, and
generates content
• Creates textual answers to
natural language questions
Source: The Economist
WHAT IS A LANGUAGE MODEL?
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
The Risk: Robots Get Things Wrong Too
Data teams must inspect, validate, and govern language model outputs
RISKS OF LANGUAGE MODELS
DATA QUALITY
Inaccuracies due to inaccurate/insufficient inputs, lack of context
EXPLAINABILITY
Vague/unknown sources or reasoning
PRIVACY
Exposure or theft due to user tracking
INTELLECTUAL PROPERTY
Liability for mishandled trademarks, copyrights, etc.
FAIRNESS
Perpetuation of bias in training data
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
Time to Get Domain Specific
Domain-specific, “small” language models reduce risk and boost productivity
by providing more governed and specialized outputs
• Enriched, detailed user
prompts
• Fine-tuned training on
enterprise data
• Augmented outputs; e.g.,
from multiple models
Small
Language
Model (SLM)
Large
Language
Model (LLM)
More
Governed
Generic Specialty
Less
Governed
ENTER THE SMALL
LANGUAGE MODEL
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
Small Language Models Will Drive the GenAI Boom
30% of data practitioners are building or training their own language models now. 20% more plan to do so*
*Source: Active LinkedIn survey of 55 respondents to date
“We believe in a world where
everyone is empowered to build
and train their own models,
imbued with their own opinions
and viewpoints.”
- Naveen Rao, Co-Founder and CEO,
MosaicML
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
“We hold these truths to be self-evident…”
TEXT
TOKENS We hold these …
VECTORS
truths to
[.45, 6.3, .99]
[7.6, .04, 19] [84, .13, 1.6]
VECTOR DB [.45, 6.3, .99] [7.6, .04, 19] [84, .13, 1.6]
LANGUAGE
MODEL
QUERY ONE QUERY TWO
1
2
3
4
5
Data teams must design and build new pipelines to feed their domain-specific data into language models
Data Processing for Language Models
Assemble unstructured text from various files
Convert words and punctuation marks to tokens
Use embeddings to convert tokens into numerical
vectors that describe their semantics
Load, organize, and index these vectors in a vector
database
Use a language model to search and query the
vectors while responding to real-time user prompts
NEW DATA PIPELINE
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
ON PREMISES | HYBRID | CLOUD | MULTI CLOUD
GENERATIVE AI EMBEDDED
CATALOG GOVERN
OBSERVE
INTEGRATE MASTER
STRUCTURED DATA
(DB TABLES)
SEMI STRUCTURED
(LOGS, CLICKSTREAMS, SENSORS…)
UNSTRUCTURED
(TEXT, IMAGES…)
CATALOG
INTEGRATE MASTER
ANALYTICS OPERATIONS
As companies embed generative AI into their workflows, they must manage
and process multi-structured data in a more holistic and efficient manner
The New Generative AI Data Stack
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
AI/ML Initiatives Need Fast and Simple Data Access
AI/ML initiatives require companies to balance, optimize, and secure workloads
across distributed datasets and compute resources
• Data access. View and process data wherever
it resides
• Performance. Retrieve data with low
latency/high throughput
• Portability. Run applications wherever suitable
compute resides
• Cost visibility. Monitor and control compute
cycles
• Multi tenancy. Isolate application compute to
safeguard performance
• Security. Restrict data access to minimize risk
of breaches
REQUIREMENTS
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
The Data Access Layer: Architecture
The data access layer continuously adjusts workloads, storage, and compute
• Namespace. Unified interface for
all data access
• APIs. Dynamic communication
between applications and storage
• Caching. Tier data by priority:
memory, SSDs, object store
• Metadata. Centralize descriptions
of data objects and resources
• Security. Authenticate users,
authorize access, log actions
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
Use Cases
AI initiatives have a range of use cases that require fast and simple data access
DATA CENTER CLOUD 1 CLOUD 2
ANALYTICS & AI IN A HYBRID ENVIRONMENT
ANALYTICS & AI ACROSS CLOUDS
WORKLOAD BURSTS
PROJECT EXPANSIONS
MIGRATIONS
COST OPTIMIZATION
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
Guiding Principles
Data leaders should evaluate where and how to implement a data access layer
to support generative AI initiatives
FIND THE BOTTLENECK
DECIDE WHETHER TO BUILD OR BUY
PLAN FOR GROWTH
© Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com
Questions?
I’m listening!
13
Retooling the enterprise data infrastructure
Legacy data centers can’t keep up
High Performance
Computing
Specialized
Hardware
Varied
Workloads
We're seeing incredible orders to retool
the world's data centers… a 10-year
transition to basically recycle or reclaim
the world's data centers and build it out
as accelerated computing.
Jensen Huang
Nvidia CEO
“
Challenges as you try to scale
14
GPUs are this year’s
toilet paper.
Wall Street Journal
“
GPUs are
scarce
GPUs are
expensive
Low GPU
Utilization
Business Pressures Complex & Costly Solutions
GPUs are
scarce
GPUs are
expensive
Low GPU
Utilization
Faster model
development times
Increased
freshness
Higher accuracy
and traceability
Rapidly growing
datasets
Extensive data engineering
managing data copies
Specialized storage
Out of control cloud and
infra costs
15
16
Alluxio Data Platform
High Performance data access, unified global view
1.Faster Time-to-Market
50%
Hundreds of thousands of dollars saved annually
compared to previous deployment.
2-3X Model Training Performance Cost Reduction, Performance Boost
International B2C with a multi-cloud, cross-region AI platform, serving LLMs and training
models from object storage. They optimized their AI platform with Alluxio to speed data
delivery to training clusters and facilitate faster model deployment in latency sensitive
production use cases.
Models Deployed in Minutes vs Days
Faster model deployment times
2. Higher GPU Utilization
“In a cloud environment, where GPU hardware is paid for as a function of time, you need
fast, performant, reliable, and cost effective data for your model training pipelines to keep
your GPU utilization close to 99%.”
20-30%
Average reported GPU utilization
based on direct access from remote
storage
GPU Utilization accessing commodity storage
GPU Utilization accessing Alluxio
Alluxio serves high throughput data to K8s training
workloads.
90
%
GPU utilization from Alluxio serving
data pulled from object storage. In
increase from 50% utilization via s3fs-
fuse.
3. Reduction in Personnel
Increase in Productivity
Pre-Processed
Data
Data
Management
Pre-
Processed
Data
Training
Clusters
Data scientists
send requests to
AI platform
teams. Platform
teams set up
individual data
pipelines.
With Alluxio, data
scientists just
access their data.
Alluxio
consolidates many
pipelines into an
access layer.
Pipeline or
Scheduler
Training
Clusters
20
4. Reduction in Infrastructure Spend
Alluxio optimizes data platforms to increase efficiency
Data Engineering
Pipelines
Data workflows improved by on-
demand access from Alluxio cache
S3 Egress and API
Fees
Fees significantly reduced via
granular caching and data
reuse
High Performance
Computing
Replaceable with low-cost hardware
at comparable performance
Reduced or Eliminated
Network Congestion Network congestion reduced by
serving files locally
5. Cloud Vendor Leverage
Multi-cloud strategies with cost-effective benefits
Respond to Limited GPU Availability
Demand for GPUs has exploded
Organizations use Alluxio to supply high performance data access
to remote GPU clusters wherever they find capacity.
Increase Cloud Agility
Competing CSPs may provide attractive discounts
Alluxio empowers organizations to capitalize on hardware discounts
or cost-effective storage in real-time. Users access data wherever it
resides.
Avoid Vendor Lock-In
Negotiate with CSPs from a stronger position
Single cloud deployments are convenient, but that may become an
obstacle in negotiations. Alluxio facilitates hybrid and multi-cloud.
Twitter.com/alluxio
Linkedin.com/alluxio
Website
www.alluxio.io
Slack
http://slackin.alluxio.io/
@
Social Media
Q&A
1 de 22

Recomendados

2022 Trends in Enterprise Analytics por
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
511 visualizações36 slides
Curiosity and fourTheorem present: From Coverage Guesswork to Targeted Test G... por
Curiosity and fourTheorem present: From Coverage Guesswork to Targeted Test G...Curiosity and fourTheorem present: From Coverage Guesswork to Targeted Test G...
Curiosity and fourTheorem present: From Coverage Guesswork to Targeted Test G...Curiosity Software Ireland
29 visualizações33 slides
Data Science Salon: Applying Machine Learning to Modernize Business Processes por
Data Science Salon: Applying Machine Learning to Modernize Business ProcessesData Science Salon: Applying Machine Learning to Modernize Business Processes
Data Science Salon: Applying Machine Learning to Modernize Business ProcessesFormulatedby
783 visualizações22 slides
Analytics and Lakehouse Integration Options for Oracle Applications por
Analytics and Lakehouse Integration Options for Oracle ApplicationsAnalytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle ApplicationsRay Février
420 visualizações45 slides
Data science lab enabling flexibility por
Data science lab   enabling flexibilityData science lab   enabling flexibility
Data science lab enabling flexibilityKognitio
362 visualizações18 slides
Estimating the Total Costs of Your Cloud Analytics Platform por
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
335 visualizações39 slides

Mais conteúdo relacionado

Similar a Alluxio + Eckerson Webinar | Simplifying and Accelerating Data Access for AI/ML Model Training

Privacy preserving public auditing for secured cloud storage por
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagedbpublications
32 visualizações9 slides
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19 por
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
2.5K visualizações35 slides
Building a Logical Data Fabric using Data Virtualization (ASEAN) por
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Denodo
271 visualizações40 slides
Data Architecture Best Practices for Advanced Analytics por
Data Architecture Best Practices for Advanced AnalyticsData Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced AnalyticsDATAVERSITY
918 visualizações41 slides
Big Data Fabric: A Necessity For Any Successful Big Data Initiative por
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeBig Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeDenodo
588 visualizações24 slides
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014 por
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014Amazon Web Services
1.3K visualizações17 slides

Similar a Alluxio + Eckerson Webinar | Simplifying and Accelerating Data Access for AI/ML Model Training(20)

Privacy preserving public auditing for secured cloud storage por dbpublications
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storage
dbpublications32 visualizações
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19 por Cloudera, Inc.
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.2.5K visualizações
Building a Logical Data Fabric using Data Virtualization (ASEAN) por Denodo
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Denodo 271 visualizações
Data Architecture Best Practices for Advanced Analytics por DATAVERSITY
Data Architecture Best Practices for Advanced AnalyticsData Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced Analytics
DATAVERSITY918 visualizações
Big Data Fabric: A Necessity For Any Successful Big Data Initiative por Denodo
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeBig Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Denodo 588 visualizações
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014 por Amazon Web Services
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
Amazon Web Services1.3K visualizações
Insurtech, Cloud and Cybersecurity - Chartered Insurance Institute por Henrique Centieiro
Insurtech, Cloud and Cybersecurity -  Chartered Insurance InstituteInsurtech, Cloud and Cybersecurity -  Chartered Insurance Institute
Insurtech, Cloud and Cybersecurity - Chartered Insurance Institute
Henrique Centieiro106 visualizações
THE SURVEY ON REFERENCE MODEL FOR OPEN STORAGE SYSTEMS INTERCONNECTION MASS S... por IRJET Journal
THE SURVEY ON REFERENCE MODEL FOR OPEN STORAGE SYSTEMS INTERCONNECTION MASS S...THE SURVEY ON REFERENCE MODEL FOR OPEN STORAGE SYSTEMS INTERCONNECTION MASS S...
THE SURVEY ON REFERENCE MODEL FOR OPEN STORAGE SYSTEMS INTERCONNECTION MASS S...
IRJET Journal2 visualizações
IRJET - Efficient and Verifiable Queries over Encrypted Data in Cloud por IRJET Journal
 IRJET - Efficient and Verifiable Queries over Encrypted Data in Cloud IRJET - Efficient and Verifiable Queries over Encrypted Data in Cloud
IRJET - Efficient and Verifiable Queries over Encrypted Data in Cloud
IRJET Journal12 visualizações
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale por Robb Boyd
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleInfrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Robb Boyd357 visualizações
Cloudera federal summit por Matt Carroll
Cloudera federal summitCloudera federal summit
Cloudera federal summit
Matt Carroll462 visualizações
Advanced Analytics and Machine Learning with Data Virtualization (India) por Denodo
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
Denodo 109 visualizações
Building ML Pipelines with DCOS por QAware GmbH
Building ML Pipelines with DCOSBuilding ML Pipelines with DCOS
Building ML Pipelines with DCOS
QAware GmbH277 visualizações
Databricks on AWS.pptx por Wasm1953
Databricks on AWS.pptxDatabricks on AWS.pptx
Databricks on AWS.pptx
Wasm1953161 visualizações
Building Data Ecosystems for Accelerated Discovery por adamkraut
Building Data Ecosystems for Accelerated DiscoveryBuilding Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated Discovery
adamkraut33 visualizações
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris... por DATAVERSITY
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
DATAVERSITY829 visualizações
Breed data scientists_ A Presentation.pptx por GautamPopli1
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptx
GautamPopli16 visualizações
BEST FINAL YEAR PROJECT IEEE 2015 BY SPECTRUM SOLUTIONS PONDICHERRY por Raushan Kumar Singh
BEST FINAL YEAR PROJECT IEEE 2015 BY SPECTRUM SOLUTIONS PONDICHERRYBEST FINAL YEAR PROJECT IEEE 2015 BY SPECTRUM SOLUTIONS PONDICHERRY
BEST FINAL YEAR PROJECT IEEE 2015 BY SPECTRUM SOLUTIONS PONDICHERRY
Raushan Kumar Singh982 visualizações
ADV Slides: Building and Growing Organizational Analytics with Data Lakes por DATAVERSITY
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
DATAVERSITY567 visualizações
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc... por Denodo
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo 73 visualizações

Mais de Alluxio, Inc.

Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI... por
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...Alluxio, Inc.
12 visualizações24 slides
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca... por
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...Alluxio, Inc.
19 visualizações22 slides
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up... por
AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up...Alluxio, Inc.
32 visualizações9 slides
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS por
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWSAlluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWSAlluxio, Inc.
74 visualizações28 slides
Alluxio Product school Webinar - Distributed Caching for Generative AI por
Alluxio Product school Webinar - Distributed Caching for Generative AIAlluxio Product school Webinar - Distributed Caching for Generative AI
Alluxio Product school Webinar - Distributed Caching for Generative AIAlluxio, Inc.
115 visualizações15 slides
Alluxio Product School Webinar - Get Started with Alluxio on Kubernetes por
Alluxio Product School Webinar - Get Started with Alluxio on KubernetesAlluxio Product School Webinar - Get Started with Alluxio on Kubernetes
Alluxio Product School Webinar - Get Started with Alluxio on KubernetesAlluxio, Inc.
67 visualizações14 slides

Mais de Alluxio, Inc.(20)

Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI... por Alluxio, Inc.
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio Monthly Webinar | Why NFS/NAS on Object Storage May Not Solve Your AI...
Alluxio, Inc.12 visualizações
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca... por Alluxio, Inc.
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
AI Infra Day | Accelerate Your Model Training and Serving with Distributed Ca...
Alluxio, Inc.19 visualizações
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up... por Alluxio, Inc.
AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...AI Infra Day | The Generative AI Market  And Intel AI Strategy and Product Up...
AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Up...
Alluxio, Inc.32 visualizações
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS por Alluxio, Inc.
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWSAlluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio Monthly Webinar | Efficient Data Loading for Model Training on AWS
Alluxio, Inc.74 visualizações
Alluxio Product school Webinar - Distributed Caching for Generative AI por Alluxio, Inc.
Alluxio Product school Webinar - Distributed Caching for Generative AIAlluxio Product school Webinar - Distributed Caching for Generative AI
Alluxio Product school Webinar - Distributed Caching for Generative AI
Alluxio, Inc.115 visualizações
Alluxio Product School Webinar - Get Started with Alluxio on Kubernetes por Alluxio, Inc.
Alluxio Product School Webinar - Get Started with Alluxio on KubernetesAlluxio Product School Webinar - Get Started with Alluxio on Kubernetes
Alluxio Product School Webinar - Get Started with Alluxio on Kubernetes
Alluxio, Inc.67 visualizações
Alluxio Product School Webinar - Boosting Trino Performance. por Alluxio, Inc.
Alluxio Product School Webinar - Boosting Trino Performance.Alluxio Product School Webinar - Boosting Trino Performance.
Alluxio Product School Webinar - Boosting Trino Performance.
Alluxio, Inc.132 visualizações
Alluxio Product School Webinar - Transparent URI por Alluxio, Inc.
Alluxio Product School Webinar - Transparent URIAlluxio Product School Webinar - Transparent URI
Alluxio Product School Webinar - Transparent URI
Alluxio, Inc.73 visualizações
Alluxio 2.9 Release Overview por Alluxio, Inc.
Alluxio 2.9 Release OverviewAlluxio 2.9 Release Overview
Alluxio 2.9 Release Overview
Alluxio, Inc.173 visualizações
Building a Distributed File System for the Cloud-Native Era por Alluxio, Inc.
Building a Distributed File System for the Cloud-Native EraBuilding a Distributed File System for the Cloud-Native Era
Building a Distributed File System for the Cloud-Native Era
Alluxio, Inc.135 visualizações
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance por Alluxio, Inc.
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault ToleranceZookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance
Alluxio, Inc.512 visualizações
Modernize Your Data Platform for Analytics and AI por Alluxio, Inc.
Modernize Your Data Platform for Analytics and AIModernize Your Data Platform for Analytics and AI
Modernize Your Data Platform for Analytics and AI
Alluxio, Inc.149 visualizações
The Power of Data Orchestration: Storage Acceleration and Servitization at Sh... por Alluxio, Inc.
The Power of Data Orchestration: Storage Acceleration and Servitization at Sh...The Power of Data Orchestration: Storage Acceleration and Servitization at Sh...
The Power of Data Orchestration: Storage Acceleration and Servitization at Sh...
Alluxio, Inc.45 visualizações
Apache Hudi: Community-Driven Development por Alluxio, Inc.
Apache Hudi: Community-Driven DevelopmentApache Hudi: Community-Driven Development
Apache Hudi: Community-Driven Development
Alluxio, Inc.43 visualizações
Modern Data Stack in Motion por Alluxio, Inc.
Modern Data Stack in MotionModern Data Stack in Motion
Modern Data Stack in Motion
Alluxio, Inc.233 visualizações
Unified Data API for Distributed Cloud Analytics and AI por Alluxio, Inc.
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AI
Alluxio, Inc.31 visualizações
Real-Time Analytics: Going Beyond Stream Processing With Apache Pinot por Alluxio, Inc.
Real-Time Analytics: Going Beyond Stream Processing With Apache PinotReal-Time Analytics: Going Beyond Stream Processing With Apache Pinot
Real-Time Analytics: Going Beyond Stream Processing With Apache Pinot
Alluxio, Inc.32 visualizações
ML-Based SQL Query Resource Usage Prediction por Alluxio, Inc.
ML-Based SQL Query Resource Usage PredictionML-Based SQL Query Resource Usage Prediction
ML-Based SQL Query Resource Usage Prediction
Alluxio, Inc.47 visualizações
Accelerating Cloud Training With Alluxio por Alluxio, Inc.
Accelerating Cloud Training With AlluxioAccelerating Cloud Training With Alluxio
Accelerating Cloud Training With Alluxio
Alluxio, Inc.38 visualizações
Deconstructing a Machine Learning Pipeline with Virtual Data Lake por Alluxio, Inc.
Deconstructing a Machine Learning Pipeline with Virtual Data LakeDeconstructing a Machine Learning Pipeline with Virtual Data Lake
Deconstructing a Machine Learning Pipeline with Virtual Data Lake
Alluxio, Inc.82 visualizações

Último

DSD-INT 2023 European Digital Twin Ocean and Delft3D FM - Dols por
DSD-INT 2023 European Digital Twin Ocean and Delft3D FM - DolsDSD-INT 2023 European Digital Twin Ocean and Delft3D FM - Dols
DSD-INT 2023 European Digital Twin Ocean and Delft3D FM - DolsDeltares
7 visualizações23 slides
DevsRank por
DevsRankDevsRank
DevsRankdevsrank786
11 visualizações1 slide
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon por
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - AfternoonDSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - AfternoonDeltares
15 visualizações43 slides
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge... por
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...Deltares
17 visualizações12 slides
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -... por
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...Deltares
6 visualizações15 slides
SAP FOR TYRE INDUSTRY.pdf por
SAP FOR TYRE INDUSTRY.pdfSAP FOR TYRE INDUSTRY.pdf
SAP FOR TYRE INDUSTRY.pdfVirendra Rai, PMP
24 visualizações3 slides

Último(20)

DSD-INT 2023 European Digital Twin Ocean and Delft3D FM - Dols por Deltares
DSD-INT 2023 European Digital Twin Ocean and Delft3D FM - DolsDSD-INT 2023 European Digital Twin Ocean and Delft3D FM - Dols
DSD-INT 2023 European Digital Twin Ocean and Delft3D FM - Dols
Deltares7 visualizações
DevsRank por devsrank786
DevsRankDevsRank
DevsRank
devsrank78611 visualizações
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon por Deltares
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - AfternoonDSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon
DSD-INT 2023 - Delft3D User Days - Welcome - Day 3 - Afternoon
Deltares15 visualizações
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge... por Deltares
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...
DSD-INT 2023 Delft3D FM Suite 2024.01 2D3D - New features + Improvements - Ge...
Deltares17 visualizações
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -... por Deltares
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
Deltares6 visualizações
SAP FOR TYRE INDUSTRY.pdf por Virendra Rai, PMP
SAP FOR TYRE INDUSTRY.pdfSAP FOR TYRE INDUSTRY.pdf
SAP FOR TYRE INDUSTRY.pdf
Virendra Rai, PMP24 visualizações
Generic or specific? Making sensible software design decisions por Bert Jan Schrijver
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisions
Bert Jan Schrijver6 visualizações
Advanced API Mocking Techniques por Dimpy Adhikary
Advanced API Mocking TechniquesAdvanced API Mocking Techniques
Advanced API Mocking Techniques
Dimpy Adhikary19 visualizações
Roadmap y Novedades de producto por Neo4j
Roadmap y Novedades de productoRoadmap y Novedades de producto
Roadmap y Novedades de producto
Neo4j50 visualizações
WebAssembly por Jens Siebert
WebAssemblyWebAssembly
WebAssembly
Jens Siebert35 visualizações
MariaDB stored procedures and why they should be improved por Federico Razzoli
MariaDB stored procedures and why they should be improvedMariaDB stored procedures and why they should be improved
MariaDB stored procedures and why they should be improved
Federico Razzoli8 visualizações
Unmasking the Dark Art of Vectored Exception Handling: Bypassing XDR and EDR ... por Donato Onofri
Unmasking the Dark Art of Vectored Exception Handling: Bypassing XDR and EDR ...Unmasking the Dark Art of Vectored Exception Handling: Bypassing XDR and EDR ...
Unmasking the Dark Art of Vectored Exception Handling: Bypassing XDR and EDR ...
Donato Onofri773 visualizações
HarshithAkkapelli_Presentation.pdf por harshithakkapelli
HarshithAkkapelli_Presentation.pdfHarshithAkkapelli_Presentation.pdf
HarshithAkkapelli_Presentation.pdf
harshithakkapelli11 visualizações
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI... por Marc Müller
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Marc Müller37 visualizações
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema por Deltares
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - GeertsemaDSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema
Deltares17 visualizações
ict act 1.pptx por sanjaniarun08
ict act 1.pptxict act 1.pptx
ict act 1.pptx
sanjaniarun0813 visualizações
DSD-INT 2023 Leveraging the results of a 3D hydrodynamic model to improve the... por Deltares
DSD-INT 2023 Leveraging the results of a 3D hydrodynamic model to improve the...DSD-INT 2023 Leveraging the results of a 3D hydrodynamic model to improve the...
DSD-INT 2023 Leveraging the results of a 3D hydrodynamic model to improve the...
Deltares6 visualizações
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit... por Deltares
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...
DSD-INT 2023 FloodAdapt - A decision-support tool for compound flood risk mit...
Deltares13 visualizações
DSD-INT 2023 Simulation of Coastal Hydrodynamics and Water Quality in Hong Ko... por Deltares
DSD-INT 2023 Simulation of Coastal Hydrodynamics and Water Quality in Hong Ko...DSD-INT 2023 Simulation of Coastal Hydrodynamics and Water Quality in Hong Ko...
DSD-INT 2023 Simulation of Coastal Hydrodynamics and Water Quality in Hong Ko...
Deltares12 visualizações
SAP FOR CONTRACT MANUFACTURING.pdf por Virendra Rai, PMP
SAP FOR CONTRACT MANUFACTURING.pdfSAP FOR CONTRACT MANUFACTURING.pdf
SAP FOR CONTRACT MANUFACTURING.pdf
Virendra Rai, PMP11 visualizações

Alluxio + Eckerson Webinar | Simplifying and Accelerating Data Access for AI/ML Model Training

  • 1. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com Simplifying and Accelerating Data Access for AI/ML Model Training Kevin Petrie Vice President of Research Sridhar Venkatesh SVP of Product
  • 2. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com The Rise of Generative AI After lots of training, language models generate strings of words that become logical sentences and paragraphs • A neural network whose nodes share inputs and outputs • Learns, summarizes, and generates content • Creates textual answers to natural language questions Source: The Economist WHAT IS A LANGUAGE MODEL?
  • 3. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com The Risk: Robots Get Things Wrong Too Data teams must inspect, validate, and govern language model outputs RISKS OF LANGUAGE MODELS DATA QUALITY Inaccuracies due to inaccurate/insufficient inputs, lack of context EXPLAINABILITY Vague/unknown sources or reasoning PRIVACY Exposure or theft due to user tracking INTELLECTUAL PROPERTY Liability for mishandled trademarks, copyrights, etc. FAIRNESS Perpetuation of bias in training data
  • 4. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com Time to Get Domain Specific Domain-specific, “small” language models reduce risk and boost productivity by providing more governed and specialized outputs • Enriched, detailed user prompts • Fine-tuned training on enterprise data • Augmented outputs; e.g., from multiple models Small Language Model (SLM) Large Language Model (LLM) More Governed Generic Specialty Less Governed ENTER THE SMALL LANGUAGE MODEL
  • 5. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com Small Language Models Will Drive the GenAI Boom 30% of data practitioners are building or training their own language models now. 20% more plan to do so* *Source: Active LinkedIn survey of 55 respondents to date “We believe in a world where everyone is empowered to build and train their own models, imbued with their own opinions and viewpoints.” - Naveen Rao, Co-Founder and CEO, MosaicML
  • 6. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com “We hold these truths to be self-evident…” TEXT TOKENS We hold these … VECTORS truths to [.45, 6.3, .99] [7.6, .04, 19] [84, .13, 1.6] VECTOR DB [.45, 6.3, .99] [7.6, .04, 19] [84, .13, 1.6] LANGUAGE MODEL QUERY ONE QUERY TWO 1 2 3 4 5 Data teams must design and build new pipelines to feed their domain-specific data into language models Data Processing for Language Models Assemble unstructured text from various files Convert words and punctuation marks to tokens Use embeddings to convert tokens into numerical vectors that describe their semantics Load, organize, and index these vectors in a vector database Use a language model to search and query the vectors while responding to real-time user prompts NEW DATA PIPELINE
  • 7. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com ON PREMISES | HYBRID | CLOUD | MULTI CLOUD GENERATIVE AI EMBEDDED CATALOG GOVERN OBSERVE INTEGRATE MASTER STRUCTURED DATA (DB TABLES) SEMI STRUCTURED (LOGS, CLICKSTREAMS, SENSORS…) UNSTRUCTURED (TEXT, IMAGES…) CATALOG INTEGRATE MASTER ANALYTICS OPERATIONS As companies embed generative AI into their workflows, they must manage and process multi-structured data in a more holistic and efficient manner The New Generative AI Data Stack
  • 8. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com AI/ML Initiatives Need Fast and Simple Data Access AI/ML initiatives require companies to balance, optimize, and secure workloads across distributed datasets and compute resources • Data access. View and process data wherever it resides • Performance. Retrieve data with low latency/high throughput • Portability. Run applications wherever suitable compute resides • Cost visibility. Monitor and control compute cycles • Multi tenancy. Isolate application compute to safeguard performance • Security. Restrict data access to minimize risk of breaches REQUIREMENTS
  • 9. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com The Data Access Layer: Architecture The data access layer continuously adjusts workloads, storage, and compute • Namespace. Unified interface for all data access • APIs. Dynamic communication between applications and storage • Caching. Tier data by priority: memory, SSDs, object store • Metadata. Centralize descriptions of data objects and resources • Security. Authenticate users, authorize access, log actions
  • 10. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com Use Cases AI initiatives have a range of use cases that require fast and simple data access DATA CENTER CLOUD 1 CLOUD 2 ANALYTICS & AI IN A HYBRID ENVIRONMENT ANALYTICS & AI ACROSS CLOUDS WORKLOAD BURSTS PROJECT EXPANSIONS MIGRATIONS COST OPTIMIZATION
  • 11. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com Guiding Principles Data leaders should evaluate where and how to implement a data access layer to support generative AI initiatives FIND THE BOTTLENECK DECIDE WHETHER TO BUILD OR BUY PLAN FOR GROWTH
  • 12. © Eckerson Group 2021 Twitter: @eckersongroup www.eckerson.com Questions? I’m listening!
  • 13. 13 Retooling the enterprise data infrastructure Legacy data centers can’t keep up High Performance Computing Specialized Hardware Varied Workloads We're seeing incredible orders to retool the world's data centers… a 10-year transition to basically recycle or reclaim the world's data centers and build it out as accelerated computing. Jensen Huang Nvidia CEO “
  • 14. Challenges as you try to scale 14 GPUs are this year’s toilet paper. Wall Street Journal “ GPUs are scarce GPUs are expensive Low GPU Utilization
  • 15. Business Pressures Complex & Costly Solutions GPUs are scarce GPUs are expensive Low GPU Utilization Faster model development times Increased freshness Higher accuracy and traceability Rapidly growing datasets Extensive data engineering managing data copies Specialized storage Out of control cloud and infra costs 15
  • 16. 16 Alluxio Data Platform High Performance data access, unified global view
  • 17. 1.Faster Time-to-Market 50% Hundreds of thousands of dollars saved annually compared to previous deployment. 2-3X Model Training Performance Cost Reduction, Performance Boost International B2C with a multi-cloud, cross-region AI platform, serving LLMs and training models from object storage. They optimized their AI platform with Alluxio to speed data delivery to training clusters and facilitate faster model deployment in latency sensitive production use cases. Models Deployed in Minutes vs Days Faster model deployment times
  • 18. 2. Higher GPU Utilization “In a cloud environment, where GPU hardware is paid for as a function of time, you need fast, performant, reliable, and cost effective data for your model training pipelines to keep your GPU utilization close to 99%.” 20-30% Average reported GPU utilization based on direct access from remote storage GPU Utilization accessing commodity storage GPU Utilization accessing Alluxio Alluxio serves high throughput data to K8s training workloads. 90 % GPU utilization from Alluxio serving data pulled from object storage. In increase from 50% utilization via s3fs- fuse.
  • 19. 3. Reduction in Personnel Increase in Productivity Pre-Processed Data Data Management Pre- Processed Data Training Clusters Data scientists send requests to AI platform teams. Platform teams set up individual data pipelines. With Alluxio, data scientists just access their data. Alluxio consolidates many pipelines into an access layer. Pipeline or Scheduler Training Clusters
  • 20. 20 4. Reduction in Infrastructure Spend Alluxio optimizes data platforms to increase efficiency Data Engineering Pipelines Data workflows improved by on- demand access from Alluxio cache S3 Egress and API Fees Fees significantly reduced via granular caching and data reuse High Performance Computing Replaceable with low-cost hardware at comparable performance Reduced or Eliminated Network Congestion Network congestion reduced by serving files locally
  • 21. 5. Cloud Vendor Leverage Multi-cloud strategies with cost-effective benefits Respond to Limited GPU Availability Demand for GPUs has exploded Organizations use Alluxio to supply high performance data access to remote GPU clusters wherever they find capacity. Increase Cloud Agility Competing CSPs may provide attractive discounts Alluxio empowers organizations to capitalize on hardware discounts or cost-effective storage in real-time. Users access data wherever it resides. Avoid Vendor Lock-In Negotiate with CSPs from a stronger position Single cloud deployments are convenient, but that may become an obstacle in negotiations. Alluxio facilitates hybrid and multi-cloud.