Marv Wexler - Transform Your with AI.pdf

SOLTUIONSpeople, THINKubators, THINKathons
SOLTUIONSpeople, THINKubators, THINKathonsThinkubator 💡Innovation Experience Designer 💡 Linkedin's #1 Most Connected Innovator in The World em SOLTUIONSpeople, THINKubators, THINKathons
Confidential
Transform Your
Business With AI
Transform Your
Business With AI
AI Summit
Marv Wexler
GM Technical Services
September 21, 2023
AI Summit
Marv Wexler
GM Technical Services
September 21, 2023
Better Faster Greener™ © 2023 Supermicro
Confidential
Where are we on the AI journey ?
9/20/2023 Better Faster Greener™ © 2023 Supermicro
2
“Once a new technology rolls over you, if you're not part of the steamroller, you're
part of the road.” - Stewart Brand
Confidential
9/20/2023 Better Faster Greener™ © 2023 Supermicro
3
Current AI Trends
• Democratization of AI will continue
• AI is a fundamental differentiator for businesses
• Find deeper insights in data, real-time and at scale
-Else your competitors surely will
• Generative AI is becoming commercialized
• AI ethics a top priority
• Biased algorithms, Deep fakes, “Hallucinations” as a
feature
• Generative AI applications reign : Microsoft (Designer),
Adobe (Firefly), Meta (Ad creation)
• New regulations for safe and responsible practices
• EU AI Act: Set of new rules that establish obligations for risks
from artificial intelligence
Confidential
AI Applications
9/20/2023 Better Faster Greener™ © 2023 Supermicro
4
Deep Learning
Solving complex
problems
Computer model taught to
learn actions using images,
texts and sounds
Machine Learning
Machines making
decisions
Building Machines with
predictive algorithm and
create predictive models
Artificial Intelligence
Simulate intelligence
Building Smart Machines
capable of performing
intelligent tasks
Confidential
9/20/2023 Better Faster Greener™ © 2023 Supermicro
5
Text
Image
Audio
Video
Games
Text/ Voice prompt
Generative AI models
(also Large Language
LLM, or Foundational
Models)
User Input
What is Generative AI?
Generative AI models are models that, when receiving a text prompt, give an output related to
that input. The output can be text, image, audio, video, code etc.
The ability for generative AI to produce useful, impressively synthesized text, images, and other types of content
almost effortlessly based on a few text cues has already become an important business capability worthy of
providing immense value to most knowledge workers
Confidential
The far-reaching impacts of Generative AI
9/20/2023 Better Faster Greener™ © 2023 Supermicro
6
Around 75% of the technology's value will be seen across four areas:
• customer operations
• marketing and sales
• software engineering
• research and development
automating conversations with customers
creating personalized messages for customers
generating code
generative design
Confidential
Customizable AI infrastructure for Generative AI
9/20/2023 Better Faster Greener™ © 2023 Supermicro
7
Training
•compute intensive
•massive datasets
involved
Fine-Tuning
•Requires relatively less
computational power
Inferencing
•Accelerators may be
needed depending on
type of application
(batch/real-time)
Various stages in building a
Generative AI Application
At Supermicro, We have you covered all the way with affordable, customizable
and scalable solutions
Confidential
Application Details
9/20/2023 Better Faster Greener™ © 2023 Supermicro
8
LangChain Instructor Embeddings WizardLM / LLAMA
• Ask questions to your documents AND learn from your documents using the
power of LLMs.
• 100% private, no data leaves your execution environment at any point.
• You can ingest documents and ask questions without an internet connection!
localGPT
BUILT WITH
• Text pre processed
into chunks
• Embedded in a
vector space
• Query search for
similar chunks
An instruction-finetuned text
embedding model that can generate
text embeddings tailored to any
task by simply providing the task
instruction, without any finetuning.
Instructor achieves SOTA on 70
diverse embedding tasks!
(e.g., classification, retrieval,
clustering, text evaluation, etc.) and
domains (e.g., science, finance, etc.)
• WizardLM is a Llama variant
trained with
complex instructions
• Evol-Instruct which
leverages AI to
"evolve" instructions
Confidential
Application Details
9/20/2023 Better Faster Greener™ © 2023 Supermicro
9
Ingest.py
• uses LangChain tools to parse the document and create
embeddings locally using Instructor Embeddings
Chroma
vector store
• local vector database that stores the created
embeddings
Run_localGPT • uses local LLM to understand questions and create
answers.
Similarity
Search
• used to extract right piece of context
from the local vector store
Confidential
10
©2023 Supermicro
Large Scale AI Training
• Key Technologies
• NVIDIA HGX H100 SXM 8-GPU/4-GPU with 900GB/s NVLink interconnect
• Dedicated, lots of high performance, high bandwidth GPU memory - HBM3, HBM2e
• 400GbE networking (Ethernet or InfiniBand), PCIe 5.0 storage for fast AI data pipe
• NVIDIA GPUDirect RDMA and Storage to keep feeding data to GPUs with minimum latency
• Liquid cooling for GPUs and CPUs
• All-flash storage and file systems to support petabytes of hot-tier data cache
• NVIDIA HGX H100 SXM5
board with 4- GPU or 8-
GPU
• NVLink and NVSwitch
• 80GB HBM3 per GPU
• Up to 700W TDP
• NVIDIA ConnectX-7
• Up to 400GbE or 400G NDR InfiniBand
• x16/x32 PCIe 5.0
Confidential
Supermicro AI
Experience
Supermicro AI
Experience
Marv Wexler
August 2023
Marv Wexler
August 2023
Better Faster Greener™ © 2023 Supermicro
Confidential
9/20/2023 Better Faster Greener™ © 2023 Supermicro
12
Confidential
Evolving to an AI / Total IT Solutions Partner
9/20/2023 Better Faster Greener™ © 2022 Supermicro
13
 5S: Software, Services,
Switch, Storage, Security
and more
 Total Solutions: Enterprise,
OEM- Appliance / Cloud
 Complete Systems
 Sub-systems and
Components
~5X+ Faster growth rate than the
industry avg rate over the past 12+
months (~50% YoY)
~5X+ Faster growth rate than the
industry avg rate over the past 12+
months (~50% YoY)
Our Momentum:
SMCI 1.0
Components &
Subsystems
SMCI 2.0
Servers &
Storage Systems
SMCI 3.0
Total IT
Solutions
Today
1993
$5B
$10B
Confidential
SMCI AI Strategy
9/20/2023 Better Faster Greener™ © 2023 Supermicro
14
• Partner with the Leaders
• Provide the best picks and shovels for the gold miners (Apps, YOU)
• Do not be religious with Products Offerings (multi-vendor, multi-platform)
Confidential
SMCI AI Business Results
9/20/2023 Better Faster Greener™ © 2023 Supermicro
15
• Bring up platform partner for virtually all AI Solutions / GPU offerings
• Lead supplier for virtually all Large Language Model Cloud Deployments
(ChatGPT, BARD, Bing, etc.)
The Next Platform, August 16, 2023
Confidential
16
©2023 Supermicro
GPU Optimized Systems by Workloads
• Large Scale AI Training • HPC/AI Workloads
H100 PCIe
Grace Hopper Superchip (Grace
CPU + H100 GPU)
H100 NVL
HGX H100 SXM
8-GPU or 4-GPU
4U 4-GPU System (HGX H100 SXM)
(codenamed: Redstone-Next)
SYS-421GU-TNXR, SYS-521GU-TNXR
8U 8-GPU System (HGX H100 SXM)
(codenamed: Delta-Next)
SYS-821GE-TNHR, AS -8125GS-TNHR
4U 4-GPU System (HGX H100 SXM)
SYS-421GU-TNXR
4U/5U 8-10 GPU System
SYS-521GE-TNRT, SYS-421GE-TNRT/TNRT3
AS -4125GS-TNRT/TNRT1/TNRT2
1U Grace Hopper MGX System
SYS-421GU-TNXR / SYS-521GU-TNXR
8U SuperBlade (Up to 20 nodes)
SBI-411E-1G / SBI-411E-5G
Petabyte Scale All-Flash Storage
SSG-121E-NE316R, ASG-1115S-NE316R
Confidential
Scales to thousands of nodes in 32-node increments
(SRS-42UHPC-32SU-01)
Accelerate AI Development by Supermicro
Supermicro 8U Delta-Next (SYS-821GE-TNHR)
A Proven Platform, Purpose Built for AI
H100 SXM5 GPU ConnectX-7 SmartNICs
H100 Rack Scale SuperPod Scalable Unit
8x NVIDIA H100 SXM5 GPUs | 640GB HBM3 GPU Memory 2TB
System Memory | 3.2Tbps Network B/W | Superior I/O
32x HGX H100 | 1+ EFLOPS AI | 20TB HBM3 GPU Memory 102.4Tbps
Network B/W Non-blocking | InfiniBand NDR
Software: NVIDIA BCM | NGC | NVAIE | SLURM | Kubernetes
Full Turnkey AI Supercomputer for Enterprises
9/20/2023 Better Faster Greener™ © 2023 Supermicro
17
Confidential
Supermicro Rack Integration Services
• Full rack integration up to L11 and L12
• Broad portfolio of compute, power, cooling
and networking options
• Liquid cooling integration
• Cooling Distribution Unit (CDU)
• Direct to Chip cold plate
• Manifold and tubing
• Design, assembly, configuration, testing
and deployment
• Start running applications from Day 1
Confidential
Supermicro CDU
80kW to 120kW, 45°C Warm Water
Liquid Cooling Option for Rack Scale H100 SuperPods
9/20/2023 Better Faster Greener™ © 2023 Supermicro
19
Confidential
Onsite Rack Services
9/20/2023 Better Faster Greener™ © 2023 Supermicro
20
Simplifying Your Solution Deployment Needs
• White glove custom service from beginning to end
• Onsite rack & stack of the custom solution
• Onsite integration ensuring proper installation and
connectivity, providing for reliable operation and reduced
downtime
• Onsite software installation with application configurations
• Onsite benchmark testing ensuring solution meets the
requirements of the customer
• Delivery of a customized rack solution that meets all
requirements
• SMC Cooling tower product line is available to enable
facility level water connections for CDU/CDM/RDHX
Reliable – Repeatable – Reproducible
Confidential
DISCLAIMER
Super Micro Computer, Inc. may make changes to specifications and product descriptions at any time, without notice. The
information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions
and typographical errors. Any performance tests and ratings are measured using systems that reflect the approximate
performance of Super Micro Computer, Inc. products as measured by those tests. Any differences in software or hardware
configuration may affect actual performance, and Super Micro Computer, Inc. does not control the design or implementation of
third party benchmarks or websites referenced in this document. The information contained herein is subject to change and may
be rendered inaccurate for many reasons, including but not limited to any changes in product and/or roadmap, component and
hardware revision changes, new model and/or product releases, software changes, firmware changes, or the like. Super Micro
Computer, Inc. assumes no obligation to update or otherwise correct or revise this information.
SUPER MICRO COMPUTER, INC. MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE
CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT
MAY APPEAR IN THIS INFORMATION.
SUPER MICRO COMPUTER, INC. SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR
FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL SUPER MICRO COMPUTER, INC. BE LIABLE TO ANY
PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF
ANY INFORMATION CONTAINED HEREIN, EVEN IF SUPER MICRO COMPUTER, Inc. IS EXPRESSLY ADVISED OF THE
POSSIBILITY OF SUCH DAMAGES.
ATTRIBUTION
© 2023 Super Micro Computer, Inc. All rights reserved.
9/20/2023 Better Faster Greener™ © 2023 Supermicro
21
Confidential
www.supermicro.com
1 de 22

Recomendados

Steve Cunningham - AI Innovation Summit.pdf por
Steve Cunningham - AI Innovation Summit.pdfSteve Cunningham - AI Innovation Summit.pdf
Steve Cunningham - AI Innovation Summit.pdfSOLTUIONSpeople, THINKubators, THINKathons
243 visualizações20 slides
James Feldman - AII Powered Business Tools.pdf por
James Feldman - AII Powered Business Tools.pdfJames Feldman - AII Powered Business Tools.pdf
James Feldman - AII Powered Business Tools.pdfSOLTUIONSpeople, THINKubators, THINKathons
251 visualizações19 slides
Theresa Fesinstine - AI Forward.pdf por
Theresa Fesinstine - AI Forward.pdfTheresa Fesinstine - AI Forward.pdf
Theresa Fesinstine - AI Forward.pdfSOLTUIONSpeople, THINKubators, THINKathons
344 visualizações20 slides
Neha Shukla - Future of the AI Revolution - Building Ethical and Equitable T... por
 Neha Shukla - Future of the AI Revolution - Building Ethical and Equitable T... Neha Shukla - Future of the AI Revolution - Building Ethical and Equitable T...
Neha Shukla - Future of the AI Revolution - Building Ethical and Equitable T...SOLTUIONSpeople, THINKubators, THINKathons
338 visualizações16 slides
Andy Roy - Conversational AI - Why We Must Build.pdf por
Andy Roy - Conversational AI - Why We Must Build.pdfAndy Roy - Conversational AI - Why We Must Build.pdf
Andy Roy - Conversational AI - Why We Must Build.pdfSOLTUIONSpeople, THINKubators, THINKathons
225 visualizações20 slides
Nils Vesk - Building an Innovative, Productive, AI empowered Culture.pdf por
Nils Vesk - Building an Innovative, Productive, AI empowered Culture.pdfNils Vesk - Building an Innovative, Productive, AI empowered Culture.pdf
Nils Vesk - Building an Innovative, Productive, AI empowered Culture.pdfSOLTUIONSpeople, THINKubators, THINKathons
404 visualizações20 slides

Mais conteúdo relacionado

Mais procurados

Ben Bressington - Buy Back Your Time and Increase Profits - 1 AI Strategy You... por
Ben Bressington - Buy Back Your Time and Increase Profits - 1 AI Strategy You...Ben Bressington - Buy Back Your Time and Increase Profits - 1 AI Strategy You...
Ben Bressington - Buy Back Your Time and Increase Profits - 1 AI Strategy You...SOLTUIONSpeople, THINKubators, THINKathons
298 visualizações20 slides
Terry Proto - AI Accelerates XR.pdf por
Terry Proto - AI Accelerates XR.pdfTerry Proto - AI Accelerates XR.pdf
Terry Proto - AI Accelerates XR.pdfSOLTUIONSpeople, THINKubators, THINKathons
285 visualizações19 slides
Josh Cavalier - ChatGPT Prompt Strategies.pdf por
Josh Cavalier - ChatGPT Prompt Strategies.pdfJosh Cavalier - ChatGPT Prompt Strategies.pdf
Josh Cavalier - ChatGPT Prompt Strategies.pdfSOLTUIONSpeople, THINKubators, THINKathons
555 visualizações20 slides
Matt Lewis - The Hardest Thing-Final to Host.pdf por
Matt Lewis - The Hardest Thing-Final to Host.pdfMatt Lewis - The Hardest Thing-Final to Host.pdf
Matt Lewis - The Hardest Thing-Final to Host.pdfSOLTUIONSpeople, THINKubators, THINKathons
396 visualizações20 slides
Maisa Penha - Art of Possible.pdf por
Maisa Penha - Art of Possible.pdfMaisa Penha - Art of Possible.pdf
Maisa Penha - Art of Possible.pdfSOLTUIONSpeople, THINKubators, THINKathons
623 visualizações20 slides
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m... por
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...SOLTUIONSpeople, THINKubators, THINKathons
636 visualizações20 slides

Mais procurados(20)

Leveraging Generative AI & Best practices por DianaGray10
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray101.7K visualizações
AI FOR BUSINESS LEADERS por Andre Muscat
AI FOR BUSINESS LEADERSAI FOR BUSINESS LEADERS
AI FOR BUSINESS LEADERS
Andre Muscat928 visualizações
Introduction to AI with Business Use Cases por Jack C Crawford
Introduction to AI with Business Use CasesIntroduction to AI with Business Use Cases
Introduction to AI with Business Use Cases
Jack C Crawford1.4K visualizações

Similar a Marv Wexler - Transform Your with AI.pdf

Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable por
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableRebekah Rodriguez
185 visualizações61 slides
Design - Changing Perceptions of Infrastructure as a Service por
Design - Changing Perceptions of Infrastructure as a ServiceDesign - Changing Perceptions of Infrastructure as a Service
Design - Changing Perceptions of Infrastructure as a ServiceLaurenWendler
111 visualizações13 slides
Accelerating Innovation from Edge to Cloud por
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudRebekah Rodriguez
283 visualizações31 slides
SUPERMICRO Innovative Computing Architecture por
SUPERMICRO Innovative Computing ArchitectureSUPERMICRO Innovative Computing Architecture
SUPERMICRO Innovative Computing ArchitectureIntel IT Center
1K visualizações17 slides
How Cloud Providers are Playing with Traditional Data Center por
How Cloud Providers are Playing with Traditional Data CenterHow Cloud Providers are Playing with Traditional Data Center
How Cloud Providers are Playing with Traditional Data CenterHostway|HOSTING
442 visualizações34 slides
Cimteq CableBuilder Go por
Cimteq CableBuilder GoCimteq CableBuilder Go
Cimteq CableBuilder GoCimteq
144 visualizações16 slides

Similar a Marv Wexler - Transform Your with AI.pdf(20)

Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable por Rebekah Rodriguez
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Rebekah Rodriguez185 visualizações
Design - Changing Perceptions of Infrastructure as a Service por LaurenWendler
Design - Changing Perceptions of Infrastructure as a ServiceDesign - Changing Perceptions of Infrastructure as a Service
Design - Changing Perceptions of Infrastructure as a Service
LaurenWendler111 visualizações
Accelerating Innovation from Edge to Cloud por Rebekah Rodriguez
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to Cloud
Rebekah Rodriguez283 visualizações
SUPERMICRO Innovative Computing Architecture por Intel IT Center
SUPERMICRO Innovative Computing ArchitectureSUPERMICRO Innovative Computing Architecture
SUPERMICRO Innovative Computing Architecture
Intel IT Center1K visualizações
How Cloud Providers are Playing with Traditional Data Center por Hostway|HOSTING
How Cloud Providers are Playing with Traditional Data CenterHow Cloud Providers are Playing with Traditional Data Center
How Cloud Providers are Playing with Traditional Data Center
Hostway|HOSTING442 visualizações
Cimteq CableBuilder Go por Cimteq
Cimteq CableBuilder GoCimteq CableBuilder Go
Cimteq CableBuilder Go
Cimteq144 visualizações
Webinar: Microprocessadores 32 bits, suas principais aplicações no mercado br... por Embarcados
Webinar: Microprocessadores 32 bits, suas principais aplicações no mercado br...Webinar: Microprocessadores 32 bits, suas principais aplicações no mercado br...
Webinar: Microprocessadores 32 bits, suas principais aplicações no mercado br...
Embarcados115 visualizações
Cisco connect montreal 2018 compute v final por Cisco Canada
Cisco connect montreal 2018   compute v finalCisco connect montreal 2018   compute v final
Cisco connect montreal 2018 compute v final
Cisco Canada1.6K visualizações
Building Efficient Edge Nodes for Content Delivery Networks por Rebekah Rodriguez
Building Efficient Edge Nodes for Content Delivery NetworksBuilding Efficient Edge Nodes for Content Delivery Networks
Building Efficient Edge Nodes for Content Delivery Networks
Rebekah Rodriguez107 visualizações
New high-density storage server - IBM System x3650 M4 HD por Cliff Kinard
New high-density storage server - IBM System x3650 M4 HDNew high-density storage server - IBM System x3650 M4 HD
New high-density storage server - IBM System x3650 M4 HD
Cliff Kinard4.3K visualizações
IBM SoftLayer - overview of Cloud Infrastructure por Avinaba Basu
IBM SoftLayer - overview of Cloud Infrastructure IBM SoftLayer - overview of Cloud Infrastructure
IBM SoftLayer - overview of Cloud Infrastructure
Avinaba Basu2.7K visualizações
What is ThousandEyes Webinar por ThousandEyes
What is ThousandEyes WebinarWhat is ThousandEyes Webinar
What is ThousandEyes Webinar
ThousandEyes62 visualizações
abiquo por guestf5c2fa
abiquoabiquo
abiquo
guestf5c2fa299 visualizações
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck por IBM Events
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM Events4.3K visualizações
Adding Recurring Revenue with Cloud Computing ProfitBricks por ProfitBricks
Adding Recurring Revenue with Cloud Computing ProfitBricksAdding Recurring Revenue with Cloud Computing ProfitBricks
Adding Recurring Revenue with Cloud Computing ProfitBricks
ProfitBricks529 visualizações
Cloud computing case studies with ProfitBricks IaaS por ProfitBricks
Cloud computing case studies with ProfitBricks IaaSCloud computing case studies with ProfitBricks IaaS
Cloud computing case studies with ProfitBricks IaaS
ProfitBricks1.1K visualizações
ProfitBricks Cloud Computing IaaS An Introduction por ProfitBricks
ProfitBricks Cloud Computing IaaS An IntroductionProfitBricks Cloud Computing IaaS An Introduction
ProfitBricks Cloud Computing IaaS An Introduction
ProfitBricks1.1K visualizações
TechWiseTV Workshop: ASR 9000 por Robb Boyd
TechWiseTV Workshop: ASR 9000 TechWiseTV Workshop: ASR 9000
TechWiseTV Workshop: ASR 9000
Robb Boyd735 visualizações
InSource 2017 Roadshow: Analyzing Data por InSource Solutions
InSource 2017 Roadshow: Analyzing DataInSource 2017 Roadshow: Analyzing Data
InSource 2017 Roadshow: Analyzing Data
InSource Solutions1.3K visualizações
Softlayer an IBM Compay . Connaissez vous le cloud de l'avenir por Patrick Bouillaud
Softlayer an IBM Compay . Connaissez vous le cloud de l'avenir Softlayer an IBM Compay . Connaissez vous le cloud de l'avenir
Softlayer an IBM Compay . Connaissez vous le cloud de l'avenir
Patrick Bouillaud4.1K visualizações

Mais de SOLTUIONSpeople, THINKubators, THINKathons

George Boretos & FutureUP-AI the big picture.pdf por
George Boretos & FutureUP-AI the big picture.pdfGeorge Boretos & FutureUP-AI the big picture.pdf
George Boretos & FutureUP-AI the big picture.pdfSOLTUIONSpeople, THINKubators, THINKathons
397 visualizações20 slides
Kai Wang - AI for Innovation1.1r.pdf por
Kai Wang - AI for Innovation1.1r.pdfKai Wang - AI for Innovation1.1r.pdf
Kai Wang - AI for Innovation1.1r.pdfSOLTUIONSpeople, THINKubators, THINKathons
284 visualizações20 slides
Lars Tvede - How We Boosted Total Productivity in Our Company by 20x w.pdf por
Lars Tvede - How We Boosted Total Productivity in Our Company by 20x w.pdfLars Tvede - How We Boosted Total Productivity in Our Company by 20x w.pdf
Lars Tvede - How We Boosted Total Productivity in Our Company by 20x w.pdfSOLTUIONSpeople, THINKubators, THINKathons
227 visualizações28 slides
Kelly Dowd - Leading Digital Transformation with AI and Human-Centered Design... por
Kelly Dowd - Leading Digital Transformation with AI and Human-Centered Design...Kelly Dowd - Leading Digital Transformation with AI and Human-Centered Design...
Kelly Dowd - Leading Digital Transformation with AI and Human-Centered Design...SOLTUIONSpeople, THINKubators, THINKathons
203 visualizações10 slides
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li... por
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...SOLTUIONSpeople, THINKubators, THINKathons
307 visualizações21 slides
George Pace - Keeping Pace with ChatGPT.pdf por
George Pace - Keeping Pace with ChatGPT.pdfGeorge Pace - Keeping Pace with ChatGPT.pdf
George Pace - Keeping Pace with ChatGPT.pdfSOLTUIONSpeople, THINKubators, THINKathons
560 visualizações20 slides

Último

Integrating Talent Management Practices por
Integrating Talent Management PracticesIntegrating Talent Management Practices
Integrating Talent Management PracticesSeta Wicaksana
134 visualizações29 slides
Basic of Air Ticketing & IATA Geography por
Basic of Air Ticketing & IATA GeographyBasic of Air Ticketing & IATA Geography
Basic of Air Ticketing & IATA GeographyMd Shaifullar Rabbi
67 visualizações27 slides
Business Process Reengineering (BPR) por
Business Process Reengineering (BPR)Business Process Reengineering (BPR)
Business Process Reengineering (BPR)Operational Excellence Consulting (Singapore)
24 visualizações30 slides
Accel_Series_2023Autumn_En.pptx por
Accel_Series_2023Autumn_En.pptxAccel_Series_2023Autumn_En.pptx
Accel_Series_2023Autumn_En.pptxNTTDATA INTRAMART
203 visualizações75 slides
terms_2.pdf por
terms_2.pdfterms_2.pdf
terms_2.pdfJAWADIQBAL40
18 visualizações8 slides
Presentation on proposed acquisition of leading European asset manager Aermon... por
Presentation on proposed acquisition of leading European asset manager Aermon...Presentation on proposed acquisition of leading European asset manager Aermon...
Presentation on proposed acquisition of leading European asset manager Aermon...KeppelCorporation
240 visualizações11 slides

Último(20)

Integrating Talent Management Practices por Seta Wicaksana
Integrating Talent Management PracticesIntegrating Talent Management Practices
Integrating Talent Management Practices
Seta Wicaksana134 visualizações
Basic of Air Ticketing & IATA Geography por Md Shaifullar Rabbi
Basic of Air Ticketing & IATA GeographyBasic of Air Ticketing & IATA Geography
Basic of Air Ticketing & IATA Geography
Md Shaifullar Rabbi 67 visualizações
Accel_Series_2023Autumn_En.pptx por NTTDATA INTRAMART
Accel_Series_2023Autumn_En.pptxAccel_Series_2023Autumn_En.pptx
Accel_Series_2023Autumn_En.pptx
NTTDATA INTRAMART203 visualizações
terms_2.pdf por JAWADIQBAL40
terms_2.pdfterms_2.pdf
terms_2.pdf
JAWADIQBAL4018 visualizações
Presentation on proposed acquisition of leading European asset manager Aermon... por KeppelCorporation
Presentation on proposed acquisition of leading European asset manager Aermon...Presentation on proposed acquisition of leading European asset manager Aermon...
Presentation on proposed acquisition of leading European asset manager Aermon...
KeppelCorporation240 visualizações
bookmyshow-1.pptx por 125071035
bookmyshow-1.pptxbookmyshow-1.pptx
bookmyshow-1.pptx
12507103515 visualizações
December 2023 - Meat on the Bones por NZSG
December 2023 - Meat on the BonesDecember 2023 - Meat on the Bones
December 2023 - Meat on the Bones
NZSG24 visualizações
PMU Launch - Guaranteed Slides por pmulaunch
PMU Launch - Guaranteed SlidesPMU Launch - Guaranteed Slides
PMU Launch - Guaranteed Slides
pmulaunch16 visualizações
Building Careers at Specialty TRE 2023 por Jennifer Sanborn
Building Careers at Specialty TRE 2023Building Careers at Specialty TRE 2023
Building Careers at Specialty TRE 2023
Jennifer Sanborn50 visualizações
Navigating EUDR Compliance within the Coffee Industry por Peter Horsten
Navigating EUDR Compliance within the Coffee IndustryNavigating EUDR Compliance within the Coffee Industry
Navigating EUDR Compliance within the Coffee Industry
Peter Horsten44 visualizações
Amazing Opportunities: PCD Pharma Franchise in Kerala.pptx por SaphnixMedicure1
Amazing Opportunities: PCD Pharma Franchise in Kerala.pptxAmazing Opportunities: PCD Pharma Franchise in Kerala.pptx
Amazing Opportunities: PCD Pharma Franchise in Kerala.pptx
SaphnixMedicure120 visualizações
Imports Next Level.pdf por Bloomerang
Imports Next Level.pdfImports Next Level.pdf
Imports Next Level.pdf
Bloomerang120 visualizações
Coomes Consulting Business Profile por Chris Coomes
Coomes Consulting Business ProfileCoomes Consulting Business Profile
Coomes Consulting Business Profile
Chris Coomes52 visualizações
Pitch Deck Teardown: Scalestack's $1M AI sales tech Seed deck por HajeJanKamps
Pitch Deck Teardown: Scalestack's $1M AI sales tech Seed deckPitch Deck Teardown: Scalestack's $1M AI sales tech Seed deck
Pitch Deck Teardown: Scalestack's $1M AI sales tech Seed deck
HajeJanKamps597 visualizações
Bloomerang_Forecasting Your Fundraising Revenue 2024.pptx.pdf por Bloomerang
Bloomerang_Forecasting Your Fundraising Revenue 2024.pptx.pdfBloomerang_Forecasting Your Fundraising Revenue 2024.pptx.pdf
Bloomerang_Forecasting Your Fundraising Revenue 2024.pptx.pdf
Bloomerang146 visualizações
Bloomerang Thank Yous Dec 2023.pdf por Bloomerang
Bloomerang Thank Yous Dec 2023.pdfBloomerang Thank Yous Dec 2023.pdf
Bloomerang Thank Yous Dec 2023.pdf
Bloomerang123 visualizações
On the Concept of Discovery Power of Enterprise Modeling Languages and its Re... por Ilia Bider
On the Concept of Discovery Power of Enterprise Modeling Languages and its Re...On the Concept of Discovery Power of Enterprise Modeling Languages and its Re...
On the Concept of Discovery Power of Enterprise Modeling Languages and its Re...
Ilia Bider15 visualizações

Marv Wexler - Transform Your with AI.pdf

  • 1. Confidential Transform Your Business With AI Transform Your Business With AI AI Summit Marv Wexler GM Technical Services September 21, 2023 AI Summit Marv Wexler GM Technical Services September 21, 2023 Better Faster Greener™ © 2023 Supermicro
  • 2. Confidential Where are we on the AI journey ? 9/20/2023 Better Faster Greener™ © 2023 Supermicro 2 “Once a new technology rolls over you, if you're not part of the steamroller, you're part of the road.” - Stewart Brand
  • 3. Confidential 9/20/2023 Better Faster Greener™ © 2023 Supermicro 3 Current AI Trends • Democratization of AI will continue • AI is a fundamental differentiator for businesses • Find deeper insights in data, real-time and at scale -Else your competitors surely will • Generative AI is becoming commercialized • AI ethics a top priority • Biased algorithms, Deep fakes, “Hallucinations” as a feature • Generative AI applications reign : Microsoft (Designer), Adobe (Firefly), Meta (Ad creation) • New regulations for safe and responsible practices • EU AI Act: Set of new rules that establish obligations for risks from artificial intelligence
  • 4. Confidential AI Applications 9/20/2023 Better Faster Greener™ © 2023 Supermicro 4 Deep Learning Solving complex problems Computer model taught to learn actions using images, texts and sounds Machine Learning Machines making decisions Building Machines with predictive algorithm and create predictive models Artificial Intelligence Simulate intelligence Building Smart Machines capable of performing intelligent tasks
  • 5. Confidential 9/20/2023 Better Faster Greener™ © 2023 Supermicro 5 Text Image Audio Video Games Text/ Voice prompt Generative AI models (also Large Language LLM, or Foundational Models) User Input What is Generative AI? Generative AI models are models that, when receiving a text prompt, give an output related to that input. The output can be text, image, audio, video, code etc. The ability for generative AI to produce useful, impressively synthesized text, images, and other types of content almost effortlessly based on a few text cues has already become an important business capability worthy of providing immense value to most knowledge workers
  • 6. Confidential The far-reaching impacts of Generative AI 9/20/2023 Better Faster Greener™ © 2023 Supermicro 6 Around 75% of the technology's value will be seen across four areas: • customer operations • marketing and sales • software engineering • research and development automating conversations with customers creating personalized messages for customers generating code generative design
  • 7. Confidential Customizable AI infrastructure for Generative AI 9/20/2023 Better Faster Greener™ © 2023 Supermicro 7 Training •compute intensive •massive datasets involved Fine-Tuning •Requires relatively less computational power Inferencing •Accelerators may be needed depending on type of application (batch/real-time) Various stages in building a Generative AI Application At Supermicro, We have you covered all the way with affordable, customizable and scalable solutions
  • 8. Confidential Application Details 9/20/2023 Better Faster Greener™ © 2023 Supermicro 8 LangChain Instructor Embeddings WizardLM / LLAMA • Ask questions to your documents AND learn from your documents using the power of LLMs. • 100% private, no data leaves your execution environment at any point. • You can ingest documents and ask questions without an internet connection! localGPT BUILT WITH • Text pre processed into chunks • Embedded in a vector space • Query search for similar chunks An instruction-finetuned text embedding model that can generate text embeddings tailored to any task by simply providing the task instruction, without any finetuning. Instructor achieves SOTA on 70 diverse embedding tasks! (e.g., classification, retrieval, clustering, text evaluation, etc.) and domains (e.g., science, finance, etc.) • WizardLM is a Llama variant trained with complex instructions • Evol-Instruct which leverages AI to "evolve" instructions
  • 9. Confidential Application Details 9/20/2023 Better Faster Greener™ © 2023 Supermicro 9 Ingest.py • uses LangChain tools to parse the document and create embeddings locally using Instructor Embeddings Chroma vector store • local vector database that stores the created embeddings Run_localGPT • uses local LLM to understand questions and create answers. Similarity Search • used to extract right piece of context from the local vector store
  • 10. Confidential 10 ©2023 Supermicro Large Scale AI Training • Key Technologies • NVIDIA HGX H100 SXM 8-GPU/4-GPU with 900GB/s NVLink interconnect • Dedicated, lots of high performance, high bandwidth GPU memory - HBM3, HBM2e • 400GbE networking (Ethernet or InfiniBand), PCIe 5.0 storage for fast AI data pipe • NVIDIA GPUDirect RDMA and Storage to keep feeding data to GPUs with minimum latency • Liquid cooling for GPUs and CPUs • All-flash storage and file systems to support petabytes of hot-tier data cache • NVIDIA HGX H100 SXM5 board with 4- GPU or 8- GPU • NVLink and NVSwitch • 80GB HBM3 per GPU • Up to 700W TDP • NVIDIA ConnectX-7 • Up to 400GbE or 400G NDR InfiniBand • x16/x32 PCIe 5.0
  • 11. Confidential Supermicro AI Experience Supermicro AI Experience Marv Wexler August 2023 Marv Wexler August 2023 Better Faster Greener™ © 2023 Supermicro
  • 12. Confidential 9/20/2023 Better Faster Greener™ © 2023 Supermicro 12
  • 13. Confidential Evolving to an AI / Total IT Solutions Partner 9/20/2023 Better Faster Greener™ © 2022 Supermicro 13  5S: Software, Services, Switch, Storage, Security and more  Total Solutions: Enterprise, OEM- Appliance / Cloud  Complete Systems  Sub-systems and Components ~5X+ Faster growth rate than the industry avg rate over the past 12+ months (~50% YoY) ~5X+ Faster growth rate than the industry avg rate over the past 12+ months (~50% YoY) Our Momentum: SMCI 1.0 Components & Subsystems SMCI 2.0 Servers & Storage Systems SMCI 3.0 Total IT Solutions Today 1993 $5B $10B
  • 14. Confidential SMCI AI Strategy 9/20/2023 Better Faster Greener™ © 2023 Supermicro 14 • Partner with the Leaders • Provide the best picks and shovels for the gold miners (Apps, YOU) • Do not be religious with Products Offerings (multi-vendor, multi-platform)
  • 15. Confidential SMCI AI Business Results 9/20/2023 Better Faster Greener™ © 2023 Supermicro 15 • Bring up platform partner for virtually all AI Solutions / GPU offerings • Lead supplier for virtually all Large Language Model Cloud Deployments (ChatGPT, BARD, Bing, etc.) The Next Platform, August 16, 2023
  • 16. Confidential 16 ©2023 Supermicro GPU Optimized Systems by Workloads • Large Scale AI Training • HPC/AI Workloads H100 PCIe Grace Hopper Superchip (Grace CPU + H100 GPU) H100 NVL HGX H100 SXM 8-GPU or 4-GPU 4U 4-GPU System (HGX H100 SXM) (codenamed: Redstone-Next) SYS-421GU-TNXR, SYS-521GU-TNXR 8U 8-GPU System (HGX H100 SXM) (codenamed: Delta-Next) SYS-821GE-TNHR, AS -8125GS-TNHR 4U 4-GPU System (HGX H100 SXM) SYS-421GU-TNXR 4U/5U 8-10 GPU System SYS-521GE-TNRT, SYS-421GE-TNRT/TNRT3 AS -4125GS-TNRT/TNRT1/TNRT2 1U Grace Hopper MGX System SYS-421GU-TNXR / SYS-521GU-TNXR 8U SuperBlade (Up to 20 nodes) SBI-411E-1G / SBI-411E-5G Petabyte Scale All-Flash Storage SSG-121E-NE316R, ASG-1115S-NE316R
  • 17. Confidential Scales to thousands of nodes in 32-node increments (SRS-42UHPC-32SU-01) Accelerate AI Development by Supermicro Supermicro 8U Delta-Next (SYS-821GE-TNHR) A Proven Platform, Purpose Built for AI H100 SXM5 GPU ConnectX-7 SmartNICs H100 Rack Scale SuperPod Scalable Unit 8x NVIDIA H100 SXM5 GPUs | 640GB HBM3 GPU Memory 2TB System Memory | 3.2Tbps Network B/W | Superior I/O 32x HGX H100 | 1+ EFLOPS AI | 20TB HBM3 GPU Memory 102.4Tbps Network B/W Non-blocking | InfiniBand NDR Software: NVIDIA BCM | NGC | NVAIE | SLURM | Kubernetes Full Turnkey AI Supercomputer for Enterprises 9/20/2023 Better Faster Greener™ © 2023 Supermicro 17
  • 18. Confidential Supermicro Rack Integration Services • Full rack integration up to L11 and L12 • Broad portfolio of compute, power, cooling and networking options • Liquid cooling integration • Cooling Distribution Unit (CDU) • Direct to Chip cold plate • Manifold and tubing • Design, assembly, configuration, testing and deployment • Start running applications from Day 1
  • 19. Confidential Supermicro CDU 80kW to 120kW, 45°C Warm Water Liquid Cooling Option for Rack Scale H100 SuperPods 9/20/2023 Better Faster Greener™ © 2023 Supermicro 19
  • 20. Confidential Onsite Rack Services 9/20/2023 Better Faster Greener™ © 2023 Supermicro 20 Simplifying Your Solution Deployment Needs • White glove custom service from beginning to end • Onsite rack & stack of the custom solution • Onsite integration ensuring proper installation and connectivity, providing for reliable operation and reduced downtime • Onsite software installation with application configurations • Onsite benchmark testing ensuring solution meets the requirements of the customer • Delivery of a customized rack solution that meets all requirements • SMC Cooling tower product line is available to enable facility level water connections for CDU/CDM/RDHX Reliable – Repeatable – Reproducible
  • 21. Confidential DISCLAIMER Super Micro Computer, Inc. may make changes to specifications and product descriptions at any time, without notice. The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. Any performance tests and ratings are measured using systems that reflect the approximate performance of Super Micro Computer, Inc. products as measured by those tests. Any differences in software or hardware configuration may affect actual performance, and Super Micro Computer, Inc. does not control the design or implementation of third party benchmarks or websites referenced in this document. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to any changes in product and/or roadmap, component and hardware revision changes, new model and/or product releases, software changes, firmware changes, or the like. Super Micro Computer, Inc. assumes no obligation to update or otherwise correct or revise this information. SUPER MICRO COMPUTER, INC. MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. SUPER MICRO COMPUTER, INC. SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL SUPER MICRO COMPUTER, INC. BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF SUPER MICRO COMPUTER, Inc. IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. ATTRIBUTION © 2023 Super Micro Computer, Inc. All rights reserved. 9/20/2023 Better Faster Greener™ © 2023 Supermicro 21