Generative AI for the rest of us

Massimo Ferre'
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Massimo Re Ferrè
Senior Principal Technologist, AWS
Generative AI for the rest of us
© 2023, Amazon Web Services, Inc. or its affiliates. 2
Mainframes
Zooming out a bit
Technology wave #1
Data center
© 2023, Amazon Web Services, Inc. or its affiliates. 3
Mainframes
Zooming out a bit
Personal Computers
Technology wave #2
Technology wave #1
Data center
© 2023, Amazon Web Services, Inc. or its affiliates. 4
Mainframes
Zooming out a bit
Phyisical Servers
Virtual Machines
Personal Computers
Technology wave #2
Technology wave #1
Data center
© 2023, Amazon Web Services, Inc. or its affiliates. 5
Mainframes
Zooming out a bit
Phyisical Servers
Virtual Machines
Personal Computers
Technology wave #2
Technology wave #1
T
e
c
h
n
o
l
o
g
y
d
e
l
i
v
e
r
y
m
o
d
e
l
Data center
Cloud
© 2023, Amazon Web Services, Inc. or its affiliates. 6
Mainframes
Zooming out a bit
Phyisical Servers
Virtual Machines
Personal Computers
Containers
Functions
Technology wave #2
Technology wave #1
T
e
c
h
n
o
l
o
g
y
d
e
l
i
v
e
r
y
m
o
d
e
l
Data center
Cloud
© 2023, Amazon Web Services, Inc. or its affiliates. 7
Mainframes
Zooming out a bit
Phyisical Servers
Virtual Machines
Personal Computers
Containers
Functions
Generative AI
Technology wave #3
Technology wave #2
Technology wave #1
T
e
c
h
n
o
l
o
g
y
d
e
l
i
v
e
r
y
m
o
d
e
l
Data center
Cloud
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
What is Generative AI?
8
© 2023, Amazon Web Services, Inc. or its affiliates.
What is Generative AI (in simple terms)
9
- Traditional AI/ML: “Is this a picture of Rome or Florence?”
- [ Discriminative ]
- Gen AI: “Compare Rome Vs. Florence for someone interested in history”
- [ Generative ]
© 2023, Amazon Web Services, Inc. or its affiliates.
Gen AI “prompt”
10
A T I T S V E R Y C O R E ( T H E L L M - L A R G E L A N G U A G E M O D E L ) , G E N A I I S A F A K E . B U T A U S E F U L O N E
submit
© 2023, Amazon Web Services, Inc. or its affiliates.
This is how I like to think about an LLM
11
* or any profession that has nothing to do with a job in IT for that matter
q An LLM is akin to a … windsurfer professional*
qVery proficient in English
qAnd that had memorized all Wikipedia and all IT forums out there (and a lot more)
q They know Stack Overflow inside out! But don’t have a window to check the
weather (or a watch to check the time, etc)
q On their own, they have no relation to reality (beyond what they read)
q But they are great at generating free form content based on what they know
“have seen"
© 2023, Amazon Web Services, Inc. or its affiliates.
Why is Gen AI useful? [ the builder view ]
Source of unstructured knowledge
How can I use this knowledge and reason about it to create a new asset?
An asset being a piece of code, a whole program, a blog, an architecture, a troubleshooting
workflow, a db query and more outside of the IT realm (a poem, a picture, a receipt …)
© 2023, Amazon Web Services, Inc. or its affiliates.
Why is Gen AI useful? [ the builder view ]
Read and
memorize it all
(LOL – yeah sure)
(1)
Source of unstructured knowledge
You
© 2023, Amazon Web Services, Inc. or its affiliates.
Why is Gen AI useful? [ the builder view ]
You
Read and
memorize it all
(LOL – yeah sure)
Search engines (possibly
not relevant and still hard
- you are the integrator
and generator of a new
asset – text or code)
(1)
(2)
Source of unstructured knowledge
asset
© 2023, Amazon Web Services, Inc. or its affiliates.
Why is Gen AI useful? [ the builder view ]
15
You
Read and
memorize it all
(LOL – yeah sure)
LLM
Train on it
(doable)
(1)
(2)
(3a)
Natural language
conversation
(3b)
Source of unstructured knowledge
asset
asset
Search engines (possibly
not relevant and still hard
- you are the integrator
and generator of a new
asset – text or code)
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
My first Gen AI application
16
© 2023, Amazon Web Services, Inc. or its affiliates.
Real life use case
17
B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S
!!!!
© 2023, Amazon Web Services, Inc. or its affiliates.
Real life use case – the ClickOps version
18
B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S
Audio
file
Text
file
LLM
Audio to text translation Text summarization
Text
file
© 2023, Amazon Web Services, Inc. or its affiliates.
Real life use case – the ClickOps version
19
B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S
Prompt
Output
(generated asset)
© 2023, Amazon Web Services, Inc. or its affiliates.
Real life use case – the application version
20
M Y F I R S T ( N O N T U T O R I A L - B A S E D H E L L O - W O R L D ) G E N E R A T I V E A I A P P L I C A T I O N
© 2023, Amazon Web Services, Inc. or its affiliates. 21
Real life use case – the application version
L A M B D A C A L L S A N E X T E R N A L L L M S E R V I C E
https://it20.info/2023/08/building-a-generative-ai-application-using-aws-step-functions/
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Making LLMs useful
22
© 2023, Amazon Web Services, Inc. or its affiliates. 23
Why are people talking about things like Agents, Tools, RAG..
q The LLM is just one (fundamental) component of Generative AI
q The LLM could hallucinate, don’t have knowledge of recent / private / live
information, can’t do advanced math, may have limited reasoning capabilities, etc.
q You need something to complement its capabilities and guide/help it
q Especially for “real” business use cases that go beyond “toying around”
© 2023, Amazon Web Services, Inc. or its affiliates. 24
Why are people talking about things like Agents, Tools, RAG..
T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S )
Developing code
Debugging code
Living life
Deploying code
Domains
Writing a novel
Organizing travels
© 2023, Amazon Web Services, Inc. or its affiliates. 25
Why are people talking about things like Agents, Tools, RAG..
T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S )
Developing code
Debugging code
Deploying code
Domains
Autocomplete a
function method
Build a new ERP
from scratch
Resolve an error
message
Rearchitect the
app to avoid this
error at scale
Suggest what I
could do today
Organize my
whole life for the
next 10 years
Level of abstraction
Simple task Complex task
Writing a novel
Living life
Organizing travels
Tell me how long
it takes driving
from Florence to
Rome
Plan in details all
my 1-year long
sabbatical
© 2023, Amazon Web Services, Inc. or its affiliates. 26
Why are people talking about things like Agents, Tools, RAG..
T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S )
Domains
Level of abstraction
Simple task Complex task
Progressive complexity
Completion Chat Reasoning Acting
à
à
à
coverage
Domain
A
function
of
the
corpus
data
© 2023, Amazon Web Services, Inc. or its affiliates. 27
Why are people talking about things like Agents, Tools, RAG..
T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S )
Domains
Level of abstraction
Simple task Complex task
Large Language model
Smaller
purpose
built/tuned
model
Models may need to be
helped / guided to achieve
goals where task complexity
is too high or simply for
missing domain knowledge
© 2023, Amazon Web Services, Inc. or its affiliates. 28
Why are people talking about things like Agents, Tools, RAG..
A N E X A M P L E O F C O T ( C H A I N O F T H O U G H T S )
https://arxiv.org/abs/2201.11903
But sometimes in-prompt Chain of Thoughts (CoT) isn’t enough for the LLM to reason properly
W
elcom
e to
the
m
agic world
of
“prom
pt engineering”
© 2023, Amazon Web Services, Inc. or its affiliates. 29
Why are people talking about things like Agents, Tools, RAG..
A N E X A M P L E O F T H E F A C T C H E C K I N G W I T H P R O M P T C H A I N I N G P R O C E S S
https://it20.info/2023/6/the-dark-zone-between-the-magic-genai-experience-and-the-large-language-model/
Q: What is the biggest clock in the world?
© 2023, Amazon Web Services, Inc. or its affiliates. 30
Why are people talking about things like Agents, Tools, RAG..
LLM
A N E X A M P L E O F T O O L S
You
“what’s the weather like
today in Rome?”
Math function
code
Web search
code
“Calculate <very complex
formula>”
(1a)
(2a)
(1b)
(2b)
© 2023, Amazon Web Services, Inc. or its affiliates. 31
Why are people talking about things like Agents, Tools, RAG..
A N E X A M P L E O F R E A C T ( R E A S O N I N G A N D A C T I N G )
https://arxiv.org/abs/2210.03629
© 2023, Amazon Web Services, Inc. or its affiliates. 32
Why are people talking about things like Agents, Tools, RAG..
LLM
A N E X A M P L E O F R E A C T ( R E A S O N I N G A N D A C T I N G )
You
Iterating
reasoning
code
“Write the solution for
<very complex task>” (1)
(2)
https://arxiv.org/abs/2210.03629
Tool
© 2023, Amazon Web Services, Inc. or its affiliates. 33
Why are people talking about things like Agents, Tools, RAG..
LLM
Vector
DB
A N E X A M P L E O F R A G ( R E T R I E V A L - A U G M E N T E D G E N E R A T I O N )
You
Private corpus of data
embedding
“Write a draft email on <specific
company secret topic>”
(1)
(2)
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Prompt context Vs. RAG Vs. fine-tuning
34
© 2023, Amazon Web Services, Inc. or its affiliates. 35
Prompt context Vs. RAG Vs. fine-tuning: I am lost
q Fair. There are three ways to increase an LLM answer precision and correctness
1. Provide context in the prompt
2. Augment the LLM with an external source of vectorized data at inference time (RAG)
3. Fine tune the LLM with additional data
q There isn’t a global right or wrong approach. As often happens, it depends
q Also they are not mutually exclusive
q They could (and often should) be used together to achieve optimal results
© 2023, Amazon Web Services, Inc. or its affiliates. 36
q Rate of the change of the data source
q Limits, cost, latency, speed of prompt context tokens
q Cost of fine tuning
q including the work required to “prepare the data”
q Cost of creating and maintaining the vector store
Prompt context Vs. RAG Vs. fine-tuning: when to use what?
© 2023, Amazon Web Services, Inc. or its affiliates. 37
q Complexity of the architecture
q fine tuning may make the architecture easier (with an upfront fine-tuning investment)
q Shape and location of the data source
q Precision of the outcome
q no absolute rules exist, testing may be required
q Personal experience of the team building the solution
q “I have always used RAG and that’s what I am comfortable with”
Prompt context Vs. RAG Vs. fine-tuning: when to use what?
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Who’s Gen AI for?
38
© 2023, Amazon Web Services, Inc. or its affiliates. 39
Who’s Gen AI for?
q For the developer that is writing code
q e.g. code assistants e.g. AWS CodeWhisperer
q For the developer that wants to use English as a programming language
q e.g. the example of the WhatsApp vocal messages
q For the ops person that does not want to write a SQL query to extract data
q e.g. https://www.honeycomb.io/blog/introducing-query-assistant
© 2023, Amazon Web Services, Inc. or its affiliates. 40
Who’s Gen AI for?
q For the business analyst that wants to create a report off of a spreadsheet
q For the journalist that wants to draft an article on a specific topic
q Etc. etc.
q Come see me later to chat about the story of my plumber impressed by “chat …
chat …. chat something” (true story)
© 2023, Amazon Web Services, Inc. or its affiliates. 41
Conclusions
q Get ready for this new wave. It’s coming and (I think) it’s staying.
q LLMs have moved the needle of the art of possible
q But LLMs alone are not enough. You need to … make LLMs useful.
q Gen AI is for everyone, not just for “builders”. It’s for “consumers” of tech too.
q Go explore! Go build!
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Massimo Re Ferrè
Senior Principal Technologist, AWS
Twitter: @mreferre
E-mail: mreferre@amazon.com
Thanks!
1 de 42

Recomendados

Using the power of Generative AI at scale por
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scaleMaxim Salnikov
901 visualizações31 slides
Suresh Poopandi_Generative AI On AWS-MidWestCommunityDay-Final.pdf por
Suresh Poopandi_Generative AI On AWS-MidWestCommunityDay-Final.pdfSuresh Poopandi_Generative AI On AWS-MidWestCommunityDay-Final.pdf
Suresh Poopandi_Generative AI On AWS-MidWestCommunityDay-Final.pdfAWS Chicago
713 visualizações43 slides
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74) por
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Naoki (Neo) SATO
1.2K visualizações47 slides
Exploring Opportunities in the Generative AI Value Chain.pdf por
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfDung Hoang
266 visualizações9 slides
Unlocking the Power of Generative AI An Executive's Guide.pdf por
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfPremNaraindas1
2.2K visualizações29 slides
An Introduction to Generative AI - May 18, 2023 por
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023CoriFaklaris1
904 visualizações28 slides

Mais conteúdo relacionado

Mais procurados

How Does Generative AI Actually Work? (a quick semi-technical introduction to... por
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...ssuser4edc93
938 visualizações14 slides
AzureOpenAI.pptx por
AzureOpenAI.pptxAzureOpenAI.pptx
AzureOpenAI.pptxUdaiappa Ramachandran
545 visualizações16 slides
Leveraging Generative AI & Best practices por
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesDianaGray10
1.7K visualizações21 slides
introduction Azure OpenAI by Usama wahab khan por
introduction  Azure OpenAI by Usama wahab khanintroduction  Azure OpenAI by Usama wahab khan
introduction Azure OpenAI by Usama wahab khanUsama Wahab Khan Cloud, Data and AI
944 visualizações26 slides
ChatGPT Evaluation for NLP por
ChatGPT Evaluation for NLPChatGPT Evaluation for NLP
ChatGPT Evaluation for NLPXiachongFeng
769 visualizações69 slides
Generative AI: Past, Present, and Future – A Practitioner's Perspective por
Generative AI: Past, Present, and Future – A Practitioner's PerspectiveGenerative AI: Past, Present, and Future – A Practitioner's Perspective
Generative AI: Past, Present, and Future – A Practitioner's PerspectiveHuahai Yang
549 visualizações48 slides

Mais procurados(20)

How Does Generative AI Actually Work? (a quick semi-technical introduction to... por ssuser4edc93
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ssuser4edc93938 visualizações
Leveraging Generative AI & Best practices por DianaGray10
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray101.7K visualizações
ChatGPT Evaluation for NLP por XiachongFeng
ChatGPT Evaluation for NLPChatGPT Evaluation for NLP
ChatGPT Evaluation for NLP
XiachongFeng769 visualizações
Generative AI: Past, Present, and Future – A Practitioner's Perspective por Huahai Yang
Generative AI: Past, Present, and Future – A Practitioner's PerspectiveGenerative AI: Past, Present, and Future – A Practitioner's Perspective
Generative AI: Past, Present, and Future – A Practitioner's Perspective
Huahai Yang549 visualizações
Best Practice on using Azure OpenAI Service por Kumton Suttiraksiri
Best Practice on using Azure OpenAI ServiceBest Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI Service
Kumton Suttiraksiri517 visualizações
AI and ML Series - Introduction to Generative AI and LLMs - Session 1 por DianaGray10
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
DianaGray101.2K visualizações
Generative AI por lutzsuarnaba1
Generative AIGenerative AI
Generative AI
lutzsuarnaba1630 visualizações
Generative AI por All Things Open
Generative AIGenerative AI
Generative AI
All Things Open4.5K visualizações
Large Language Models Bootcamp por Data Science Dojo
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
Data Science Dojo3.3K visualizações
Using Generative AI por Mark DeLoura
Using Generative AIUsing Generative AI
Using Generative AI
Mark DeLoura129 visualizações
An Introduction to Generative AI por Cori Faklaris
An Introduction  to Generative AIAn Introduction  to Generative AI
An Introduction to Generative AI
Cori Faklaris11K visualizações
OpenAI Chatgpt.pptx por Nawroz University
OpenAI Chatgpt.pptxOpenAI Chatgpt.pptx
OpenAI Chatgpt.pptx
Nawroz University842 visualizações
ChatGPT, Foundation Models and Web3.pptx por Jesus Rodriguez
ChatGPT, Foundation Models and Web3.pptxChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptx
Jesus Rodriguez799 visualizações
Using Generative AI in the Classroom .pptx por JonathanDietz3
Using Generative AI in the Classroom .pptxUsing Generative AI in the Classroom .pptx
Using Generative AI in the Classroom .pptx
JonathanDietz3110 visualizações
Generative-AI-in-enterprise-20230615.pdf por Liming Zhu
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu815 visualizações
AUGMENTING CREATIVITY USING GEN AI FOR DESIGN & INNOVATION | TOJIN T. EAPEN por Tojin Eapen, PhD
AUGMENTING CREATIVITY  USING GEN AI FOR DESIGN & INNOVATION | TOJIN T. EAPENAUGMENTING CREATIVITY  USING GEN AI FOR DESIGN & INNOVATION | TOJIN T. EAPEN
AUGMENTING CREATIVITY USING GEN AI FOR DESIGN & INNOVATION | TOJIN T. EAPEN
Tojin Eapen, PhD53 visualizações
Responsible Generative AI por CMassociates
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
CMassociates232 visualizações
Generative AI and law.pptx por Chris Marsden
Generative AI and law.pptxGenerative AI and law.pptx
Generative AI and law.pptx
Chris Marsden625 visualizações

Similar a Generative AI for the rest of us

AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ... por
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...Amazon Web Services
632 visualizações30 slides
MongoDB .local London 2019: Using AWS to Transform Customer Data in MongoDB i... por
MongoDB .local London 2019: Using AWS to Transform Customer Data in MongoDB i...MongoDB .local London 2019: Using AWS to Transform Customer Data in MongoDB i...
MongoDB .local London 2019: Using AWS to Transform Customer Data in MongoDB i...Lisa Roth, PMP
46 visualizações48 slides
Sviluppa, addestra e distribuisci modelli di machine learning.pdf por
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfAmazon Web Services
985 visualizações37 slides
Machine Learning for the Enterprise, ft. Sony Interactive Entertainment (ENT2... por
Machine Learning for the Enterprise, ft. Sony Interactive Entertainment (ENT2...Machine Learning for the Enterprise, ft. Sony Interactive Entertainment (ENT2...
Machine Learning for the Enterprise, ft. Sony Interactive Entertainment (ENT2...Amazon Web Services
2.1K visualizações40 slides
Add Intelligence to Applications with AWS AI Services por
Add Intelligence to Applications with AWS AI ServicesAdd Intelligence to Applications with AWS AI Services
Add Intelligence to Applications with AWS AI ServicesNicholas Walsh
257 visualizações83 slides
AI/ML Week: Strengthen Cybersecurity por
AI/ML Week: Strengthen CybersecurityAI/ML Week: Strengthen Cybersecurity
AI/ML Week: Strengthen CybersecurityAmazon Web Services
2.3K visualizações36 slides

Similar a Generative AI for the rest of us(20)

AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ... por Amazon Web Services
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
Amazon Web Services632 visualizações
MongoDB .local London 2019: Using AWS to Transform Customer Data in MongoDB i... por Lisa Roth, PMP
MongoDB .local London 2019: Using AWS to Transform Customer Data in MongoDB i...MongoDB .local London 2019: Using AWS to Transform Customer Data in MongoDB i...
MongoDB .local London 2019: Using AWS to Transform Customer Data in MongoDB i...
Lisa Roth, PMP46 visualizações
Sviluppa, addestra e distribuisci modelli di machine learning.pdf por Amazon Web Services
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
Amazon Web Services985 visualizações
Machine Learning for the Enterprise, ft. Sony Interactive Entertainment (ENT2... por Amazon Web Services
Machine Learning for the Enterprise, ft. Sony Interactive Entertainment (ENT2...Machine Learning for the Enterprise, ft. Sony Interactive Entertainment (ENT2...
Machine Learning for the Enterprise, ft. Sony Interactive Entertainment (ENT2...
Amazon Web Services2.1K visualizações
Add Intelligence to Applications with AWS AI Services por Nicholas Walsh
Add Intelligence to Applications with AWS AI ServicesAdd Intelligence to Applications with AWS AI Services
Add Intelligence to Applications with AWS AI Services
Nicholas Walsh257 visualizações
AI/ML Week: Strengthen Cybersecurity por Amazon Web Services
AI/ML Week: Strengthen CybersecurityAI/ML Week: Strengthen Cybersecurity
AI/ML Week: Strengthen Cybersecurity
Amazon Web Services2.3K visualizações
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019 por AWS Summits
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
AWS Summits218 visualizações
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019 por Amazon Web Services
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Amazon Web Services442 visualizações
Build Intelligent Apps Using AI Services.pdf por Amazon Web Services
Build Intelligent Apps Using AI Services.pdfBuild Intelligent Apps Using AI Services.pdf
Build Intelligent Apps Using AI Services.pdf
Amazon Web Services702 visualizações
Build Machine Learning Models with Amazon SageMaker (April 2019) por Julien SIMON
Build Machine Learning Models with Amazon SageMaker (April 2019)Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)
Julien SIMON1.6K visualizações
Machine learning for developers & data scientists with Amazon SageMaker - AIM... por Amazon Web Services
Machine learning for developers & data scientists with Amazon SageMaker - AIM...Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Amazon Web Services788 visualizações
Build_Intelligent_Apps_Using_AI_Services.pdf por Amazon Web Services
Build_Intelligent_Apps_Using_AI_Services.pdfBuild_Intelligent_Apps_Using_AI_Services.pdf
Build_Intelligent_Apps_Using_AI_Services.pdf
Amazon Web Services244 visualizações
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin... por Amazon Web Services
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon Web Services448 visualizações
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ... por Amazon Web Services
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
AWS Summit Singapore 2019 | Building Business Outcomes with Machine Learning ...
Amazon Web Services776 visualizações
Build-Train-Deploy-Machine-Learning-Models-at-Any-Scale por Amazon Web Services
Build-Train-Deploy-Machine-Learning-Models-at-Any-ScaleBuild-Train-Deploy-Machine-Learning-Models-at-Any-Scale
Build-Train-Deploy-Machine-Learning-Models-at-Any-Scale
Amazon Web Services832 visualizações
Meetup Niort Data - AWS Intelligence Artificielle por Olivier Cahagne
Meetup Niort Data - AWS Intelligence ArtificielleMeetup Niort Data - AWS Intelligence Artificielle
Meetup Niort Data - AWS Intelligence Artificielle
Olivier Cahagne128 visualizações
Machine Learning at the Edge por Amazon Web Services
Machine Learning at the EdgeMachine Learning at the Edge
Machine Learning at the Edge
Amazon Web Services212 visualizações
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017 por Amazon Web Services
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017
Amazon Web Services1.4K visualizações
Building Intelligent Applications Using AI Services por Amazon Web Services
Building Intelligent Applications Using AI ServicesBuilding Intelligent Applications Using AI Services
Building Intelligent Applications Using AI Services
Amazon Web Services306 visualizações

Mais de Massimo Ferre'

IDI_2023_MRF-Final.pdf por
IDI_2023_MRF-Final.pdfIDI_2023_MRF-Final.pdf
IDI_2023_MRF-Final.pdfMassimo Ferre'
30 visualizações17 slides
Web Assembly (on the server) por
Web Assembly (on the server)Web Assembly (on the server)
Web Assembly (on the server)Massimo Ferre'
699 visualizações40 slides
IDI 2022: Making sense of the '17 ways to run containers on AWS' por
IDI 2022: Making sense of the '17 ways to run containers on AWS'IDI 2022: Making sense of the '17 ways to run containers on AWS'
IDI 2022: Making sense of the '17 ways to run containers on AWS'Massimo Ferre'
166 visualizações25 slides
Codemotion 2020 - Containers Meet Serverless por
Codemotion 2020 - Containers Meet ServerlessCodemotion 2020 - Containers Meet Serverless
Codemotion 2020 - Containers Meet ServerlessMassimo Ferre'
161 visualizações17 slides
ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate por
ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate
ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate Massimo Ferre'
275 visualizações21 slides
IDI 2020 - Containers Meet Serverless por
IDI 2020 - Containers Meet ServerlessIDI 2020 - Containers Meet Serverless
IDI 2020 - Containers Meet ServerlessMassimo Ferre'
229 visualizações28 slides

Mais de Massimo Ferre'(14)

IDI_2023_MRF-Final.pdf por Massimo Ferre'
IDI_2023_MRF-Final.pdfIDI_2023_MRF-Final.pdf
IDI_2023_MRF-Final.pdf
Massimo Ferre'30 visualizações
Web Assembly (on the server) por Massimo Ferre'
Web Assembly (on the server)Web Assembly (on the server)
Web Assembly (on the server)
Massimo Ferre'699 visualizações
IDI 2022: Making sense of the '17 ways to run containers on AWS' por Massimo Ferre'
IDI 2022: Making sense of the '17 ways to run containers on AWS'IDI 2022: Making sense of the '17 ways to run containers on AWS'
IDI 2022: Making sense of the '17 ways to run containers on AWS'
Massimo Ferre'166 visualizações
Codemotion 2020 - Containers Meet Serverless por Massimo Ferre'
Codemotion 2020 - Containers Meet ServerlessCodemotion 2020 - Containers Meet Serverless
Codemotion 2020 - Containers Meet Serverless
Massimo Ferre'161 visualizações
ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate por Massimo Ferre'
ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate
ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate
Massimo Ferre'275 visualizações
IDI 2020 - Containers Meet Serverless por Massimo Ferre'
IDI 2020 - Containers Meet ServerlessIDI 2020 - Containers Meet Serverless
IDI 2020 - Containers Meet Serverless
Massimo Ferre'229 visualizações
Containers Meetup (AWS+CNCF) Milano Jan 15th 2020 por Massimo Ferre'
Containers Meetup (AWS+CNCF) Milano Jan 15th 2020Containers Meetup (AWS+CNCF) Milano Jan 15th 2020
Containers Meetup (AWS+CNCF) Milano Jan 15th 2020
Massimo Ferre'80 visualizações
Breaking the monolith (an example) por Massimo Ferre'
Breaking the monolith (an example)Breaking the monolith (an example)
Breaking the monolith (an example)
Massimo Ferre'92 visualizações
From 0 to Blue-Green deployments on AWS Fargate por Massimo Ferre'
From 0 to Blue-Green deployments on AWS Fargate From 0 to Blue-Green deployments on AWS Fargate
From 0 to Blue-Green deployments on AWS Fargate
Massimo Ferre'209 visualizações
AWS Summit Stockholm - Fargate: deploy containers, not infrastructure por Massimo Ferre'
AWS Summit Stockholm - Fargate: deploy containers, not infrastructureAWS Summit Stockholm - Fargate: deploy containers, not infrastructure
AWS Summit Stockholm - Fargate: deploy containers, not infrastructure
Massimo Ferre'757 visualizações
AWS Summit London 2019 - Containers on AWS por Massimo Ferre'
AWS Summit London 2019 - Containers on AWSAWS Summit London 2019 - Containers on AWS
AWS Summit London 2019 - Containers on AWS
Massimo Ferre'7K visualizações
Meetup CNCF Torino - Amazon EKS March 29th 2019 por Massimo Ferre'
Meetup CNCF Torino - Amazon EKS March 29th 2019 Meetup CNCF Torino - Amazon EKS March 29th 2019
Meetup CNCF Torino - Amazon EKS March 29th 2019
Massimo Ferre'56 visualizações
End-to-end CI/CD deployments of containerized applications using AWS services por Massimo Ferre'
End-to-end CI/CD deployments of containerized applications using AWS servicesEnd-to-end CI/CD deployments of containerized applications using AWS services
End-to-end CI/CD deployments of containerized applications using AWS services
Massimo Ferre'856 visualizações
Containers at AWS: State of the Union por Massimo Ferre'
Containers at AWS: State of the Union  Containers at AWS: State of the Union
Containers at AWS: State of the Union
Massimo Ferre'134 visualizações

Último

STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptx por
STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptxSTUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptx
STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptxAnnieRachelJohn
31 visualizações34 slides
Saikat Chakraborty Java Oracle Certificate.pdf por
Saikat Chakraborty Java Oracle Certificate.pdfSaikat Chakraborty Java Oracle Certificate.pdf
Saikat Chakraborty Java Oracle Certificate.pdfSaikatChakraborty787148
14 visualizações1 slide
SWM L1-L14_drhasan (Part 1).pdf por
SWM L1-L14_drhasan (Part 1).pdfSWM L1-L14_drhasan (Part 1).pdf
SWM L1-L14_drhasan (Part 1).pdfMahmudHasan747870
48 visualizações150 slides
Update 42 models(Diode/General ) in SPICE PARK(DEC2023) por
Update 42 models(Diode/General ) in SPICE PARK(DEC2023)Update 42 models(Diode/General ) in SPICE PARK(DEC2023)
Update 42 models(Diode/General ) in SPICE PARK(DEC2023)Tsuyoshi Horigome
19 visualizações16 slides
What is Whirling Hygrometer.pdf por
What is Whirling Hygrometer.pdfWhat is Whirling Hygrometer.pdf
What is Whirling Hygrometer.pdfIIT KHARAGPUR
11 visualizações3 slides
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L... por
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...Anowar Hossain
12 visualizações34 slides

Último(20)

STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptx por AnnieRachelJohn
STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptxSTUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptx
STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptx
AnnieRachelJohn31 visualizações
Saikat Chakraborty Java Oracle Certificate.pdf por SaikatChakraborty787148
Saikat Chakraborty Java Oracle Certificate.pdfSaikat Chakraborty Java Oracle Certificate.pdf
Saikat Chakraborty Java Oracle Certificate.pdf
SaikatChakraborty78714814 visualizações
SWM L1-L14_drhasan (Part 1).pdf por MahmudHasan747870
SWM L1-L14_drhasan (Part 1).pdfSWM L1-L14_drhasan (Part 1).pdf
SWM L1-L14_drhasan (Part 1).pdf
MahmudHasan74787048 visualizações
Update 42 models(Diode/General ) in SPICE PARK(DEC2023) por Tsuyoshi Horigome
Update 42 models(Diode/General ) in SPICE PARK(DEC2023)Update 42 models(Diode/General ) in SPICE PARK(DEC2023)
Update 42 models(Diode/General ) in SPICE PARK(DEC2023)
Tsuyoshi Horigome19 visualizações
What is Whirling Hygrometer.pdf por IIT KHARAGPUR
What is Whirling Hygrometer.pdfWhat is Whirling Hygrometer.pdf
What is Whirling Hygrometer.pdf
IIT KHARAGPUR 11 visualizações
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L... por Anowar Hossain
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...
Anowar Hossain12 visualizações
MSA Website Slideshow (16).pdf por msaucla
MSA Website Slideshow (16).pdfMSA Website Slideshow (16).pdf
MSA Website Slideshow (16).pdf
msaucla46 visualizações
7_DVD_Combinational_MOS_Logic_Circuits.pdf por Usha Mehta
7_DVD_Combinational_MOS_Logic_Circuits.pdf7_DVD_Combinational_MOS_Logic_Circuits.pdf
7_DVD_Combinational_MOS_Logic_Circuits.pdf
Usha Mehta59 visualizações
Digital Watermarking Of Audio Signals.pptx por AyushJaiswal781174
Digital Watermarking Of Audio Signals.pptxDigital Watermarking Of Audio Signals.pptx
Digital Watermarking Of Audio Signals.pptx
AyushJaiswal7811748 visualizações
2_DVD_ASIC_Design_FLow.pdf por Usha Mehta
2_DVD_ASIC_Design_FLow.pdf2_DVD_ASIC_Design_FLow.pdf
2_DVD_ASIC_Design_FLow.pdf
Usha Mehta19 visualizações
SWM L15-L28_drhasan (Part 2).pdf por MahmudHasan747870
SWM L15-L28_drhasan (Part 2).pdfSWM L15-L28_drhasan (Part 2).pdf
SWM L15-L28_drhasan (Part 2).pdf
MahmudHasan74787028 visualizações
Literature review and Case study on Commercial Complex in Nepal, Durbar mall,... por AakashShakya12
Literature review and Case study on Commercial Complex in Nepal, Durbar mall,...Literature review and Case study on Commercial Complex in Nepal, Durbar mall,...
Literature review and Case study on Commercial Complex in Nepal, Durbar mall,...
AakashShakya1257 visualizações
Thermal aware task assignment for multicore processors using genetic algorithm por IJECEIAES
Thermal aware task assignment for multicore processors using genetic algorithm Thermal aware task assignment for multicore processors using genetic algorithm
Thermal aware task assignment for multicore processors using genetic algorithm
IJECEIAES30 visualizações
13_DVD_Latch-up_prevention.pdf por Usha Mehta
13_DVD_Latch-up_prevention.pdf13_DVD_Latch-up_prevention.pdf
13_DVD_Latch-up_prevention.pdf
Usha Mehta10 visualizações
Machine Element II Course outline.pdf por odatadese1
Machine Element II Course outline.pdfMachine Element II Course outline.pdf
Machine Element II Course outline.pdf
odatadese17 visualizações
Codes and Conventions.pptx por IsabellaGraceAnkers
Codes and Conventions.pptxCodes and Conventions.pptx
Codes and Conventions.pptx
IsabellaGraceAnkers7 visualizações
Multi-objective distributed generation integration in radial distribution sy... por IJECEIAES
Multi-objective distributed generation integration in radial  distribution sy...Multi-objective distributed generation integration in radial  distribution sy...
Multi-objective distributed generation integration in radial distribution sy...
IJECEIAES15 visualizações

Generative AI for the rest of us

  • 1. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Massimo Re Ferrè Senior Principal Technologist, AWS Generative AI for the rest of us
  • 2. © 2023, Amazon Web Services, Inc. or its affiliates. 2 Mainframes Zooming out a bit Technology wave #1 Data center
  • 3. © 2023, Amazon Web Services, Inc. or its affiliates. 3 Mainframes Zooming out a bit Personal Computers Technology wave #2 Technology wave #1 Data center
  • 4. © 2023, Amazon Web Services, Inc. or its affiliates. 4 Mainframes Zooming out a bit Phyisical Servers Virtual Machines Personal Computers Technology wave #2 Technology wave #1 Data center
  • 5. © 2023, Amazon Web Services, Inc. or its affiliates. 5 Mainframes Zooming out a bit Phyisical Servers Virtual Machines Personal Computers Technology wave #2 Technology wave #1 T e c h n o l o g y d e l i v e r y m o d e l Data center Cloud
  • 6. © 2023, Amazon Web Services, Inc. or its affiliates. 6 Mainframes Zooming out a bit Phyisical Servers Virtual Machines Personal Computers Containers Functions Technology wave #2 Technology wave #1 T e c h n o l o g y d e l i v e r y m o d e l Data center Cloud
  • 7. © 2023, Amazon Web Services, Inc. or its affiliates. 7 Mainframes Zooming out a bit Phyisical Servers Virtual Machines Personal Computers Containers Functions Generative AI Technology wave #3 Technology wave #2 Technology wave #1 T e c h n o l o g y d e l i v e r y m o d e l Data center Cloud
  • 8. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. What is Generative AI? 8
  • 9. © 2023, Amazon Web Services, Inc. or its affiliates. What is Generative AI (in simple terms) 9 - Traditional AI/ML: “Is this a picture of Rome or Florence?” - [ Discriminative ] - Gen AI: “Compare Rome Vs. Florence for someone interested in history” - [ Generative ]
  • 10. © 2023, Amazon Web Services, Inc. or its affiliates. Gen AI “prompt” 10 A T I T S V E R Y C O R E ( T H E L L M - L A R G E L A N G U A G E M O D E L ) , G E N A I I S A F A K E . B U T A U S E F U L O N E submit
  • 11. © 2023, Amazon Web Services, Inc. or its affiliates. This is how I like to think about an LLM 11 * or any profession that has nothing to do with a job in IT for that matter q An LLM is akin to a … windsurfer professional* qVery proficient in English qAnd that had memorized all Wikipedia and all IT forums out there (and a lot more) q They know Stack Overflow inside out! But don’t have a window to check the weather (or a watch to check the time, etc) q On their own, they have no relation to reality (beyond what they read) q But they are great at generating free form content based on what they know “have seen"
  • 12. © 2023, Amazon Web Services, Inc. or its affiliates. Why is Gen AI useful? [ the builder view ] Source of unstructured knowledge How can I use this knowledge and reason about it to create a new asset? An asset being a piece of code, a whole program, a blog, an architecture, a troubleshooting workflow, a db query and more outside of the IT realm (a poem, a picture, a receipt …)
  • 13. © 2023, Amazon Web Services, Inc. or its affiliates. Why is Gen AI useful? [ the builder view ] Read and memorize it all (LOL – yeah sure) (1) Source of unstructured knowledge You
  • 14. © 2023, Amazon Web Services, Inc. or its affiliates. Why is Gen AI useful? [ the builder view ] You Read and memorize it all (LOL – yeah sure) Search engines (possibly not relevant and still hard - you are the integrator and generator of a new asset – text or code) (1) (2) Source of unstructured knowledge asset
  • 15. © 2023, Amazon Web Services, Inc. or its affiliates. Why is Gen AI useful? [ the builder view ] 15 You Read and memorize it all (LOL – yeah sure) LLM Train on it (doable) (1) (2) (3a) Natural language conversation (3b) Source of unstructured knowledge asset asset Search engines (possibly not relevant and still hard - you are the integrator and generator of a new asset – text or code)
  • 16. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. My first Gen AI application 16
  • 17. © 2023, Amazon Web Services, Inc. or its affiliates. Real life use case 17 B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S !!!!
  • 18. © 2023, Amazon Web Services, Inc. or its affiliates. Real life use case – the ClickOps version 18 B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S Audio file Text file LLM Audio to text translation Text summarization Text file
  • 19. © 2023, Amazon Web Services, Inc. or its affiliates. Real life use case – the ClickOps version 19 B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S Prompt Output (generated asset)
  • 20. © 2023, Amazon Web Services, Inc. or its affiliates. Real life use case – the application version 20 M Y F I R S T ( N O N T U T O R I A L - B A S E D H E L L O - W O R L D ) G E N E R A T I V E A I A P P L I C A T I O N
  • 21. © 2023, Amazon Web Services, Inc. or its affiliates. 21 Real life use case – the application version L A M B D A C A L L S A N E X T E R N A L L L M S E R V I C E https://it20.info/2023/08/building-a-generative-ai-application-using-aws-step-functions/
  • 22. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Making LLMs useful 22
  • 23. © 2023, Amazon Web Services, Inc. or its affiliates. 23 Why are people talking about things like Agents, Tools, RAG.. q The LLM is just one (fundamental) component of Generative AI q The LLM could hallucinate, don’t have knowledge of recent / private / live information, can’t do advanced math, may have limited reasoning capabilities, etc. q You need something to complement its capabilities and guide/help it q Especially for “real” business use cases that go beyond “toying around”
  • 24. © 2023, Amazon Web Services, Inc. or its affiliates. 24 Why are people talking about things like Agents, Tools, RAG.. T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S ) Developing code Debugging code Living life Deploying code Domains Writing a novel Organizing travels
  • 25. © 2023, Amazon Web Services, Inc. or its affiliates. 25 Why are people talking about things like Agents, Tools, RAG.. T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S ) Developing code Debugging code Deploying code Domains Autocomplete a function method Build a new ERP from scratch Resolve an error message Rearchitect the app to avoid this error at scale Suggest what I could do today Organize my whole life for the next 10 years Level of abstraction Simple task Complex task Writing a novel Living life Organizing travels Tell me how long it takes driving from Florence to Rome Plan in details all my 1-year long sabbatical
  • 26. © 2023, Amazon Web Services, Inc. or its affiliates. 26 Why are people talking about things like Agents, Tools, RAG.. T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S ) Domains Level of abstraction Simple task Complex task Progressive complexity Completion Chat Reasoning Acting à à à coverage Domain A function of the corpus data
  • 27. © 2023, Amazon Web Services, Inc. or its affiliates. 27 Why are people talking about things like Agents, Tools, RAG.. T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S ) Domains Level of abstraction Simple task Complex task Large Language model Smaller purpose built/tuned model Models may need to be helped / guided to achieve goals where task complexity is too high or simply for missing domain knowledge
  • 28. © 2023, Amazon Web Services, Inc. or its affiliates. 28 Why are people talking about things like Agents, Tools, RAG.. A N E X A M P L E O F C O T ( C H A I N O F T H O U G H T S ) https://arxiv.org/abs/2201.11903 But sometimes in-prompt Chain of Thoughts (CoT) isn’t enough for the LLM to reason properly W elcom e to the m agic world of “prom pt engineering”
  • 29. © 2023, Amazon Web Services, Inc. or its affiliates. 29 Why are people talking about things like Agents, Tools, RAG.. A N E X A M P L E O F T H E F A C T C H E C K I N G W I T H P R O M P T C H A I N I N G P R O C E S S https://it20.info/2023/6/the-dark-zone-between-the-magic-genai-experience-and-the-large-language-model/ Q: What is the biggest clock in the world?
  • 30. © 2023, Amazon Web Services, Inc. or its affiliates. 30 Why are people talking about things like Agents, Tools, RAG.. LLM A N E X A M P L E O F T O O L S You “what’s the weather like today in Rome?” Math function code Web search code “Calculate <very complex formula>” (1a) (2a) (1b) (2b)
  • 31. © 2023, Amazon Web Services, Inc. or its affiliates. 31 Why are people talking about things like Agents, Tools, RAG.. A N E X A M P L E O F R E A C T ( R E A S O N I N G A N D A C T I N G ) https://arxiv.org/abs/2210.03629
  • 32. © 2023, Amazon Web Services, Inc. or its affiliates. 32 Why are people talking about things like Agents, Tools, RAG.. LLM A N E X A M P L E O F R E A C T ( R E A S O N I N G A N D A C T I N G ) You Iterating reasoning code “Write the solution for <very complex task>” (1) (2) https://arxiv.org/abs/2210.03629 Tool
  • 33. © 2023, Amazon Web Services, Inc. or its affiliates. 33 Why are people talking about things like Agents, Tools, RAG.. LLM Vector DB A N E X A M P L E O F R A G ( R E T R I E V A L - A U G M E N T E D G E N E R A T I O N ) You Private corpus of data embedding “Write a draft email on <specific company secret topic>” (1) (2)
  • 34. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Prompt context Vs. RAG Vs. fine-tuning 34
  • 35. © 2023, Amazon Web Services, Inc. or its affiliates. 35 Prompt context Vs. RAG Vs. fine-tuning: I am lost q Fair. There are three ways to increase an LLM answer precision and correctness 1. Provide context in the prompt 2. Augment the LLM with an external source of vectorized data at inference time (RAG) 3. Fine tune the LLM with additional data q There isn’t a global right or wrong approach. As often happens, it depends q Also they are not mutually exclusive q They could (and often should) be used together to achieve optimal results
  • 36. © 2023, Amazon Web Services, Inc. or its affiliates. 36 q Rate of the change of the data source q Limits, cost, latency, speed of prompt context tokens q Cost of fine tuning q including the work required to “prepare the data” q Cost of creating and maintaining the vector store Prompt context Vs. RAG Vs. fine-tuning: when to use what?
  • 37. © 2023, Amazon Web Services, Inc. or its affiliates. 37 q Complexity of the architecture q fine tuning may make the architecture easier (with an upfront fine-tuning investment) q Shape and location of the data source q Precision of the outcome q no absolute rules exist, testing may be required q Personal experience of the team building the solution q “I have always used RAG and that’s what I am comfortable with” Prompt context Vs. RAG Vs. fine-tuning: when to use what?
  • 38. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Who’s Gen AI for? 38
  • 39. © 2023, Amazon Web Services, Inc. or its affiliates. 39 Who’s Gen AI for? q For the developer that is writing code q e.g. code assistants e.g. AWS CodeWhisperer q For the developer that wants to use English as a programming language q e.g. the example of the WhatsApp vocal messages q For the ops person that does not want to write a SQL query to extract data q e.g. https://www.honeycomb.io/blog/introducing-query-assistant
  • 40. © 2023, Amazon Web Services, Inc. or its affiliates. 40 Who’s Gen AI for? q For the business analyst that wants to create a report off of a spreadsheet q For the journalist that wants to draft an article on a specific topic q Etc. etc. q Come see me later to chat about the story of my plumber impressed by “chat … chat …. chat something” (true story)
  • 41. © 2023, Amazon Web Services, Inc. or its affiliates. 41 Conclusions q Get ready for this new wave. It’s coming and (I think) it’s staying. q LLMs have moved the needle of the art of possible q But LLMs alone are not enough. You need to … make LLMs useful. q Gen AI is for everyone, not just for “builders”. It’s for “consumers” of tech too. q Go explore! Go build!
  • 42. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Massimo Re Ferrè Senior Principal Technologist, AWS Twitter: @mreferre E-mail: mreferre@amazon.com Thanks!