SlideShare uma empresa Scribd logo
1 de 11
Baixar para ler offline
zzzzzzz
Fact based
Generative AI
Leverage existing knowledge to generate
specific, up-to-date yet tailored results.
Stefan Weber
Senior Director Software Development
Telelink Business Services
OutSystems MVP – AWS Community Builder
2
Topics 1. Challenge – Why AI does not tell the truth
2. Solution – Retrievable Augmented Generation
and Fine Tuning a Large Language Model
3. Demo – Munich Airport QnA Tailored Answering
4. Flow – Implementing a RAG Pipeline with
OutSystems, OpenAI and Qdrant.
5. Run – Choose where to host your Large Language
Model
6. Forge – Ready made components for your RAG
flow.
Challenge
Large Language Models (LLM) exhibit inconsistency. On occasion, they excel in
providing accurate responses to inquiries, while at other times, they simply parrot
unrelated facts extracted from their training corpus. Their occasional lapses into
inconsistency are due to their systemic limitations.
LLMs possess a statistical understanding of word relationships but lack genuine
comprehension of meaning.
3
3
4
Retrievable Augmented Generation (RAG)
RAG is a technique for improving the quality of
generated responses by an LLM. In this
process, information from external knowledge
sources, along with further instructions, is
provided to generate fact-based results.
Solution
Model Fine-Tuning
LLM fine-tuning is a process of adjusting and adapting
a pre-trained large language model to perform specific
tasks or to cater to a particular domain more
effectively. While fine-tuning proves effective in
emulating behaviors, it's not the best fit for cases that
require extensive domain knowledge, such as legal or
financial sectors.
RAG and Model Fine-Tuning are not mutually exclusive but should be used in combination to ensure high-quality and uniform
results.
Demo – Munich Airport
QnA Tailored
Answering
5
5
RAG Flow
Turn information into data – Extract data
from information sources and create
semantic vector embeddings.
 Query – Perform semantic similarity
search across vectorized data.
 Synthesize – Prepare one-shot or
chain of thought prompt instructions
and inject search results.
 Generate – Let LLM completions
generate tailored results based on
prompt.
6
Building a custom Retrievable Augmented
Generation Pipeline – Building Blocks
7
Text Cleaning
Document
Segmentation
Deduplication
Entity
Resolution
Corpus
Diversity
Annotations
8
Vendor
Using the public APIs of LLM vendors
 OpenAI
 Aleph Alpha
 Cohere
 Anthropic
 …
Using a Vendor Public API is the most
cost-effective way to get started with
LLMs and generative AI.
At the same time, you have no influence
on the lifecycle of data and there are
fine-tuning limitations.
Running Large Language Models
Public Cloud Runtimes
Hosting a model using a runtime of a
public cloud provider
 AWS Sagemaker / Bedrock
 Azure OpenAI
 Huggingface
Full control of data lifecycle and security.
Possibility to offload parts of data
transformation to the platform to reduce
latency.
Own Datacenter
Build your own runtime environment or
use a prebuilt runtime.
9
Forge Components
Integration Components
 Azure OpenAI – OutSystems Platform Maintenance
Team
 OpenAI Embeddings – Stefan Weber
 Qdrant Vector Database – Stefan Weber
 AWS Bedrock Runtime – Stefan Weber
Demo Application
 Vector Embeddings Demo – Stefan Weber
Information Extraction Components
 Adobe Acrobat Services – Stefan Weber
 AWS Textract – OutSystems Platform Maintenance Team
Prompt Templating
 Handlebars.Net – Miguel Antunes
Custom Code
 Microsoft Semantic Kernel – Microsoft
 LangChain – LangChain Inc. (e.g. via AWS Lambda Integration)
10
Links
 OutSystems, OpenAI Embeddings and Qdrant Vector
Database—Find Similar
 OutSystems, OpenAI Embeddings and Qdrant Vector
Database—Answer Right
 Get Started with OutSystems and Amazon Bedrock
 Master Prompt Engineering
 RAG vs Fine Tuning (Medium Member Article)
 OpenAI
 Qdrant Vector Database
 Amazon Bedrock
Stefan Weber
Senior Director Software Development
Telelink Business Services
OutSystems MVP – AWS Community Builder
stefan.weber@tbs.tech
+49 1590 1888452
https://www.tbs.tech
https://www.linkedin.com/in/stefanweber1/

Mais conteúdo relacionado

Semelhante a Fact based Generative AI Leverage existing knowledge to generate specific, up-to-date yet tailored results

Real time data processing and model inferncing platform with Kafka streams (N...
Real time data processing and model inferncing platform with Kafka streams (N...Real time data processing and model inferncing platform with Kafka streams (N...
Real time data processing and model inferncing platform with Kafka streams (N...KafkaZone
 
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...Mark Tabladillo
 
Build, Train, and Deploy ML Models at Scale
Build, Train, and Deploy ML Models at ScaleBuild, Train, and Deploy ML Models at Scale
Build, Train, and Deploy ML Models at ScaleAmazon Web Services
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann
 
Integrating Splunk into your Spring Applications
Integrating Splunk into your Spring ApplicationsIntegrating Splunk into your Spring Applications
Integrating Splunk into your Spring ApplicationsDamien Dallimore
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0Ganesan Narayanasamy
 
Latest trends in information technology
Latest trends in information technologyLatest trends in information technology
Latest trends in information technologyEldos Kuriakose
 
Stream Processing with CompletableFuture and Flow in Java 9
Stream Processing with CompletableFuture and Flow in Java 9Stream Processing with CompletableFuture and Flow in Java 9
Stream Processing with CompletableFuture and Flow in Java 9Trayan Iliev
 
Stream SQL eventflow visual programming for real programmers presentation
Stream SQL eventflow visual programming for real programmers presentationStream SQL eventflow visual programming for real programmers presentation
Stream SQL eventflow visual programming for real programmers presentationstreambase
 
Linux Assignment 3
Linux Assignment 3Linux Assignment 3
Linux Assignment 3Diane Allen
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningEdunomica
 
Why Cloud Management Makes Sense
Why Cloud Management Makes SenseWhy Cloud Management Makes Sense
Why Cloud Management Makes SenseRightScale
 
Ibm watson - who what why
Ibm   watson - who what whyIbm   watson - who what why
Ibm watson - who what whyRick Bouter
 
Recommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareRecommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareJustin Basilico
 
Venkata Sateesh_BigData_Latest-Resume
Venkata Sateesh_BigData_Latest-ResumeVenkata Sateesh_BigData_Latest-Resume
Venkata Sateesh_BigData_Latest-Resumevenkata sateeshs
 
Build Applications on the Microsoft Platform Using Eclipse, Java, Ruby and PHP!
Build Applications on the Microsoft Platform Using Eclipse, Java, Ruby and PHP!Build Applications on the Microsoft Platform Using Eclipse, Java, Ruby and PHP!
Build Applications on the Microsoft Platform Using Eclipse, Java, Ruby and PHP!goodfriday
 
MY NEWEST RESUME
MY NEWEST RESUMEMY NEWEST RESUME
MY NEWEST RESUMEHan Yan
 

Semelhante a Fact based Generative AI Leverage existing knowledge to generate specific, up-to-date yet tailored results (20)

Real time data processing and model inferncing platform with Kafka streams (N...
Real time data processing and model inferncing platform with Kafka streams (N...Real time data processing and model inferncing platform with Kafka streams (N...
Real time data processing and model inferncing platform with Kafka streams (N...
 
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
 
Build, Train, and Deploy ML Models at Scale
Build, Train, and Deploy ML Models at ScaleBuild, Train, and Deploy ML Models at Scale
Build, Train, and Deploy ML Models at Scale
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
Integrating Splunk into your Spring Applications
Integrating Splunk into your Spring ApplicationsIntegrating Splunk into your Spring Applications
Integrating Splunk into your Spring Applications
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
Games en
Games enGames en
Games en
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0
 
Latest trends in information technology
Latest trends in information technologyLatest trends in information technology
Latest trends in information technology
 
Stream Processing with CompletableFuture and Flow in Java 9
Stream Processing with CompletableFuture and Flow in Java 9Stream Processing with CompletableFuture and Flow in Java 9
Stream Processing with CompletableFuture and Flow in Java 9
 
Mohammed_Murtuza_EAI
Mohammed_Murtuza_EAIMohammed_Murtuza_EAI
Mohammed_Murtuza_EAI
 
Stream SQL eventflow visual programming for real programmers presentation
Stream SQL eventflow visual programming for real programmers presentationStream SQL eventflow visual programming for real programmers presentation
Stream SQL eventflow visual programming for real programmers presentation
 
Linux Assignment 3
Linux Assignment 3Linux Assignment 3
Linux Assignment 3
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
 
Why Cloud Management Makes Sense
Why Cloud Management Makes SenseWhy Cloud Management Makes Sense
Why Cloud Management Makes Sense
 
Ibm watson - who what why
Ibm   watson - who what whyIbm   watson - who what why
Ibm watson - who what why
 
Recommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareRecommendations for Building Machine Learning Software
Recommendations for Building Machine Learning Software
 
Venkata Sateesh_BigData_Latest-Resume
Venkata Sateesh_BigData_Latest-ResumeVenkata Sateesh_BigData_Latest-Resume
Venkata Sateesh_BigData_Latest-Resume
 
Build Applications on the Microsoft Platform Using Eclipse, Java, Ruby and PHP!
Build Applications on the Microsoft Platform Using Eclipse, Java, Ruby and PHP!Build Applications on the Microsoft Platform Using Eclipse, Java, Ruby and PHP!
Build Applications on the Microsoft Platform Using Eclipse, Java, Ruby and PHP!
 
MY NEWEST RESUME
MY NEWEST RESUMEMY NEWEST RESUME
MY NEWEST RESUME
 

Último

Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noidabntitsolutionsrishis
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfLivetecs LLC
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 

Último (20)

Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdf
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 

Fact based Generative AI Leverage existing knowledge to generate specific, up-to-date yet tailored results

  • 1. zzzzzzz Fact based Generative AI Leverage existing knowledge to generate specific, up-to-date yet tailored results. Stefan Weber Senior Director Software Development Telelink Business Services OutSystems MVP – AWS Community Builder
  • 2. 2 Topics 1. Challenge – Why AI does not tell the truth 2. Solution – Retrievable Augmented Generation and Fine Tuning a Large Language Model 3. Demo – Munich Airport QnA Tailored Answering 4. Flow – Implementing a RAG Pipeline with OutSystems, OpenAI and Qdrant. 5. Run – Choose where to host your Large Language Model 6. Forge – Ready made components for your RAG flow.
  • 3. Challenge Large Language Models (LLM) exhibit inconsistency. On occasion, they excel in providing accurate responses to inquiries, while at other times, they simply parrot unrelated facts extracted from their training corpus. Their occasional lapses into inconsistency are due to their systemic limitations. LLMs possess a statistical understanding of word relationships but lack genuine comprehension of meaning. 3 3
  • 4. 4 Retrievable Augmented Generation (RAG) RAG is a technique for improving the quality of generated responses by an LLM. In this process, information from external knowledge sources, along with further instructions, is provided to generate fact-based results. Solution Model Fine-Tuning LLM fine-tuning is a process of adjusting and adapting a pre-trained large language model to perform specific tasks or to cater to a particular domain more effectively. While fine-tuning proves effective in emulating behaviors, it's not the best fit for cases that require extensive domain knowledge, such as legal or financial sectors. RAG and Model Fine-Tuning are not mutually exclusive but should be used in combination to ensure high-quality and uniform results.
  • 5. Demo – Munich Airport QnA Tailored Answering 5 5
  • 6. RAG Flow Turn information into data – Extract data from information sources and create semantic vector embeddings.  Query – Perform semantic similarity search across vectorized data.  Synthesize – Prepare one-shot or chain of thought prompt instructions and inject search results.  Generate – Let LLM completions generate tailored results based on prompt. 6
  • 7. Building a custom Retrievable Augmented Generation Pipeline – Building Blocks 7 Text Cleaning Document Segmentation Deduplication Entity Resolution Corpus Diversity Annotations
  • 8. 8 Vendor Using the public APIs of LLM vendors  OpenAI  Aleph Alpha  Cohere  Anthropic  … Using a Vendor Public API is the most cost-effective way to get started with LLMs and generative AI. At the same time, you have no influence on the lifecycle of data and there are fine-tuning limitations. Running Large Language Models Public Cloud Runtimes Hosting a model using a runtime of a public cloud provider  AWS Sagemaker / Bedrock  Azure OpenAI  Huggingface Full control of data lifecycle and security. Possibility to offload parts of data transformation to the platform to reduce latency. Own Datacenter Build your own runtime environment or use a prebuilt runtime.
  • 9. 9 Forge Components Integration Components  Azure OpenAI – OutSystems Platform Maintenance Team  OpenAI Embeddings – Stefan Weber  Qdrant Vector Database – Stefan Weber  AWS Bedrock Runtime – Stefan Weber Demo Application  Vector Embeddings Demo – Stefan Weber Information Extraction Components  Adobe Acrobat Services – Stefan Weber  AWS Textract – OutSystems Platform Maintenance Team Prompt Templating  Handlebars.Net – Miguel Antunes Custom Code  Microsoft Semantic Kernel – Microsoft  LangChain – LangChain Inc. (e.g. via AWS Lambda Integration)
  • 10. 10 Links  OutSystems, OpenAI Embeddings and Qdrant Vector Database—Find Similar  OutSystems, OpenAI Embeddings and Qdrant Vector Database—Answer Right  Get Started with OutSystems and Amazon Bedrock  Master Prompt Engineering  RAG vs Fine Tuning (Medium Member Article)  OpenAI  Qdrant Vector Database  Amazon Bedrock
  • 11. Stefan Weber Senior Director Software Development Telelink Business Services OutSystems MVP – AWS Community Builder stefan.weber@tbs.tech +49 1590 1888452 https://www.tbs.tech https://www.linkedin.com/in/stefanweber1/