SlideShare uma empresa Scribd logo
1 de 7
Baixar para ler offline
NOTES FROM AN INDUSTRY PRACTITIONER ROUNDTABLE
NEW YORK CITY, 3RD
MAY 2023
Managing the risk of large
language models in
financial services
Context
Over the last 2-3 years, Financial Institutions (FIs) have been making changes to their
internal policies, standards and processes to manage the risks arising from the adoption
of ML models. However, in many FIs, these mechanisms, created for small scale
experimentation and adoption, have failed to scale up over time. The challenge has
become even more acute as FIs grapple with the implications of recent advances in
Large Language Models (LLMs).
This roundtable brought together practitioners from 10-15 leading FIs (including 6 of the
top 10 US banks). Representatives from across first line Data and Analytics/ AI teams,
Model Risk, Data Management and analytics-intensive functions spent over 2 hours
comparing notes on
• The most relevant LLM use cases
• Emerging risk considerations
• Approaches to managing such risks, including but not limited to Model Risk
• Implications for people, skills and ways of working
The event kicked off with short presentations by Agus Sudjianto, EVP and Head of
Corporate Model Risk at Wells Fargo, Sri Krishnamurthy, founder of Quant
University, and Anupam Datta, co-founder of TruEra and a former professor at
Carnegie Mellon University. This was followed by an open, Chatham House rules
discussion with all round-table participants.
The following pages summarize the key points of discussion during the 2–3-hour
discussion. There was broad agreement on some topics, and differences in opinion on
other
Key Takeaways
LLM Use Cases
Q&A on internal knowledge corpus is the first “killer app”
Text summarization/ translation is popular too
“Traditional” NLP use cases are on steroids with LLMs
Writing/ reading code may be the Next Big Thing
Appetite for customer-facing use seems limited for now
Risks – New and Old
Accuracy/ hallucination concerns are top-of-mind
Security/ Privacy feature heavily with hosted LLMs
There is uncertainty over IP obligations and rights
Reputation risk is holding back direct customer use
Bias will become important with customer-facing use cases
Managing Risks – Model Risk and
Beyond
LLM risk is an end-to-end system risk (not just model risk)
Most risk managers seem to be in exploratory mode
Access to embeddings seems essential for Model Validation
Testing output quality in generative AI is a major challenge
People, Skills, Ways of Working
Everyone agrees that the nature of roles will change
There is less alignment on the extent/ pace of job losses
“Democratization” of AI will have significant impact
Will Specialists win over Generalists? Jury is still out
1. LLM Use Cases in Finance
The first wave of use cases in FIs are a mix of some in the “too difficult to
solve” category, and others in which LLMs represent significant
improvements over existing alternatives.
“Killer app” – information retrieval/ contextual search (q&a)
Q&A applications that allow staff to access an internal corpus of relevant knowledge in a simple
and reliable way, with a much lower level of curation needed compared to alternatives.
Examples: financial advisors querying internal research on investment opportunities; staff
querying FI’s internal policies to get HR or compliance advice.
FI has control over the quality and provenance of the internal corpus. Testing for quality is
easier if the embeddings created by the LLM on the internal corpus are available.
Text summarization (potentially also with translation)
Creating document summaries, typically for review and use by an expert human – e.g., ‘reading’
multiple documents to create a Know-Your-Customer profile, translating and summarizing a
200-page report in German for a US-based analyst to consume in 5 minutes.
‘Traditional’ NLP apps - on steroids now!
Traditional NLP use cases – e.g., extracting relevant data from unstructured data sources for
operational automation, sentiment analysis on customer complaints or external media reports,
analyzing trading room communication for potential misconduct – set for a boom, as LLMs
• Promise greater accuracy on existing data extraction use cases
• Make NLP viable for smaller, “long tail” use cases (previously inaccessible due to need
for labelled data and custom model development/ training)
• Add complementary ‘last mile’ generative AI capabilities on top of data extraction (e.g.
text summarization) that increase end-to-end automation levels
Reading/ writing code – next big thing?
Significant medium-term potential in LLM applications that can translate code into non-technical
language and vice versa.
Customer-facing applications – low appetite for now!
Potential to automate aspects of customer engagement – e.g., automating service requests –
considered but mostly deferred for now. At this stage, FIs are choosing to attack lower hanging
fruit (above) with fewer privacy, security, bias and reputation risks.
2. Risks with LLM Use
Accuracy/ hallucination concerns dominate, but privacy/ security/ IP/
reputation risks matter too.
Accuracy/ “hallucinations”
Most LLM applications prone to hallucinations, weak in math and unable to provide traceability
of answers, at least for now. Except where applying LLMs on FI’s internal knowledge corpuses.
Accuracy-related risk perception changes significantly based on the end user. Responses from
an LLM-based Q&A system could be:
• Low risk if being reviewed by an expert user before being acted upon
• Somewhat riskier if the end user is inside the FI but lacks domain expertise
• Very risky if put in front of an end customer.
Accuracy is also heavily impacted by the exact wording of the question. “English language
programming” can be too flexible and may need to be translated to resemble more structured
‘pseudo-code’ to ensure repeatability/ reliability.
Security and privacy
Concerns around security and privacy breaches, due to the need to expose internal FI data to
the LLM provider. Some FIs have opted to host LLMs inside their own networks (using open
source LLMs or through specific arrangements with the likes of Azure/ Open AI).
IP considerations
Risks of inadvertent IP/ copyright breaches by the FI, in case the LLM was originally trained on
external data without appropriate permissions. Uncertainty over IP ownership when content or
code is created using LLMs.
Reputation
A particular source of concern in customer-facing use cases (e.g., a chatbot that is ‘tricked’ into
using inappropriate language by an end-user trying to embarrass the FI). Concerns around
unjust bias not yet significant in the group, since none of the participants is focusing on
customer-facing use cases.
***
Beyond the risks themselves, there was also concern about the potential originators of such
risks– e.g., fintechs attempting to compete with regulated entities but without the internal
guardrails that the latter have in place; staff inside regulated FIs who find themselves using
models for the first time due to the LLM-led “democratization” of AI.
3. Managing LLM Risks
Broad agreement that the approach to risk management should be “end-
to-end”, rather than solely focused on model risk.
Overall - everyone still in experimental mode
A majority of the participants claimed that ChatGPT was ‘banned’ from use in their
organizations, except in narrow supervised conditions.
Several participants acknowledged that they were still trying to put their arms around the risks
of LLM. They were in learning mode, and had implemented ad-hoc controls (e.g., a single
channel to approve any LLM use case around the FI) in the interim. One participant felt that it
was important to let low risk use cases move forward, and use those experiences to flesh out
the control framework for higher risk ones.
Model risk – still working through implications
Many FIs opined that MRM departments were under pressure from all parts of their
organizations to review/ approve potential LLM use cases. Some felt unprepared for the sudden
spike in interest.
Some participants expressed confidence that when the LLMs are used in the embedding space
(e.g., with the info retrieval type use cases), it was possible to apply robust MRM controls. Some
pointed out that this was already underway with BERT style models.
However, where the embeddings are not made available, outcome-based testing is the only
option right now. This naturally limits the extent to which MRM can approve such models for
high stakes use cases, and/or results in imposition of additional human controls to compensate.
Outcome-based testing is hard since exhaustive testing is not possible in most LLM use cases.
Different prompts and/or end-user queries can produce different outcomes with same
underlying model.
The challenges of testing generative ai output quality
For Generative AI, an additional testing challenge is to identify the quality of the output itself.
Human feedback, where available, is of course ideal. However, in many situations, that is not
a viable or scalable option. Here, considering a combination of automated feedback functions
may be helpful (e.g., relevance testing or sentiment analysis of the answers)
The analogy is to data labelling in traditional ML, which was initially all human, but subsequently
used automated techniques. Similar programmatic approaches may provide a way to test the
quality of Generative AI system outputs.
4. The People Dimension
A sense that LLMs will have profound impact on the nature, and possibly
the number, of jobs in the industry, but not much clarity or agreement
beyond that.
Elimination of roles – unclear at this stage
Opinion seemed divided on if/ how quickly LLMs will result in large scale reduction of jobs.
Some participants felt that there may be fewer roles in coding and data science; others foresaw
significant reductions in operations. However, few offered a view on the timing or scale of such
losses
Consensus that many existing roles will change
There was general agreement among participants that over time, several roles inside FIs will
certainly change a lot. One participant highlighted the potential for LLMs to automate the
interpretation of code and/or the generation of new code from natural language instructions. At
current capability levels, this would still leave a role for coders, but much more in an expert
‘checker’ role.
Another opined that LLMs are democratizing data science (since “anyone can implement LLM
based applications”), and this would have significant implications, positive and negative, for
data scientists.
The talent pipeline challenge
One potential challenge is the need for apprenticeship in order to build the necessary domain
expertise needed to work with advanced AI tools: if most of the ‘grunt work’ that was previously
done by entry level/ junior staff can be automated, then how would the pipeline for tomorrow’s
experts be built?
Experts vs. Generalists: no consensus yet
An unresolved question was whether widespread use of LLMs will tilt the balance more towards
generalists or specialists.
• Will LLMs empower generalists to overcome the disadvantages arising from their lack
of specialist knowledge, or
• Will they further entrench the value of specialists who can complement their deep
domain knowledge with the outputs from LLM applications

Mais conteúdo relacionado

Mais procurados

Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...David Talby
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN FrameworkKeymate.AI
 
Using Generative AI
Using Generative AIUsing Generative AI
Using Generative AIMark DeLoura
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scaleMaxim Salnikov
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021Steve Omohundro
 
Generative AI and law.pptx
Generative AI and law.pptxGenerative AI and law.pptx
Generative AI and law.pptxChris Marsden
 
Generative AI Art - The Dark Side
Generative AI Art - The Dark SideGenerative AI Art - The Dark Side
Generative AI Art - The Dark SideAbhinav Gupta
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...ssuser4edc93
 
Generative AI Risks & Concerns
Generative AI Risks & ConcernsGenerative AI Risks & Concerns
Generative AI Risks & ConcernsAjitesh Kumar
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYGENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYAndre Muscat
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of usMassimo Ferre'
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1DianaGray10
 
ChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptxChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptxJesus Rodriguez
 
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AICavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AICavalry Ventures
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models BootcampData Science Dojo
 
The Future is in Responsible Generative AI
The Future is in Responsible Generative AIThe Future is in Responsible Generative AI
The Future is in Responsible Generative AISaeed Al Dhaheri
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMsLoic Merckel
 
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...DataScienceConferenc1
 

Mais procurados (20)

Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
 
LLMs Bootcamp
LLMs BootcampLLMs Bootcamp
LLMs Bootcamp
 
Using Generative AI
Using Generative AIUsing Generative AI
Using Generative AI
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021
 
Generative AI and law.pptx
Generative AI and law.pptxGenerative AI and law.pptx
Generative AI and law.pptx
 
Generative AI Art - The Dark Side
Generative AI Art - The Dark SideGenerative AI Art - The Dark Side
Generative AI Art - The Dark Side
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
 
Generative AI Risks & Concerns
Generative AI Risks & ConcernsGenerative AI Risks & Concerns
Generative AI Risks & Concerns
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYGENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
 
Generative AI
Generative AIGenerative AI
Generative AI
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of us
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
 
ChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptxChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptx
 
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AICavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AI
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
 
The Future is in Responsible Generative AI
The Future is in Responsible Generative AIThe Future is in Responsible Generative AI
The Future is in Responsible Generative AI
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMs
 
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
 

Semelhante a Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdf

How to test LLMs in production.pdf
How to test LLMs in production.pdfHow to test LLMs in production.pdf
How to test LLMs in production.pdfAnastasiaSteele10
 
Train foundation model for domain-specific language model
Train foundation model for domain-specific language modelTrain foundation model for domain-specific language model
Train foundation model for domain-specific language modelBenjaminlapid1
 
Concerns Examiners Have About Your Spreadsheets and How to Respond
Concerns Examiners Have About Your Spreadsheets and How to RespondConcerns Examiners Have About Your Spreadsheets and How to Respond
Concerns Examiners Have About Your Spreadsheets and How to RespondLibby Bierman
 
Governing risk from decision-making learning algorithms (DMLAs)
Governing risk from decision-making learning algorithms(DMLAs)Governing risk from decision-making learning algorithms(DMLAs)
Governing risk from decision-making learning algorithms (DMLAs)Anca Georgiana Rusu
 
B potential pitfalls_of_process_modeling_part_b-2
B potential pitfalls_of_process_modeling_part_b-2B potential pitfalls_of_process_modeling_part_b-2
B potential pitfalls_of_process_modeling_part_b-2Jean-François Périé
 
AI MODELS USAGE IN FINTECH PRODUCTS: PM APPROACH & BEST PRACTICES by Kasthuri...
AI MODELS USAGE IN FINTECH PRODUCTS: PM APPROACH & BEST PRACTICES by Kasthuri...AI MODELS USAGE IN FINTECH PRODUCTS: PM APPROACH & BEST PRACTICES by Kasthuri...
AI MODELS USAGE IN FINTECH PRODUCTS: PM APPROACH & BEST PRACTICES by Kasthuri...ISPMAIndia
 
Which generation of siem?
Which generation of siem?Which generation of siem?
Which generation of siem?Ertugrul Akbas
 
Simulation Strategy Show
Simulation Strategy ShowSimulation Strategy Show
Simulation Strategy Showjsturman
 
How adversaries interfere with AI and ML systems
How adversaries interfere with AI and ML systemsHow adversaries interfere with AI and ML systems
How adversaries interfere with AI and ML systemsaNumak & Company
 
Interpretable Machine Learning_ Techniques for Model Explainability.
Interpretable Machine Learning_ Techniques for Model Explainability.Interpretable Machine Learning_ Techniques for Model Explainability.
Interpretable Machine Learning_ Techniques for Model Explainability.Tyrion Lannister
 
The President Of A Company
The President Of A CompanyThe President Of A Company
The President Of A CompanyNavy Savchenko
 
Js Sim Strategy 2010
Js Sim Strategy 2010Js Sim Strategy 2010
Js Sim Strategy 2010jsturman
 
Real-world Strategies for Debugging Machine Learning Systems
Real-world Strategies for Debugging Machine Learning SystemsReal-world Strategies for Debugging Machine Learning Systems
Real-world Strategies for Debugging Machine Learning SystemsDatabricks
 
Technolody Acceptance Model
Technolody Acceptance ModelTechnolody Acceptance Model
Technolody Acceptance ModelKimberly Thomas
 
Transformation of legacy landscape in the insurance world
Transformation of legacy landscape in the insurance worldTransformation of legacy landscape in the insurance world
Transformation of legacy landscape in the insurance worldNIIT Technologies
 
IRJET- Social Network Message Credibility: An Agent-based Approach
IRJET- Social Network Message Credibility: An Agent-based ApproachIRJET- Social Network Message Credibility: An Agent-based Approach
IRJET- Social Network Message Credibility: An Agent-based ApproachIRJET Journal
 
IRJET - Social Network Message Credibility: An Agent-based Approach
IRJET -  	  Social Network Message Credibility: An Agent-based ApproachIRJET -  	  Social Network Message Credibility: An Agent-based Approach
IRJET - Social Network Message Credibility: An Agent-based ApproachIRJET Journal
 
Five Mistakes of Vulnerability Management
Five Mistakes of Vulnerability ManagementFive Mistakes of Vulnerability Management
Five Mistakes of Vulnerability ManagementAnton Chuvakin
 

Semelhante a Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdf (20)

How to test LLMs in production.pdf
How to test LLMs in production.pdfHow to test LLMs in production.pdf
How to test LLMs in production.pdf
 
Train foundation model for domain-specific language model
Train foundation model for domain-specific language modelTrain foundation model for domain-specific language model
Train foundation model for domain-specific language model
 
Concerns Examiners Have About Your Spreadsheets and How to Respond
Concerns Examiners Have About Your Spreadsheets and How to RespondConcerns Examiners Have About Your Spreadsheets and How to Respond
Concerns Examiners Have About Your Spreadsheets and How to Respond
 
Governing risk from decision-making learning algorithms (DMLAs)
Governing risk from decision-making learning algorithms(DMLAs)Governing risk from decision-making learning algorithms(DMLAs)
Governing risk from decision-making learning algorithms (DMLAs)
 
B potential pitfalls_of_process_modeling_part_b-2
B potential pitfalls_of_process_modeling_part_b-2B potential pitfalls_of_process_modeling_part_b-2
B potential pitfalls_of_process_modeling_part_b-2
 
Article-V16
Article-V16Article-V16
Article-V16
 
AI MODELS USAGE IN FINTECH PRODUCTS: PM APPROACH & BEST PRACTICES by Kasthuri...
AI MODELS USAGE IN FINTECH PRODUCTS: PM APPROACH & BEST PRACTICES by Kasthuri...AI MODELS USAGE IN FINTECH PRODUCTS: PM APPROACH & BEST PRACTICES by Kasthuri...
AI MODELS USAGE IN FINTECH PRODUCTS: PM APPROACH & BEST PRACTICES by Kasthuri...
 
Technovision
TechnovisionTechnovision
Technovision
 
Which generation of siem?
Which generation of siem?Which generation of siem?
Which generation of siem?
 
Simulation Strategy Show
Simulation Strategy ShowSimulation Strategy Show
Simulation Strategy Show
 
How adversaries interfere with AI and ML systems
How adversaries interfere with AI and ML systemsHow adversaries interfere with AI and ML systems
How adversaries interfere with AI and ML systems
 
Interpretable Machine Learning_ Techniques for Model Explainability.
Interpretable Machine Learning_ Techniques for Model Explainability.Interpretable Machine Learning_ Techniques for Model Explainability.
Interpretable Machine Learning_ Techniques for Model Explainability.
 
The President Of A Company
The President Of A CompanyThe President Of A Company
The President Of A Company
 
Js Sim Strategy 2010
Js Sim Strategy 2010Js Sim Strategy 2010
Js Sim Strategy 2010
 
Real-world Strategies for Debugging Machine Learning Systems
Real-world Strategies for Debugging Machine Learning SystemsReal-world Strategies for Debugging Machine Learning Systems
Real-world Strategies for Debugging Machine Learning Systems
 
Technolody Acceptance Model
Technolody Acceptance ModelTechnolody Acceptance Model
Technolody Acceptance Model
 
Transformation of legacy landscape in the insurance world
Transformation of legacy landscape in the insurance worldTransformation of legacy landscape in the insurance world
Transformation of legacy landscape in the insurance world
 
IRJET- Social Network Message Credibility: An Agent-based Approach
IRJET- Social Network Message Credibility: An Agent-based ApproachIRJET- Social Network Message Credibility: An Agent-based Approach
IRJET- Social Network Message Credibility: An Agent-based Approach
 
IRJET - Social Network Message Credibility: An Agent-based Approach
IRJET -  	  Social Network Message Credibility: An Agent-based ApproachIRJET -  	  Social Network Message Credibility: An Agent-based Approach
IRJET - Social Network Message Credibility: An Agent-based Approach
 
Five Mistakes of Vulnerability Management
Five Mistakes of Vulnerability ManagementFive Mistakes of Vulnerability Management
Five Mistakes of Vulnerability Management
 

Mais de QuantUniversity

EU Artificial Intelligence Act 2024 passed !
EU Artificial Intelligence Act 2024 passed !EU Artificial Intelligence Act 2024 passed !
EU Artificial Intelligence Act 2024 passed !QuantUniversity
 
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSPYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSQuantUniversity
 
Qu for India - QuantUniversity FundRaiser
Qu for India  - QuantUniversity FundRaiserQu for India  - QuantUniversity FundRaiser
Qu for India - QuantUniversity FundRaiserQuantUniversity
 
Ml master class for CFA Dallas
Ml master class for CFA DallasMl master class for CFA Dallas
Ml master class for CFA DallasQuantUniversity
 
Algorithmic auditing 1.0
Algorithmic auditing 1.0Algorithmic auditing 1.0
Algorithmic auditing 1.0QuantUniversity
 
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...QuantUniversity
 
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...QuantUniversity
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewQuantUniversity
 
AI Explainability and Model Risk Management
AI Explainability and Model Risk ManagementAI Explainability and Model Risk Management
AI Explainability and Model Risk ManagementQuantUniversity
 
Algorithmic auditing 1.0
Algorithmic auditing 1.0Algorithmic auditing 1.0
Algorithmic auditing 1.0QuantUniversity
 
Machine Learning in Finance: 10 Things You Need to Know in 2021
Machine Learning in Finance: 10 Things You Need to Know in 2021Machine Learning in Finance: 10 Things You Need to Know in 2021
Machine Learning in Finance: 10 Things You Need to Know in 2021QuantUniversity
 
Bayesian Portfolio Allocation
Bayesian Portfolio AllocationBayesian Portfolio Allocation
Bayesian Portfolio AllocationQuantUniversity
 
Constructing Private Asset Benchmarks
Constructing Private Asset BenchmarksConstructing Private Asset Benchmarks
Constructing Private Asset BenchmarksQuantUniversity
 
Machine Learning Interpretability
Machine Learning InterpretabilityMachine Learning Interpretability
Machine Learning InterpretabilityQuantUniversity
 
Responsible AI in Action
Responsible AI in ActionResponsible AI in Action
Responsible AI in ActionQuantUniversity
 
Qu speaker series 14: Synthetic Data Generation in Finance
Qu speaker series 14: Synthetic Data Generation in FinanceQu speaker series 14: Synthetic Data Generation in Finance
Qu speaker series 14: Synthetic Data Generation in FinanceQuantUniversity
 
Qu speaker series:Ethical Use of AI in Financial Markets
Qu speaker series:Ethical Use of AI in Financial MarketsQu speaker series:Ethical Use of AI in Financial Markets
Qu speaker series:Ethical Use of AI in Financial MarketsQuantUniversity
 

Mais de QuantUniversity (20)

EU Artificial Intelligence Act 2024 passed !
EU Artificial Intelligence Act 2024 passed !EU Artificial Intelligence Act 2024 passed !
EU Artificial Intelligence Act 2024 passed !
 
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSPYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
 
Qu for India - QuantUniversity FundRaiser
Qu for India  - QuantUniversity FundRaiserQu for India  - QuantUniversity FundRaiser
Qu for India - QuantUniversity FundRaiser
 
Ml master class for CFA Dallas
Ml master class for CFA DallasMl master class for CFA Dallas
Ml master class for CFA Dallas
 
Algorithmic auditing 1.0
Algorithmic auditing 1.0Algorithmic auditing 1.0
Algorithmic auditing 1.0
 
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
 
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
 
AI Explainability and Model Risk Management
AI Explainability and Model Risk ManagementAI Explainability and Model Risk Management
AI Explainability and Model Risk Management
 
Algorithmic auditing 1.0
Algorithmic auditing 1.0Algorithmic auditing 1.0
Algorithmic auditing 1.0
 
Machine Learning in Finance: 10 Things You Need to Know in 2021
Machine Learning in Finance: 10 Things You Need to Know in 2021Machine Learning in Finance: 10 Things You Need to Know in 2021
Machine Learning in Finance: 10 Things You Need to Know in 2021
 
Bayesian Portfolio Allocation
Bayesian Portfolio AllocationBayesian Portfolio Allocation
Bayesian Portfolio Allocation
 
The API Jungle
The API JungleThe API Jungle
The API Jungle
 
Explainable AI Workshop
Explainable AI WorkshopExplainable AI Workshop
Explainable AI Workshop
 
Constructing Private Asset Benchmarks
Constructing Private Asset BenchmarksConstructing Private Asset Benchmarks
Constructing Private Asset Benchmarks
 
Machine Learning Interpretability
Machine Learning InterpretabilityMachine Learning Interpretability
Machine Learning Interpretability
 
Responsible AI in Action
Responsible AI in ActionResponsible AI in Action
Responsible AI in Action
 
Qu speaker series 14: Synthetic Data Generation in Finance
Qu speaker series 14: Synthetic Data Generation in FinanceQu speaker series 14: Synthetic Data Generation in Finance
Qu speaker series 14: Synthetic Data Generation in Finance
 
Qwafafew meeting 5
Qwafafew meeting 5Qwafafew meeting 5
Qwafafew meeting 5
 
Qu speaker series:Ethical Use of AI in Financial Markets
Qu speaker series:Ethical Use of AI in Financial MarketsQu speaker series:Ethical Use of AI in Financial Markets
Qu speaker series:Ethical Use of AI in Financial Markets
 

Último

microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 

Último (20)

microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 

Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdf

  • 1. NOTES FROM AN INDUSTRY PRACTITIONER ROUNDTABLE NEW YORK CITY, 3RD MAY 2023 Managing the risk of large language models in financial services
  • 2. Context Over the last 2-3 years, Financial Institutions (FIs) have been making changes to their internal policies, standards and processes to manage the risks arising from the adoption of ML models. However, in many FIs, these mechanisms, created for small scale experimentation and adoption, have failed to scale up over time. The challenge has become even more acute as FIs grapple with the implications of recent advances in Large Language Models (LLMs). This roundtable brought together practitioners from 10-15 leading FIs (including 6 of the top 10 US banks). Representatives from across first line Data and Analytics/ AI teams, Model Risk, Data Management and analytics-intensive functions spent over 2 hours comparing notes on • The most relevant LLM use cases • Emerging risk considerations • Approaches to managing such risks, including but not limited to Model Risk • Implications for people, skills and ways of working The event kicked off with short presentations by Agus Sudjianto, EVP and Head of Corporate Model Risk at Wells Fargo, Sri Krishnamurthy, founder of Quant University, and Anupam Datta, co-founder of TruEra and a former professor at Carnegie Mellon University. This was followed by an open, Chatham House rules discussion with all round-table participants. The following pages summarize the key points of discussion during the 2–3-hour discussion. There was broad agreement on some topics, and differences in opinion on other
  • 3. Key Takeaways LLM Use Cases Q&A on internal knowledge corpus is the first “killer app” Text summarization/ translation is popular too “Traditional” NLP use cases are on steroids with LLMs Writing/ reading code may be the Next Big Thing Appetite for customer-facing use seems limited for now Risks – New and Old Accuracy/ hallucination concerns are top-of-mind Security/ Privacy feature heavily with hosted LLMs There is uncertainty over IP obligations and rights Reputation risk is holding back direct customer use Bias will become important with customer-facing use cases Managing Risks – Model Risk and Beyond LLM risk is an end-to-end system risk (not just model risk) Most risk managers seem to be in exploratory mode Access to embeddings seems essential for Model Validation Testing output quality in generative AI is a major challenge People, Skills, Ways of Working Everyone agrees that the nature of roles will change There is less alignment on the extent/ pace of job losses “Democratization” of AI will have significant impact Will Specialists win over Generalists? Jury is still out
  • 4. 1. LLM Use Cases in Finance The first wave of use cases in FIs are a mix of some in the “too difficult to solve” category, and others in which LLMs represent significant improvements over existing alternatives. “Killer app” – information retrieval/ contextual search (q&a) Q&A applications that allow staff to access an internal corpus of relevant knowledge in a simple and reliable way, with a much lower level of curation needed compared to alternatives. Examples: financial advisors querying internal research on investment opportunities; staff querying FI’s internal policies to get HR or compliance advice. FI has control over the quality and provenance of the internal corpus. Testing for quality is easier if the embeddings created by the LLM on the internal corpus are available. Text summarization (potentially also with translation) Creating document summaries, typically for review and use by an expert human – e.g., ‘reading’ multiple documents to create a Know-Your-Customer profile, translating and summarizing a 200-page report in German for a US-based analyst to consume in 5 minutes. ‘Traditional’ NLP apps - on steroids now! Traditional NLP use cases – e.g., extracting relevant data from unstructured data sources for operational automation, sentiment analysis on customer complaints or external media reports, analyzing trading room communication for potential misconduct – set for a boom, as LLMs • Promise greater accuracy on existing data extraction use cases • Make NLP viable for smaller, “long tail” use cases (previously inaccessible due to need for labelled data and custom model development/ training) • Add complementary ‘last mile’ generative AI capabilities on top of data extraction (e.g. text summarization) that increase end-to-end automation levels Reading/ writing code – next big thing? Significant medium-term potential in LLM applications that can translate code into non-technical language and vice versa. Customer-facing applications – low appetite for now! Potential to automate aspects of customer engagement – e.g., automating service requests – considered but mostly deferred for now. At this stage, FIs are choosing to attack lower hanging fruit (above) with fewer privacy, security, bias and reputation risks.
  • 5. 2. Risks with LLM Use Accuracy/ hallucination concerns dominate, but privacy/ security/ IP/ reputation risks matter too. Accuracy/ “hallucinations” Most LLM applications prone to hallucinations, weak in math and unable to provide traceability of answers, at least for now. Except where applying LLMs on FI’s internal knowledge corpuses. Accuracy-related risk perception changes significantly based on the end user. Responses from an LLM-based Q&A system could be: • Low risk if being reviewed by an expert user before being acted upon • Somewhat riskier if the end user is inside the FI but lacks domain expertise • Very risky if put in front of an end customer. Accuracy is also heavily impacted by the exact wording of the question. “English language programming” can be too flexible and may need to be translated to resemble more structured ‘pseudo-code’ to ensure repeatability/ reliability. Security and privacy Concerns around security and privacy breaches, due to the need to expose internal FI data to the LLM provider. Some FIs have opted to host LLMs inside their own networks (using open source LLMs or through specific arrangements with the likes of Azure/ Open AI). IP considerations Risks of inadvertent IP/ copyright breaches by the FI, in case the LLM was originally trained on external data without appropriate permissions. Uncertainty over IP ownership when content or code is created using LLMs. Reputation A particular source of concern in customer-facing use cases (e.g., a chatbot that is ‘tricked’ into using inappropriate language by an end-user trying to embarrass the FI). Concerns around unjust bias not yet significant in the group, since none of the participants is focusing on customer-facing use cases. *** Beyond the risks themselves, there was also concern about the potential originators of such risks– e.g., fintechs attempting to compete with regulated entities but without the internal guardrails that the latter have in place; staff inside regulated FIs who find themselves using models for the first time due to the LLM-led “democratization” of AI.
  • 6. 3. Managing LLM Risks Broad agreement that the approach to risk management should be “end- to-end”, rather than solely focused on model risk. Overall - everyone still in experimental mode A majority of the participants claimed that ChatGPT was ‘banned’ from use in their organizations, except in narrow supervised conditions. Several participants acknowledged that they were still trying to put their arms around the risks of LLM. They were in learning mode, and had implemented ad-hoc controls (e.g., a single channel to approve any LLM use case around the FI) in the interim. One participant felt that it was important to let low risk use cases move forward, and use those experiences to flesh out the control framework for higher risk ones. Model risk – still working through implications Many FIs opined that MRM departments were under pressure from all parts of their organizations to review/ approve potential LLM use cases. Some felt unprepared for the sudden spike in interest. Some participants expressed confidence that when the LLMs are used in the embedding space (e.g., with the info retrieval type use cases), it was possible to apply robust MRM controls. Some pointed out that this was already underway with BERT style models. However, where the embeddings are not made available, outcome-based testing is the only option right now. This naturally limits the extent to which MRM can approve such models for high stakes use cases, and/or results in imposition of additional human controls to compensate. Outcome-based testing is hard since exhaustive testing is not possible in most LLM use cases. Different prompts and/or end-user queries can produce different outcomes with same underlying model. The challenges of testing generative ai output quality For Generative AI, an additional testing challenge is to identify the quality of the output itself. Human feedback, where available, is of course ideal. However, in many situations, that is not a viable or scalable option. Here, considering a combination of automated feedback functions may be helpful (e.g., relevance testing or sentiment analysis of the answers) The analogy is to data labelling in traditional ML, which was initially all human, but subsequently used automated techniques. Similar programmatic approaches may provide a way to test the quality of Generative AI system outputs.
  • 7. 4. The People Dimension A sense that LLMs will have profound impact on the nature, and possibly the number, of jobs in the industry, but not much clarity or agreement beyond that. Elimination of roles – unclear at this stage Opinion seemed divided on if/ how quickly LLMs will result in large scale reduction of jobs. Some participants felt that there may be fewer roles in coding and data science; others foresaw significant reductions in operations. However, few offered a view on the timing or scale of such losses Consensus that many existing roles will change There was general agreement among participants that over time, several roles inside FIs will certainly change a lot. One participant highlighted the potential for LLMs to automate the interpretation of code and/or the generation of new code from natural language instructions. At current capability levels, this would still leave a role for coders, but much more in an expert ‘checker’ role. Another opined that LLMs are democratizing data science (since “anyone can implement LLM based applications”), and this would have significant implications, positive and negative, for data scientists. The talent pipeline challenge One potential challenge is the need for apprenticeship in order to build the necessary domain expertise needed to work with advanced AI tools: if most of the ‘grunt work’ that was previously done by entry level/ junior staff can be automated, then how would the pipeline for tomorrow’s experts be built? Experts vs. Generalists: no consensus yet An unresolved question was whether widespread use of LLMs will tilt the balance more towards generalists or specialists. • Will LLMs empower generalists to overcome the disadvantages arising from their lack of specialist knowledge, or • Will they further entrench the value of specialists who can complement their deep domain knowledge with the outputs from LLM applications