SlideShare a Scribd company logo
1 of 24
1StoryStream.ai Dr Janet Bastiman @yssybyl
AI Fails:
how can you begin to overcome bias in design and test
2StoryStream.ai Dr Janet Bastiman @yssybyl
The world’s leading automotive content platform
StoryStream is a dedicated automotive content platform, trusted by some of the
world’s leading car brands. Specifically created to help automotive brands
provide a more relevant, engaging customer experience, fuelled with authentic
content and designed for efficiently scaling content operations across global
teams.
● Grow customer engagement and conversions by up to 25%
● Reduce content creation and management costs by up to 60%
● Provide a more authentic customer experience
● Understand your customer in a deeper way
About StoryStream
The Core StoryStream Benefits
3StoryStream.ai Dr Janet Bastiman @yssybyl
Tonight I’m going to be looking at why so many big companies
have a problem with bias and what checks and balances you
can put in place to help prevent falling victim to these types of
errors.
For argument’s sake, I’m using AI as a superset of machine
learning, deep learning and all other techniques that lead to a
system that appears to make intelligent decisions.
Also, since this is a short talk, bear in mind that each one of
these slides warrants a full presentation in itself, so this will be
a high level introduction to get you thinking about things.
4StoryStream.ai Dr Janet Bastiman @yssybyl
AI fails - Bias
***
5StoryStream.ai Dr Janet Bastiman @yssybyl
All those headlines make us feel uncomfortable, and rightly so.
They cover Amazon’s sexist recruitment AI, Google’s image
tagging and the COMPAS system to predict re-offense rates. I
think everyone in this room would nod sagely about how bad
these are and claim it would never happen on their watch.
So why do we keep seeing this happen?
What about the AI that doesn’t make the headlines, the ones
quietly deciding whether you get sent a special offer, a credit
card, a cancer diagnosis? The things most of us are working
on. What if our work is flawed but never makes the headlines
– would we know?
Nobody in this field sets out to make a bad AI, so why does
this happen?
It’d be easy to say have a diverse team and diverse data, but
that’s not good enough.
6StoryStream.ai Dr Janet Bastiman @yssybyl
What is bias?
an unwarranted correlation between input
variable and output classification
IMPACT is more important than ACCURACY
7StoryStream.ai Dr Janet Bastiman @yssybyl
Let’s take a step back and look at “What is bias?”
If I gave you all a test no doubt you’d write answers around
underfitting and overfitting – mathematical answers.
Focus on the more descriptive definition: an unwarranted
correlation.
For many of us, from a position of privilege, it’s hard to really
understand the impact of being on the receiving end of these
correlations.
So how are these biases introduced? Let’s take a look at the
maths…
8StoryStream.ai Dr Janet Bastiman @yssybyl
Maths
“Fairness” assumes:
A. Calibration within groups
B. Balance for negative class
C. Balance for positive class
Can only be achieved if prediction is
perfect or there are completely equal
base rates
● You cannot balance everything
● Either:
○ prediction is unbiased
○ or error is unbiased
● Fairness is personal
Conclusions
Chouldechova
https://arxiv.org/abs/1610.07524
Kleinberg et al
https://arxiv.org/abs/1609.05807
9StoryStream.ai Dr Janet Bastiman @yssybyl
Both of these papers were studies into whether the COMPAS
system was deliberately biased, and come to the same
conclusions via different proofs. They are both well worth a
read. As a side note, there’s a minor mathematical error in the
Kleinburg paper (which does not affect the proof) but worth
noting you shouldn’t just blindly implement what you read in
papers .
Starting with a definition of fairness in both papers they
conclude that:
Unless you have perfect prediction and a balanced population,
you will have either bias in positive prediction or bias in error
rates.
10StoryStream.ai Dr Janet Bastiman @yssybyl
So to avoid bias, mathematically, we need to live in an
unbiased world. Sadly this is not the case.
You cannot have positive parity and error parity at the same
time, you can only choose which is least unacceptable.
COMPAS chose to minimise false negatives and as a result
created something that was racially biased.
For all real problems you will violate one of the fairness
measures. Typically we focus on overall accuracy and most
practitioners don’t think further.
We are post-GDPR now so if you are making inferences
against protected variables make sure you are storing them
correctly and have some explainability (*whole other talk  )
11StoryStream.ai Dr Janet Bastiman @yssybyl
Data Errors
● Selection bias
● Random Sampling
● Over coverage
● Undercoverage
● Measurement (Response) error
● Processing errors
● Participation bias
12StoryStream.ai Dr Janet Bastiman @yssybyl
In addition to the mathematics of creating AI, bias creeps in
earlier in the chain. Unless you are lucky enough to get a full
view of your data pipeline, you way not have a good
understanding of how you’ve ended up with the data in front of
you.
If you’ve done statistical sampling theory then you’ll be aware
of this, but here’s a taster. There are seven key data sampling
errors that you should know and be able to ask about before
building any model.
The data available to any company is by nature limited to a
subset of all possible data. The graph shows the
mathematical spread of accuracy of a system predicting a 50%
average score based on population size of sample. Small data
sets can cause large variations.
Extrapolate to your own data – where are the holes?
13StoryStream.ai Dr Janet Bastiman @yssybyl
Example: Is Oxford racially biased for admissions?
A couple of years ago, admissions data from Oxford University showed
that there was a much lower proportion of BAME students offered places
than were in the general population. While I’m not discounting that there
was racial bias occurring, let’s look at some of the data biases involved:
- Students at private schools are more likely to apply than state school
students with the same grades (selection)
- Students at private schools are mostly white (undercoverage)
- Students from state schools are more likely to apply to popular /
oversubscribed courses due to curriculum restrictions (participation)
All of these affect the perceived outcome and can exacerbate or mask a
true result. Know the providence of your data and where the sampling
impacts your results.
14StoryStream.ai Dr Janet Bastiman @yssybyl
Everyone is biased
You are no exception
15StoryStream.ai Dr Janet Bastiman @yssybyl
This is really important. Accept that everyone is biased in some way.
We are biased by our experiences (positive and negative) and we are
biased by the comments from the networks we trust. Every day our
biases are reinforced.
Our data sets are affected by our biases.
Our test sets are affected by our biases.
We need to get into a different mindset.
The image on the next slide is from:
https://www.designhacks.co/products/cognitive-bias-codex-poster
Buy a copy and put it somewhere you can see everyday. I have!
16StoryStream.ai Dr Janet Bastiman @yssybyl
17StoryStream.ai Dr Janet Bastiman @yssybyl
These are your biases and why – please give the people who created
this the traffic and buy the poster!
This is how you are manipulated. This is how you justify bad
behaviour. Apply this to your day to day life. Question yourself if you
find yourself agreeing or disagreeing on “gut instinct”. Stop yourself if
you make sweeping generalisations.
Challenge yourself. This is why we are bad at gathering data and why
we are bad at analysing it. We are primed to see patterns even when
they are not there. I’ve had blazing rows with more than one C-level
exec because they have seen something in the data that just isn’t
there. Saying a model is wrong because it doesn’t fit expectations is
just as bad as saying it is correct just because it does fit your own
biases.
18StoryStream.ai Dr Janet Bastiman @yssybyl
Without understanding your biases you will create data sets that fit
your own experience profile. You will discard data points that don’t fit
without being conscious of it. If you get the results you expect you will
not test them as thoroughly as if they disagreed with your
expectations. You will twist your models for an experience that makes
you comfortable at the expense of others.
Be cognisant of your own biases. Diverse teams help here, but even
with this, challenge yourselves.
Which brings me to testing… Most AI practitioners validate their
models but do not test them in the way that test engineers do…
19StoryStream.ai Dr Janet Bastiman @yssybyl
AI testing is not TESTING
What happens if your model
gets bad data?
Humans just love to prove
superiority over tech.
Learn how to break everything you create
https://www.sempf.net/post/On-Testing1
20StoryStream.ai Dr Janet Bastiman @yssybyl
Sure, you test your models against known data and you probably have
a golden test set and do final validation against that. You may even
have a pipeline for constant sampling and retest as live data goes
through your system. The problem is that fails are accepted as part of
the overall statistics: “it’s only 1%”, “the system wasn’t designed for
that”, “that failed because of [thing you’re not going to change]”.
The issue is that most people are reticent to really and thoroughly test
their systems. If you’ve come from a software engineering
background then you should be familiar with these concepts, but
optimisation and testing are the two biggest omissions in every AI
course I’ve seen.
Learn to break your models…
21StoryStream.ai Dr Janet Bastiman @yssybyl
All models fail in some circumstances – find those situations, go out of
your way to understand your models so thoroughly you should never
be surprised. Test them with the broadest range of data you can. Do
your own adversarial attacks. I regularly test my team’s models with
pictures of my cats and static…
Read Bill Sempf’s blog post for a great example of how to test a
simple input box that expects a number. Extrapolate this to your
systems (*whole other talk in this!)
This doesn’t mean that systems have to be perfect to be released.
We live in the real world where you will be pushed (probably by people
like me) to get solutions out. Push back, be clear on limitations so that
decisions can be made about the risks. Add in safeguards for when
your model is wrong.
Even with all this testing, the thing that should be at the front of your
mind is not accuracy but impact.
22StoryStream.ai Dr Janet Bastiman @yssybyl
Impact > accuracy
23StoryStream.ai Dr Janet Bastiman @yssybyl
All businesses should care about the impact of the AI they create.
Rather than talking, accuracy, recall, precision, let’s shift to impact. What
is the impact of mislabelling a car? Getting someone’s gender incorrect?
Refusing a loan? Incarcerating an innocent person? Missing a diagnosis of
a terminal illness?
The answer may be different for different individuals – what might be a non
issue for one person could be life-changing for another. Stop thinking from
your own position of privilege and your own biases and take a broader
view.
How is the information used – will there be a human in the loop? Put
yourself in the position of the most vulnerable and marginalised users of
your system and ask what is the impact of a false positive or false negative
on them. Don’t brush off those results if they don’t fit with your experience.
24StoryStream.ai Dr Janet Bastiman @yssybyl
Summary
You are biased
Challenge your biases
Understand data provenance
Break everything you create
Create AI mindful of impact on the individual

More Related Content

What's hot

Top 5 home remedies for getting rid of hives fast yabibo
Top 5 home remedies for getting rid of hives fast   yabiboTop 5 home remedies for getting rid of hives fast   yabibo
Top 5 home remedies for getting rid of hives fast yabiboYabibo
 
VSSML18. Evaluations
VSSML18. EvaluationsVSSML18. Evaluations
VSSML18. EvaluationsBigML, Inc
 
Lean analytics: Five lessons beyond the basics
Lean analytics: Five lessons beyond the basicsLean analytics: Five lessons beyond the basics
Lean analytics: Five lessons beyond the basicsLean Analytics
 
Lean Analytics: Using Data to Build a Better Business Faster
Lean Analytics: Using Data to Build a Better Business FasterLean Analytics: Using Data to Build a Better Business Faster
Lean Analytics: Using Data to Build a Better Business FasterLean Startup Co.
 
Slides from New Media Manitoba Lean Analytics workshop, June 2015
Slides from New Media Manitoba Lean Analytics workshop, June 2015Slides from New Media Manitoba Lean Analytics workshop, June 2015
Slides from New Media Manitoba Lean Analytics workshop, June 2015Lean Analytics
 
Introduction to Lean Analytics for Lean Startup Circle SF
Introduction to Lean Analytics for Lean Startup Circle SFIntroduction to Lean Analytics for Lean Startup Circle SF
Introduction to Lean Analytics for Lean Startup Circle SFLean Analytics
 
Web Analytics Analysis: How to Find Low-Hanging Fruit and A/B Test Ideas
Web Analytics Analysis: How to Find Low-Hanging Fruit and A/B Test IdeasWeb Analytics Analysis: How to Find Low-Hanging Fruit and A/B Test Ideas
Web Analytics Analysis: How to Find Low-Hanging Fruit and A/B Test IdeasShanelle Mullin
 
10NTC - Data Superheroes - DiJulio
10NTC - Data Superheroes - DiJulio10NTC - Data Superheroes - DiJulio
10NTC - Data Superheroes - DiJuliosarahdijulio
 
Guide: 7 ways to better filter your surveys
Guide: 7 ways to better filter your surveysGuide: 7 ways to better filter your surveys
Guide: 7 ways to better filter your surveysSurveyMonkey
 
Resolving Sponsors’ Objections in Clinical Trials
Resolving Sponsors’ Objections in Clinical TrialsResolving Sponsors’ Objections in Clinical Trials
Resolving Sponsors’ Objections in Clinical TrialsTrialJoin
 

What's hot (10)

Top 5 home remedies for getting rid of hives fast yabibo
Top 5 home remedies for getting rid of hives fast   yabiboTop 5 home remedies for getting rid of hives fast   yabibo
Top 5 home remedies for getting rid of hives fast yabibo
 
VSSML18. Evaluations
VSSML18. EvaluationsVSSML18. Evaluations
VSSML18. Evaluations
 
Lean analytics: Five lessons beyond the basics
Lean analytics: Five lessons beyond the basicsLean analytics: Five lessons beyond the basics
Lean analytics: Five lessons beyond the basics
 
Lean Analytics: Using Data to Build a Better Business Faster
Lean Analytics: Using Data to Build a Better Business FasterLean Analytics: Using Data to Build a Better Business Faster
Lean Analytics: Using Data to Build a Better Business Faster
 
Slides from New Media Manitoba Lean Analytics workshop, June 2015
Slides from New Media Manitoba Lean Analytics workshop, June 2015Slides from New Media Manitoba Lean Analytics workshop, June 2015
Slides from New Media Manitoba Lean Analytics workshop, June 2015
 
Introduction to Lean Analytics for Lean Startup Circle SF
Introduction to Lean Analytics for Lean Startup Circle SFIntroduction to Lean Analytics for Lean Startup Circle SF
Introduction to Lean Analytics for Lean Startup Circle SF
 
Web Analytics Analysis: How to Find Low-Hanging Fruit and A/B Test Ideas
Web Analytics Analysis: How to Find Low-Hanging Fruit and A/B Test IdeasWeb Analytics Analysis: How to Find Low-Hanging Fruit and A/B Test Ideas
Web Analytics Analysis: How to Find Low-Hanging Fruit and A/B Test Ideas
 
10NTC - Data Superheroes - DiJulio
10NTC - Data Superheroes - DiJulio10NTC - Data Superheroes - DiJulio
10NTC - Data Superheroes - DiJulio
 
Guide: 7 ways to better filter your surveys
Guide: 7 ways to better filter your surveysGuide: 7 ways to better filter your surveys
Guide: 7 ways to better filter your surveys
 
Resolving Sponsors’ Objections in Clinical Trials
Resolving Sponsors’ Objections in Clinical TrialsResolving Sponsors’ Objections in Clinical Trials
Resolving Sponsors’ Objections in Clinical Trials
 

Similar to AI Fails: Avoiding bias in your systems

ArtificialIntelligenceandMachineLearningforBusiness.pptx
ArtificialIntelligenceandMachineLearningforBusiness.pptxArtificialIntelligenceandMachineLearningforBusiness.pptx
ArtificialIntelligenceandMachineLearningforBusiness.pptxPerumalPitchandi
 
Artificial Intelligence and Machine Learning for business
Artificial Intelligence and Machine Learning for businessArtificial Intelligence and Machine Learning for business
Artificial Intelligence and Machine Learning for businessSteven Finlay
 
Slides from Growthcon 2014 Lean Analytics masterclass
Slides from Growthcon 2014 Lean Analytics masterclassSlides from Growthcon 2014 Lean Analytics masterclass
Slides from Growthcon 2014 Lean Analytics masterclassLean Analytics
 
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision MakingData-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Makingindeedeng
 
College Admission Essay Yahoo. Online assignment writing service.
College Admission Essay Yahoo. Online assignment writing service.College Admission Essay Yahoo. Online assignment writing service.
College Admission Essay Yahoo. Online assignment writing service.April Eide
 
Automated decision making with predictive applications – Big Data Frankfurt
Automated decision making with predictive applications – Big Data FrankfurtAutomated decision making with predictive applications – Big Data Frankfurt
Automated decision making with predictive applications – Big Data FrankfurtLars Trieloff
 
Pairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYPairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYMichele Kiss
 
Statistics in Journalism
Statistics in JournalismStatistics in Journalism
Statistics in JournalismRegina Nuzzo
 
School customer service presentation
School customer service presentationSchool customer service presentation
School customer service presentationsteve muzzy
 
FREE 6 Sample Informative Essay Templates In MS Word
FREE 6 Sample Informative Essay Templates In MS WordFREE 6 Sample Informative Essay Templates In MS Word
FREE 6 Sample Informative Essay Templates In MS WordSandra Long
 
Make clinical prediction models great again
Make clinical prediction models great againMake clinical prediction models great again
Make clinical prediction models great againBenVanCalster
 
Discussion Questions Chapter 15Terms in Review1Define or exp.docx
Discussion Questions Chapter 15Terms in Review1Define or exp.docxDiscussion Questions Chapter 15Terms in Review1Define or exp.docx
Discussion Questions Chapter 15Terms in Review1Define or exp.docxedgar6wallace88877
 
Safety and Social Media Dia webinar 12 sep2013
Safety and Social Media Dia webinar 12 sep2013 Safety and Social Media Dia webinar 12 sep2013
Safety and Social Media Dia webinar 12 sep2013 Michael Ibara
 
How to Perform Website Experiments [+ SEJ Experiment Walk-Through & Results]
How to Perform Website Experiments [+ SEJ Experiment Walk-Through & Results]How to Perform Website Experiments [+ SEJ Experiment Walk-Through & Results]
How to Perform Website Experiments [+ SEJ Experiment Walk-Through & Results]Search Engine Journal
 
DAMA Webinar: Influencing with Data – Facts Don’t Matter Much!
DAMA Webinar: Influencing with Data – Facts Don’t Matter Much!DAMA Webinar: Influencing with Data – Facts Don’t Matter Much!
DAMA Webinar: Influencing with Data – Facts Don’t Matter Much!DATAVERSITY
 

Similar to AI Fails: Avoiding bias in your systems (20)

ArtificialIntelligenceandMachineLearningforBusiness.pptx
ArtificialIntelligenceandMachineLearningforBusiness.pptxArtificialIntelligenceandMachineLearningforBusiness.pptx
ArtificialIntelligenceandMachineLearningforBusiness.pptx
 
Artificial Intelligence and Machine Learning for business
Artificial Intelligence and Machine Learning for businessArtificial Intelligence and Machine Learning for business
Artificial Intelligence and Machine Learning for business
 
Slides from Growthcon 2014 Lean Analytics masterclass
Slides from Growthcon 2014 Lean Analytics masterclassSlides from Growthcon 2014 Lean Analytics masterclass
Slides from Growthcon 2014 Lean Analytics masterclass
 
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision MakingData-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
 
Ysc2013
Ysc2013Ysc2013
Ysc2013
 
Jerait PDF.pdf
Jerait PDF.pdfJerait PDF.pdf
Jerait PDF.pdf
 
College Admission Essay Yahoo. Online assignment writing service.
College Admission Essay Yahoo. Online assignment writing service.College Admission Essay Yahoo. Online assignment writing service.
College Admission Essay Yahoo. Online assignment writing service.
 
Automated decision making with predictive applications – Big Data Frankfurt
Automated decision making with predictive applications – Big Data FrankfurtAutomated decision making with predictive applications – Big Data Frankfurt
Automated decision making with predictive applications – Big Data Frankfurt
 
Pairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYPairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHY
 
Statistics in Journalism
Statistics in JournalismStatistics in Journalism
Statistics in Journalism
 
Ed7
Ed7Ed7
Ed7
 
School customer service presentation
School customer service presentationSchool customer service presentation
School customer service presentation
 
Being a Data-Driven Communicator
Being a Data-Driven CommunicatorBeing a Data-Driven Communicator
Being a Data-Driven Communicator
 
FREE 6 Sample Informative Essay Templates In MS Word
FREE 6 Sample Informative Essay Templates In MS WordFREE 6 Sample Informative Essay Templates In MS Word
FREE 6 Sample Informative Essay Templates In MS Word
 
Make clinical prediction models great again
Make clinical prediction models great againMake clinical prediction models great again
Make clinical prediction models great again
 
AI for Finance
AI for FinanceAI for Finance
AI for Finance
 
Discussion Questions Chapter 15Terms in Review1Define or exp.docx
Discussion Questions Chapter 15Terms in Review1Define or exp.docxDiscussion Questions Chapter 15Terms in Review1Define or exp.docx
Discussion Questions Chapter 15Terms in Review1Define or exp.docx
 
Safety and Social Media Dia webinar 12 sep2013
Safety and Social Media Dia webinar 12 sep2013 Safety and Social Media Dia webinar 12 sep2013
Safety and Social Media Dia webinar 12 sep2013
 
How to Perform Website Experiments [+ SEJ Experiment Walk-Through & Results]
How to Perform Website Experiments [+ SEJ Experiment Walk-Through & Results]How to Perform Website Experiments [+ SEJ Experiment Walk-Through & Results]
How to Perform Website Experiments [+ SEJ Experiment Walk-Through & Results]
 
DAMA Webinar: Influencing with Data – Facts Don’t Matter Much!
DAMA Webinar: Influencing with Data – Facts Don’t Matter Much!DAMA Webinar: Influencing with Data – Facts Don’t Matter Much!
DAMA Webinar: Influencing with Data – Facts Don’t Matter Much!
 

More from Dr Janet Bastiman

Can abstraction lead to intelligence?
Can abstraction lead to intelligence?Can abstraction lead to intelligence?
Can abstraction lead to intelligence?Dr Janet Bastiman
 
Creating AI using biological network techniques
Creating AI using biological network techniquesCreating AI using biological network techniques
Creating AI using biological network techniquesDr Janet Bastiman
 
Collaboration, Publications, Community: Building your personal tech brand
Collaboration, Publications, Community: Building your personal tech brandCollaboration, Publications, Community: Building your personal tech brand
Collaboration, Publications, Community: Building your personal tech brandDr Janet Bastiman
 

More from Dr Janet Bastiman (8)

Making a deepfake
Making a deepfakeMaking a deepfake
Making a deepfake
 
Ethics of Deepfakes
Ethics of DeepfakesEthics of Deepfakes
Ethics of Deepfakes
 
What are deepfakes?
What are deepfakes?What are deepfakes?
What are deepfakes?
 
Making AI efficient
Making AI efficientMaking AI efficient
Making AI efficient
 
Can abstraction lead to intelligence?
Can abstraction lead to intelligence?Can abstraction lead to intelligence?
Can abstraction lead to intelligence?
 
AI Bias Oxford 2017
AI Bias Oxford 2017AI Bias Oxford 2017
AI Bias Oxford 2017
 
Creating AI using biological network techniques
Creating AI using biological network techniquesCreating AI using biological network techniques
Creating AI using biological network techniques
 
Collaboration, Publications, Community: Building your personal tech brand
Collaboration, Publications, Community: Building your personal tech brandCollaboration, Publications, Community: Building your personal tech brand
Collaboration, Publications, Community: Building your personal tech brand
 

Recently uploaded

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Recently uploaded (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

AI Fails: Avoiding bias in your systems

  • 1. 1StoryStream.ai Dr Janet Bastiman @yssybyl AI Fails: how can you begin to overcome bias in design and test
  • 2. 2StoryStream.ai Dr Janet Bastiman @yssybyl The world’s leading automotive content platform StoryStream is a dedicated automotive content platform, trusted by some of the world’s leading car brands. Specifically created to help automotive brands provide a more relevant, engaging customer experience, fuelled with authentic content and designed for efficiently scaling content operations across global teams. ● Grow customer engagement and conversions by up to 25% ● Reduce content creation and management costs by up to 60% ● Provide a more authentic customer experience ● Understand your customer in a deeper way About StoryStream The Core StoryStream Benefits
  • 3. 3StoryStream.ai Dr Janet Bastiman @yssybyl Tonight I’m going to be looking at why so many big companies have a problem with bias and what checks and balances you can put in place to help prevent falling victim to these types of errors. For argument’s sake, I’m using AI as a superset of machine learning, deep learning and all other techniques that lead to a system that appears to make intelligent decisions. Also, since this is a short talk, bear in mind that each one of these slides warrants a full presentation in itself, so this will be a high level introduction to get you thinking about things.
  • 4. 4StoryStream.ai Dr Janet Bastiman @yssybyl AI fails - Bias ***
  • 5. 5StoryStream.ai Dr Janet Bastiman @yssybyl All those headlines make us feel uncomfortable, and rightly so. They cover Amazon’s sexist recruitment AI, Google’s image tagging and the COMPAS system to predict re-offense rates. I think everyone in this room would nod sagely about how bad these are and claim it would never happen on their watch. So why do we keep seeing this happen? What about the AI that doesn’t make the headlines, the ones quietly deciding whether you get sent a special offer, a credit card, a cancer diagnosis? The things most of us are working on. What if our work is flawed but never makes the headlines – would we know? Nobody in this field sets out to make a bad AI, so why does this happen? It’d be easy to say have a diverse team and diverse data, but that’s not good enough.
  • 6. 6StoryStream.ai Dr Janet Bastiman @yssybyl What is bias? an unwarranted correlation between input variable and output classification IMPACT is more important than ACCURACY
  • 7. 7StoryStream.ai Dr Janet Bastiman @yssybyl Let’s take a step back and look at “What is bias?” If I gave you all a test no doubt you’d write answers around underfitting and overfitting – mathematical answers. Focus on the more descriptive definition: an unwarranted correlation. For many of us, from a position of privilege, it’s hard to really understand the impact of being on the receiving end of these correlations. So how are these biases introduced? Let’s take a look at the maths…
  • 8. 8StoryStream.ai Dr Janet Bastiman @yssybyl Maths “Fairness” assumes: A. Calibration within groups B. Balance for negative class C. Balance for positive class Can only be achieved if prediction is perfect or there are completely equal base rates ● You cannot balance everything ● Either: ○ prediction is unbiased ○ or error is unbiased ● Fairness is personal Conclusions Chouldechova https://arxiv.org/abs/1610.07524 Kleinberg et al https://arxiv.org/abs/1609.05807
  • 9. 9StoryStream.ai Dr Janet Bastiman @yssybyl Both of these papers were studies into whether the COMPAS system was deliberately biased, and come to the same conclusions via different proofs. They are both well worth a read. As a side note, there’s a minor mathematical error in the Kleinburg paper (which does not affect the proof) but worth noting you shouldn’t just blindly implement what you read in papers . Starting with a definition of fairness in both papers they conclude that: Unless you have perfect prediction and a balanced population, you will have either bias in positive prediction or bias in error rates.
  • 10. 10StoryStream.ai Dr Janet Bastiman @yssybyl So to avoid bias, mathematically, we need to live in an unbiased world. Sadly this is not the case. You cannot have positive parity and error parity at the same time, you can only choose which is least unacceptable. COMPAS chose to minimise false negatives and as a result created something that was racially biased. For all real problems you will violate one of the fairness measures. Typically we focus on overall accuracy and most practitioners don’t think further. We are post-GDPR now so if you are making inferences against protected variables make sure you are storing them correctly and have some explainability (*whole other talk  )
  • 11. 11StoryStream.ai Dr Janet Bastiman @yssybyl Data Errors ● Selection bias ● Random Sampling ● Over coverage ● Undercoverage ● Measurement (Response) error ● Processing errors ● Participation bias
  • 12. 12StoryStream.ai Dr Janet Bastiman @yssybyl In addition to the mathematics of creating AI, bias creeps in earlier in the chain. Unless you are lucky enough to get a full view of your data pipeline, you way not have a good understanding of how you’ve ended up with the data in front of you. If you’ve done statistical sampling theory then you’ll be aware of this, but here’s a taster. There are seven key data sampling errors that you should know and be able to ask about before building any model. The data available to any company is by nature limited to a subset of all possible data. The graph shows the mathematical spread of accuracy of a system predicting a 50% average score based on population size of sample. Small data sets can cause large variations. Extrapolate to your own data – where are the holes?
  • 13. 13StoryStream.ai Dr Janet Bastiman @yssybyl Example: Is Oxford racially biased for admissions? A couple of years ago, admissions data from Oxford University showed that there was a much lower proportion of BAME students offered places than were in the general population. While I’m not discounting that there was racial bias occurring, let’s look at some of the data biases involved: - Students at private schools are more likely to apply than state school students with the same grades (selection) - Students at private schools are mostly white (undercoverage) - Students from state schools are more likely to apply to popular / oversubscribed courses due to curriculum restrictions (participation) All of these affect the perceived outcome and can exacerbate or mask a true result. Know the providence of your data and where the sampling impacts your results.
  • 14. 14StoryStream.ai Dr Janet Bastiman @yssybyl Everyone is biased You are no exception
  • 15. 15StoryStream.ai Dr Janet Bastiman @yssybyl This is really important. Accept that everyone is biased in some way. We are biased by our experiences (positive and negative) and we are biased by the comments from the networks we trust. Every day our biases are reinforced. Our data sets are affected by our biases. Our test sets are affected by our biases. We need to get into a different mindset. The image on the next slide is from: https://www.designhacks.co/products/cognitive-bias-codex-poster Buy a copy and put it somewhere you can see everyday. I have!
  • 16. 16StoryStream.ai Dr Janet Bastiman @yssybyl
  • 17. 17StoryStream.ai Dr Janet Bastiman @yssybyl These are your biases and why – please give the people who created this the traffic and buy the poster! This is how you are manipulated. This is how you justify bad behaviour. Apply this to your day to day life. Question yourself if you find yourself agreeing or disagreeing on “gut instinct”. Stop yourself if you make sweeping generalisations. Challenge yourself. This is why we are bad at gathering data and why we are bad at analysing it. We are primed to see patterns even when they are not there. I’ve had blazing rows with more than one C-level exec because they have seen something in the data that just isn’t there. Saying a model is wrong because it doesn’t fit expectations is just as bad as saying it is correct just because it does fit your own biases.
  • 18. 18StoryStream.ai Dr Janet Bastiman @yssybyl Without understanding your biases you will create data sets that fit your own experience profile. You will discard data points that don’t fit without being conscious of it. If you get the results you expect you will not test them as thoroughly as if they disagreed with your expectations. You will twist your models for an experience that makes you comfortable at the expense of others. Be cognisant of your own biases. Diverse teams help here, but even with this, challenge yourselves. Which brings me to testing… Most AI practitioners validate their models but do not test them in the way that test engineers do…
  • 19. 19StoryStream.ai Dr Janet Bastiman @yssybyl AI testing is not TESTING What happens if your model gets bad data? Humans just love to prove superiority over tech. Learn how to break everything you create https://www.sempf.net/post/On-Testing1
  • 20. 20StoryStream.ai Dr Janet Bastiman @yssybyl Sure, you test your models against known data and you probably have a golden test set and do final validation against that. You may even have a pipeline for constant sampling and retest as live data goes through your system. The problem is that fails are accepted as part of the overall statistics: “it’s only 1%”, “the system wasn’t designed for that”, “that failed because of [thing you’re not going to change]”. The issue is that most people are reticent to really and thoroughly test their systems. If you’ve come from a software engineering background then you should be familiar with these concepts, but optimisation and testing are the two biggest omissions in every AI course I’ve seen. Learn to break your models…
  • 21. 21StoryStream.ai Dr Janet Bastiman @yssybyl All models fail in some circumstances – find those situations, go out of your way to understand your models so thoroughly you should never be surprised. Test them with the broadest range of data you can. Do your own adversarial attacks. I regularly test my team’s models with pictures of my cats and static… Read Bill Sempf’s blog post for a great example of how to test a simple input box that expects a number. Extrapolate this to your systems (*whole other talk in this!) This doesn’t mean that systems have to be perfect to be released. We live in the real world where you will be pushed (probably by people like me) to get solutions out. Push back, be clear on limitations so that decisions can be made about the risks. Add in safeguards for when your model is wrong. Even with all this testing, the thing that should be at the front of your mind is not accuracy but impact.
  • 22. 22StoryStream.ai Dr Janet Bastiman @yssybyl Impact > accuracy
  • 23. 23StoryStream.ai Dr Janet Bastiman @yssybyl All businesses should care about the impact of the AI they create. Rather than talking, accuracy, recall, precision, let’s shift to impact. What is the impact of mislabelling a car? Getting someone’s gender incorrect? Refusing a loan? Incarcerating an innocent person? Missing a diagnosis of a terminal illness? The answer may be different for different individuals – what might be a non issue for one person could be life-changing for another. Stop thinking from your own position of privilege and your own biases and take a broader view. How is the information used – will there be a human in the loop? Put yourself in the position of the most vulnerable and marginalised users of your system and ask what is the impact of a false positive or false negative on them. Don’t brush off those results if they don’t fit with your experience.
  • 24. 24StoryStream.ai Dr Janet Bastiman @yssybyl Summary You are biased Challenge your biases Understand data provenance Break everything you create Create AI mindful of impact on the individual