SlideShare uma empresa Scribd logo
1 de 49
THE DATA LORAX
PLANTING THE SEEDS OF FAIRNESS IN DATA PRODUCTS
OMAYMA SAID
DATA SCIENTIST
THE ONCELER
LET’S UNLOCK THE VALUE OF
THE THNEED!
THE LORAX
I SPEAK FOR
THE TREES!
EVERYBODY NEEDS A THNEED!
THNEED AT SCALE!
UNLESS WHAT?
DATA
IS THE NEW
OIL“
Data is the new oil,
in the way that oil is a ubiquitous
commodity that requires incredible
resource allocation to extract value
from, deep expertise to manage –
and even when all that goes well –
can have universally consequential
negative externalities.*
“ Drew Conway
Founder & CEO
Data is the new oil,
in the way that oil is a ubiquitous
commodity that requires incredible
resource allocation to extract value
from, deep expertise to manage –
and even when all that goes well –
can have universally consequential
negative externalities.*
“ Drew Conway
Founder & CEO
Data is the new oil,
in the way that oil is a ubiquitous
commodity that requires incredible
resource allocation to extract value
from, deep expertise to manage –
and even when all that goes well –
can have universally consequential
negative externalities.*
“ Drew Conway
Founder & CEO
AI-POWERED [----]
ML-ENABLED [----]
MACHINES LEARN
WHO IS THE
TEACHER?
KODAK
SHIRLEY CARDS
(1960s & 1970s)
SHIRLEY CARDS
Several “Shirley cards” from the 1960s and 1970s.
1960s&1970s
SHIRLEY CARDS
A mixed-color photos by Walt Jabsco,
1960s&1970s
SHIRLEY CARDS
A mixed-color photos by Walt Jabsco,
1960s&1970s
PRODUCT
FAILED
DUE TO SOMETHING
INDIVIDUALS CAN’T
CHANGE ABOUT
THEMSELVES!
IMAGES DATASETS
(NOW)
NOW
INADIFFERENTCONTEXT
NOW
INADIFFERENTCONTEXT
OPEN IMAGES
NOW
MORE DIVERSITY?
OPEN IMAGES
INADIFFERENTCONTEXT
No Classification
without Representation
Assessing Geodiversity Issues in Open
Data Sets for the Developing World*
Shreya Shankar, Yoni Halpern, Eric Breck,
James Atwood, Jimbo Wilson, D. Sculley
Google Brain Team
Open Images
ImageNet
US
US
WEDDING PHOTOS
Photos of bridegrooms from different countries aligned by the log-likelihood that
the classifier trained on Open Images assigns to the bridegroom class (Source)
BETTER AND MORE
CONSISTENT
CLASSIFICATION
The WEIRDest people in the world?
Joseph Henrich, Steven J. Heine, Ara Norenzayan
University of British Columbia*
The WEIRDest people in the world?
Western
Educated
Industrialized
Rich
Democratic
CLICKBAIT?
Amazon’s system TAUGHT ITSELF that
male candidates were preferable. It penalized
resumes that included the word “women’s,” as in “women’s
chess club captain.” And it downgraded graduates of two all-
women’s colleges, according to people familiar with the matter.
They did not specify the names of the schools.“
Amazon’s system TAUGHT ITSELF that
male candidates were preferable. It penalized
resumes that included the word “women’s,” as in “women’s
chess club captain.” And it downgraded graduates of two all-
women’s colleges, according to people familiar with the matter.
They did not specify the names of the schools.“
LEARNED FROM HUMANS
HUMAN BIAS
AMPLIFICATION
UNFAIRNESS
@ SCALE
PRODUCT-X
- remove steps like resume reviews, phone screens, and
traditional assessments from their recruiting processes.
- Uses AI to give you more insight into candidates, so you
can make better decisions.“
”
Practitioners consistently:
- overestimate their model’s accuracy.
- propagate feedback loops.
- fail to notice data leaks.“
”“Why Should I Trust You?” Explaining the Predictions of Any Classifier
https://arxiv.org/pdf/1602.04938.pdf
COLLECT/LABEL
DATA
IT IS HUMANS WHO
WRITE
ALGORITHMS
DEFINE
METRICS
COLLECT/LABEL
DATA
IT IS HUMANS WHO
BIAS IN:
- REPRESENTATION
- DISTRIBUTION
- LABELS
AND MORE…..
WRITE
ALGORITHMS
DEFINE
METRICS
IT IS HUMANS WHO
DEFINE
METRICS
WRITE
ALGORITHMS
COLLECT/LABEL
DATA
- TRAIN/TEST SPLIT
- FEATURES/PROXIES
- BLACK-BOX MODELS
AND MORE…..
IT IS HUMANS WHO
COLLECT/LABEL
DATA
DEFINE
METRICS
WRITE
ALGORITHMS
- WHAT IS THE IMPACT OF
DIFFERENT ERROR TYPES
ON DIFFERENT GROUPS?
- WHAT DO YOU OPTIMIZE
FOR?
COLLECT/LABEL
DATA
IT IS HUMANS WHO
WRITE
ALGORITHMS
DEFINE
METRICS
EXTRA READINGS
THINGS WON’T GET BETTER
UNLESS SOMEONE LIKE YOU CARES
A WHOLE AWFUL LOT, NOTHING IS
GOING TO GET BETTER, IT’S NOT!
“ ”
THE DATA LORAX
PLANTING THE SEEDS OF FAIRNESS IN DATA PRODUCTS
OMAYMA SAID
DATA SCIENTIST

Mais conteúdo relacionado

Semelhante a Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness in Data Products" - Omayma Said

Entrepreneurship quiz
Entrepreneurship quizEntrepreneurship quiz
Entrepreneurship quiz
NENIndia
 
Hatch 2013 s clare inc
Hatch 2013 s clare incHatch 2013 s clare inc
Hatch 2013 s clare inc
Susan Clare
 

Semelhante a Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness in Data Products" - Omayma Said (20)

Leveraging Behavioral Nudges in Your HR Function
Leveraging Behavioral Nudges in Your HR FunctionLeveraging Behavioral Nudges in Your HR Function
Leveraging Behavioral Nudges in Your HR Function
 
Building Information Quality from the Inside Out
Building Information Quality from the Inside OutBuilding Information Quality from the Inside Out
Building Information Quality from the Inside Out
 
Ethical communication IN BUSINESS CONTEXT
Ethical communication IN BUSINESS CONTEXTEthical communication IN BUSINESS CONTEXT
Ethical communication IN BUSINESS CONTEXT
 
Motivation of employees and DEI policies CISCO MOTIVATION POLICIES CISCO.pptx
Motivation of employees and DEI policies CISCO MOTIVATION POLICIES  CISCO.pptxMotivation of employees and DEI policies CISCO MOTIVATION POLICIES  CISCO.pptx
Motivation of employees and DEI policies CISCO MOTIVATION POLICIES CISCO.pptx
 
Open Source: A Free For All
Open Source: A Free For AllOpen Source: A Free For All
Open Source: A Free For All
 
Seminars TMD2
Seminars TMD2Seminars TMD2
Seminars TMD2
 
Big Data Berlin – Automating Decisions is the Next Frontier for Big Data
Big Data Berlin – Automating Decisions is the Next Frontier for Big DataBig Data Berlin – Automating Decisions is the Next Frontier for Big Data
Big Data Berlin – Automating Decisions is the Next Frontier for Big Data
 
Surfing the chaotic ocean
Surfing the chaotic oceanSurfing the chaotic ocean
Surfing the chaotic ocean
 
What Makes a High Reliability Organization?
What Makes a High Reliability Organization?What Makes a High Reliability Organization?
What Makes a High Reliability Organization?
 
Integrate storytelling into design
Integrate storytelling into designIntegrate storytelling into design
Integrate storytelling into design
 
Stomp the Elephant in the Office
Stomp the Elephant in the OfficeStomp the Elephant in the Office
Stomp the Elephant in the Office
 
Ethics Ex Machina – Designing the Future With a Conscience
Ethics Ex Machina – Designing the Future With a ConscienceEthics Ex Machina – Designing the Future With a Conscience
Ethics Ex Machina – Designing the Future With a Conscience
 
PROPOSAL DEFENSE
PROPOSAL DEFENSEPROPOSAL DEFENSE
PROPOSAL DEFENSE
 
How to Effectively Ask For a Mentor
How to Effectively Ask For a MentorHow to Effectively Ask For a Mentor
How to Effectively Ask For a Mentor
 
Emergent Patterns in DevOps
Emergent Patterns in DevOpsEmergent Patterns in DevOps
Emergent Patterns in DevOps
 
The First of Me! Insights from the Future of Digital at SxSW 2019
The First of Me! Insights from the Future of Digital at SxSW 2019The First of Me! Insights from the Future of Digital at SxSW 2019
The First of Me! Insights from the Future of Digital at SxSW 2019
 
Work culture
Work cultureWork culture
Work culture
 
Emerging Skills for L&D to Enable the Future of Work
Emerging Skills for L&D to Enable the Future of WorkEmerging Skills for L&D to Enable the Future of Work
Emerging Skills for L&D to Enable the Future of Work
 
Entrepreneurship quiz
Entrepreneurship quizEntrepreneurship quiz
Entrepreneurship quiz
 
Hatch 2013 s clare inc
Hatch 2013 s clare incHatch 2013 s clare inc
Hatch 2013 s clare inc
 

Mais de Dataconomy Media

Mais de Dataconomy Media (20)

Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & David An...
Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & 	David An...Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & 	David An...
Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & David An...
 
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...
 
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...
 
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...
 
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...
 
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...
 
Data Natives Vienna v 7.0 | "Building Kubernetes Operators with KUDO for Dat...
Data Natives Vienna v 7.0  | "Building Kubernetes Operators with KUDO for Dat...Data Natives Vienna v 7.0  | "Building Kubernetes Operators with KUDO for Dat...
Data Natives Vienna v 7.0 | "Building Kubernetes Operators with KUDO for Dat...
 
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
 
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
 
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
 
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
 
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
 
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
 
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
 
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
 
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
 
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
 
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
 
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
 
Big Data Helsinki v 3 | "What you should know about PSD2 APIs?" - Joonas Tomperi
Big Data Helsinki v 3 | "What you should know about PSD2 APIs?" - Joonas TomperiBig Data Helsinki v 3 | "What you should know about PSD2 APIs?" - Joonas Tomperi
Big Data Helsinki v 3 | "What you should know about PSD2 APIs?" - Joonas Tomperi
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 

Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness in Data Products" - Omayma Said

Notas do Editor

  1. Truffula Trees
  2. https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
  3. https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
  4. https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
  5. https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
  6. https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
  7. https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G