SlideShare uma empresa Scribd logo
1 de 55
Baixar para ler offline
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
#TechSEOBoost | @CatalystSEM
THANK YOU TO THIS YEAR’S SPONSORS
Keynote: Bias in Search and Recommender
Systems
Ricardo Baeza-Yates, NTENT
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Ricardo Baeza-Yates
CTO, NTENT
Biases in Search &
Recommender Systems
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
NTENT ntent.com
Marketing Engineering Operations International Applied Research
ntent.com
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Prologue
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
A Bit of History
Data Volume
Complexity
IR
DB
Two different points
of view for data
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Data Understanding
Data
Query
Unstructured Structured
Explicit
Information
Retrieval
(Relational)
Databases
Implicit Recommender Systems
Unknown Data Mining
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
What is Bias?
‱ Statistical: significant systematic
deviation from a prior (unknown)
distribution;
‱ Cultural: interpretations and judgments
phenomena acquired through our life;
‱ Cognitive: systematic pattern of deviation
from norm or rationality in judgment;
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
So (Observational) Human Data has Bias
‱ Gender
‱ Racial
‱ Sexual
‱ Age
‱ Religious
‱ Social
‱ Linguistic
‱ Geographic
‱ Political
‱ Educational
‱ Economic
‱ Technological
â–Ș Gathering process
â–Ș Sampling process
â–Ș Validity (e.g. temporal)
â–Ș Completeness
â–Ș Noise, spam
Many people extrapolate results of
a sample to the whole population
(e.g., social media analysis)
In addition there is bias when
measuring bias as well as bias
towards measuring it!
Attempt of an unbiased (personal) view on bias in Search & RS
Cultural Biases Statistical Biases Cognitive Biases
Self-selection
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Impact in Search and Recommender Systems
‱ Most web systems are optimized by using implicit user feedback
‱ However, user data is partly biased to the choices that these systems make
‱ Clicks can only be done on things that are shown to us
‱ As those systems are usually based in ML, they learn to reinforce their own biases,
yielding self-fulfilled prophecies and/or sub-optimal solutions
‱ For example, personalization and filter bubbles for users
‱ but also echo chambers for (recommender) systems
‱ Moreover, sometimes these systems compete among themselves, learning also
biases of other systems rather than real user behavior
‱ Even more, an improvement in one system might be just a degradation in another
system that uses a different (even inversely correlated) optimization function
‱ For example, user experience vs. monetization
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
A Non-Technical Question
Algorithm
Biased
Data
Neutral?
Fair?
Same
Bias
Garbage In
Garbage Out
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
What is being fair?
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
A Non-Technical Question
Algorithm
Biased
Data
Neutral?
Fair?
Same
Bias
Not
Always!
Debias the input
Tune the algorithm
Debias the output
Bias awareness!
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
January 2017 ACM US Statement on
Algorithm Transparency and Accountability
1. Awareness
2. Access and redress
3. Accountability
4. Explanation
5. Data Provenance
6. Auditability
7. Validation and Testing
Systems do not need to be perfect,
they just need to be better than us
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Biases Everywhere!
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Data bias
Biases on Search & RS: Web Case Study
Web Spam
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
[Baeza-Yates, Castillo & LĂłpez,
Cybermetrics, 2005]
Number of linked domains
Exports(thousandsofUS$)
Economic Bias in Links
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
17
[Baeza-Yates & Castillo, WWW2006]
Economic Bias in Links
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
18
Minimal effortShameCultural Bias in Websites
[Baeza-Yates, Castillo,
Efthimiadis, TOIT 2007]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Language Bias
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
[Bolukbasi at al, NIPS 2016]
‱ Word embedding’s in w2vNEWS
Yes, about 60 to 70% at work
although at college is the inverse
Gender Bias in Content
Most journalists are men?
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
[E. Graells-Garrido et al,.
ACM Hypertext’15]
Systemic bias?
Equal opportunity?
Gender Bias in Content
Wikipedia
Partial
information
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Data bias
Activity bias
Bias on Usage
Actions
People
We are all in the long tail!
[Goel et al., WSDM 2010]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Popularity Bias in Recommender Systems
Items
Users
Popular
items
Rest of items
(long tail)
‱ Take care to recommended items that are
not too popular
‱ Metrics
‱ Novelty enhancement
‱ Problem solved! 
really?
𝑛𝑜𝑣 𝑖 = 1 −
# ratings of 𝑖
# users
Items
#interactions
More
novel
Less
novel
𝑎 𝑏
[Vargas & Castells, RecSys 2011]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
A Self-fulfilling Prophecy?
Popular items
(short head)
Rest of items
(long tail)
Observed user-item interaction
Unobserved preference
Items
Users
Ratings are missing
not at random (MNAR)
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Test data (relevant items)
Training data
Unobserved preference
Items
Users
Popular items
(short head)
Rest of items
(long tail)
avg P@𝑘 ∌
+
𝑘
Judgments are missing
not at random (MNAR)
Worse yet: user-system
reinforcement loop (more later)
A Self-fulfilling Prophecy?
[Marlin et al., RecSys 2010]
[Steck, RecSys 2010, 2011]
[Fleder & Hossanagar,
Management Sciences 2009]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
1.E-06
1.E-05
1.E-04
1.E-03
0 0.2 0.4 0.6 0.8 1
A Problem for IR Evaluation Methodology!
30 TREC collections
Items / documents
#ratings/judgments
(infraction)
[BellogĂ­n, Castells & Cantador, IRJ 2017]
To how many queries
is a document relevant?
25% queries can be
answered with less than
1% of the URLs!
[Baeza-Yates, Boldi,
Chierichetti, WWW 2015]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Get Rid of the Popularity Bias!
‱ In the rating split
[BellogĂ­n, Castells & Cantador, IRJ 2017]
‱ In the metrics
‱ Stratified recall
[Steck, RecSys 2011]
‱ Importance propensity scoring
[Yang et al., RecSys 2018]
‱ In the algorithms
[Steck, RecSys 2011]
[Lobato et al., ICML 2014]
[Jannach et al., UMUAI 2015]
[Cañamares & Castells, SIGIR 2018,
best paper award]
Test data (relevant items)
Training data
Unobserved preference
Items Items
#ratings
Flat test Popularity strata
#ratings
Time
Temporal
split
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Activity Bias also Affects Content
[Baeza-Yates & Saez-Trumper, ACM Hypertext 2015]
Most users are passive (i.e., more than 90% are lurkers)
Then, which percentage of active users produce 50% of the content?
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
October 2015
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
[Baeza-Yates & Saez-Trumper, ACM Hypertext 2015]
Which percentage of active users produce 50% of the content?
Wisdom of crowds is a partial illusion
Activity Bias: The Wisdom of a Few
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Content Never Seen
[Baeza-Yates & Saez-Trumper,
ACM Hypertext 2015]
The Digital Desert
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Data bias
Activity bias
Sampling
(size) bias
Algorithmic bias
Search or
Recommender System
Bias on the Web
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
[E. Graells-Garrido & M. Lalmas, ACM Hypertext’14]
Geographical Bias in Recommender Systems
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
‱ If we want to estimate the frequency of queries that appear with probability at
least p with a certain relative error, ∊, we can use the standard binomial error
formula which works well for p near œ but not for p near 0
‱ Better is the Agresti-Coull technique (also called Take 2) which gives:
where Z is the inverse of the standard normal distribution,
1 − đ›Œ is the confidence interval and
‱ If p = 0.1, 1 − đ›Œ is 80% and ∊ is 10%, the standard formula
gives n = 900, while with A-C we get n = 2342.
[Brown, Cai & DasGupta, Statistical Science, 2001]
[Baeza-Yates, SIGIR 2015, Industry track]
Sample Size?
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
‱ Standard technique:
‱ A good sample should cover well all the items distribution, but this
does not work with very skewed distributions.
10
0
10
1
10
2
10
3
Rank
10
0
10
1
10
2
10
3
10
4
10
5
Frequency
50M
10M
100K
1K
10
0
10
1
10
2
10
3
10
4
10
5
10
6
10
7
Rank
10
0
10
1
10
2
10
3
10
4
10
5
Frequency
50M
10M
100K
1K
[Zaragoza et al, CIKM 2010]
Sampling Queries
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
36
Stratified Sampling Example [Baeza-Yates, SIGIR 2015,
Industry track]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Data bias
Activity bias
Sampling
bias
Interaction bias
(Self) selection bias
Bias in the User Interaction
Search or
Recommender System
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Position bias
Ranking bias
Presentation or exposure bias
Social bias
Interaction bias
Bias in the Interaction
Amazon.com
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Position bias
Presentation
bias
Social bias
Interaction bias
Ranking bias
Click bias
Scrolling bias
Mouse
movement
bias
Data and algorithmic bias Self-selection bias
Dependencies: A Cascade of Biases!
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Ranking Bias in Web Search
[Mediative Study, 2014]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Ranking Bias: Click Bias in Web Search
‱ Ranking & next page bias
Navigational queries
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
CTR
(log)
1 11 21 Rank
Learning to Rank with bias
[Joachims et al., WSDM 2017, best paper]
+ many other papers
Fair rankings
[Zehlike et al., CIKM 2017]
Clicks as implicit positive user feedback
Debiasing Search Clicks and Other Biases
[Dupret & Piwowarski, SIGIR 2008]
[Chapelle & Zhang, WWW 2009]
[Dupret & Liao, WSDM 2010, best paper]
Debias the input
Tune the algorithm
Debias the output
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Data bias
Activity bias
Sampling
bias
Interaction bias
(Self) selection bias
Second-order bias
Vicious Cycle of Bias
Search or
Recommender System
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
[Baeza-Yates, Pereira & Ziviani, WWW 2008]
Person
Web content is redundant (> 20%)
Query
Ranking bias in new content
Redundancy grows (35%)
Search results
New page
Second Order Bias in Web Content
[Fortunato, Flammini, Menczer & Vespignani. PNAS 2006]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Data bias
Activity bias
Sampling
bias
Interaction bias
(Self) selection bias
Vicious Cycle of Bias
Search or
Recommender System
Feedback loop bias
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Bias due to Personalization
‱ The effect of self-selection bias
‱ Avoid the rich get richer and
poor get poorer syndrome
‱ Avoid the echo chamber by
empowering the tail
Cold start problem solution: Explore & Exploit
Partial solutions:
‱ Diversity
‱ Novelty
‱ Serendipity
‱ My dark side Wikipedia
[Eli Pariser, Penguin 2011]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Users’ Eco Chambers in Recommender Systems
â–Ș Filter bubbles
â–Ș Degenerate feedback loops
(e.g., YouTube autoplay)
[Jiang et al., AAAI 2019]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Eco Chamber of the Recommender System
‱ Short-term greedy optimization, partial knowledge of the world
‱ Long-term revenue optimization is not achieved
‱ Views from new users should balance the exploration for new items
‱ Disparate impact: unfair ecommerce/information markets
[Baeza-Yates & Ribaudo,
to appear]
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Epilogue
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Recap
Bias  Type Statistical Cultural Cognitive
Algorithmic ï‚š ? ?
Presentation ï‚š
Position ï‚š ï‚š ï‚š
Data ï‚š ï‚š
Sampling ï‚š ï‚š ï‚š
Activity ï‚š
Self-selection ï‚š ï‚š
Interaction ï‚š ï‚š
Social ï‚š ï‚š
Second order ï‚š ï‚š ï‚š
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
[Silberzahn et al., COS, Univ. of Virginia, 2015]
Professional Bias?
➔ 61 analysts, 29 teams: 20 yes and 9 no
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
If Systems Reflects Its Designers:
What we can/should do?
â–Ș Data
â–Ș Analyze for known and unknown biases, debias when possible/needed
â–Ș Recollect more data for difficult/sparse regions of the problem
â–Ș Delete attributes associated directly/indirectly with harmful bias
â–Ș Interaction
â–Ș Make sure that the user is aware of the biases all the time
â–Ș Give more control to the user
â–Ș Design and Implementation
‱ Let experts/colleagues/users contest every step of the process
â–Ș Evaluation
‱ Do not fool yourself!
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Final Take-Home Message
â–Ș Systems are a mirror of us, the good, the bad and the ugly
â–Ș The Web amplifies everything, but always leaves traces
â–Ș We need to be aware of our own biases!
â–Ș We must be aware of the biases and contrarrest them to stop
the vicious bias cycle
â–Ș Plenty of open (research) problems!
Big Data of People is huge
..

.. but it is tiny compared to the future
Big Data of the Internet of Things (IoT)
No activity bias!
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Thank You! Any Questions?
–
rbaeza@acm.org | http://www.baeza.cl/ | http://fairness-measures.org
Biased Questions?
ASIST 2012
Book of the
Year Award
(Biased Ad)
Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost
Thanks for Viewing the Slideshare!
–
Watch the Recording: https://youtube.com/session-example
Or
Contact us today to discover how Catalyst can deliver unparalleled SEO
results for your business. https://www.catalystdigital.com/

Mais conteĂșdo relacionado

Mais procurados

ăƒ‡ăƒŒă‚żăƒ™ăƒŒă‚č10 - æ­ŁèŠćŒ–
ăƒ‡ăƒŒă‚żăƒ™ăƒŒă‚č10 - æ­ŁèŠćŒ–ăƒ‡ăƒŒă‚żăƒ™ăƒŒă‚č10 - æ­ŁèŠćŒ–
ăƒ‡ăƒŒă‚żăƒ™ăƒŒă‚č10 - æ­ŁèŠćŒ–Kenta Oku
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural searchDmitry Kan
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender systemStanley Wang
 
Vector database
Vector databaseVector database
Vector databaseGuy Korland
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedXavier Amatriain
 
Enterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge - Taxonomy Design Best Practices and MethodologyEnterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge - Taxonomy Design Best Practices and MethodologyEnterprise Knowledge
 
How does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspectiveHow does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspectiveSease
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System ExplainedCrossing Minds
 
10 MĂ©tricas para Monitoramento de MĂ­dias Sociais
10 MĂ©tricas para Monitoramento de MĂ­dias Sociais10 MĂ©tricas para Monitoramento de MĂ­dias Sociais
10 MĂ©tricas para Monitoramento de MĂ­dias SociaisSocial Figures
 
Learning to Rank Presentation (v2) at LexisNexis Search Guild
Learning to Rank Presentation (v2) at LexisNexis Search GuildLearning to Rank Presentation (v2) at LexisNexis Search Guild
Learning to Rank Presentation (v2) at LexisNexis Search GuildSujit Pal
 
Natural Language Processing: L02 words
Natural Language Processing: L02 wordsNatural Language Processing: L02 words
Natural Language Processing: L02 wordsananth
 
MyrrorBot: a Digital Assistant Based on Holistic User Models for Personalize...
MyrrorBot: a Digital Assistant Based on Holistic User Models forPersonalize...MyrrorBot: a Digital Assistant Based on Holistic User Models forPersonalize...
MyrrorBot: a Digital Assistant Based on Holistic User Models for Personalize...Cataldo Musto
 
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022Jim Dowling
 
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019Anoop Deoras
 
Introduction to Knowledge Graphs
Introduction to Knowledge GraphsIntroduction to Knowledge Graphs
Introduction to Knowledge Graphsmukuljoshi
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Xavier Amatriain
 
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...Institute of Contemporary Sciences
 

Mais procurados (20)

ăƒ‡ăƒŒă‚żăƒ™ăƒŒă‚č10 - æ­ŁèŠćŒ–
ăƒ‡ăƒŒă‚żăƒ™ăƒŒă‚č10 - æ­ŁèŠćŒ–ăƒ‡ăƒŒă‚żăƒ™ăƒŒă‚č10 - æ­ŁèŠćŒ–
ăƒ‡ăƒŒă‚żăƒ™ăƒŒă‚č10 - æ­ŁèŠćŒ–
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural search
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
Vector database
Vector databaseVector database
Vector database
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem Revisited
 
Enterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge - Taxonomy Design Best Practices and MethodologyEnterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge - Taxonomy Design Best Practices and Methodology
 
How does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspectiveHow does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspective
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
 
10 MĂ©tricas para Monitoramento de MĂ­dias Sociais
10 MĂ©tricas para Monitoramento de MĂ­dias Sociais10 MĂ©tricas para Monitoramento de MĂ­dias Sociais
10 MĂ©tricas para Monitoramento de MĂ­dias Sociais
 
Learning to Rank Presentation (v2) at LexisNexis Search Guild
Learning to Rank Presentation (v2) at LexisNexis Search GuildLearning to Rank Presentation (v2) at LexisNexis Search Guild
Learning to Rank Presentation (v2) at LexisNexis Search Guild
 
Natural Language Processing: L02 words
Natural Language Processing: L02 wordsNatural Language Processing: L02 words
Natural Language Processing: L02 words
 
MyrrorBot: a Digital Assistant Based on Holistic User Models for Personalize...
MyrrorBot: a Digital Assistant Based on Holistic User Models forPersonalize...MyrrorBot: a Digital Assistant Based on Holistic User Models forPersonalize...
MyrrorBot: a Digital Assistant Based on Holistic User Models for Personalize...
 
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
 
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
 
Introduction to Knowledge Graphs
Introduction to Knowledge GraphsIntroduction to Knowledge Graphs
Introduction to Knowledge Graphs
 
珏15曞Solrć‹‰ćŒ·äŒš - Solr at Yahoo! JAPAN #SolrJP
珏15曞Solrć‹‰ćŒ·äŒš - Solr at Yahoo! JAPAN #SolrJP珏15曞Solrć‹‰ćŒ·äŒš - Solr at Yahoo! JAPAN #SolrJP
珏15曞Solrć‹‰ćŒ·äŒš - Solr at Yahoo! JAPAN #SolrJP
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...
 

Semelhante a Keynote: Bias in Search and Recommender Systems

Webinar - Harness the Power of Data with Tableau - 2016-02-18
Webinar - Harness the Power of Data with Tableau - 2016-02-18Webinar - Harness the Power of Data with Tableau - 2016-02-18
Webinar - Harness the Power of Data with Tableau - 2016-02-18TechSoup
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?SAS Canada
 
Audit Webinar: Surefire ways to succeed with Data Analytics
Audit Webinar: Surefire ways to succeed with Data AnalyticsAudit Webinar: Surefire ways to succeed with Data Analytics
Audit Webinar: Surefire ways to succeed with Data AnalyticsCaseWare IDEA
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Domino Data Lab
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeCaserta
 
A Brief Introduction to Enterprise Architecture
A Brief Introduction to  Enterprise Architecture A Brief Introduction to  Enterprise Architecture
A Brief Introduction to Enterprise Architecture Daljit Banger
 
Predictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupPredictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupCaserta
 
Balancing Data Governance and Innovation
Balancing Data Governance and InnovationBalancing Data Governance and Innovation
Balancing Data Governance and InnovationCaserta
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Caserta
 
Leadership beyond Agile XP2015
Leadership beyond Agile XP2015Leadership beyond Agile XP2015
Leadership beyond Agile XP2015samililja
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowskCura_Relativity
 
Simplifying Analytics - by Novoniel Deb
Simplifying Analytics - by Novoniel DebSimplifying Analytics - by Novoniel Deb
Simplifying Analytics - by Novoniel DebNovoniel Deb
 
Part 2 - 20 Years in Healthcare Analytics & Data Warehousing: What did we lea...
Part 2 - 20 Years in Healthcare Analytics & Data Warehousing: What did we lea...Part 2 - 20 Years in Healthcare Analytics & Data Warehousing: What did we lea...
Part 2 - 20 Years in Healthcare Analytics & Data Warehousing: What did we lea...Health Catalyst
 
Measuring Impact: Towards a data citation metric
Measuring Impact: Towards a data citation metricMeasuring Impact: Towards a data citation metric
Measuring Impact: Towards a data citation metricEdward Baker
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation Caserta
 
Audit: Breaking Down Barriers to Increase the Use of Data Analytics
Audit: Breaking Down Barriers to Increase the Use of Data AnalyticsAudit: Breaking Down Barriers to Increase the Use of Data Analytics
Audit: Breaking Down Barriers to Increase the Use of Data AnalyticsCaseWare IDEA
 
Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...
Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...
Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...Earley Information Science
 
The State of Streaming Analytics: The Need for Speed and Scale
The State of Streaming Analytics: The Need for Speed and ScaleThe State of Streaming Analytics: The Need for Speed and Scale
The State of Streaming Analytics: The Need for Speed and ScaleVoltDB
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceCaserta
 

Semelhante a Keynote: Bias in Search and Recommender Systems (20)

Webinar - Harness the Power of Data with Tableau - 2016-02-18
Webinar - Harness the Power of Data with Tableau - 2016-02-18Webinar - Harness the Power of Data with Tableau - 2016-02-18
Webinar - Harness the Power of Data with Tableau - 2016-02-18
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?
 
Audit Webinar: Surefire ways to succeed with Data Analytics
Audit Webinar: Surefire ways to succeed with Data AnalyticsAudit Webinar: Surefire ways to succeed with Data Analytics
Audit Webinar: Surefire ways to succeed with Data Analytics
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data Lake
 
A Brief Introduction to Enterprise Architecture
A Brief Introduction to  Enterprise Architecture A Brief Introduction to  Enterprise Architecture
A Brief Introduction to Enterprise Architecture
 
Predictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupPredictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing Meetup
 
Balancing Data Governance and Innovation
Balancing Data Governance and InnovationBalancing Data Governance and Innovation
Balancing Data Governance and Innovation
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
 
Leadership beyond Agile XP2015
Leadership beyond Agile XP2015Leadership beyond Agile XP2015
Leadership beyond Agile XP2015
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and Workflows
 
Simplifying Analytics - by Novoniel Deb
Simplifying Analytics - by Novoniel DebSimplifying Analytics - by Novoniel Deb
Simplifying Analytics - by Novoniel Deb
 
Part 2 - 20 Years in Healthcare Analytics & Data Warehousing: What did we lea...
Part 2 - 20 Years in Healthcare Analytics & Data Warehousing: What did we lea...Part 2 - 20 Years in Healthcare Analytics & Data Warehousing: What did we lea...
Part 2 - 20 Years in Healthcare Analytics & Data Warehousing: What did we lea...
 
Measuring Impact: Towards a data citation metric
Measuring Impact: Towards a data citation metricMeasuring Impact: Towards a data citation metric
Measuring Impact: Towards a data citation metric
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation
 
Cerias talk on testing and evaluation
Cerias talk on testing and evaluationCerias talk on testing and evaluation
Cerias talk on testing and evaluation
 
Audit: Breaking Down Barriers to Increase the Use of Data Analytics
Audit: Breaking Down Barriers to Increase the Use of Data AnalyticsAudit: Breaking Down Barriers to Increase the Use of Data Analytics
Audit: Breaking Down Barriers to Increase the Use of Data Analytics
 
Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...
Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...
Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...
 
The State of Streaming Analytics: The Need for Speed and Scale
The State of Streaming Analytics: The Need for Speed and ScaleThe State of Streaming Analytics: The Need for Speed and Scale
The State of Streaming Analytics: The Need for Speed and Scale
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 

Mais de Catalyst

Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...Catalyst
 
TechSEO Boost 2021 - Cultivating a Product Mindset for Success
TechSEO Boost 2021 - Cultivating a Product Mindset for SuccessTechSEO Boost 2021 - Cultivating a Product Mindset for Success
TechSEO Boost 2021 - Cultivating a Product Mindset for SuccessCatalyst
 
TechSEO Boost 2021 - SEO Experimentation
TechSEO Boost 2021 - SEO ExperimentationTechSEO Boost 2021 - SEO Experimentation
TechSEO Boost 2021 - SEO ExperimentationCatalyst
 
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...Catalyst
 
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...Catalyst
 
10 Trends Changing Programmatic
10 Trends Changing Programmatic10 Trends Changing Programmatic
10 Trends Changing ProgrammaticCatalyst
 
New Commerce Conference: Charting a Course to Success with Your Retail Media ...
New Commerce Conference: Charting a Course to Success with Your Retail Media ...New Commerce Conference: Charting a Course to Success with Your Retail Media ...
New Commerce Conference: Charting a Course to Success with Your Retail Media ...Catalyst
 
The New Commerce Conference: The Omni-channel Imperative
The New Commerce Conference: The Omni-channel ImperativeThe New Commerce Conference: The Omni-channel Imperative
The New Commerce Conference: The Omni-channel ImperativeCatalyst
 
New Commerce Commerce: All Things Instacart
New Commerce Commerce: All Things InstacartNew Commerce Commerce: All Things Instacart
New Commerce Commerce: All Things InstacartCatalyst
 
The Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
The Power of SEO: Protect Your Bottom Line & Future Proof Your BrandThe Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
The Power of SEO: Protect Your Bottom Line & Future Proof Your BrandCatalyst
 
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...Catalyst
 
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your ReopeningReignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your ReopeningCatalyst
 
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...Catalyst
 
Evolve Your Social Commerce Strategy: Thinking Beyond Facebook
Evolve Your Social Commerce Strategy: Thinking Beyond FacebookEvolve Your Social Commerce Strategy: Thinking Beyond Facebook
Evolve Your Social Commerce Strategy: Thinking Beyond FacebookCatalyst
 
B2B SEO: Increase Traffic & Leads in 2020
B2B SEO: Increase Traffic & Leads in 2020B2B SEO: Increase Traffic & Leads in 2020
B2B SEO: Increase Traffic & Leads in 2020Catalyst
 
TechSEO Boost 2019: Research Competition
TechSEO Boost 2019: Research CompetitionTechSEO Boost 2019: Research Competition
TechSEO Boost 2019: Research CompetitionCatalyst
 
NLP Powered Outreach Link Building
NLP Powered Outreach Link BuildingNLP Powered Outreach Link Building
NLP Powered Outreach Link BuildingCatalyst
 
Generating Qualitative Content with GPT-2 in All Languages
Generating Qualitative Content with GPT-2 in All LanguagesGenerating Qualitative Content with GPT-2 in All Languages
Generating Qualitative Content with GPT-2 in All LanguagesCatalyst
 
Automate, Create Tools, & Test Ideas Quickly with Google Apps Script
Automate, Create Tools, & Test Ideas Quickly with Google Apps ScriptAutomate, Create Tools, & Test Ideas Quickly with Google Apps Script
Automate, Create Tools, & Test Ideas Quickly with Google Apps ScriptCatalyst
 
NLP for SEO
NLP for SEONLP for SEO
NLP for SEOCatalyst
 

Mais de Catalyst (20)

Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
 
TechSEO Boost 2021 - Cultivating a Product Mindset for Success
TechSEO Boost 2021 - Cultivating a Product Mindset for SuccessTechSEO Boost 2021 - Cultivating a Product Mindset for Success
TechSEO Boost 2021 - Cultivating a Product Mindset for Success
 
TechSEO Boost 2021 - SEO Experimentation
TechSEO Boost 2021 - SEO ExperimentationTechSEO Boost 2021 - SEO Experimentation
TechSEO Boost 2021 - SEO Experimentation
 
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
 
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
 
10 Trends Changing Programmatic
10 Trends Changing Programmatic10 Trends Changing Programmatic
10 Trends Changing Programmatic
 
New Commerce Conference: Charting a Course to Success with Your Retail Media ...
New Commerce Conference: Charting a Course to Success with Your Retail Media ...New Commerce Conference: Charting a Course to Success with Your Retail Media ...
New Commerce Conference: Charting a Course to Success with Your Retail Media ...
 
The New Commerce Conference: The Omni-channel Imperative
The New Commerce Conference: The Omni-channel ImperativeThe New Commerce Conference: The Omni-channel Imperative
The New Commerce Conference: The Omni-channel Imperative
 
New Commerce Commerce: All Things Instacart
New Commerce Commerce: All Things InstacartNew Commerce Commerce: All Things Instacart
New Commerce Commerce: All Things Instacart
 
The Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
The Power of SEO: Protect Your Bottom Line & Future Proof Your BrandThe Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
The Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
 
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
 
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your ReopeningReignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
 
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
 
Evolve Your Social Commerce Strategy: Thinking Beyond Facebook
Evolve Your Social Commerce Strategy: Thinking Beyond FacebookEvolve Your Social Commerce Strategy: Thinking Beyond Facebook
Evolve Your Social Commerce Strategy: Thinking Beyond Facebook
 
B2B SEO: Increase Traffic & Leads in 2020
B2B SEO: Increase Traffic & Leads in 2020B2B SEO: Increase Traffic & Leads in 2020
B2B SEO: Increase Traffic & Leads in 2020
 
TechSEO Boost 2019: Research Competition
TechSEO Boost 2019: Research CompetitionTechSEO Boost 2019: Research Competition
TechSEO Boost 2019: Research Competition
 
NLP Powered Outreach Link Building
NLP Powered Outreach Link BuildingNLP Powered Outreach Link Building
NLP Powered Outreach Link Building
 
Generating Qualitative Content with GPT-2 in All Languages
Generating Qualitative Content with GPT-2 in All LanguagesGenerating Qualitative Content with GPT-2 in All Languages
Generating Qualitative Content with GPT-2 in All Languages
 
Automate, Create Tools, & Test Ideas Quickly with Google Apps Script
Automate, Create Tools, & Test Ideas Quickly with Google Apps ScriptAutomate, Create Tools, & Test Ideas Quickly with Google Apps Script
Automate, Create Tools, & Test Ideas Quickly with Google Apps Script
 
NLP for SEO
NLP for SEONLP for SEO
NLP for SEO
 

Último

Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdfDigital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdfDemandbase
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
The Impact of Digital Technologies
The Impact of Digital Technologies The Impact of Digital Technologies
The Impact of Digital Technologies bruguardarib
 
McDonald's: A Journey Through Time (PPT)
McDonald's: A Journey Through Time (PPT)McDonald's: A Journey Through Time (PPT)
McDonald's: A Journey Through Time (PPT)DEVARAJV16
 
Fueling A_B experiments with behavioral insights (1).pdf
Fueling A_B experiments with behavioral insights (1).pdfFueling A_B experiments with behavioral insights (1).pdf
Fueling A_B experiments with behavioral insights (1).pdfVWO
 
From Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOFrom Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOSzymon SƂowik
 
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon GarsideInbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garsiderobwhite630290
 
TAM AdEx 2023 Cross Media Advertising Recap - Auto Sector
TAM AdEx 2023 Cross Media Advertising Recap - Auto SectorTAM AdEx 2023 Cross Media Advertising Recap - Auto Sector
TAM AdEx 2023 Cross Media Advertising Recap - Auto SectorSocial Samosa
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentationdgtivemarketingagenc
 
5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software SolutionsDevherds Software Solutions
 
Most Influential HR Leaders Leading the Corporate World, 2024 (Final file).pdf
Most Influential HR Leaders Leading the Corporate World, 2024 (Final file).pdfMost Influential HR Leaders Leading the Corporate World, 2024 (Final file).pdf
Most Influential HR Leaders Leading the Corporate World, 2024 (Final file).pdfCIO Business World
 
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdfResearch and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdfVWO
 
Common Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityCommon Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityMonishka Adhikari
 
What are the 4 characteristics of CTAs that convert?
What are the 4 characteristics of CTAs that convert?What are the 4 characteristics of CTAs that convert?
What are the 4 characteristics of CTAs that convert?Juan Pineda
 
Exploring The World Of Adult Ad Networks.pdf
Exploring The World Of Adult Ad Networks.pdfExploring The World Of Adult Ad Networks.pdf
Exploring The World Of Adult Ad Networks.pdfadult marketing
 
The power of SEO-driven market intelligence
The power of SEO-driven market intelligenceThe power of SEO-driven market intelligence
The power of SEO-driven market intelligenceHinde Lamrani
 
Jai Institute for Parenting Program Guide
Jai Institute for Parenting Program GuideJai Institute for Parenting Program Guide
Jai Institute for Parenting Program Guidekiva6
 
Exploring Web 3.0 Growth marketing: Navigating the Future of the Internet
Exploring Web 3.0 Growth marketing: Navigating the Future of the InternetExploring Web 3.0 Growth marketing: Navigating the Future of the Internet
Exploring Web 3.0 Growth marketing: Navigating the Future of the Internetnehapardhi711
 
VIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceVIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceSapana Sha
 
ćźšćˆ¶(ULVæŻ•äžšèŻäčŠ)æ‹‰æ–‡ć€§ć­ŠæŻ•äžšèŻæˆç»©ć•ćŽŸç‰ˆäž€æŻ”äž€
ćźšćˆ¶(ULVæŻ•äžšèŻäčŠ)æ‹‰æ–‡ć€§ć­ŠæŻ•äžšèŻæˆç»©ć•ćŽŸç‰ˆäž€æŻ”äž€ćźšćˆ¶(ULVæŻ•äžšèŻäčŠ)æ‹‰æ–‡ć€§ć­ŠæŻ•äžšèŻæˆç»©ć•ćŽŸç‰ˆäž€æŻ”äž€
ćźšćˆ¶(ULVæŻ•äžšèŻäčŠ)æ‹‰æ–‡ć€§ć­ŠæŻ•äžšèŻæˆç»©ć•ćŽŸç‰ˆäž€æŻ”äž€s SS
 

Último (20)

Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdfDigital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
Digital Marketing Spotlight: Lifecycle Advertising Strategies.pdf
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
 
The Impact of Digital Technologies
The Impact of Digital Technologies The Impact of Digital Technologies
The Impact of Digital Technologies
 
McDonald's: A Journey Through Time (PPT)
McDonald's: A Journey Through Time (PPT)McDonald's: A Journey Through Time (PPT)
McDonald's: A Journey Through Time (PPT)
 
Fueling A_B experiments with behavioral insights (1).pdf
Fueling A_B experiments with behavioral insights (1).pdfFueling A_B experiments with behavioral insights (1).pdf
Fueling A_B experiments with behavioral insights (1).pdf
 
From Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOFrom Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEO
 
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon GarsideInbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
Inbound Marekting 2.0 - The Paradigm Shift in Marketing | Axon Garside
 
TAM AdEx 2023 Cross Media Advertising Recap - Auto Sector
TAM AdEx 2023 Cross Media Advertising Recap - Auto SectorTAM AdEx 2023 Cross Media Advertising Recap - Auto Sector
TAM AdEx 2023 Cross Media Advertising Recap - Auto Sector
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentation
 
5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions
 
Most Influential HR Leaders Leading the Corporate World, 2024 (Final file).pdf
Most Influential HR Leaders Leading the Corporate World, 2024 (Final file).pdfMost Influential HR Leaders Leading the Corporate World, 2024 (Final file).pdf
Most Influential HR Leaders Leading the Corporate World, 2024 (Final file).pdf
 
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdfResearch and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
 
Common Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityCommon Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic Creativity
 
What are the 4 characteristics of CTAs that convert?
What are the 4 characteristics of CTAs that convert?What are the 4 characteristics of CTAs that convert?
What are the 4 characteristics of CTAs that convert?
 
Exploring The World Of Adult Ad Networks.pdf
Exploring The World Of Adult Ad Networks.pdfExploring The World Of Adult Ad Networks.pdf
Exploring The World Of Adult Ad Networks.pdf
 
The power of SEO-driven market intelligence
The power of SEO-driven market intelligenceThe power of SEO-driven market intelligence
The power of SEO-driven market intelligence
 
Jai Institute for Parenting Program Guide
Jai Institute for Parenting Program GuideJai Institute for Parenting Program Guide
Jai Institute for Parenting Program Guide
 
Exploring Web 3.0 Growth marketing: Navigating the Future of the Internet
Exploring Web 3.0 Growth marketing: Navigating the Future of the InternetExploring Web 3.0 Growth marketing: Navigating the Future of the Internet
Exploring Web 3.0 Growth marketing: Navigating the Future of the Internet
 
VIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceVIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts Service
 
ćźšćˆ¶(ULVæŻ•äžšèŻäčŠ)æ‹‰æ–‡ć€§ć­ŠæŻ•äžšèŻæˆç»©ć•ćŽŸç‰ˆäž€æŻ”äž€
ćźšćˆ¶(ULVæŻ•äžšèŻäčŠ)æ‹‰æ–‡ć€§ć­ŠæŻ•äžšèŻæˆç»©ć•ćŽŸç‰ˆäž€æŻ”äž€ćźšćˆ¶(ULVæŻ•äžšèŻäčŠ)æ‹‰æ–‡ć€§ć­ŠæŻ•äžšèŻæˆç»©ć•ćŽŸç‰ˆäž€æŻ”äž€
ćźšćˆ¶(ULVæŻ•äžšèŻäčŠ)æ‹‰æ–‡ć€§ć­ŠæŻ•äžšèŻæˆç»©ć•ćŽŸç‰ˆäž€æŻ”äž€
 

Keynote: Bias in Search and Recommender Systems

  • 1. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost #TechSEOBoost | @CatalystSEM THANK YOU TO THIS YEAR’S SPONSORS Keynote: Bias in Search and Recommender Systems Ricardo Baeza-Yates, NTENT
  • 2. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Ricardo Baeza-Yates CTO, NTENT Biases in Search & Recommender Systems
  • 3. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost NTENT ntent.com Marketing Engineering Operations International Applied Research ntent.com
  • 4. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Prologue
  • 5. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost A Bit of History Data Volume Complexity IR DB Two different points of view for data
  • 6. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Data Understanding Data Query Unstructured Structured Explicit Information Retrieval (Relational) Databases Implicit Recommender Systems Unknown Data Mining
  • 7. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost What is Bias? ‱ Statistical: significant systematic deviation from a prior (unknown) distribution; ‱ Cultural: interpretations and judgments phenomena acquired through our life; ‱ Cognitive: systematic pattern of deviation from norm or rationality in judgment;
  • 8. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost So (Observational) Human Data has Bias ‱ Gender ‱ Racial ‱ Sexual ‱ Age ‱ Religious ‱ Social ‱ Linguistic ‱ Geographic ‱ Political ‱ Educational ‱ Economic ‱ Technological â–Ș Gathering process â–Ș Sampling process â–Ș Validity (e.g. temporal) â–Ș Completeness â–Ș Noise, spam Many people extrapolate results of a sample to the whole population (e.g., social media analysis) In addition there is bias when measuring bias as well as bias towards measuring it! Attempt of an unbiased (personal) view on bias in Search & RS Cultural Biases Statistical Biases Cognitive Biases Self-selection
  • 9. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Impact in Search and Recommender Systems ‱ Most web systems are optimized by using implicit user feedback ‱ However, user data is partly biased to the choices that these systems make ‱ Clicks can only be done on things that are shown to us ‱ As those systems are usually based in ML, they learn to reinforce their own biases, yielding self-fulfilled prophecies and/or sub-optimal solutions ‱ For example, personalization and filter bubbles for users ‱ but also echo chambers for (recommender) systems ‱ Moreover, sometimes these systems compete among themselves, learning also biases of other systems rather than real user behavior ‱ Even more, an improvement in one system might be just a degradation in another system that uses a different (even inversely correlated) optimization function ‱ For example, user experience vs. monetization
  • 10. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost A Non-Technical Question Algorithm Biased Data Neutral? Fair? Same Bias Garbage In Garbage Out
  • 11. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost What is being fair?
  • 12. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost A Non-Technical Question Algorithm Biased Data Neutral? Fair? Same Bias Not Always! Debias the input Tune the algorithm Debias the output Bias awareness!
  • 13. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost January 2017 ACM US Statement on Algorithm Transparency and Accountability 1. Awareness 2. Access and redress 3. Accountability 4. Explanation 5. Data Provenance 6. Auditability 7. Validation and Testing Systems do not need to be perfect, they just need to be better than us
  • 14. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Biases Everywhere!
  • 15. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Data bias Biases on Search & RS: Web Case Study Web Spam
  • 16. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost [Baeza-Yates, Castillo & LĂłpez, Cybermetrics, 2005] Number of linked domains Exports(thousandsofUS$) Economic Bias in Links
  • 17. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost 17 [Baeza-Yates & Castillo, WWW2006] Economic Bias in Links
  • 18. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost 18 Minimal effortShameCultural Bias in Websites [Baeza-Yates, Castillo, Efthimiadis, TOIT 2007]
  • 19. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Language Bias
  • 20. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost [Bolukbasi at al, NIPS 2016] ‱ Word embedding’s in w2vNEWS Yes, about 60 to 70% at work although at college is the inverse Gender Bias in Content Most journalists are men?
  • 21. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost [E. Graells-Garrido et al,. ACM Hypertext’15] Systemic bias? Equal opportunity? Gender Bias in Content Wikipedia Partial information
  • 22. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Data bias Activity bias Bias on Usage Actions People We are all in the long tail! [Goel et al., WSDM 2010]
  • 23. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Popularity Bias in Recommender Systems Items Users Popular items Rest of items (long tail) ‱ Take care to recommended items that are not too popular ‱ Metrics ‱ Novelty enhancement ‱ Problem solved! 
really? 𝑛𝑜𝑣 𝑖 = 1 − # ratings of 𝑖 # users Items #interactions More novel Less novel 𝑎 𝑏 [Vargas & Castells, RecSys 2011]
  • 24. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost A Self-fulfilling Prophecy? Popular items (short head) Rest of items (long tail) Observed user-item interaction Unobserved preference Items Users Ratings are missing not at random (MNAR)
  • 25. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Test data (relevant items) Training data Unobserved preference Items Users Popular items (short head) Rest of items (long tail) avg P@𝑘 ∌ + 𝑘 Judgments are missing not at random (MNAR) Worse yet: user-system reinforcement loop (more later) A Self-fulfilling Prophecy? [Marlin et al., RecSys 2010] [Steck, RecSys 2010, 2011] [Fleder & Hossanagar, Management Sciences 2009]
  • 26. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost 1.E-06 1.E-05 1.E-04 1.E-03 0 0.2 0.4 0.6 0.8 1 A Problem for IR Evaluation Methodology! 30 TREC collections Items / documents #ratings/judgments (infraction) [BellogĂ­n, Castells & Cantador, IRJ 2017] To how many queries is a document relevant? 25% queries can be answered with less than 1% of the URLs! [Baeza-Yates, Boldi, Chierichetti, WWW 2015]
  • 27. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Get Rid of the Popularity Bias! ‱ In the rating split [BellogĂ­n, Castells & Cantador, IRJ 2017] ‱ In the metrics ‱ Stratified recall [Steck, RecSys 2011] ‱ Importance propensity scoring [Yang et al., RecSys 2018] ‱ In the algorithms [Steck, RecSys 2011] [Lobato et al., ICML 2014] [Jannach et al., UMUAI 2015] [Cañamares & Castells, SIGIR 2018, best paper award] Test data (relevant items) Training data Unobserved preference Items Items #ratings Flat test Popularity strata #ratings Time Temporal split
  • 28. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Activity Bias also Affects Content [Baeza-Yates & Saez-Trumper, ACM Hypertext 2015] Most users are passive (i.e., more than 90% are lurkers) Then, which percentage of active users produce 50% of the content?
  • 29. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost October 2015
  • 30. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost [Baeza-Yates & Saez-Trumper, ACM Hypertext 2015] Which percentage of active users produce 50% of the content? Wisdom of crowds is a partial illusion Activity Bias: The Wisdom of a Few
  • 31. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Content Never Seen [Baeza-Yates & Saez-Trumper, ACM Hypertext 2015] The Digital Desert
  • 32. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Data bias Activity bias Sampling (size) bias Algorithmic bias Search or Recommender System Bias on the Web
  • 33. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost [E. Graells-Garrido & M. Lalmas, ACM Hypertext’14] Geographical Bias in Recommender Systems
  • 34. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost ‱ If we want to estimate the frequency of queries that appear with probability at least p with a certain relative error, ∊, we can use the standard binomial error formula which works well for p near Âœ but not for p near 0 ‱ Better is the Agresti-Coull technique (also called Take 2) which gives: where Z is the inverse of the standard normal distribution, 1 − đ›Œ is the confidence interval and ‱ If p = 0.1, 1 − đ›Œ is 80% and ∊ is 10%, the standard formula gives n = 900, while with A-C we get n = 2342. [Brown, Cai & DasGupta, Statistical Science, 2001] [Baeza-Yates, SIGIR 2015, Industry track] Sample Size?
  • 35. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost ‱ Standard technique: ‱ A good sample should cover well all the items distribution, but this does not work with very skewed distributions. 10 0 10 1 10 2 10 3 Rank 10 0 10 1 10 2 10 3 10 4 10 5 Frequency 50M 10M 100K 1K 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 Rank 10 0 10 1 10 2 10 3 10 4 10 5 Frequency 50M 10M 100K 1K [Zaragoza et al, CIKM 2010] Sampling Queries
  • 36. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost 36 Stratified Sampling Example [Baeza-Yates, SIGIR 2015, Industry track]
  • 37. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Data bias Activity bias Sampling bias Interaction bias (Self) selection bias Bias in the User Interaction Search or Recommender System
  • 38. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Position bias Ranking bias Presentation or exposure bias Social bias Interaction bias Bias in the Interaction Amazon.com
  • 39. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Position bias Presentation bias Social bias Interaction bias Ranking bias Click bias Scrolling bias Mouse movement bias Data and algorithmic bias Self-selection bias Dependencies: A Cascade of Biases!
  • 40. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Ranking Bias in Web Search [Mediative Study, 2014]
  • 41. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Ranking Bias: Click Bias in Web Search ‱ Ranking & next page bias Navigational queries
  • 42. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost CTR (log) 1 11 21 Rank Learning to Rank with bias [Joachims et al., WSDM 2017, best paper] + many other papers Fair rankings [Zehlike et al., CIKM 2017] Clicks as implicit positive user feedback Debiasing Search Clicks and Other Biases [Dupret & Piwowarski, SIGIR 2008] [Chapelle & Zhang, WWW 2009] [Dupret & Liao, WSDM 2010, best paper] Debias the input Tune the algorithm Debias the output
  • 43. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Data bias Activity bias Sampling bias Interaction bias (Self) selection bias Second-order bias Vicious Cycle of Bias Search or Recommender System
  • 44. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost [Baeza-Yates, Pereira & Ziviani, WWW 2008] Person Web content is redundant (> 20%) Query Ranking bias in new content Redundancy grows (35%) Search results New page Second Order Bias in Web Content [Fortunato, Flammini, Menczer & Vespignani. PNAS 2006]
  • 45. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Data bias Activity bias Sampling bias Interaction bias (Self) selection bias Vicious Cycle of Bias Search or Recommender System Feedback loop bias
  • 46. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Bias due to Personalization ‱ The effect of self-selection bias ‱ Avoid the rich get richer and poor get poorer syndrome ‱ Avoid the echo chamber by empowering the tail Cold start problem solution: Explore & Exploit Partial solutions: ‱ Diversity ‱ Novelty ‱ Serendipity ‱ My dark side Wikipedia [Eli Pariser, Penguin 2011]
  • 47. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Users’ Eco Chambers in Recommender Systems â–Ș Filter bubbles â–Ș Degenerate feedback loops (e.g., YouTube autoplay) [Jiang et al., AAAI 2019]
  • 48. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Eco Chamber of the Recommender System ‱ Short-term greedy optimization, partial knowledge of the world ‱ Long-term revenue optimization is not achieved ‱ Views from new users should balance the exploration for new items ‱ Disparate impact: unfair ecommerce/information markets [Baeza-Yates & Ribaudo, to appear]
  • 49. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Epilogue
  • 50. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Recap Bias Type Statistical Cultural Cognitive Algorithmic ï‚š ? ? Presentation ï‚š Position ï‚š ï‚š ï‚š Data ï‚š ï‚š Sampling ï‚š ï‚š ï‚š Activity ï‚š Self-selection ï‚š ï‚š Interaction ï‚š ï‚š Social ï‚š ï‚š Second order ï‚š ï‚š ï‚š
  • 51. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost [Silberzahn et al., COS, Univ. of Virginia, 2015] Professional Bias? ➔ 61 analysts, 29 teams: 20 yes and 9 no
  • 52. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost If Systems Reflects Its Designers: What we can/should do? â–Ș Data â–Ș Analyze for known and unknown biases, debias when possible/needed â–Ș Recollect more data for difficult/sparse regions of the problem â–Ș Delete attributes associated directly/indirectly with harmful bias â–Ș Interaction â–Ș Make sure that the user is aware of the biases all the time â–Ș Give more control to the user â–Ș Design and Implementation ‱ Let experts/colleagues/users contest every step of the process â–Ș Evaluation ‱ Do not fool yourself!
  • 53. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Final Take-Home Message â–Ș Systems are a mirror of us, the good, the bad and the ugly â–Ș The Web amplifies everything, but always leaves traces â–Ș We need to be aware of our own biases! â–Ș We must be aware of the biases and contrarrest them to stop the vicious bias cycle â–Ș Plenty of open (research) problems! Big Data of People is huge
.. 
.. but it is tiny compared to the future Big Data of the Internet of Things (IoT) No activity bias!
  • 54. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Thank You! Any Questions? – rbaeza@acm.org | http://www.baeza.cl/ | http://fairness-measures.org Biased Questions? ASIST 2012 Book of the Year Award (Biased Ad)
  • 55. Ricardo Baeza-Yates © | @polarbearby | #TechSEOBoost Thanks for Viewing the Slideshare! – Watch the Recording: https://youtube.com/session-example Or Contact us today to discover how Catalyst can deliver unparalleled SEO results for your business. https://www.catalystdigital.com/