SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
October 28, 2017
Giuseppe “Pino” Di Fabbrizio
Rakuten Institute of Technology – Boston
3
• Motivations
• Traditional information retrieval models
• Learning-to-rank models
• Relevance
• Ranking Metrics
• Algorithms
• Ranking optimization
• Use cases
• Summary
• What is next?
Disclaimer: If not otherwise specified, images in this presentation
comply with the (CC) creative commons publishing license
4
• E-commerce growing faster than
traditional brick-and-mortar market
($4.06T by 2020)
• Mobile shopping adoption
increasing worldwide (46%
shoppers in Asia and 28% in North
America)
• Online catalogs offering broader
selections and competitive products
• Electronic money transactions
gaining more consumers’ trust
• Massive data collected during web
and mobile interactions providing
foundation for machine learning-
driven optimizations
1.61B
Shoppers
$1.86T
Sales
$150B*
Revenues
ML
*2016 Combined revenues for Amazon, Otto Group, and Rakuten
https://www.statista.com/topics/871/online-shopping/
5
6
250M+ Products
40k+ Categories
7
How do we find
the most relevant
products for a
search query?
www.rakuten.com
Oct 10, 2017
8Query
Ranking
function
Documents
www.rakuten.com
Nov 2016
1 2 3
4 5 6
7 8 9
9
• Relevance is estimated by
lexical matches of query
terms with document terms
• Examples:
• Boolean models
• Vector space models
• Latent semantic indexing
• Okapi BM25
Index
Indexer
Query
Documents
Scoring
model
Top-n retrieved
documents
On-line
Off-line
10
www.rakuten.com
Oct 10, 2017
Query (Q)
Document 1 (D1)
Document 2 (D2)
iphone
7
case
iphone 7 Case
Q 1 1 1
D1 2 2 2
D2 3 1 0
Q
D1
D2
11
• Basic ideas
• Lexical similarity metrics
• Penalizing repeated occurrences of the same term
• Penalizing term frequency for longer documents
• Only few features
• Manually hand-tuned feature weights based on heuristic
• Cannot include important search signals such as user’s
feedback, product popularity, purchase history, etc.
• Fast and scalable
12
• Data-driven approach
• Directly optimize products rank based on relevance (different
from classification and regression ML tasks)
• Handle thousands of features
• Robust to noisy data
• Handle personalization
• Industry & research state-of-the-art (Amazon, eBay,
Microsoft, Yahoo!, Yandex, etc.)
13
A document is relevant if contains the information the
user was looking for when submitted the query
Relevance is subjective and depends on many factors:
• context (what is displayed and how)
• task (purchase, search info, answer, etc.)
• novelty (unexpected data, ads, ext.)
• time and user’s effort involved
14
1
3
2
www.rakuten.com
Nov 2016
15
buyclick add
www.rakuten.com
Nov 2016
16
• Clickthrough data (user’s implicit
feedback) as source of relevance for
search query / document pairs
• Pros
• Abundant and easy to harvest
• Always fresh
• Unbiased
• Cons
• Noisy
• Long tail queries
• Simple relevance mapping:
• score = 0 (not relevant), score = 3 (highly
relevant)
• Purchase > cart > click > impression
Score User’s implicit feedback
3 Product purchased
2 Product added to the shopping cart
1 Product clicked
0 No clicks
17
Seen products
Potentially
seen products
Unseen
products
Browser
viewport
Click
www.rakuten.com
Aug 2017
18
Documents
Normalized and Discounted Cumulative Gain
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
1 2 3 4 5 6 7 8 9 10
NDCG
19
• Tree ensemble method
• Handle sparse data
• Handle missing values and various value types
• Robust to outliers
• Learn higher-order feature interactions
• Invariant to feature scaling
• Highly scalable and optimized open source
implementation (XGBoost)
20
Point-wise
• Input: single documents / Output: class labels or scores
• Classify each document as relevant or non-relevant.
• Adjust w to reduce classification errors
Pairwise ranking
• Input: document pairs / Output: partial order preferences
• Classify pairs of documents – D1 > D2?
• Adjust w to reduce discordant pairs
List-wise ranking
• Input: document collections / ranked document list
• Score permutations -- Is {D1,D2,…} > {D1’,D2’,…} ?
• Adjust w to directly maximize ranking measure of interest (NDCG)
Di
Q
Q
DjDi >
Q
DjDi > Dk>
21
Green = relevant
Gray = not-relevant
Blue arrows = boost for
pair-wise loss function
Red arrows = boost for
list-wise loss function
(a) is the perfect ranking;
(b) is ranking with 10 pairwise errors;
(c) is ranking with 8 pairwise errors
22
• Relevance: User’s behavior signals
• Ranking Metrics: NDCG
• Machine Learning Algorithm: Gradient Tree
Boosting
• Ranking optimization: List-wise with NDCG
metrics
23
Index
Indexer
Query
Documents
Scoring
model
Scores
Query
Features
Training
data
Learning
to rank
Re-ranking
model
Top-n ranked
documents (n > 1M)
Top-m re-ranked
documents (m < 1k)
On-line
Off-line
Relevance
24www.rakuten.com
Mar 2017
25
Search Query: “40inch tv”
Regular text
search
Search with user’s signals
and learning-to-rank models
Not relevant
Not relevant
Not relevant
26
Conversion Rate
(Simulation)
NDCG CTR Simulated
Queries
Relative gain 15.58% 7.50% 10,000
Depth /
Estimators
5 / 500 3 / 500 10 / 500 3 / 500
NDCG 0.687 0.688 0.685 0.689
Relative gain 15.14% 15.41% 14.92% 15.58%
Training time
(56 cores)
2:45:48 1:20:57 35:25:44 1:58:07
27
Automatic
Speech
Recognition
Computer
Vision
Natural
Language
Processing
Information
Retrieval
2011 2013 2013-2015 2017?
28
Bhaskar Mitra, Fernando Diaz, and Nick Craswell. 2017. Learning to Match using Local and Distributed Representations of
Text for Web Search. In Proceedings of the 26th International Conference on World Wide Web (WWW '17).
29
Bhaskar Mitra, Fernando Diaz, and Nick Craswell. 2017. Learning to Match using Local and Distributed Representations of
Text for Web Search. In Proceedings of the 26th International Conference on World Wide Web (WWW '17).
30
Bhaskar Mitra, Fernando Diaz, and Nick Craswell. 2017. Learning to Match using Local and Distributed Representations of
Text for Web Search. In Proceedings of the 26th International Conference on World Wide Web (WWW '17).
31
• Traditional IR methods do not scale to modern e-commerce needs
• User’s implicit feedback is a proxy for search query / document pairs
relevance
• Learning-to-rank (LTR) methods scale to thousand of features and are
robust to data noise
• LTR with listwise-based loss function substantially improve search
relevance (15.6% NDCG increase on e-commerce data)
• NDCG improvements directly correlate to conversion rates (7.5% CTR
increase on e-commerce data)
• DNN methods for IR are starting to outperform traditional ML methods
Find it! Nail it!Boosting e-commerce search conversions with machine learning at scale

Mais conteúdo relacionado

Mais procurados

Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsYONG ZHENG
 
How to Build your Training Set for a Learning To Rank Project
How to Build your Training Set for a Learning To Rank ProjectHow to Build your Training Set for a Learning To Rank Project
How to Build your Training Set for a Learning To Rank ProjectSease
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender SystemsDavid Zibriczky
 
Neural Learning to Rank
Neural Learning to RankNeural Learning to Rank
Neural Learning to RankBhaskar Mitra
 
Data/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthData/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthXavier Amatriain
 
Learning to Rank Datasets for Search with Oscar Castaneda
Learning to Rank Datasets for Search with Oscar CastanedaLearning to Rank Datasets for Search with Oscar Castaneda
Learning to Rank Datasets for Search with Oscar CastanedaDatabricks
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender systemStanley Wang
 
Correlation, causation and incrementally recommendation problems at netflix ...
Correlation, causation and incrementally  recommendation problems at netflix ...Correlation, causation and incrementally  recommendation problems at netflix ...
Correlation, causation and incrementally recommendation problems at netflix ...Roelof van Zwol
 
Machine Learning - Dataset Preparation
Machine Learning - Dataset PreparationMachine Learning - Dataset Preparation
Machine Learning - Dataset PreparationAndrew Ferlitsch
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsXavier Amatriain
 
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems -  ACM RecSys 2013 tutorialLearning to Rank for Recommender Systems -  ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorialAlexandros Karatzoglou
 
Sequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsSequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsJaya Kawale
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsJustin Basilico
 
Steffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFSteffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFMLconf
 
System design for recommendations and search
System design for recommendations and searchSystem design for recommendations and search
System design for recommendations and searchEugene Yan Ziyou
 
Privacy-preserving Data Mining in Industry (WWW 2019 Tutorial)
Privacy-preserving Data Mining in Industry (WWW 2019 Tutorial)Privacy-preserving Data Mining in Industry (WWW 2019 Tutorial)
Privacy-preserving Data Mining in Industry (WWW 2019 Tutorial)Krishnaram Kenthapadi
 
Customer Segmentation using Clustering
Customer Segmentation using ClusteringCustomer Segmentation using Clustering
Customer Segmentation using ClusteringDessy Amirudin
 
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021Mirko Marras
 

Mais procurados (20)

Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender Systems
 
How to Build your Training Set for a Learning To Rank Project
How to Build your Training Set for a Learning To Rank ProjectHow to Build your Training Set for a Learning To Rank Project
How to Build your Training Set for a Learning To Rank Project
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
Neural Learning to Rank
Neural Learning to RankNeural Learning to Rank
Neural Learning to Rank
 
Data/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthData/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealth
 
Learning to Rank Datasets for Search with Oscar Castaneda
Learning to Rank Datasets for Search with Oscar CastanedaLearning to Rank Datasets for Search with Oscar Castaneda
Learning to Rank Datasets for Search with Oscar Castaneda
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
Correlation, causation and incrementally recommendation problems at netflix ...
Correlation, causation and incrementally  recommendation problems at netflix ...Correlation, causation and incrementally  recommendation problems at netflix ...
Correlation, causation and incrementally recommendation problems at netflix ...
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning - Dataset Preparation
Machine Learning - Dataset PreparationMachine Learning - Dataset Preparation
Machine Learning - Dataset Preparation
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
 
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems -  ACM RecSys 2013 tutorialLearning to Rank for Recommender Systems -  ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
 
Sequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsSequential Decision Making in Recommendations
Sequential Decision Making in Recommendations
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Steffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SFSteffen Rendle, Research Scientist, Google at MLconf SF
Steffen Rendle, Research Scientist, Google at MLconf SF
 
System design for recommendations and search
System design for recommendations and searchSystem design for recommendations and search
System design for recommendations and search
 
Learn to Rank search results
Learn to Rank search resultsLearn to Rank search results
Learn to Rank search results
 
Privacy-preserving Data Mining in Industry (WWW 2019 Tutorial)
Privacy-preserving Data Mining in Industry (WWW 2019 Tutorial)Privacy-preserving Data Mining in Industry (WWW 2019 Tutorial)
Privacy-preserving Data Mining in Industry (WWW 2019 Tutorial)
 
Customer Segmentation using Clustering
Customer Segmentation using ClusteringCustomer Segmentation using Clustering
Customer Segmentation using Clustering
 
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
 

Destaque

Building Creative Learning Environments
Building Creative  Learning EnvironmentsBuilding Creative  Learning Environments
Building Creative Learning EnvironmentsRakuten Group, Inc.
 
Deep learning for e-commerce: current status and future prospects
Deep learning for e-commerce: current status and future prospectsDeep learning for e-commerce: current status and future prospects
Deep learning for e-commerce: current status and future prospectsRakuten Group, Inc.
 
Artificial Intelligence for Happiness of People
Artificial Intelligence for Happiness of PeopleArtificial Intelligence for Happiness of People
Artificial Intelligence for Happiness of PeopleRakuten Group, Inc.
 
Meer libby an augumented hybrid app jeff sterling
Meer libby an augumented hybrid app jeff sterlingMeer libby an augumented hybrid app jeff sterling
Meer libby an augumented hybrid app jeff sterlingRakuten Group, Inc.
 
From the browser to the desktop with node js and electron cyril maurel
From the browser to the desktop with node js and electron cyril maurelFrom the browser to the desktop with node js and electron cyril maurel
From the browser to the desktop with node js and electron cyril maurelRakuten Group, Inc.
 
楽天テクノロジーカンファレンス2017 の見どころ 日本語版
楽天テクノロジーカンファレンス2017 の見どころ 日本語版楽天テクノロジーカンファレンス2017 の見どころ 日本語版
楽天テクノロジーカンファレンス2017 の見どころ 日本語版Rakuten Group, Inc.
 
はてなのインフラの歴史、そしてMackerelへ至る道とこれから
はてなのインフラの歴史、そしてMackerelへ至る道とこれから はてなのインフラの歴史、そしてMackerelへ至る道とこれから
はてなのインフラの歴史、そしてMackerelへ至る道とこれから Rakuten Group, Inc.
 
トラブルシューティングのあれこれ Yoshihiko kamata
トラブルシューティングのあれこれ Yoshihiko kamataトラブルシューティングのあれこれ Yoshihiko kamata
トラブルシューティングのあれこれ Yoshihiko kamataRakuten Group, Inc.
 
AI AND FUNDAMENTAL GAME TECHNOLOGIESIN FINAL FANTASY XV
AI AND FUNDAMENTAL GAME TECHNOLOGIESIN FINAL FANTASY XVAI AND FUNDAMENTAL GAME TECHNOLOGIESIN FINAL FANTASY XV
AI AND FUNDAMENTAL GAME TECHNOLOGIESIN FINAL FANTASY XVRakuten Group, Inc.
 
Value Delivery through RakutenBig Data Intelligence Ecosystem and Technology
Value Delivery through RakutenBig Data Intelligence Ecosystem  and  TechnologyValue Delivery through RakutenBig Data Intelligence Ecosystem  and  Technology
Value Delivery through RakutenBig Data Intelligence Ecosystem and TechnologyRakuten Group, Inc.
 
Rakuten Technology Conference 2017 A Distributed SQL Database For Data Analy...
Rakuten Technology Conference 2017 A Distributed SQL Database  For Data Analy...Rakuten Technology Conference 2017 A Distributed SQL Database  For Data Analy...
Rakuten Technology Conference 2017 A Distributed SQL Database For Data Analy...Rakuten Group, Inc.
 
Life of an enginner in rakuten osaka diarmaid lindsay
Life of an enginner in rakuten osaka diarmaid lindsayLife of an enginner in rakuten osaka diarmaid lindsay
Life of an enginner in rakuten osaka diarmaid lindsayRakuten Group, Inc.
 
Challenge for statup's cto from big company nagaaki hoshi
Challenge for statup's cto from big company nagaaki hoshiChallenge for statup's cto from big company nagaaki hoshi
Challenge for statup's cto from big company nagaaki hoshiRakuten Group, Inc.
 
AI based language learning tools
AI based language learning toolsAI based language learning tools
AI based language learning toolsRakuten Group, Inc.
 

Destaque (20)

Recurrent Neural Networks
Recurrent Neural NetworksRecurrent Neural Networks
Recurrent Neural Networks
 
RTC2017 Ethereum Workshop
RTC2017 Ethereum WorkshopRTC2017 Ethereum Workshop
RTC2017 Ethereum Workshop
 
SEE CHANGE.CHANGE THE WORLD.
SEE CHANGE.CHANGE THE WORLD.SEE CHANGE.CHANGE THE WORLD.
SEE CHANGE.CHANGE THE WORLD.
 
Building Creative Learning Environments
Building Creative  Learning EnvironmentsBuilding Creative  Learning Environments
Building Creative Learning Environments
 
Deep learning for e-commerce: current status and future prospects
Deep learning for e-commerce: current status and future prospectsDeep learning for e-commerce: current status and future prospects
Deep learning for e-commerce: current status and future prospects
 
Artificial Intelligence for Happiness of People
Artificial Intelligence for Happiness of PeopleArtificial Intelligence for Happiness of People
Artificial Intelligence for Happiness of People
 
Meer libby an augumented hybrid app jeff sterling
Meer libby an augumented hybrid app jeff sterlingMeer libby an augumented hybrid app jeff sterling
Meer libby an augumented hybrid app jeff sterling
 
Riemannian Geometry in Egison
Riemannian Geometry in EgisonRiemannian Geometry in Egison
Riemannian Geometry in Egison
 
From the browser to the desktop with node js and electron cyril maurel
From the browser to the desktop with node js and electron cyril maurelFrom the browser to the desktop with node js and electron cyril maurel
From the browser to the desktop with node js and electron cyril maurel
 
楽天テクノロジーカンファレンス2017 の見どころ 日本語版
楽天テクノロジーカンファレンス2017 の見どころ 日本語版楽天テクノロジーカンファレンス2017 の見どころ 日本語版
楽天テクノロジーカンファレンス2017 の見どころ 日本語版
 
はてなのインフラの歴史、そしてMackerelへ至る道とこれから
はてなのインフラの歴史、そしてMackerelへ至る道とこれから はてなのインフラの歴史、そしてMackerelへ至る道とこれから
はてなのインフラの歴史、そしてMackerelへ至る道とこれから
 
トラブルシューティングのあれこれ Yoshihiko kamata
トラブルシューティングのあれこれ Yoshihiko kamataトラブルシューティングのあれこれ Yoshihiko kamata
トラブルシューティングのあれこれ Yoshihiko kamata
 
AI AND FUNDAMENTAL GAME TECHNOLOGIESIN FINAL FANTASY XV
AI AND FUNDAMENTAL GAME TECHNOLOGIESIN FINAL FANTASY XVAI AND FUNDAMENTAL GAME TECHNOLOGIESIN FINAL FANTASY XV
AI AND FUNDAMENTAL GAME TECHNOLOGIESIN FINAL FANTASY XV
 
Value Delivery through RakutenBig Data Intelligence Ecosystem and Technology
Value Delivery through RakutenBig Data Intelligence Ecosystem  and  TechnologyValue Delivery through RakutenBig Data Intelligence Ecosystem  and  Technology
Value Delivery through RakutenBig Data Intelligence Ecosystem and Technology
 
Rakuten Technology Conference 2017 A Distributed SQL Database For Data Analy...
Rakuten Technology Conference 2017 A Distributed SQL Database  For Data Analy...Rakuten Technology Conference 2017 A Distributed SQL Database  For Data Analy...
Rakuten Technology Conference 2017 A Distributed SQL Database For Data Analy...
 
Don't manage too hard!
Don't manage too hard! Don't manage too hard!
Don't manage too hard!
 
Life of an enginner in rakuten osaka diarmaid lindsay
Life of an enginner in rakuten osaka diarmaid lindsayLife of an enginner in rakuten osaka diarmaid lindsay
Life of an enginner in rakuten osaka diarmaid lindsay
 
Challenge for statup's cto from big company nagaaki hoshi
Challenge for statup's cto from big company nagaaki hoshiChallenge for statup's cto from big company nagaaki hoshi
Challenge for statup's cto from big company nagaaki hoshi
 
AI based language learning tools
AI based language learning toolsAI based language learning tools
AI based language learning tools
 
COBOL to Apache Spark
COBOL to Apache SparkCOBOL to Apache Spark
COBOL to Apache Spark
 

Semelhante a Find it! Nail it! Boosting e-commerce search conversions with machine learning at scale

Google Analytics Premium for Better Data-Driven Decisions With Swapnil Sinha
Google Analytics Premium for Better Data-Driven Decisions With Swapnil SinhaGoogle Analytics Premium for Better Data-Driven Decisions With Swapnil Sinha
Google Analytics Premium for Better Data-Driven Decisions With Swapnil SinhaTatvic Analytics
 
Power Up Competitive Price Intelligence with Web Data
Power Up Competitive Price Intelligence with Web DataPower Up Competitive Price Intelligence with Web Data
Power Up Competitive Price Intelligence with Web DataConnotate
 
Power Up Your Competitive Price Intelligence With Web Data
Power Up Your Competitive Price Intelligence With Web DataPower Up Your Competitive Price Intelligence With Web Data
Power Up Your Competitive Price Intelligence With Web DataConnotate
 
Steve Forbes: Empowering Marketing
Steve Forbes: Empowering MarketingSteve Forbes: Empowering Marketing
Steve Forbes: Empowering MarketingBMA Carolinas
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIsBala Iyer
 
Converting Data-Driven Insights Into Revenue
Converting Data-Driven Insights Into RevenueConverting Data-Driven Insights Into Revenue
Converting Data-Driven Insights Into RevenueTechTarget
 
HacktoberFestPune - DSC MESCOE x DSC PVGCOET
HacktoberFestPune - DSC MESCOE x DSC PVGCOETHacktoberFestPune - DSC MESCOE x DSC PVGCOET
HacktoberFestPune - DSC MESCOE x DSC PVGCOETTanyaRaina3
 
Webanalytics with Microsoft BI
Webanalytics with Microsoft BIWebanalytics with Microsoft BI
Webanalytics with Microsoft BITillmann Eitelberg
 
Google Analytics Training - full 2017
Google Analytics Training - full 2017Google Analytics Training - full 2017
Google Analytics Training - full 2017Nate Plaunt
 
Clicks, Conversions and Crawls
Clicks, Conversions and CrawlsClicks, Conversions and Crawls
Clicks, Conversions and CrawlsMichelle Robbins
 
230286802015PPT.pptx
230286802015PPT.pptx230286802015PPT.pptx
230286802015PPT.pptxannalakshmi35
 
Why Human Annotated Data Matters for Search - Grant Ingersoll, Lucidworks & K...
Why Human Annotated Data Matters for Search - Grant Ingersoll, Lucidworks & K...Why Human Annotated Data Matters for Search - Grant Ingersoll, Lucidworks & K...
Why Human Annotated Data Matters for Search - Grant Ingersoll, Lucidworks & K...Lucidworks
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1Joni Salminen
 
Dataiku tatvic webinar presentation
Dataiku tatvic webinar presentationDataiku tatvic webinar presentation
Dataiku tatvic webinar presentationTatvic Analytics
 
Web Analytics 2.0 and Multiplicity - PixelMEDIA
Web Analytics 2.0 and Multiplicity - PixelMEDIAWeb Analytics 2.0 and Multiplicity - PixelMEDIA
Web Analytics 2.0 and Multiplicity - PixelMEDIAPixelMEDIA
 
Making your analytics talk business | Big Data Demystified
Making your analytics talk business | Big Data DemystifiedMaking your analytics talk business | Big Data Demystified
Making your analytics talk business | Big Data DemystifiedOmid Vahdaty
 
Decision Making with Data by PayPal Sr Product Manager
Decision Making with Data by PayPal Sr Product ManagerDecision Making with Data by PayPal Sr Product Manager
Decision Making with Data by PayPal Sr Product ManagerProduct School
 

Semelhante a Find it! Nail it! Boosting e-commerce search conversions with machine learning at scale (20)

Google Analytics Premium for Better Data-Driven Decisions With Swapnil Sinha
Google Analytics Premium for Better Data-Driven Decisions With Swapnil SinhaGoogle Analytics Premium for Better Data-Driven Decisions With Swapnil Sinha
Google Analytics Premium for Better Data-Driven Decisions With Swapnil Sinha
 
Power Up Competitive Price Intelligence with Web Data
Power Up Competitive Price Intelligence with Web DataPower Up Competitive Price Intelligence with Web Data
Power Up Competitive Price Intelligence with Web Data
 
Power Up Your Competitive Price Intelligence With Web Data
Power Up Your Competitive Price Intelligence With Web DataPower Up Your Competitive Price Intelligence With Web Data
Power Up Your Competitive Price Intelligence With Web Data
 
Steve Forbes: Empowering Marketing
Steve Forbes: Empowering MarketingSteve Forbes: Empowering Marketing
Steve Forbes: Empowering Marketing
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
 
Converting Data-Driven Insights Into Revenue
Converting Data-Driven Insights Into RevenueConverting Data-Driven Insights Into Revenue
Converting Data-Driven Insights Into Revenue
 
HacktoberFestPune - DSC MESCOE x DSC PVGCOET
HacktoberFestPune - DSC MESCOE x DSC PVGCOETHacktoberFestPune - DSC MESCOE x DSC PVGCOET
HacktoberFestPune - DSC MESCOE x DSC PVGCOET
 
Webanalytics with Microsoft BI
Webanalytics with Microsoft BIWebanalytics with Microsoft BI
Webanalytics with Microsoft BI
 
Google Analytics Training - full 2017
Google Analytics Training - full 2017Google Analytics Training - full 2017
Google Analytics Training - full 2017
 
Analytics in Action - UPCEA Central
Analytics in Action - UPCEA CentralAnalytics in Action - UPCEA Central
Analytics in Action - UPCEA Central
 
uae views on big data
  uae views on  big data  uae views on  big data
uae views on big data
 
Clicks, Conversions and Crawls
Clicks, Conversions and CrawlsClicks, Conversions and Crawls
Clicks, Conversions and Crawls
 
230286802015PPT.pptx
230286802015PPT.pptx230286802015PPT.pptx
230286802015PPT.pptx
 
Metrics that Matter
Metrics that MatterMetrics that Matter
Metrics that Matter
 
Why Human Annotated Data Matters for Search - Grant Ingersoll, Lucidworks & K...
Why Human Annotated Data Matters for Search - Grant Ingersoll, Lucidworks & K...Why Human Annotated Data Matters for Search - Grant Ingersoll, Lucidworks & K...
Why Human Annotated Data Matters for Search - Grant Ingersoll, Lucidworks & K...
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1
 
Dataiku tatvic webinar presentation
Dataiku tatvic webinar presentationDataiku tatvic webinar presentation
Dataiku tatvic webinar presentation
 
Web Analytics 2.0 and Multiplicity - PixelMEDIA
Web Analytics 2.0 and Multiplicity - PixelMEDIAWeb Analytics 2.0 and Multiplicity - PixelMEDIA
Web Analytics 2.0 and Multiplicity - PixelMEDIA
 
Making your analytics talk business | Big Data Demystified
Making your analytics talk business | Big Data DemystifiedMaking your analytics talk business | Big Data Demystified
Making your analytics talk business | Big Data Demystified
 
Decision Making with Data by PayPal Sr Product Manager
Decision Making with Data by PayPal Sr Product ManagerDecision Making with Data by PayPal Sr Product Manager
Decision Making with Data by PayPal Sr Product Manager
 

Mais de Rakuten Group, Inc.

コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話Rakuten Group, Inc.
 
楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のりRakuten Group, Inc.
 
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Rakuten Group, Inc.
 
DataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組みDataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組みRakuten Group, Inc.
 
大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開Rakuten Group, Inc.
 
楽天における大規模データベースの運用
楽天における大規模データベースの運用楽天における大規模データベースの運用
楽天における大規模データベースの運用Rakuten Group, Inc.
 
楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャーRakuten Group, Inc.
 
楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割Rakuten Group, Inc.
 
Rakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdfRakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdfRakuten Group, Inc.
 
The Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdfThe Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdfRakuten Group, Inc.
 
Supporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdfSupporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdfRakuten Group, Inc.
 
Making Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdfMaking Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdfRakuten Group, Inc.
 
How We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdfHow We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdfRakuten Group, Inc.
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoRakuten Group, Inc.
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoRakuten Group, Inc.
 
Introduction of GORA API Group technology
Introduction of GORA API Group technologyIntroduction of GORA API Group technology
Introduction of GORA API Group technologyRakuten Group, Inc.
 
100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情Rakuten Group, Inc.
 
社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャーRakuten Group, Inc.
 

Mais de Rakuten Group, Inc. (20)

コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
 
楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり
 
What Makes Software Green?
What Makes Software Green?What Makes Software Green?
What Makes Software Green?
 
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
 
DataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組みDataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組み
 
大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開
 
楽天における大規模データベースの運用
楽天における大規模データベースの運用楽天における大規模データベースの運用
楽天における大規模データベースの運用
 
楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー
 
楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割
 
Rakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdfRakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdf
 
The Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdfThe Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdf
 
Supporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdfSupporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdf
 
Making Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdfMaking Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdf
 
How We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdfHow We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdf
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
 
OWASPTop10_Introduction
OWASPTop10_IntroductionOWASPTop10_Introduction
OWASPTop10_Introduction
 
Introduction of GORA API Group technology
Introduction of GORA API Group technologyIntroduction of GORA API Group technology
Introduction of GORA API Group technology
 
100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情
 
社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー
 

Último

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 

Último (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

Find it! Nail it! Boosting e-commerce search conversions with machine learning at scale

  • 1. October 28, 2017 Giuseppe “Pino” Di Fabbrizio Rakuten Institute of Technology – Boston
  • 2.
  • 3. 3 • Motivations • Traditional information retrieval models • Learning-to-rank models • Relevance • Ranking Metrics • Algorithms • Ranking optimization • Use cases • Summary • What is next? Disclaimer: If not otherwise specified, images in this presentation comply with the (CC) creative commons publishing license
  • 4. 4 • E-commerce growing faster than traditional brick-and-mortar market ($4.06T by 2020) • Mobile shopping adoption increasing worldwide (46% shoppers in Asia and 28% in North America) • Online catalogs offering broader selections and competitive products • Electronic money transactions gaining more consumers’ trust • Massive data collected during web and mobile interactions providing foundation for machine learning- driven optimizations 1.61B Shoppers $1.86T Sales $150B* Revenues ML *2016 Combined revenues for Amazon, Otto Group, and Rakuten https://www.statista.com/topics/871/online-shopping/
  • 5. 5
  • 7. 7 How do we find the most relevant products for a search query? www.rakuten.com Oct 10, 2017
  • 9. 9 • Relevance is estimated by lexical matches of query terms with document terms • Examples: • Boolean models • Vector space models • Latent semantic indexing • Okapi BM25 Index Indexer Query Documents Scoring model Top-n retrieved documents On-line Off-line
  • 10. 10 www.rakuten.com Oct 10, 2017 Query (Q) Document 1 (D1) Document 2 (D2) iphone 7 case iphone 7 Case Q 1 1 1 D1 2 2 2 D2 3 1 0 Q D1 D2
  • 11. 11 • Basic ideas • Lexical similarity metrics • Penalizing repeated occurrences of the same term • Penalizing term frequency for longer documents • Only few features • Manually hand-tuned feature weights based on heuristic • Cannot include important search signals such as user’s feedback, product popularity, purchase history, etc. • Fast and scalable
  • 12. 12 • Data-driven approach • Directly optimize products rank based on relevance (different from classification and regression ML tasks) • Handle thousands of features • Robust to noisy data • Handle personalization • Industry & research state-of-the-art (Amazon, eBay, Microsoft, Yahoo!, Yandex, etc.)
  • 13. 13 A document is relevant if contains the information the user was looking for when submitted the query Relevance is subjective and depends on many factors: • context (what is displayed and how) • task (purchase, search info, answer, etc.) • novelty (unexpected data, ads, ext.) • time and user’s effort involved
  • 16. 16 • Clickthrough data (user’s implicit feedback) as source of relevance for search query / document pairs • Pros • Abundant and easy to harvest • Always fresh • Unbiased • Cons • Noisy • Long tail queries • Simple relevance mapping: • score = 0 (not relevant), score = 3 (highly relevant) • Purchase > cart > click > impression Score User’s implicit feedback 3 Product purchased 2 Product added to the shopping cart 1 Product clicked 0 No clicks
  • 18. 18 Documents Normalized and Discounted Cumulative Gain 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10 NDCG
  • 19. 19 • Tree ensemble method • Handle sparse data • Handle missing values and various value types • Robust to outliers • Learn higher-order feature interactions • Invariant to feature scaling • Highly scalable and optimized open source implementation (XGBoost)
  • 20. 20 Point-wise • Input: single documents / Output: class labels or scores • Classify each document as relevant or non-relevant. • Adjust w to reduce classification errors Pairwise ranking • Input: document pairs / Output: partial order preferences • Classify pairs of documents – D1 > D2? • Adjust w to reduce discordant pairs List-wise ranking • Input: document collections / ranked document list • Score permutations -- Is {D1,D2,…} > {D1’,D2’,…} ? • Adjust w to directly maximize ranking measure of interest (NDCG) Di Q Q DjDi > Q DjDi > Dk>
  • 21. 21 Green = relevant Gray = not-relevant Blue arrows = boost for pair-wise loss function Red arrows = boost for list-wise loss function (a) is the perfect ranking; (b) is ranking with 10 pairwise errors; (c) is ranking with 8 pairwise errors
  • 22. 22 • Relevance: User’s behavior signals • Ranking Metrics: NDCG • Machine Learning Algorithm: Gradient Tree Boosting • Ranking optimization: List-wise with NDCG metrics
  • 25. 25 Search Query: “40inch tv” Regular text search Search with user’s signals and learning-to-rank models Not relevant Not relevant Not relevant
  • 26. 26 Conversion Rate (Simulation) NDCG CTR Simulated Queries Relative gain 15.58% 7.50% 10,000 Depth / Estimators 5 / 500 3 / 500 10 / 500 3 / 500 NDCG 0.687 0.688 0.685 0.689 Relative gain 15.14% 15.41% 14.92% 15.58% Training time (56 cores) 2:45:48 1:20:57 35:25:44 1:58:07
  • 28. 28 Bhaskar Mitra, Fernando Diaz, and Nick Craswell. 2017. Learning to Match using Local and Distributed Representations of Text for Web Search. In Proceedings of the 26th International Conference on World Wide Web (WWW '17).
  • 29. 29 Bhaskar Mitra, Fernando Diaz, and Nick Craswell. 2017. Learning to Match using Local and Distributed Representations of Text for Web Search. In Proceedings of the 26th International Conference on World Wide Web (WWW '17).
  • 30. 30 Bhaskar Mitra, Fernando Diaz, and Nick Craswell. 2017. Learning to Match using Local and Distributed Representations of Text for Web Search. In Proceedings of the 26th International Conference on World Wide Web (WWW '17).
  • 31. 31 • Traditional IR methods do not scale to modern e-commerce needs • User’s implicit feedback is a proxy for search query / document pairs relevance • Learning-to-rank (LTR) methods scale to thousand of features and are robust to data noise • LTR with listwise-based loss function substantially improve search relevance (15.6% NDCG increase on e-commerce data) • NDCG improvements directly correlate to conversion rates (7.5% CTR increase on e-commerce data) • DNN methods for IR are starting to outperform traditional ML methods