SlideShare uma empresa Scribd logo
1 de 23
Deep learning:
the future of recommendations
Balázs Hidasi
Head of Data Mining and Research
Gravity meetup @ Startup Safary
April 21, 2016
Deep learning in the headlines
Deep learning in the background
• Life improving services
 Speech recognition
 Personal assistants (e.g. Siri,
Cortana)
 Computer vision, object
recognition
 Machine translation
 Chatbot technology
 Natural Language Processing
 Face recognition
 Self driving cars
• For fun
 Text generation
 Composing music
 Painting pictures
 Etc.
What is deep learning?
• A class of machine learning algorithms
 that use a cascade of multiple non-linear processing layers
 and complex model structures
 to learn different representations of the data in each layer
 where higher level features are derived from lower level
features
 to form a hierarchical representation.
Deep learning is not a new topic
• First deep network proposed in the 1970s
• More papers in the 80s and 90s
• Why now?
 Older research was not used widely in practice
 Applications were much more simplistic that today’s
Neural networks: a brief overview
Neurons, neural networks
• Neuron: rough abstraction of the human neuron
 Receives inputs (signals)
 Sum weighted inputs is big enough  signal
 Amplifiers and inhibitors
 Basic pattern recognition
• Neural network: neurons connected to one another
• Feedforward networks: neurons are organized into
layers
 Connections only between subsequent layers
𝑦
𝑥1
𝑥2
𝑥3
𝑥4
𝑓(. )
𝑖=1
𝑁
𝑤𝑖 𝑥𝑖 + 𝑏
𝑥1
𝑥2
𝑥3
ℎ1
1
ℎ2
1
ℎ3
1
ℎ1
2
ℎ2
2
Networks that big enough: go deep not wide
• Feedforward neural networks are universal
approximators
 Can imitate any function if they are big enough
 (Also needs enough in-out pairs to learn)
• What is big enough?
 Number of layers / neurons
 Theoretical „big enough” conditions massively overshoot
• Go deep, not wide
 The number of neurons required for good approximation is
polynomial in the input if the network is deep enough
 Otherwise it is exponential
Training neural networks
• Forward pass: get the current estimate of the target
o 𝑠𝑗
1
= 𝑖 𝑤𝑖,𝑗
1
𝑥𝑖 + 𝑏𝑗
1
; ℎ𝑗
1
= 𝑓 𝑠𝑗
1
o 𝑠 𝑘
2
= 𝑗 𝑤𝑗,𝑘
2
ℎ𝑗
1
+ 𝑏 𝑘
2
; ℎ 𝑘
2
= 𝑓 𝑠 𝑘
2
o …
o 𝑠𝑙
𝑂
= 𝑘 𝑤 𝑘,𝑙
𝑁+1
ℎ 𝑘
𝑁
+ 𝑏𝑙
𝑂
; 𝑦𝑙 = 𝑓 𝑠𝑙
𝑂
• Backward pass: correct weights to reduce error
 Gradient descentLayer Error Gradient
(w.r.t. weights between current and prev. layer)
Output Defined loss
(e.g. 𝐿 = 𝑖=1
𝑁 𝑜
𝑦𝑖 − 𝑦𝑖
2
)
𝜕𝐿
𝜕𝑤𝑗,𝑖
(𝑁+1)
=
𝜕𝐿
𝜕𝑦𝑖
∗
𝜕𝑦𝑖
𝜕𝑠𝑖
𝑂 ∗
𝜕𝑠𝑖
𝑂
𝜕𝑤𝑗,𝑖
𝑁+1 =
𝜕𝐿
𝜕𝑦𝑖
𝑓′
𝑠𝑖
𝑂
ℎ𝑗
𝑁
𝑁 𝑡ℎ
hidden
𝛿𝑖
𝑁
=
𝜕𝐿
𝜕𝑦𝑖
∗
𝜕𝑦𝑖
𝜕𝑠𝑖
𝑂
𝜕𝐿
𝜕𝑤 𝑘,𝑗
𝑁 =
𝑖
𝜕𝐿
𝜕𝑦𝑖
∗
𝜕𝑦𝑖
𝜕𝑠𝑖
𝑂 ∗
𝜕𝑠𝑖
𝑂
𝜕ℎ𝑗
𝑁 ∗
𝜕ℎ𝑗
𝑁
𝜕𝑠𝑗
𝑁 ∗
𝜕𝑠𝑗
𝑁
𝜕𝑤𝑗,𝑖
𝑁 =
𝑖
𝛿𝑖
𝑁
𝑤𝑖,𝑗
𝑁+1
𝑓′ 𝑠𝑗
𝑁
ℎ 𝑘
𝑁−1
(𝑁 −
𝛿𝑗
𝑁−1
=
𝑖
𝛿𝑖
𝑁
𝑤𝑖,𝑗
𝑁+1
𝑓′ 𝑠𝑗
𝑁
𝜕𝐿
𝜕𝑤𝑙,𝑘
𝑁−1 =
𝑗
𝛿𝑗
𝑁−1
𝑤𝑗,𝑘
𝑁
𝑓′ 𝑠 𝑘
𝑁−1
ℎ𝑙
𝑁−2
…
1 𝑠𝑡
hidden
𝛿 𝑘
1 𝜕𝐿
𝜕𝑤𝑖,𝑗
1 =
𝑘
𝛿 𝑘
1
𝑤 𝑘,𝑙
2
𝑓′
𝑠𝑗
1
𝑥𝑖
Challenges of training deep networks
• Saturation
• Vanishing gradients
• Overfitting
• Slowness of second order methods
• Slow convergence, stucks in local optima with first
order methods
• (Exploding gradients)
Why now?
Breakthroughs in research
• Saturation & vanishing gradients
 Layer-by-layer training (2006)
 Non-saturating activation functions, e.g. ReLU (2013)
• Overfitting
 Dropout (2014)
• Convergence problems
 Adagrad, Adadelta, Adam, RMSProp, etc.
Computational power
• Natural increase in computational power
• GP GPU technology
Intermission
Don’t give in to the HYPE
• Deep learning is impressive but
 deep learning is not true AI
o it may be a component of it when
and if AI is created
 deep learning is not how the human
brain works
 95% of machine learning tasks don’t
require deep learning
 deep learning requires a lot of
computational power
• Deep learning is a tool
 which is successful in certain,
previously very challenging domains
(speech recognition, computer
vision, NLP, etc.)
 that excels in pattern recognition
You are here
Deep learning for RecSys
From the Netflix prize...
• Netflix prize (2006-2009)
 Gave a huge push to recommender systems research
 Determined the direction of research for years
 Task:
o Some (User, Item, Rating) known triplets
o (User, Item) pairs with unknown rating
o Predict the missing ratings (1-5)
... to recommenders in practice
• Ratings  events [implicit feedback]
 Lots of services don’t allow for rating
 Majority of users don’t rate
 Monitored passively  preferences have to be infered
• Rating prediction  ranking [top N recommendations]
 All that matters is the relevancy of the top N items
 Rating prediction is biased
• User  session / situation [session-based / context-driven
recommendation]
 Users are not logged in, identification is unreliable
 Accounts used by multiple users
 Aim of the session (e.g. buy a good laptop)
 Similar behavior of different users in a situation, different behavior of the same
user in different situations
Challenges in RecSys
• Session modeling
 Most of the algorithms are personalized
 A few are item-to-item
o Recommends similar items
o Also used for session-based recommendations (industry de facto standard)
 There are no good session based solutions
• Incorporating factors that influence user clicks
 Users click based on what they see
o Title
o Product image
o Description
 and on their knowledge of the product
o Usually harder to model
o Except when the product is content (e.g. music)
Deep learning to the rescue – Session modeling
• Recurrent Neural Networks (RNN)
 Sequence modeling
 Hidden state: next state is based on the previous hidden state and the current input
 „Infinite” depth
 More sophisticated versions: GRU, LSTM
• Needs to be adapted to the recommendation task
• GRU4Rec:
 Session-parallel minibatch training for handling the large variance in session lengths
 Sampling the output for reasonable training times, without losing much accuracy
 Ranking loss for better item ranking
• Results: 15-30% improvement over item-to-item recommendations
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
RSC15 VIDEO
Recall@20
Item-kNN
GRU4Rec
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
RSC15 VIDEO
MRR@20
Item-kNN
GRU4Rec
Other uses of deep learning for recsys
• Incorporating content directly
 Music, images, video, text
 User influencing aspects of the items
 Direct content representation
• Context-state modeling from sensory data
 IoT devices
 Lot of sensory data
 Some missing and noise
 Infer context state and recommend accordingly
• Interactive recommenders using chatbots
• Personalized content generation
 Today’s news
 Images in personalized style with personalized content
• Etc...
There is work to be done
• DL + RecSys research: just started
 Last year:
o 0 long papers, 1 short paper and 1 poster that is loosely connected
 This year:
o 10+ submissions to RecSys in this topic
o DLRS 2016 workshop @ RecSys
• Open questions
 (More) Application areas
 Adaptations required for the recsys problem
 Scalability
 Best practices
 ...
Thanks for your attention!

Mais conteúdo relacionado

Mais procurados

Word embeddings, RNN, GRU and LSTM
Word embeddings, RNN, GRU and LSTMWord embeddings, RNN, GRU and LSTM
Word embeddings, RNN, GRU and LSTMDivya Gera
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks남주 김
 
Deep learning based recommender systems (lab seminar paper review)
Deep learning based recommender systems (lab seminar paper review)Deep learning based recommender systems (lab seminar paper review)
Deep learning based recommender systems (lab seminar paper review)hyunsung lee
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement LearningUsman Qayyum
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systemsNAVER Engineering
 
[GAN by Hung-yi Lee]Part 1: General introduction of GAN
[GAN by Hung-yi Lee]Part 1: General introduction of GAN[GAN by Hung-yi Lee]Part 1: General introduction of GAN
[GAN by Hung-yi Lee]Part 1: General introduction of GANNAVER Engineering
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...Simplilearn
 
The Art of Social Media Analysis with Twitter & Python
The Art of Social Media Analysis with Twitter & PythonThe Art of Social Media Analysis with Twitter & Python
The Art of Social Media Analysis with Twitter & PythonKrishna Sankar
 
Deep Learning: Application & Opportunity
Deep Learning: Application & OpportunityDeep Learning: Application & Opportunity
Deep Learning: Application & OpportunityiTrain
 
Talk@rmit 09112017
Talk@rmit 09112017Talk@rmit 09112017
Talk@rmit 09112017Shuai Zhang
 
Deep Learning Explained
Deep Learning ExplainedDeep Learning Explained
Deep Learning ExplainedMelanie Swan
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)Unsupervised learning (clustering)
Unsupervised learning (clustering)Pravinkumar Landge
 
System design for recommendations and search
System design for recommendations and searchSystem design for recommendations and search
System design for recommendations and searchEugene Yan Ziyou
 
2.2 decision tree
2.2 decision tree2.2 decision tree
2.2 decision treeKrish_ver2
 
Faster R-CNN - PR012
Faster R-CNN - PR012Faster R-CNN - PR012
Faster R-CNN - PR012Jinwon Lee
 

Mais procurados (20)

Word embeddings, RNN, GRU and LSTM
Word embeddings, RNN, GRU and LSTMWord embeddings, RNN, GRU and LSTM
Word embeddings, RNN, GRU and LSTM
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
 
Deep learning based recommender systems (lab seminar paper review)
Deep learning based recommender systems (lab seminar paper review)Deep learning based recommender systems (lab seminar paper review)
Deep learning based recommender systems (lab seminar paper review)
 
Rnn & Lstm
Rnn & LstmRnn & Lstm
Rnn & Lstm
 
LSTM Basics
LSTM BasicsLSTM Basics
LSTM Basics
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement Learning
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
 
[GAN by Hung-yi Lee]Part 1: General introduction of GAN
[GAN by Hung-yi Lee]Part 1: General introduction of GAN[GAN by Hung-yi Lee]Part 1: General introduction of GAN
[GAN by Hung-yi Lee]Part 1: General introduction of GAN
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
 
The Art of Social Media Analysis with Twitter & Python
The Art of Social Media Analysis with Twitter & PythonThe Art of Social Media Analysis with Twitter & Python
The Art of Social Media Analysis with Twitter & Python
 
Deep Learning: Application & Opportunity
Deep Learning: Application & OpportunityDeep Learning: Application & Opportunity
Deep Learning: Application & Opportunity
 
Talk@rmit 09112017
Talk@rmit 09112017Talk@rmit 09112017
Talk@rmit 09112017
 
Deep Learning Explained
Deep Learning ExplainedDeep Learning Explained
Deep Learning Explained
 
Recurrent Neural Networks
Recurrent Neural NetworksRecurrent Neural Networks
Recurrent Neural Networks
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)Unsupervised learning (clustering)
Unsupervised learning (clustering)
 
System design for recommendations and search
System design for recommendations and searchSystem design for recommendations and search
System design for recommendations and search
 
2.2 decision tree
2.2 decision tree2.2 decision tree
2.2 decision tree
 
Autoencoder
AutoencoderAutoencoder
Autoencoder
 
Faster R-CNN - PR012
Faster R-CNN - PR012Faster R-CNN - PR012
Faster R-CNN - PR012
 

Semelhante a Deep learning: the future of recommendations

Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...Balázs Hidasi
 
Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi
 
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A PrimerMDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A PrimerPoo Kuan Hoong
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learningAmr Rashed
 
Big Data Malaysia - A Primer on Deep Learning
Big Data Malaysia - A Primer on Deep LearningBig Data Malaysia - A Primer on Deep Learning
Big Data Malaysia - A Primer on Deep LearningPoo Kuan Hoong
 
An Introduction to Deep Learning
An Introduction to Deep LearningAn Introduction to Deep Learning
An Introduction to Deep LearningPoo Kuan Hoong
 
Deep learning tutorial 9/2019
Deep learning tutorial 9/2019Deep learning tutorial 9/2019
Deep learning tutorial 9/2019Amr Rashed
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning TutorialAmr Rashed
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introductionAdwait Bhave
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101Felipe Prado
 
Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersRoelof Pieters
 
Using Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical ApplicationsUsing Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical ApplicationsGreg Makowski
 
Visualization of Deep Learning
Visualization of Deep LearningVisualization of Deep Learning
Visualization of Deep LearningYaminiAlapati1
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learningdoppenhe
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye viewRoelof Pieters
 

Semelhante a Deep learning: the future of recommendations (20)

Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...
 
Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017
 
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A PrimerMDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
 
Deep learning - a primer
Deep learning - a primerDeep learning - a primer
Deep learning - a primer
 
Deep learning - a primer
Deep learning - a primerDeep learning - a primer
Deep learning - a primer
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Big Data Malaysia - A Primer on Deep Learning
Big Data Malaysia - A Primer on Deep LearningBig Data Malaysia - A Primer on Deep Learning
Big Data Malaysia - A Primer on Deep Learning
 
An Introduction to Deep Learning
An Introduction to Deep LearningAn Introduction to Deep Learning
An Introduction to Deep Learning
 
Deep learning tutorial 9/2019
Deep learning tutorial 9/2019Deep learning tutorial 9/2019
Deep learning tutorial 9/2019
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introduction
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Deep learning
Deep learningDeep learning
Deep learning
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
 
Phx dl meetup
Phx dl meetupPhx dl meetup
Phx dl meetup
 
Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ers
 
Using Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical ApplicationsUsing Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical Applications
 
Visualization of Deep Learning
Visualization of Deep LearningVisualization of Deep Learning
Visualization of Deep Learning
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye view
 

Mais de Balázs Hidasi

Egyedi termék kreatívok tömeges gyártása generatív AI segítségével
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelEgyedi termék kreatívok tömeges gyártása generatív AI segítségével
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelBalázs Hidasi
 
The Effect of Third Party Implementations on Reproducibility
The Effect of Third Party Implementations on ReproducibilityThe Effect of Third Party Implementations on Reproducibility
The Effect of Third Party Implementations on ReproducibilityBalázs Hidasi
 
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...Balázs Hidasi
 
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Balázs Hidasi
 
Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...Balázs Hidasi
 
Context-aware preference modeling with factorization
Context-aware preference modeling with factorizationContext-aware preference modeling with factorization
Context-aware preference modeling with factorizationBalázs Hidasi
 
Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...Balázs Hidasi
 
Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...Balázs Hidasi
 
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...Balázs Hidasi
 
Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...Balázs Hidasi
 
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...Balázs Hidasi
 
Initialization of matrix factorization (CaRR 2012 presentation)
Initialization of matrix factorization (CaRR 2012 presentation)Initialization of matrix factorization (CaRR 2012 presentation)
Initialization of matrix factorization (CaRR 2012 presentation)Balázs Hidasi
 
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)Balázs Hidasi
 
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)Balázs Hidasi
 
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)Balázs Hidasi
 

Mais de Balázs Hidasi (15)

Egyedi termék kreatívok tömeges gyártása generatív AI segítségével
Egyedi termék kreatívok tömeges gyártása generatív AI segítségévelEgyedi termék kreatívok tömeges gyártása generatív AI segítségével
Egyedi termék kreatívok tömeges gyártása generatív AI segítségével
 
The Effect of Third Party Implementations on Reproducibility
The Effect of Third Party Implementations on ReproducibilityThe Effect of Third Party Implementations on Reproducibility
The Effect of Third Party Implementations on Reproducibility
 
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
 
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
 
Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...
 
Context-aware preference modeling with factorization
Context-aware preference modeling with factorizationContext-aware preference modeling with factorization
Context-aware preference modeling with factorization
 
Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...
 
Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...Utilizing additional information in factorization methods (research overview,...
Utilizing additional information in factorization methods (research overview,...
 
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
Az implicit ajánlási probléma és néhány megoldása (BME TMIT szeminárium előad...
 
Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...
 
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
iTALS: implicit tensor factorization for context-aware recommendations (ECML/...
 
Initialization of matrix factorization (CaRR 2012 presentation)
Initialization of matrix factorization (CaRR 2012 presentation)Initialization of matrix factorization (CaRR 2012 presentation)
Initialization of matrix factorization (CaRR 2012 presentation)
 
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
ShiftTree: model alapú idősor-osztályozó (VK 2009 előadás)
 
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
ShiftTree: model alapú idősor-osztályozó (ML@BP előadás, 2012)
 
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
ShiftTree: model based time series classifier (ECML/PKDD 2011 presentation)
 

Último

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 

Último (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

Deep learning: the future of recommendations

  • 1. Deep learning: the future of recommendations Balázs Hidasi Head of Data Mining and Research Gravity meetup @ Startup Safary April 21, 2016
  • 2. Deep learning in the headlines
  • 3. Deep learning in the background • Life improving services  Speech recognition  Personal assistants (e.g. Siri, Cortana)  Computer vision, object recognition  Machine translation  Chatbot technology  Natural Language Processing  Face recognition  Self driving cars • For fun  Text generation  Composing music  Painting pictures  Etc.
  • 4. What is deep learning? • A class of machine learning algorithms  that use a cascade of multiple non-linear processing layers  and complex model structures  to learn different representations of the data in each layer  where higher level features are derived from lower level features  to form a hierarchical representation.
  • 5. Deep learning is not a new topic • First deep network proposed in the 1970s • More papers in the 80s and 90s • Why now?  Older research was not used widely in practice  Applications were much more simplistic that today’s
  • 6. Neural networks: a brief overview
  • 7. Neurons, neural networks • Neuron: rough abstraction of the human neuron  Receives inputs (signals)  Sum weighted inputs is big enough  signal  Amplifiers and inhibitors  Basic pattern recognition • Neural network: neurons connected to one another • Feedforward networks: neurons are organized into layers  Connections only between subsequent layers 𝑦 𝑥1 𝑥2 𝑥3 𝑥4 𝑓(. ) 𝑖=1 𝑁 𝑤𝑖 𝑥𝑖 + 𝑏 𝑥1 𝑥2 𝑥3 ℎ1 1 ℎ2 1 ℎ3 1 ℎ1 2 ℎ2 2
  • 8. Networks that big enough: go deep not wide • Feedforward neural networks are universal approximators  Can imitate any function if they are big enough  (Also needs enough in-out pairs to learn) • What is big enough?  Number of layers / neurons  Theoretical „big enough” conditions massively overshoot • Go deep, not wide  The number of neurons required for good approximation is polynomial in the input if the network is deep enough  Otherwise it is exponential
  • 9. Training neural networks • Forward pass: get the current estimate of the target o 𝑠𝑗 1 = 𝑖 𝑤𝑖,𝑗 1 𝑥𝑖 + 𝑏𝑗 1 ; ℎ𝑗 1 = 𝑓 𝑠𝑗 1 o 𝑠 𝑘 2 = 𝑗 𝑤𝑗,𝑘 2 ℎ𝑗 1 + 𝑏 𝑘 2 ; ℎ 𝑘 2 = 𝑓 𝑠 𝑘 2 o … o 𝑠𝑙 𝑂 = 𝑘 𝑤 𝑘,𝑙 𝑁+1 ℎ 𝑘 𝑁 + 𝑏𝑙 𝑂 ; 𝑦𝑙 = 𝑓 𝑠𝑙 𝑂 • Backward pass: correct weights to reduce error  Gradient descentLayer Error Gradient (w.r.t. weights between current and prev. layer) Output Defined loss (e.g. 𝐿 = 𝑖=1 𝑁 𝑜 𝑦𝑖 − 𝑦𝑖 2 ) 𝜕𝐿 𝜕𝑤𝑗,𝑖 (𝑁+1) = 𝜕𝐿 𝜕𝑦𝑖 ∗ 𝜕𝑦𝑖 𝜕𝑠𝑖 𝑂 ∗ 𝜕𝑠𝑖 𝑂 𝜕𝑤𝑗,𝑖 𝑁+1 = 𝜕𝐿 𝜕𝑦𝑖 𝑓′ 𝑠𝑖 𝑂 ℎ𝑗 𝑁 𝑁 𝑡ℎ hidden 𝛿𝑖 𝑁 = 𝜕𝐿 𝜕𝑦𝑖 ∗ 𝜕𝑦𝑖 𝜕𝑠𝑖 𝑂 𝜕𝐿 𝜕𝑤 𝑘,𝑗 𝑁 = 𝑖 𝜕𝐿 𝜕𝑦𝑖 ∗ 𝜕𝑦𝑖 𝜕𝑠𝑖 𝑂 ∗ 𝜕𝑠𝑖 𝑂 𝜕ℎ𝑗 𝑁 ∗ 𝜕ℎ𝑗 𝑁 𝜕𝑠𝑗 𝑁 ∗ 𝜕𝑠𝑗 𝑁 𝜕𝑤𝑗,𝑖 𝑁 = 𝑖 𝛿𝑖 𝑁 𝑤𝑖,𝑗 𝑁+1 𝑓′ 𝑠𝑗 𝑁 ℎ 𝑘 𝑁−1 (𝑁 − 𝛿𝑗 𝑁−1 = 𝑖 𝛿𝑖 𝑁 𝑤𝑖,𝑗 𝑁+1 𝑓′ 𝑠𝑗 𝑁 𝜕𝐿 𝜕𝑤𝑙,𝑘 𝑁−1 = 𝑗 𝛿𝑗 𝑁−1 𝑤𝑗,𝑘 𝑁 𝑓′ 𝑠 𝑘 𝑁−1 ℎ𝑙 𝑁−2 … 1 𝑠𝑡 hidden 𝛿 𝑘 1 𝜕𝐿 𝜕𝑤𝑖,𝑗 1 = 𝑘 𝛿 𝑘 1 𝑤 𝑘,𝑙 2 𝑓′ 𝑠𝑗 1 𝑥𝑖
  • 10. Challenges of training deep networks • Saturation • Vanishing gradients • Overfitting • Slowness of second order methods • Slow convergence, stucks in local optima with first order methods • (Exploding gradients)
  • 12. Breakthroughs in research • Saturation & vanishing gradients  Layer-by-layer training (2006)  Non-saturating activation functions, e.g. ReLU (2013) • Overfitting  Dropout (2014) • Convergence problems  Adagrad, Adadelta, Adam, RMSProp, etc.
  • 13. Computational power • Natural increase in computational power • GP GPU technology
  • 15. Don’t give in to the HYPE • Deep learning is impressive but  deep learning is not true AI o it may be a component of it when and if AI is created  deep learning is not how the human brain works  95% of machine learning tasks don’t require deep learning  deep learning requires a lot of computational power • Deep learning is a tool  which is successful in certain, previously very challenging domains (speech recognition, computer vision, NLP, etc.)  that excels in pattern recognition You are here
  • 17. From the Netflix prize... • Netflix prize (2006-2009)  Gave a huge push to recommender systems research  Determined the direction of research for years  Task: o Some (User, Item, Rating) known triplets o (User, Item) pairs with unknown rating o Predict the missing ratings (1-5)
  • 18. ... to recommenders in practice • Ratings  events [implicit feedback]  Lots of services don’t allow for rating  Majority of users don’t rate  Monitored passively  preferences have to be infered • Rating prediction  ranking [top N recommendations]  All that matters is the relevancy of the top N items  Rating prediction is biased • User  session / situation [session-based / context-driven recommendation]  Users are not logged in, identification is unreliable  Accounts used by multiple users  Aim of the session (e.g. buy a good laptop)  Similar behavior of different users in a situation, different behavior of the same user in different situations
  • 19. Challenges in RecSys • Session modeling  Most of the algorithms are personalized  A few are item-to-item o Recommends similar items o Also used for session-based recommendations (industry de facto standard)  There are no good session based solutions • Incorporating factors that influence user clicks  Users click based on what they see o Title o Product image o Description  and on their knowledge of the product o Usually harder to model o Except when the product is content (e.g. music)
  • 20. Deep learning to the rescue – Session modeling • Recurrent Neural Networks (RNN)  Sequence modeling  Hidden state: next state is based on the previous hidden state and the current input  „Infinite” depth  More sophisticated versions: GRU, LSTM • Needs to be adapted to the recommendation task • GRU4Rec:  Session-parallel minibatch training for handling the large variance in session lengths  Sampling the output for reasonable training times, without losing much accuracy  Ranking loss for better item ranking • Results: 15-30% improvement over item-to-item recommendations 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 RSC15 VIDEO Recall@20 Item-kNN GRU4Rec 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 RSC15 VIDEO MRR@20 Item-kNN GRU4Rec
  • 21. Other uses of deep learning for recsys • Incorporating content directly  Music, images, video, text  User influencing aspects of the items  Direct content representation • Context-state modeling from sensory data  IoT devices  Lot of sensory data  Some missing and noise  Infer context state and recommend accordingly • Interactive recommenders using chatbots • Personalized content generation  Today’s news  Images in personalized style with personalized content • Etc...
  • 22. There is work to be done • DL + RecSys research: just started  Last year: o 0 long papers, 1 short paper and 1 poster that is loosely connected  This year: o 10+ submissions to RecSys in this topic o DLRS 2016 workshop @ RecSys • Open questions  (More) Application areas  Adaptations required for the recsys problem  Scalability  Best practices  ...
  • 23. Thanks for your attention!