SlideShare uma empresa Scribd logo
1 de 25
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Fabrikatyr Analytics
Uncover tangible truths amidst the noise of modern media
Recommendation service using Factorisation
Machines
PyCon Dublin - 2016
@Conr
@fabrikatyr
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Agenda
The Problem
Factorisation machines as a method
Getting the data right
Modelling and Deployment
Further Research
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015
Fabrikatyr – Increasing Customer Response Rate
Business Problem
Increase User Engagement by displaying content
which is both personalised and interesting
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Content can be User Generated or Taken from 3rd party
content provider;
Users Communities
A user can be in MANY
communities
Behaviours are consistent
across communities,
Content consumption is not
Interesting Content will generate
● Likes
● Comments
● Share
Content can be ‘EverGreen’
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015 Fabrikatyr – Increasing Customer Response Rate
Optimisation
Problem
5
General
recommender
Speed & Scalability
Sparse data set
How to select an method
to deploy which can
answer the challenges?
Accuracy is important, but the goal is
to generate recommendations which
are consumed
The system needs to respond quickly
to trends and topics across
communities
Lot of ‘hidden’ behaviours
Measurably engagement is Low so
the data set is very sparse
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Factorisation Machines appeared to be the method
which answered the challenge
Factorisation
Machines
General accuracy Quick Designed for it
Accuracy Speed Sparsity
Collaborative Filter Too Accurate Suitable Suitable
Support Vector
Machines
Too Accurate Suitable Unsuitable
Random Forest /
CART
General Accuracy Unsuitable Unsuitable
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015
Fabrikatyr – Increasing Customer Response Rate
Factorisation machines as a
method
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Factorisation Machine
- The Equation
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Factorisation Machine
- The Equation
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Factorisation Machine
- The Equation
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Limits of Factorisation Machines
Need to understand your features as the model
Not good with ‘dense’ data with binary outcomes
Relatively newer method, but supported by most languages
General model, so predictions are also general
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015
Fabrikatyr – Increasing Customer Response Rate
Getting the data right
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines 13
The data from the systems needs to be examined and
structured before executing the model
3 groups of
information
Users
Content
Context
Time was NOT a feature
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Important
Unimportant
● Is the user and Admin / Moderator
● Has the User ‘logged-in’
Not all USER behaviours are important when using
a generalised model
● User behaviour
● Engagement
● Count of Community membership
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Engagement
Keywords
● Did the user ‘Like’ the content
● Did the user ‘comment’ on the content
● Did the user ‘share’ the content
Content needs to be given ‘Context’ to be worked
with effectively
● Which keywords does the content have?
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
The general behaviour is that a set of users and
content generate most of the activity
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Final result was a ‘wide’ dataset per user
event with many columns
● Each time a user either saw content or it engaged
with it a row must be added to the data set
● Keywords, likes, etc. all receive a 1 or a 0 for ALL the
events
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015
Fabrikatyr – Increasing Customer Response Rate
Modelling and Deployment
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
We used an application which could deploy
our python model at scale
Turi Predictive Services supports model
predictions, hosting and managing
machine learning models as low-latency
RESTful services.
Turi was acquired by Apple Inc. for $200
mill
Domino Data labs is an alternative
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
sFrame versus Pandas - works for
Factorisation Machines
SFrame is an scalable, out-of-core
dataframe, which
Allows you to work with datasets that are
larger than the amount of RAM on your
system.
Similar to Spark RDD
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Content store
Solution Architecture
Guest data
Factorisation
Machine
model
Scoring Engine
&
Recommendations
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Machine Learning Service
Content store
URL’s
Guest behaviour
Balance between online / offline calcuations
Guest data
Factorisation
Machine
model
Content consumption
Guest classification
Scoring Engine
&
Recommendations
Content URL
C#
content
server
(Offline/batch)
(Online)
Model weights
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Further analysis
Injecting content into the model
Consumption tracking using A/B testing
Presentation Bias - does rank affect consumption
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
Fabrikatyr Analytics
Uncover tangible truths amidst the noise of modern media
Thank you - Any Questions?
PyCon Dublin - 2016
@Conr
@fabrikatyr
Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines
References
● http://www.csie.ntu.edu.tw/~b97053/paper/Rendle20
10FM.pdf
● https://github.com/ibayer/fastFM
● www.libfm.org
● https://github.com/zhengruifeng/spark-libFM
● https://github.com/scikit-learn-contrib/polylearn

Mais conteúdo relacionado

Semelhante a Content Recommendation using factorisation machines ; Pycon Ireland 2016

Building the Data-Driven Organization
Building the Data-Driven OrganizationBuilding the Data-Driven Organization
Building the Data-Driven OrganizationLora Cecere
 
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Seeling Cheung
 
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...Lora Cecere
 
Getting to grips with a Service Level Agreement and how SLA-Ready can help
Getting to grips with a Service Level Agreement and how SLA-Ready can helpGetting to grips with a Service Level Agreement and how SLA-Ready can help
Getting to grips with a Service Level Agreement and how SLA-Ready can helpSLA-Ready Network
 
Data and Analytics: What Your Data Is Telling You and Why It's Worth Listening
Data and Analytics: What Your Data Is Telling You and Why It's Worth ListeningData and Analytics: What Your Data Is Telling You and Why It's Worth Listening
Data and Analytics: What Your Data Is Telling You and Why It's Worth ListeningSAP Ariba
 
Faurecia's roadmap to digitalization
Faurecia's roadmap to digitalizationFaurecia's roadmap to digitalization
Faurecia's roadmap to digitalizationJose Gascon
 
Enterprise Applications of Text Intelligence - Lecture slides
Enterprise Applications of Text Intelligence - Lecture slidesEnterprise Applications of Text Intelligence - Lecture slides
Enterprise Applications of Text Intelligence - Lecture slidesUniversity St. Gallen
 
Smart Factories Redefine Security Paradigms
Smart Factories Redefine Security ParadigmsSmart Factories Redefine Security Paradigms
Smart Factories Redefine Security Paradigmsteam-WIBU
 
Supply Chain Visibility via Cargo Tracking and Monitoring
Supply Chain Visibility via Cargo Tracking and MonitoringSupply Chain Visibility via Cargo Tracking and Monitoring
Supply Chain Visibility via Cargo Tracking and MonitoringInspiricon AG
 
Apps World Europe: Data Management panel.
Apps World Europe: Data Management panel.Apps World Europe: Data Management panel.
Apps World Europe: Data Management panel.Steef Klein
 
Journey to analytics in the cloud
Journey to analytics in the cloudJourney to analytics in the cloud
Journey to analytics in the cloudSaama
 
Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...yalisassoon
 
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...DataBench
 
Digitalize Manufacturing Through SAP S/4HANA Cloud
Digitalize Manufacturing Through SAP S/4HANA CloudDigitalize Manufacturing Through SAP S/4HANA Cloud
Digitalize Manufacturing Through SAP S/4HANA CloudCapgemini
 
Simpda 2014 - A living story: measuring quality of developments in a large in...
Simpda 2014 - A living story: measuring quality of developments in a large in...Simpda 2014 - A living story: measuring quality of developments in a large in...
Simpda 2014 - A living story: measuring quality of developments in a large in...SpagoWorld
 
Prodapt Corporate Overview Presentation
Prodapt Corporate Overview PresentationProdapt Corporate Overview Presentation
Prodapt Corporate Overview PresentationProdapt Solutions
 
Cwin16 tls-faurecia predictive maintenance
Cwin16 tls-faurecia predictive maintenanceCwin16 tls-faurecia predictive maintenance
Cwin16 tls-faurecia predictive maintenanceCapgemini
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Pactera_US
 

Semelhante a Content Recommendation using factorisation machines ; Pycon Ireland 2016 (20)

Building the Data-Driven Organization
Building the Data-Driven OrganizationBuilding the Data-Driven Organization
Building the Data-Driven Organization
 
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
 
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
Imagining Supply Chain Processes Outside-in. Building Value Networks at IBM t...
 
Getting to grips with a Service Level Agreement and how SLA-Ready can help
Getting to grips with a Service Level Agreement and how SLA-Ready can helpGetting to grips with a Service Level Agreement and how SLA-Ready can help
Getting to grips with a Service Level Agreement and how SLA-Ready can help
 
Are you SLA-Ready?
Are you SLA-Ready?Are you SLA-Ready?
Are you SLA-Ready?
 
Data and Analytics: What Your Data Is Telling You and Why It's Worth Listening
Data and Analytics: What Your Data Is Telling You and Why It's Worth ListeningData and Analytics: What Your Data Is Telling You and Why It's Worth Listening
Data and Analytics: What Your Data Is Telling You and Why It's Worth Listening
 
Faurecia's roadmap to digitalization
Faurecia's roadmap to digitalizationFaurecia's roadmap to digitalization
Faurecia's roadmap to digitalization
 
Enterprise Applications of Text Intelligence - Lecture slides
Enterprise Applications of Text Intelligence - Lecture slidesEnterprise Applications of Text Intelligence - Lecture slides
Enterprise Applications of Text Intelligence - Lecture slides
 
Smart Factories Redefine Security Paradigms
Smart Factories Redefine Security ParadigmsSmart Factories Redefine Security Paradigms
Smart Factories Redefine Security Paradigms
 
Supply Chain Visibility via Cargo Tracking and Monitoring
Supply Chain Visibility via Cargo Tracking and MonitoringSupply Chain Visibility via Cargo Tracking and Monitoring
Supply Chain Visibility via Cargo Tracking and Monitoring
 
Apps World Europe: Data Management panel.
Apps World Europe: Data Management panel.Apps World Europe: Data Management panel.
Apps World Europe: Data Management panel.
 
Journey to analytics in the cloud
Journey to analytics in the cloudJourney to analytics in the cloud
Journey to analytics in the cloud
 
Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...
 
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
 
Digitalize Manufacturing Through SAP S/4HANA Cloud
Digitalize Manufacturing Through SAP S/4HANA CloudDigitalize Manufacturing Through SAP S/4HANA Cloud
Digitalize Manufacturing Through SAP S/4HANA Cloud
 
3D Printing and Entrepreneurship A Co-Evolution
3D Printing and Entrepreneurship A Co-Evolution3D Printing and Entrepreneurship A Co-Evolution
3D Printing and Entrepreneurship A Co-Evolution
 
Simpda 2014 - A living story: measuring quality of developments in a large in...
Simpda 2014 - A living story: measuring quality of developments in a large in...Simpda 2014 - A living story: measuring quality of developments in a large in...
Simpda 2014 - A living story: measuring quality of developments in a large in...
 
Prodapt Corporate Overview Presentation
Prodapt Corporate Overview PresentationProdapt Corporate Overview Presentation
Prodapt Corporate Overview Presentation
 
Cwin16 tls-faurecia predictive maintenance
Cwin16 tls-faurecia predictive maintenanceCwin16 tls-faurecia predictive maintenance
Cwin16 tls-faurecia predictive maintenance
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
 

Último

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 

Último (20)

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 

Content Recommendation using factorisation machines ; Pycon Ireland 2016

  • 1. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Fabrikatyr Analytics Uncover tangible truths amidst the noise of modern media Recommendation service using Factorisation Machines PyCon Dublin - 2016 @Conr @fabrikatyr
  • 2. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Agenda The Problem Factorisation machines as a method Getting the data right Modelling and Deployment Further Research
  • 3. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015 Fabrikatyr – Increasing Customer Response Rate Business Problem Increase User Engagement by displaying content which is both personalised and interesting
  • 4. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Content can be User Generated or Taken from 3rd party content provider; Users Communities A user can be in MANY communities Behaviours are consistent across communities, Content consumption is not Interesting Content will generate ● Likes ● Comments ● Share Content can be ‘EverGreen’
  • 5. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015 Fabrikatyr – Increasing Customer Response Rate Optimisation Problem 5 General recommender Speed & Scalability Sparse data set How to select an method to deploy which can answer the challenges? Accuracy is important, but the goal is to generate recommendations which are consumed The system needs to respond quickly to trends and topics across communities Lot of ‘hidden’ behaviours Measurably engagement is Low so the data set is very sparse
  • 6. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Factorisation Machines appeared to be the method which answered the challenge Factorisation Machines General accuracy Quick Designed for it Accuracy Speed Sparsity Collaborative Filter Too Accurate Suitable Suitable Support Vector Machines Too Accurate Suitable Unsuitable Random Forest / CART General Accuracy Unsuitable Unsuitable
  • 7. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015 Fabrikatyr – Increasing Customer Response Rate Factorisation machines as a method
  • 8. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Factorisation Machine - The Equation
  • 9. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Factorisation Machine - The Equation
  • 10. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Factorisation Machine - The Equation
  • 11. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Limits of Factorisation Machines Need to understand your features as the model Not good with ‘dense’ data with binary outcomes Relatively newer method, but supported by most languages General model, so predictions are also general
  • 12. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015 Fabrikatyr – Increasing Customer Response Rate Getting the data right
  • 13. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines 13 The data from the systems needs to be examined and structured before executing the model 3 groups of information Users Content Context Time was NOT a feature
  • 14. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Important Unimportant ● Is the user and Admin / Moderator ● Has the User ‘logged-in’ Not all USER behaviours are important when using a generalised model ● User behaviour ● Engagement ● Count of Community membership
  • 15. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Engagement Keywords ● Did the user ‘Like’ the content ● Did the user ‘comment’ on the content ● Did the user ‘share’ the content Content needs to be given ‘Context’ to be worked with effectively ● Which keywords does the content have?
  • 16. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines The general behaviour is that a set of users and content generate most of the activity
  • 17. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Final result was a ‘wide’ dataset per user event with many columns ● Each time a user either saw content or it engaged with it a row must be added to the data set ● Keywords, likes, etc. all receive a 1 or a 0 for ALL the events
  • 18. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation MachinesOct 2015 - PyCon Dublin 2015 Fabrikatyr – Increasing Customer Response Rate Modelling and Deployment
  • 19. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines We used an application which could deploy our python model at scale Turi Predictive Services supports model predictions, hosting and managing machine learning models as low-latency RESTful services. Turi was acquired by Apple Inc. for $200 mill Domino Data labs is an alternative
  • 20. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines sFrame versus Pandas - works for Factorisation Machines SFrame is an scalable, out-of-core dataframe, which Allows you to work with datasets that are larger than the amount of RAM on your system. Similar to Spark RDD
  • 21. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Content store Solution Architecture Guest data Factorisation Machine model Scoring Engine & Recommendations
  • 22. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Machine Learning Service Content store URL’s Guest behaviour Balance between online / offline calcuations Guest data Factorisation Machine model Content consumption Guest classification Scoring Engine & Recommendations Content URL C# content server (Offline/batch) (Online) Model weights
  • 23. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Further analysis Injecting content into the model Consumption tracking using A/B testing Presentation Bias - does rank affect consumption
  • 24. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines Fabrikatyr Analytics Uncover tangible truths amidst the noise of modern media Thank you - Any Questions? PyCon Dublin - 2016 @Conr @fabrikatyr
  • 25. Nov 2016- PyCon Dublin 2016 Fabrikatyr – Factorisation Machines References ● http://www.csie.ntu.edu.tw/~b97053/paper/Rendle20 10FM.pdf ● https://github.com/ibayer/fastFM ● www.libfm.org ● https://github.com/zhengruifeng/spark-libFM ● https://github.com/scikit-learn-contrib/polylearn

Notas do Editor

  1. We use modern data gathering and machine learning techniques to fuel our visualisation platform
  2. 01/04/2015