SlideShare uma empresa Scribd logo
1 de 30
PERSONALIZED RETWEET PREDICTION IN TWITTER


        Liangjie Hong*, Lehigh University
        Aziz Doumith, Lehigh University
  1
        Brian D. Davison, Lehigh University
OVERVIEW
 Motivation
 Related Work

 Our Method

 Experimental Results




                         2
MOTIVATIONS
Social information platforms




                               3
MOTIVATIONS
Information overload




                       4
MOTIVATIONS
Information shortage




                       5
MOTIVATIONS




                                                                                          6


              Photo from: http://www.jenful.com/2011/06/google-1-and-the-filter-bubble/
MOTIVATIONS




                                                 7


              Photo from: http://graphlab.org/
MOTIVATIONS




                                               8


              Photo from: http://kexino.com/
MOTIVATIONS




                                                                        9


              Photo from: http://performancemarketingassociation.com/
TASKS
Given a target user and his/her friends, provide a ranked
list of tweets from these friends such that the tweets that
are potentially retweeted will be ranked higher.




                                                                              10


                    Photo from: http://performancemarketingassociation.com/
RELATED WORK
Generic Popular Tweets Analysis/Prediction
 [Suh et al., SocialCom 2010]

 [Y. Kim and K. Shim, ICDM, 2011]

 [Uysal and W. B. Croft, CIKM 2011]

 [Hong et al., WWW 2011]

Personalized Tweets Prediction
 [Chen et al., SIGIR 2012]

 [Peng et al., ICDM Workshop 2011]



                                             11
RELATED WORK
Generic Popular Tweets Analysis/Prediction
 [Suh et al., SocialCom 2010]

 [Y. Kim and K. Shim, ICDM, 2011]

 [Uysal and W. B. Croft, CIKM 2011]

 [Hong et al., WWW 2011]

Personalized Tweets Analysis/Prediction
 [Chen et al., SIGIR 2012]

 [Peng et al., ICDM Workshop 2011]


       Understanding users’ behaviors & content modeling
                                                           12
OUR METHOD
Design requirements
 Utilize users’ historical behaviors

 Collaborative filtering

 Incorporating a rich-set of features

 Coupled modeling with content

 Learning a correct objective function

 Scalability




                                          13
OUR METHOD
Design requirements
 Utilize users’ historical behaviors

 Collaborative filtering




   Latent factor models



                                        14
OUR METHOD
Design requirements
 Utilize users’ historical behaviors

 Collaborative filtering

 Incorporating a rich-set of features




   Latent factor models
       Factorization Machines [Rendle, ACM TIST 2012]
                                                         15
OUR METHOD
Factorization Machines
 Generic enough
       matrix factorization
       pairwise interaction tensor factorization
       SVD++
       neighborhood models
       …
   Technically mature
       [Rendle, ICDM 2010]
       [Rendle et al., SIGIR 2011]
       [Freudenthaler et al., NIPS Workshop 2011]
       [Rendle et al., WSDM 2012]
                                                     16
       [Rendle, ACM TIST 2012]
OUR METHOD
Extending Factorization Machines
 Non-negative decomposition of term-tweet matrix
       Compatible to standard topic models
   Co-Factorization Machines
       Multiple aspects of the dataset
         Shared feature paradigm
         Shared latent space paradigm

         Regularized latent space paradigm




                                                    17
OUR METHOD
Design requirements
 Utilize users’ historical behaviors

 Collaborative filtering

 Incorporating a rich-set of features

 Coupled modeling with content

 Learning a correct objective function

 Scalability




                                          18
OUR METHOD
Design requirements
 Learning objective functions for different aspects
     User decisions
         Ranking-based loss
          Weighted Approximately Rank Pairwise loss (WARP)
     Content modeling
       Log-Poisson loss
       Logistic loss




                                                             19
OUR METHOD
WARP loss
 Proposed by [Usunier et al., ICML 2009]

 Image retrieval tasks and IR tasks
    [Weston et al., Machine Learning 2010]
    [Weston et al., ICML 2012]
    [Weston et al., UAI 2012]
    [Bordes et al, AISTATS 2012]
   Can mimic many ranking measures
       NDCG, MAP, Precision@k


   Applied to collaborative filtering
                                             20
OUR METHOD
Design requirements
 Utilize users’ historical behaviors

 Collaborative filtering

 Incorporating a rich-set of features

 Coupled modeling with content

 Learning a correct objective function

 Scalability (Stochastic Gradient Descent)




                                              21
EXPERIMENTS
Twitter data
 0.7M target users with 11M tweets

 4.3M neighbor users with 27M tweets

 “Complete” sample for each target user

 Mean Average Precision (MAP) as measure

 Train/test on consecutive time periods




                                            22
EXPERIMENTS
Comparisons
 Matrix factorization (MF)

 Matrix factorization with attributes (MFA)

 CPTR [Chen et al, SIGIR 2012]

 Factorization machines with attributes (FMA)

 CoFM with shared features (CoFM-SF)

 CoFM with shared latent spaces (CoFM-SL)

 CoFM with latent space regularization (CoFM-REG)



                                                     23
EXPERIMENTS
Comparisons
 Matrix factorization (MF)

 Matrix factorization with attributes (MFA)

 CPTR [Chen et al, SIGIR 2012]

 Factorization machines with attributes (FMA)

 CoFM with shared features (CoFM-SF)

 CoFM with shared latent spaces (CoFM-SL)

 CoFM with latent space regularization (CoFM-REG)



                                                     24
EXPERIMENTS




              25
EXPERIMENTS




              26
EXPERIMENTS




              27
EXPERIMENTS
Examples of topics are shown. The terms are top ranked terms in
each topic. The topic names in bold are given by the authors.
Entertainment


album music lady artist video listen itunes apple produced movies #bieber bieber new songs

Finance

percent billion bank financial debt banks euro crisis rates greece bailout spain economy


Politics


party election budget tax president million obama money pay bill federal increase cuts
                                                                                             28
CONCLUSIONS
   Main contributions
     Propose Co-Factorization Machines (CoFM) to handle
      two (multiple) aspects of the dataset.
     Apply FM to text data with constraints to mimic topic
      models
     Introduce WARP loss into collaborative filtering/recsys
      models
     Explore a wide range of features and demonstrate the
      effectiveness of feature sets with significant
      improvement over several non-trival baselines.


                                                                29
THANK YOU.




   Liangjie Hong
   PhD candidate
   WUME Lab
   Lehigh University
   lih307@cse.lehigh.edu   30

Mais conteúdo relacionado

Semelhante a Personalized Retweet Prediction in Twitter

Towards Method Engineering of Model-Driven User Interface Development
Towards Method Engineering ofModel-Driven User Interface Development Towards Method Engineering ofModel-Driven User Interface Development
Towards Method Engineering of Model-Driven User Interface Development Jean Vanderdonckt
 
Automating Software Development Using Artificial Intelligence (AI)
Automating Software Development Using Artificial Intelligence (AI)Automating Software Development Using Artificial Intelligence (AI)
Automating Software Development Using Artificial Intelligence (AI)Jeremy Bradbury
 
A Context-aware Model for the Analysis of User Interaction and QoE in Mobile ...
A Context-aware Model for the Analysis of User Interaction and QoE in Mobile ...A Context-aware Model for the Analysis of User Interaction and QoE in Mobile ...
A Context-aware Model for the Analysis of User Interaction and QoE in Mobile ...Pedro Luis Mateo Navarro
 
REQUIREMENTS VARIABILITY SPECIFICATION FOR DATA INTENSIVE SOFTWARE
REQUIREMENTS VARIABILITY SPECIFICATION FOR DATA INTENSIVE SOFTWARE REQUIREMENTS VARIABILITY SPECIFICATION FOR DATA INTENSIVE SOFTWARE
REQUIREMENTS VARIABILITY SPECIFICATION FOR DATA INTENSIVE SOFTWARE mathsjournal
 
Requirements Variability Specification for Data Intensive Software
Requirements Variability Specification for Data Intensive Software Requirements Variability Specification for Data Intensive Software
Requirements Variability Specification for Data Intensive Software ijseajournal
 
Discreate eventsimulation idef
Discreate eventsimulation idefDiscreate eventsimulation idef
Discreate eventsimulation idefMandar Trivedi
 
Productivity mdd mdb_code_centric
Productivity mdd mdb_code_centricProductivity mdd mdb_code_centric
Productivity mdd mdb_code_centricSantiago Meliá
 
OO Development 1 - Introduction to Object-Oriented Development
OO Development 1 - Introduction to Object-Oriented DevelopmentOO Development 1 - Introduction to Object-Oriented Development
OO Development 1 - Introduction to Object-Oriented DevelopmentRandy Connolly
 
PresentationTest
PresentationTestPresentationTest
PresentationTestbolu804
 
Large Graph Mining
Large Graph MiningLarge Graph Mining
Large Graph MiningSabri Skhiri
 
Fundamentals of Deep Recommender Systems
 Fundamentals of Deep Recommender Systems Fundamentals of Deep Recommender Systems
Fundamentals of Deep Recommender SystemsWQ Fan
 
Enhanced Feature Analysis Framework for Comparative Analysis & Evaluation of ...
Enhanced Feature Analysis Framework for Comparative Analysis & Evaluation of ...Enhanced Feature Analysis Framework for Comparative Analysis & Evaluation of ...
Enhanced Feature Analysis Framework for Comparative Analysis & Evaluation of ...IJCSIS Research Publications
 
Systems variability modeling a textual model mixing class and feature concepts
Systems variability modeling a textual model mixing class and feature conceptsSystems variability modeling a textual model mixing class and feature concepts
Systems variability modeling a textual model mixing class and feature conceptsijcsit
 
A Model To Compare The Degree Of Refactoring Opportunities Of Three Projects ...
A Model To Compare The Degree Of Refactoring Opportunities Of Three Projects ...A Model To Compare The Degree Of Refactoring Opportunities Of Three Projects ...
A Model To Compare The Degree Of Refactoring Opportunities Of Three Projects ...acijjournal
 
A MODEL TO COMPARE THE DEGREE OF REFACTORING OPPORTUNITIES OF THREE PROJECTS ...
A MODEL TO COMPARE THE DEGREE OF REFACTORING OPPORTUNITIES OF THREE PROJECTS ...A MODEL TO COMPARE THE DEGREE OF REFACTORING OPPORTUNITIES OF THREE PROJECTS ...
A MODEL TO COMPARE THE DEGREE OF REFACTORING OPPORTUNITIES OF THREE PROJECTS ...acijjournal
 
Self-adaptation Driven by goals in SysML Models
Self-adaptation Driven by goals in SysML ModelsSelf-adaptation Driven by goals in SysML Models
Self-adaptation Driven by goals in SysML Modelsamalanda1
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 

Semelhante a Personalized Retweet Prediction in Twitter (20)

Towards Method Engineering of Model-Driven User Interface Development
Towards Method Engineering ofModel-Driven User Interface Development Towards Method Engineering ofModel-Driven User Interface Development
Towards Method Engineering of Model-Driven User Interface Development
 
Automating Software Development Using Artificial Intelligence (AI)
Automating Software Development Using Artificial Intelligence (AI)Automating Software Development Using Artificial Intelligence (AI)
Automating Software Development Using Artificial Intelligence (AI)
 
A Context-aware Model for the Analysis of User Interaction and QoE in Mobile ...
A Context-aware Model for the Analysis of User Interaction and QoE in Mobile ...A Context-aware Model for the Analysis of User Interaction and QoE in Mobile ...
A Context-aware Model for the Analysis of User Interaction and QoE in Mobile ...
 
REQUIREMENTS VARIABILITY SPECIFICATION FOR DATA INTENSIVE SOFTWARE
REQUIREMENTS VARIABILITY SPECIFICATION FOR DATA INTENSIVE SOFTWARE REQUIREMENTS VARIABILITY SPECIFICATION FOR DATA INTENSIVE SOFTWARE
REQUIREMENTS VARIABILITY SPECIFICATION FOR DATA INTENSIVE SOFTWARE
 
Requirements Variability Specification for Data Intensive Software
Requirements Variability Specification for Data Intensive Software Requirements Variability Specification for Data Intensive Software
Requirements Variability Specification for Data Intensive Software
 
Discreate eventsimulation idef
Discreate eventsimulation idefDiscreate eventsimulation idef
Discreate eventsimulation idef
 
Productivity mdd mdb_code_centric
Productivity mdd mdb_code_centricProductivity mdd mdb_code_centric
Productivity mdd mdb_code_centric
 
OO Development 1 - Introduction to Object-Oriented Development
OO Development 1 - Introduction to Object-Oriented DevelopmentOO Development 1 - Introduction to Object-Oriented Development
OO Development 1 - Introduction to Object-Oriented Development
 
Sub1583
Sub1583Sub1583
Sub1583
 
PresentationTest
PresentationTestPresentationTest
PresentationTest
 
Large Graph Mining
Large Graph MiningLarge Graph Mining
Large Graph Mining
 
2008.11560v2.pdf
2008.11560v2.pdf2008.11560v2.pdf
2008.11560v2.pdf
 
Fundamentals of Deep Recommender Systems
 Fundamentals of Deep Recommender Systems Fundamentals of Deep Recommender Systems
Fundamentals of Deep Recommender Systems
 
Introduction to MDE
Introduction to MDEIntroduction to MDE
Introduction to MDE
 
Enhanced Feature Analysis Framework for Comparative Analysis & Evaluation of ...
Enhanced Feature Analysis Framework for Comparative Analysis & Evaluation of ...Enhanced Feature Analysis Framework for Comparative Analysis & Evaluation of ...
Enhanced Feature Analysis Framework for Comparative Analysis & Evaluation of ...
 
Systems variability modeling a textual model mixing class and feature concepts
Systems variability modeling a textual model mixing class and feature conceptsSystems variability modeling a textual model mixing class and feature concepts
Systems variability modeling a textual model mixing class and feature concepts
 
A Model To Compare The Degree Of Refactoring Opportunities Of Three Projects ...
A Model To Compare The Degree Of Refactoring Opportunities Of Three Projects ...A Model To Compare The Degree Of Refactoring Opportunities Of Three Projects ...
A Model To Compare The Degree Of Refactoring Opportunities Of Three Projects ...
 
A MODEL TO COMPARE THE DEGREE OF REFACTORING OPPORTUNITIES OF THREE PROJECTS ...
A MODEL TO COMPARE THE DEGREE OF REFACTORING OPPORTUNITIES OF THREE PROJECTS ...A MODEL TO COMPARE THE DEGREE OF REFACTORING OPPORTUNITIES OF THREE PROJECTS ...
A MODEL TO COMPARE THE DEGREE OF REFACTORING OPPORTUNITIES OF THREE PROJECTS ...
 
Self-adaptation Driven by goals in SysML Models
Self-adaptation Driven by goals in SysML ModelsSelf-adaptation Driven by goals in SysML Models
Self-adaptation Driven by goals in SysML Models
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 

Último

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Último (20)

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Personalized Retweet Prediction in Twitter

  • 1. PERSONALIZED RETWEET PREDICTION IN TWITTER Liangjie Hong*, Lehigh University Aziz Doumith, Lehigh University 1 Brian D. Davison, Lehigh University
  • 2. OVERVIEW  Motivation  Related Work  Our Method  Experimental Results 2
  • 6. MOTIVATIONS 6 Photo from: http://www.jenful.com/2011/06/google-1-and-the-filter-bubble/
  • 7. MOTIVATIONS 7 Photo from: http://graphlab.org/
  • 8. MOTIVATIONS 8 Photo from: http://kexino.com/
  • 9. MOTIVATIONS 9 Photo from: http://performancemarketingassociation.com/
  • 10. TASKS Given a target user and his/her friends, provide a ranked list of tweets from these friends such that the tweets that are potentially retweeted will be ranked higher. 10 Photo from: http://performancemarketingassociation.com/
  • 11. RELATED WORK Generic Popular Tweets Analysis/Prediction  [Suh et al., SocialCom 2010]  [Y. Kim and K. Shim, ICDM, 2011]  [Uysal and W. B. Croft, CIKM 2011]  [Hong et al., WWW 2011] Personalized Tweets Prediction  [Chen et al., SIGIR 2012]  [Peng et al., ICDM Workshop 2011] 11
  • 12. RELATED WORK Generic Popular Tweets Analysis/Prediction  [Suh et al., SocialCom 2010]  [Y. Kim and K. Shim, ICDM, 2011]  [Uysal and W. B. Croft, CIKM 2011]  [Hong et al., WWW 2011] Personalized Tweets Analysis/Prediction  [Chen et al., SIGIR 2012]  [Peng et al., ICDM Workshop 2011] Understanding users’ behaviors & content modeling 12
  • 13. OUR METHOD Design requirements  Utilize users’ historical behaviors  Collaborative filtering  Incorporating a rich-set of features  Coupled modeling with content  Learning a correct objective function  Scalability 13
  • 14. OUR METHOD Design requirements  Utilize users’ historical behaviors  Collaborative filtering  Latent factor models 14
  • 15. OUR METHOD Design requirements  Utilize users’ historical behaviors  Collaborative filtering  Incorporating a rich-set of features  Latent factor models  Factorization Machines [Rendle, ACM TIST 2012] 15
  • 16. OUR METHOD Factorization Machines  Generic enough  matrix factorization  pairwise interaction tensor factorization  SVD++  neighborhood models  …  Technically mature  [Rendle, ICDM 2010]  [Rendle et al., SIGIR 2011]  [Freudenthaler et al., NIPS Workshop 2011]  [Rendle et al., WSDM 2012] 16  [Rendle, ACM TIST 2012]
  • 17. OUR METHOD Extending Factorization Machines  Non-negative decomposition of term-tweet matrix  Compatible to standard topic models  Co-Factorization Machines  Multiple aspects of the dataset  Shared feature paradigm  Shared latent space paradigm  Regularized latent space paradigm 17
  • 18. OUR METHOD Design requirements  Utilize users’ historical behaviors  Collaborative filtering  Incorporating a rich-set of features  Coupled modeling with content  Learning a correct objective function  Scalability 18
  • 19. OUR METHOD Design requirements  Learning objective functions for different aspects  User decisions  Ranking-based loss Weighted Approximately Rank Pairwise loss (WARP)  Content modeling  Log-Poisson loss  Logistic loss 19
  • 20. OUR METHOD WARP loss  Proposed by [Usunier et al., ICML 2009]  Image retrieval tasks and IR tasks [Weston et al., Machine Learning 2010] [Weston et al., ICML 2012] [Weston et al., UAI 2012] [Bordes et al, AISTATS 2012]  Can mimic many ranking measures  NDCG, MAP, Precision@k  Applied to collaborative filtering 20
  • 21. OUR METHOD Design requirements  Utilize users’ historical behaviors  Collaborative filtering  Incorporating a rich-set of features  Coupled modeling with content  Learning a correct objective function  Scalability (Stochastic Gradient Descent) 21
  • 22. EXPERIMENTS Twitter data  0.7M target users with 11M tweets  4.3M neighbor users with 27M tweets  “Complete” sample for each target user  Mean Average Precision (MAP) as measure  Train/test on consecutive time periods 22
  • 23. EXPERIMENTS Comparisons  Matrix factorization (MF)  Matrix factorization with attributes (MFA)  CPTR [Chen et al, SIGIR 2012]  Factorization machines with attributes (FMA)  CoFM with shared features (CoFM-SF)  CoFM with shared latent spaces (CoFM-SL)  CoFM with latent space regularization (CoFM-REG) 23
  • 24. EXPERIMENTS Comparisons  Matrix factorization (MF)  Matrix factorization with attributes (MFA)  CPTR [Chen et al, SIGIR 2012]  Factorization machines with attributes (FMA)  CoFM with shared features (CoFM-SF)  CoFM with shared latent spaces (CoFM-SL)  CoFM with latent space regularization (CoFM-REG) 24
  • 28. EXPERIMENTS Examples of topics are shown. The terms are top ranked terms in each topic. The topic names in bold are given by the authors. Entertainment album music lady artist video listen itunes apple produced movies #bieber bieber new songs Finance percent billion bank financial debt banks euro crisis rates greece bailout spain economy Politics party election budget tax president million obama money pay bill federal increase cuts 28
  • 29. CONCLUSIONS  Main contributions  Propose Co-Factorization Machines (CoFM) to handle two (multiple) aspects of the dataset.  Apply FM to text data with constraints to mimic topic models  Introduce WARP loss into collaborative filtering/recsys models  Explore a wide range of features and demonstrate the effectiveness of feature sets with significant improvement over several non-trival baselines. 29
  • 30. THANK YOU. Liangjie Hong PhD candidate WUME Lab Lehigh University lih307@cse.lehigh.edu 30