SlideShare uma empresa Scribd logo
1 de 32
Baixar para ler offline
Ranking Resources in Folksonomies
      By Exploiting Semantic Information
                                        i-KNOW 2012, Saarbrücken



                                                     Users               Tags                Resources



                                                                         Research
                                                                           Talk



                                                                                                                                                           n
                                                                                                                                                Perso
                                                                                                                                    Typ
                                                                                                                                        e                      Loca
                                                                                                                                                                   tion
                                                                         Ranking
                                                                                                                                                Topic
                                                                        Algorithms
                                                                                                                                    Event                      Oth
                                                                                                                                              Act                 er
                                                                                                                                                 ivit
                                                                                                                                                      y



Thomas Rodenhausen                                                      Slideshare

Mojisola Anjorin
Renato Domínguez García
Christoph Rensing                                                                                                                                      Prof. Dr.-Ing. Ralf Steinmetz
Ralf Steinmetz                                                                                                                              KOM - Multimedia Communications Lab

Iknow_Ranking_Sem_Info_v9.0__2012.09.07_MA.pptx                                                                                                                             7-Sep-12
© author(s) of these slides including research results from the KOM research network and TU Darmstadt; otherwise it is specified at the respective slide
Social Tagging Applications

Social tagging applications are used to organize, classify, manage
 and share knowledge resources
 !  Tags are freely chosen keywords attached to resources
 !  Tags often describe an aspect of the resource




                                                                          Lora
                                                                                o
                                                                          Aroy     Graz
                                                               Key
                                                                   note    Dial E
                                                                I-know
                                                                              for
                                                                 2012       Events Sema
                                                                                        n
                                                                        Pre
                                                                            pa rin
                                                                                    W tic
                                                                                       eb
                                                                                   g
                                                                      Con for
                                                                         fer
                                                                             enc
                                                                                 e




                                                            KOM – Multimedia Communications Lab   2
                                                                                www.wordle.net
Recommendations in Social Tagging
Applications




                                    KOM – Multimedia Communications Lab   3
Overview

 !  Motivation
 !  Basics
    !  Folksonomy
    !  Folksonomy Extended by Tag Types
    !  Graph-based Resource Recommendation
    !  Challenge: Concept Drift
 !  AspectScore & InteliScore
 !  Evaluation Methodology and Metrics
 !  Results
 !  Conclusion & Future Work




                                             KOM – Multimedia Communications Lab   4
Folksonomy

                                 Users    Tags                Resources
A folksonomy is a quadruple
F:= (U, T, R, Y), where
                                          Research
                                            Talk

U – Users
T – Tags
R – Resources
Y ⊆ U ! T ! R - tag assignment            Ranking
                                         Algorithms




                                         Slideshare



 [Hotho et al. 2006]
                                          KOM – Multimedia Communications Lab   5
Folksonomy Extended by Tag Types
                                                                Users    Tags              Resources
An extended folksonomy
FA:= (U, T, R, A, Y) where
                                                                        Research
                                                                          Talk

U – Users                             Perso
                                              n
                          Typ
T – Tags                        e                 Loca
                                                         tion
                                      Topic
R – Resources             Event                   Oth
                                    Act              er
A – Tag Types                           ivity                            Ranking
                                                                        Algorithms

Y⊆U!T!RxA

A = {Topic, Resource Type, Location,
Person, Event, Activity, Other}
                                                                        Slideshare



[Böhnstedt et al. 2009]
                                                                        KOM – Multimedia Communications Lab   6
Graph-based Resource Recommendation



                                                                       Recommendation List

                                                                         Item              Score
               3               P2
P1    1                    1                  Graph-based                r1                0.9
                               2                Ranking                  r2                0.7
          4            1                       Algorithm
                   2           1                                         r3                0.5
          P3
                       2            P4                                   r4                0.2


     Folksonomy Graph                    Scorer e.g. PageRank, HITS      Ranked Resources




                                                                      KOM – Multimedia Communications Lab   7
Adapted PageRank
                                               [Hotho et al. 2006]



                                           "                    "    Dancing
$%&'()*+,&               Tango                         !                     l
                               0                                     Festiva
                                                                         0
                                   #           #
             "-.
                                                            "

                             #-.
    1
                                                   !
                                                           PageRank‘s intelligent surfer model
                   #-.

                         Buenos        "                   The ranking of a node is determined by how
        "-.               Aires                            often the surfer visits the node
                               0


               Buenos                                      Adjoining edges are followed with a certain
                Aires                                      probability – determined by the edge weights
                     0

                         "                                 The query node acts as the starting point and
                                                           focus i.e. the surfer returns to this node with
                    !                                      a certain probability – determined by the
                                                           node weights
                                                                                 KOM – Multimedia Communications Lab   8
Challenge: Concept Drift

Concept drift is a challenge for graph-based ranking algorithms
 !  e.g. Ambiguous tags can cause concept drift as a single tag might represent
 multiple semantic concepts


                                               ?


                                football                News about Messi




                                              ?
 FC Barcelona Website



                                                         Dallas Cowboys‘ Website
                                                           KOM – Multimedia Communications Lab   9
Overview

 !  Motivation
 !  Basics
    !  Folksonomy
    !  Folksonomy Extended by Tag Types
    !  Graph-based Resource Recommendation
    !  Challenge: Concept Drift
 !  AspectScore & InteliScore
 !  Evaluation Methodology and Metrics
 !  Results
 !  Conclusion & Future Work




                                             KOM – Multimedia Communications Lab 10
InteliScore

 The semantic information gained from the semantic relatedness
 between tags is used to reduce concept drift

                                    Dancing
                        Tango
                                    Festival


                                               Semantic Relatedness (XESA)


                                               0.005


                         Buenos
                          Aires

                                        XESA calculates the semantic
                                        relatedness between pairs of tokens
                                        (tags) using the English Wikipedia as
                                        reference corpus
                                                                      [Scholl et al. 2010]
Source: wikipedia.org                                       KOM – Multimedia Communications Lab 11
AspectScore

Tag types help to alleviate concept drift
 !  Tags are disambiguated with respect to different aspects of a resource that a
 user may describe while tagging




                            topic                      location
                                      Buenos
                                       Aires

 Tourism in Argentina
                                                                     News about Tango




                                                            KOM – Multimedia Communications Lab 12
AspectScore

Tag types help to alleviate concept drift
 !  e.g. by focusing on the tags describing the content of resources


                           topic                   location
                                    Buenos
                                     Aires

  Tourism in Argentina
                                                              News about Tango
                            topic


                                       Assumption:
                                       The tags of type „Topic“
                                       describe the content of the resources well,
  Google Map of Buenos Aires           therefore „Topic“ Tags are given priority.

                                                              KOM – Multimedia Communications Lab 13
AspectScore: Step 1

                                                              Buenos
 1. Transform Query Node                      Tango    Buenos Aires
      into Query Tags                            3      Aires    1
                                                            1

                           User query node is transformed into tag nodes,
                           weighted by the usage frequency of the user

                           Assumption:
                           Tags of a user describe the user‘s interests well
                                                                        [Abel 2011]




                                                      KOM – Multimedia Communications Lab 14
AspectScore: Step 1

                           Query Tag
 1. Transform Query Node
       in Query Tags
                                                                          Dancing
                                         Tango                            Festiva
                                                                                  l




                   Query Node

                                          Buenos
                                           Aires


                                       Buenos
                                        Aires      Query Tags




                                                    KOM – Multimedia Communications Lab 15
AspectScore: Step 2


 1. Transform Query Node
      into Query Tags

 2. Create Folksonomy
Graph for each Query Tag




                           KOM – Multimedia Communications Lab 16
AspectScore: Step 2


 1. Transform Query Node                                                 "                          "     Dancing
                           $%&'()*+,               Tango                              !                           l
      into Query Tags                                   3                                                 Festiva
                                                                                                               0
                                                                 #           #
                                           "
 2. Create Folksonomy                                                                           "
Graph for each Query Tag
                               !                        #
                                                                                 !

                                                   Buenos            "
                                               "
                                                    Aires
                                           "                0


                                       "       Buenos
                                                Aires
                                                    0           Depending on Ranking Algorithm
                                                                e.g. FolkRank
                                                   "



                                               !



                                                                                 KOM – Multimedia Communications Lab 17
AspectScore: Step 3


 1. Transform Query Node                                               "#"$                             "#"$     Dancing
                           '()*+",-.                 Tango                                   !                           l
      into Query Tags                                       3                                                    Festiva
                                                                                                                      0
                                                                    "%"$      "%"$
                                       "#"$
 2. Create Folksonomy                                                                                 "#"$
Graph for each Query Tag                                  "%"$
                               !
                                                                                     !

                                                      Buenos         "#"$
                                              "#"$
  3. Adapt Edge Weights                                Aires
                                       "#"$                     0


                                                Buenos
                                   "#"$&
                                                 Aires
                                                      0                     Edge Weights are adapted
                                                     "#"$
                                                                            (in several iteration steps)
                                                                            depending on Query Tag

                                                !



                                                                                         KOM – Multimedia Communications Lab 18
AspectScore: Step 4


 1. Transform Query Node                                                "#"$                             "#"$     Dancing
                            '()*+",-.                 Tango                                   !                           l
      into Query Tags                                        3                                                    Festiva
                                                                                                                       0
                                                                     "%"$      "%"$
                                        "#"$
 2. Create Folksonomy                                                                                  "#"$
Graph for each Query Tag                                   "%"$
                                !
                                                                                      !

                                                       Buenos         "#"$
                                               "#"$
  3. Adapt Edge Weights                                 Aires
                                        "#"$                     0


                                                 Buenos
                                    "#"$&
                                                  Aires
 4. Run Ranking Algorithm                              0                Run e.g. FolkRank on the
                                                      "#"$
                                                                        adapted folksonomy graph


                                                 !



                                                                                          KOM – Multimedia Communications Lab 19
AspectScore: Step 5


 1. Transform Query Node
      into Query Tags

 2. Create Folksonomy
Graph for each Query Tag


  3. Adapt Edge Weights

                            The resulting rankings are accumulated
                            giving preference to certain tag types
 4. Run Ranking Algorithm   e.g. topic tags
                                                             Buenos
                                           Tango      Buenos Aires
  5. Accumulate Results                                Aires 1δ Topic
                                           3δ Topic
   for each Query Node                                1δ Location


                                                 KOM – Multimedia Communications Lab 20
Overview

 !  Motivation
 !  Basics
    !  Folksonomy
    !  Folksonomy Extended by Tag Types
    !  Graph-based Resource Recommendation
    !  Challenge Concept Drift
 !  AspectScore & InteliScore
 !  Evaluation Methodology and Metrics
 !  Results
 !  Conclusion & Future Work




                                             KOM – Multimedia Communications Lab 21
Evaluation Methodology: LeavePostOut

A post is a Pu,r= {(u,r,t)|(u,r,t) ! Y}

                                   Dancing                                                   Dancing
        Tango                                       Tango
                                   Festival                                                  Festival




         Buenos                                      Buenos
          Aires                                       Aires




For LeavePostOut, the recommendation task
with user as input is harder as with tag as input
                                                                   [Jäschke et al. 2007]
                                                              KOM – Multimedia Communications Lab 22
Evaluation Methodology: LeaveRTOut

RTr,t= {(u,r,t)|(u,r,t) ! Y}

                                   Dancing                                                   Dancing
        Tango                                       Tango
                                   Festival                                                  Festival




         Buenos                                      Buenos
          Aires                                       Aires




For LeaveRTOut, the recommendation task
with tag as input is harder as with user as input

                                                              KOM – Multimedia Communications Lab 23
Evaluation Corpus

Bibsonomy corpus with a p-core extraction at level 5 to reduce noise
 and to focus on the dense portion of the corpus

                                   Before           After                    Tag Type                             Count
 Users                                   7243                69              Topic                                       2225
 Bookmark resources                  281550                    9             Other                                         486
 Bibtex resources                    469654                134               Resource Type                                 198
 Tags                                216094                179               Event                                         182
 Tag assignments                   2740834               3269                Person/Organisation                           143
 Bookmark posts                      330192                  51              Activity                                        35
 Bibtex posts                        526691                959


                                                                                   FReSET – Domínguez García et al 2012
                                                   http://www.kom.tu-darmstadt.de/research-results/downloads/software/freset/
                                                                                               KOM – Multimedia Communications Lab 24
         Knowledge and Data Engineering Group, University of Kassel: Benchmark Folksonomy Data from Bibsonomy, version of July 7th 2011
Evaluation Metrics

 Mean Average Precision:
                                          The mean of the Average
             |Q|     mj                   Precision over several queries Q
          1  1 
MAP(Q) =                Precision(Rjk )
         |Q| j=1 mj k=1                                 [Manning et al 2008]




Mean Normalized Precision:
                                          The mean of the normalized
                                          Precision at k
                |Q|
             1   Precisionj (k)          over several queries Q
MNP(Q, k) =
            |Q| j=1 Precisionmax,j (k)



                                                   KOM – Multimedia Communications Lab 25
Visualization of Results with Violin Plots

A violin plot is a combination of a box plot and a density trace




                                            3rd Quartile

Median
                                            1st Quartile
                                                           [Hintze et al. 1998]
                                                    KOM – Multimedia Communications Lab 26
Evaluation Results LeavePostOut

Evaluation results for the recommendation task having tag as input




                                                 KOM – Multimedia Communications Lab 27
Evaluation Results for LeavePostOut

Evaluation results for the recommendation task having tag as input

                                           Approaches MAP
                                           AspectScore 0.2240
                                           FolkRank          0.2136
                                           InteliScore       0.1801
                                           Popularity        0.0937




                                                  KOM – Multimedia Communications Lab 28
Evaluation Results for LeaveRTOut

Evaluation results for the recommendation task having tag as input




                                                 KOM – Multimedia Communications Lab 29
Evaluation Results for LeaveRTOut

Evaluation results for the recommendation task having tag as input


                                           Approaches MAP
                                           Popularity        0.0834
                                           AspectScore 0.0589
                                           FolkRank          0.0529
                                           InteliScore       0.0433




                                                  KOM – Multimedia Communications Lab 30
Conclusion and Future Work

Exploiting semantic information for resource ranking in folksonomies
 AspectScore                                                 InteliScore
 Tag disambiguation  importance of                          Based on semantic
 tags (based on type)                       n
                                                             relatedness between tags
                                     Perso
                           Typ
                              e                 Loca
                                                      tion
                                                             e.g. XESA
                                    Topic
                           Event                Oth
                                   Act             er
                                      ivit
                                          y




Limitations
 !  Manually labeled tag type dataset – error prone, subjective
 !  XESA based on English Wikipedia – No semantic relatedness measurable for
    27% of tags in corpus


Future Work
 !  Evaluation using CROKODIL corpus – an e-learning application with tag types
 !  User Study                                  www.crokodil.de
                                                                       KOM – Multimedia Communications Lab 31
Questions  Contact




                      KOM – Multimedia Communications Lab 32

Mais conteúdo relacionado

Mais procurados

Dan Hill - Creative City, Soft City
Dan Hill - Creative City, Soft CityDan Hill - Creative City, Soft City
Dan Hill - Creative City, Soft City
Shane Mitchell
 
Past for teachers
Past for teachersPast for teachers
Past for teachers
ejrock20
 
Constituent elements of mainframe processing
Constituent elements of mainframe processingConstituent elements of mainframe processing
Constituent elements of mainframe processing
Dennis Hoffman
 
jQuery Open Source Process (RIT 2011)
jQuery Open Source Process (RIT 2011)jQuery Open Source Process (RIT 2011)
jQuery Open Source Process (RIT 2011)
jeresig
 
Jerome Euzenat's presentation at SSSW 2011
Jerome Euzenat's presentation at SSSW 2011Jerome Euzenat's presentation at SSSW 2011
Jerome Euzenat's presentation at SSSW 2011
sssw2011
 
Rascal Devnology Code Fest
Rascal Devnology Code FestRascal Devnology Code Fest
Rascal Devnology Code Fest
Devnology
 

Mais procurados (20)

Bali Virtual Assistants
Bali Virtual AssistantsBali Virtual Assistants
Bali Virtual Assistants
 
Dan Hill - Creative City, Soft City
Dan Hill - Creative City, Soft CityDan Hill - Creative City, Soft City
Dan Hill - Creative City, Soft City
 
A4 slideshare
A4 slideshareA4 slideshare
A4 slideshare
 
I-KNOW 07: Creativity@Work
I-KNOW 07: Creativity@WorkI-KNOW 07: Creativity@Work
I-KNOW 07: Creativity@Work
 
20110413 founder institute
20110413 founder institute20110413 founder institute
20110413 founder institute
 
Past for teachers
Past for teachersPast for teachers
Past for teachers
 
Constituent elements of mainframe processing
Constituent elements of mainframe processingConstituent elements of mainframe processing
Constituent elements of mainframe processing
 
How should universities prepare information graduates for the working world?
How should universities prepare information graduates for the working world?How should universities prepare information graduates for the working world?
How should universities prepare information graduates for the working world?
 
jQuery Open Source Process (RIT 2011)
jQuery Open Source Process (RIT 2011)jQuery Open Source Process (RIT 2011)
jQuery Open Source Process (RIT 2011)
 
Alijfs 2011 Bro A5 Pdfw
Alijfs 2011 Bro A5 PdfwAlijfs 2011 Bro A5 Pdfw
Alijfs 2011 Bro A5 Pdfw
 
Open Source Process: jQuery by John Resig
Open Source Process: jQuery by John ResigOpen Source Process: jQuery by John Resig
Open Source Process: jQuery by John Resig
 
jQuery Open Source (Fronteer 2011)
jQuery Open Source (Fronteer 2011)jQuery Open Source (Fronteer 2011)
jQuery Open Source (Fronteer 2011)
 
Jerome Euzenat's presentation at SSSW 2011
Jerome Euzenat's presentation at SSSW 2011Jerome Euzenat's presentation at SSSW 2011
Jerome Euzenat's presentation at SSSW 2011
 
jerome Euzenat - Ontology Matching
jerome Euzenat - Ontology Matchingjerome Euzenat - Ontology Matching
jerome Euzenat - Ontology Matching
 
EDF2013: Language Technology Panel, Hans Uszkoreit: Key-enabling technology f...
EDF2013: Language Technology Panel, Hans Uszkoreit: Key-enabling technology f...EDF2013: Language Technology Panel, Hans Uszkoreit: Key-enabling technology f...
EDF2013: Language Technology Panel, Hans Uszkoreit: Key-enabling technology f...
 
jQuery Keynote 2011: Boston
jQuery Keynote 2011: BostonjQuery Keynote 2011: Boston
jQuery Keynote 2011: Boston
 
Open Service Design at the SDC 2010
Open Service Design at the SDC 2010Open Service Design at the SDC 2010
Open Service Design at the SDC 2010
 
Rascal Devnology Code Fest
Rascal Devnology Code FestRascal Devnology Code Fest
Rascal Devnology Code Fest
 
Measuring Social Media
Measuring Social MediaMeasuring Social Media
Measuring Social Media
 
Dc poster
Dc posterDc poster
Dc poster
 

Destaque (8)

Ectel sem_info_rec_learning_resources_v6.0_20120921_ma
Ectel  sem_info_rec_learning_resources_v6.0_20120921_maEctel  sem_info_rec_learning_resources_v6.0_20120921_ma
Ectel sem_info_rec_learning_resources_v6.0_20120921_ma
 
Eval rec algo_crowdsourcing__icalt_2014_ma
Eval rec algo_crowdsourcing__icalt_2014_maEval rec algo_crowdsourcing__icalt_2014_ma
Eval rec algo_crowdsourcing__icalt_2014_ma
 
Exploiting Semantic Information for Graph-based Recommendations of Learning R...
Exploiting Semantic Information for Graph-based Recommendations of Learning R...Exploiting Semantic Information for Graph-based Recommendations of Learning R...
Exploiting Semantic Information for Graph-based Recommendations of Learning R...
 
Similarity Measurement Preliminary Results
Similarity  Measurement  Preliminary ResultsSimilarity  Measurement  Preliminary Results
Similarity Measurement Preliminary Results
 
Rs web context_content__v4.0__20120908_ma
Rs web context_content__v4.0__20120908_maRs web context_content__v4.0__20120908_ma
Rs web context_content__v4.0__20120908_ma
 
Context Determines Content - An Approach to Resource Recommendation in Folkso...
Context Determines Content - An Approach to Resource Recommendation in Folkso...Context Determines Content - An Approach to Resource Recommendation in Folkso...
Context Determines Content - An Approach to Resource Recommendation in Folkso...
 
Towards Ranking in Folksonomies for Personalized Recommender Systems in E-Lea...
Towards Ranking in Folksonomies for Personalized Recommender Systems in E-Lea...Towards Ranking in Folksonomies for Personalized Recommender Systems in E-Lea...
Towards Ranking in Folksonomies for Personalized Recommender Systems in E-Lea...
 
Otot Manusia
Otot ManusiaOtot Manusia
Otot Manusia
 

Semelhante a Iknow ranking sem_info_v9.0__2012.09.07_anjorin

Itsme Lezione Usi Lugano
Itsme Lezione Usi LuganoItsme Lezione Usi Lugano
Itsme Lezione Usi Lugano
itsmesrl
 

Semelhante a Iknow ranking sem_info_v9.0__2012.09.07_anjorin (20)

TechWorld Alan Kell presentation
TechWorld Alan Kell presentationTechWorld Alan Kell presentation
TechWorld Alan Kell presentation
 
5 Ways Every Non-profit Should Use the Internet
5 Ways Every Non-profit Should Use the Internet  5 Ways Every Non-profit Should Use the Internet
5 Ways Every Non-profit Should Use the Internet
 
BPD Keynote: Design is How We Change the World
BPD Keynote: Design is How We Change the WorldBPD Keynote: Design is How We Change the World
BPD Keynote: Design is How We Change the World
 
TV Connect London 2013 Preview Brochure
TV Connect London 2013 Preview BrochureTV Connect London 2013 Preview Brochure
TV Connect London 2013 Preview Brochure
 
Are You Ready for the Future?
Are You Ready for the Future?Are You Ready for the Future?
Are You Ready for the Future?
 
High stakes world of Mobile Payments
High stakes world of Mobile PaymentsHigh stakes world of Mobile Payments
High stakes world of Mobile Payments
 
High stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographicHigh stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographic
 
The Workflow Abstraction
The Workflow AbstractionThe Workflow Abstraction
The Workflow Abstraction
 
The Workflow Abstraction
The Workflow AbstractionThe Workflow Abstraction
The Workflow Abstraction
 
Presentation for HLP
Presentation for HLPPresentation for HLP
Presentation for HLP
 
Krypton
KryptonKrypton
Krypton
 
HGI Overview
HGI OverviewHGI Overview
HGI Overview
 
Functional programming for optimization problems in Big Data
Functional programming for optimization problems in Big DataFunctional programming for optimization problems in Big Data
Functional programming for optimization problems in Big Data
 
Concept mapride bong_m[1]
Concept mapride bong_m[1]Concept mapride bong_m[1]
Concept mapride bong_m[1]
 
Exerpt From Exec Overview
Exerpt From Exec OverviewExerpt From Exec Overview
Exerpt From Exec Overview
 
Hacker Lab Summary
Hacker Lab SummaryHacker Lab Summary
Hacker Lab Summary
 
Hacker Lab Startup Community Hackerspace
Hacker Lab Startup Community HackerspaceHacker Lab Startup Community Hackerspace
Hacker Lab Startup Community Hackerspace
 
The Singles Lifecycle
The Singles LifecycleThe Singles Lifecycle
The Singles Lifecycle
 
Itsme Lezione Usi Lugano
Itsme Lezione Usi LuganoItsme Lezione Usi Lugano
Itsme Lezione Usi Lugano
 
Veyond Partners Introduction 200901
Veyond Partners Introduction 200901Veyond Partners Introduction 200901
Veyond Partners Introduction 200901
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

Iknow ranking sem_info_v9.0__2012.09.07_anjorin

  • 1. Ranking Resources in Folksonomies By Exploiting Semantic Information i-KNOW 2012, Saarbrücken Users Tags Resources Research Talk n Perso Typ e Loca tion Ranking Topic Algorithms Event Oth Act er ivit y Thomas Rodenhausen Slideshare Mojisola Anjorin Renato Domínguez García Christoph Rensing Prof. Dr.-Ing. Ralf Steinmetz Ralf Steinmetz KOM - Multimedia Communications Lab Iknow_Ranking_Sem_Info_v9.0__2012.09.07_MA.pptx 7-Sep-12 © author(s) of these slides including research results from the KOM research network and TU Darmstadt; otherwise it is specified at the respective slide
  • 2. Social Tagging Applications Social tagging applications are used to organize, classify, manage and share knowledge resources !  Tags are freely chosen keywords attached to resources !  Tags often describe an aspect of the resource Lora o Aroy Graz Key note Dial E I-know for 2012 Events Sema n Pre pa rin W tic eb g Con for fer enc e KOM – Multimedia Communications Lab 2 www.wordle.net
  • 3. Recommendations in Social Tagging Applications KOM – Multimedia Communications Lab 3
  • 4. Overview !  Motivation !  Basics !  Folksonomy !  Folksonomy Extended by Tag Types !  Graph-based Resource Recommendation !  Challenge: Concept Drift !  AspectScore & InteliScore !  Evaluation Methodology and Metrics !  Results !  Conclusion & Future Work KOM – Multimedia Communications Lab 4
  • 5. Folksonomy Users Tags Resources A folksonomy is a quadruple F:= (U, T, R, Y), where Research Talk U – Users T – Tags R – Resources Y ⊆ U ! T ! R - tag assignment Ranking Algorithms Slideshare [Hotho et al. 2006] KOM – Multimedia Communications Lab 5
  • 6. Folksonomy Extended by Tag Types Users Tags Resources An extended folksonomy FA:= (U, T, R, A, Y) where Research Talk U – Users Perso n Typ T – Tags e Loca tion Topic R – Resources Event Oth Act er A – Tag Types ivity Ranking Algorithms Y⊆U!T!RxA A = {Topic, Resource Type, Location, Person, Event, Activity, Other} Slideshare [Böhnstedt et al. 2009] KOM – Multimedia Communications Lab 6
  • 7. Graph-based Resource Recommendation Recommendation List Item Score 3 P2 P1 1 1 Graph-based r1 0.9 2 Ranking r2 0.7 4 1 Algorithm 2 1 r3 0.5 P3 2 P4 r4 0.2 Folksonomy Graph Scorer e.g. PageRank, HITS Ranked Resources KOM – Multimedia Communications Lab 7
  • 8. Adapted PageRank [Hotho et al. 2006] " " Dancing $%&'()*+,& Tango ! l 0 Festiva 0 # # "-. " #-. 1 ! PageRank‘s intelligent surfer model #-. Buenos " The ranking of a node is determined by how "-. Aires often the surfer visits the node 0 Buenos Adjoining edges are followed with a certain Aires probability – determined by the edge weights 0 " The query node acts as the starting point and focus i.e. the surfer returns to this node with ! a certain probability – determined by the node weights KOM – Multimedia Communications Lab 8
  • 9. Challenge: Concept Drift Concept drift is a challenge for graph-based ranking algorithms !  e.g. Ambiguous tags can cause concept drift as a single tag might represent multiple semantic concepts ? football News about Messi ? FC Barcelona Website Dallas Cowboys‘ Website KOM – Multimedia Communications Lab 9
  • 10. Overview !  Motivation !  Basics !  Folksonomy !  Folksonomy Extended by Tag Types !  Graph-based Resource Recommendation !  Challenge: Concept Drift !  AspectScore & InteliScore !  Evaluation Methodology and Metrics !  Results !  Conclusion & Future Work KOM – Multimedia Communications Lab 10
  • 11. InteliScore The semantic information gained from the semantic relatedness between tags is used to reduce concept drift Dancing Tango Festival Semantic Relatedness (XESA) 0.005 Buenos Aires XESA calculates the semantic relatedness between pairs of tokens (tags) using the English Wikipedia as reference corpus [Scholl et al. 2010] Source: wikipedia.org KOM – Multimedia Communications Lab 11
  • 12. AspectScore Tag types help to alleviate concept drift !  Tags are disambiguated with respect to different aspects of a resource that a user may describe while tagging topic location Buenos Aires Tourism in Argentina News about Tango KOM – Multimedia Communications Lab 12
  • 13. AspectScore Tag types help to alleviate concept drift !  e.g. by focusing on the tags describing the content of resources topic location Buenos Aires Tourism in Argentina News about Tango topic Assumption: The tags of type „Topic“ describe the content of the resources well, Google Map of Buenos Aires therefore „Topic“ Tags are given priority. KOM – Multimedia Communications Lab 13
  • 14. AspectScore: Step 1 Buenos 1. Transform Query Node Tango Buenos Aires into Query Tags 3 Aires 1 1 User query node is transformed into tag nodes, weighted by the usage frequency of the user Assumption: Tags of a user describe the user‘s interests well [Abel 2011] KOM – Multimedia Communications Lab 14
  • 15. AspectScore: Step 1 Query Tag 1. Transform Query Node in Query Tags Dancing Tango Festiva l Query Node Buenos Aires Buenos Aires Query Tags KOM – Multimedia Communications Lab 15
  • 16. AspectScore: Step 2 1. Transform Query Node into Query Tags 2. Create Folksonomy Graph for each Query Tag KOM – Multimedia Communications Lab 16
  • 17. AspectScore: Step 2 1. Transform Query Node " " Dancing $%&'()*+, Tango ! l into Query Tags 3 Festiva 0 # # " 2. Create Folksonomy " Graph for each Query Tag ! # ! Buenos " " Aires " 0 " Buenos Aires 0 Depending on Ranking Algorithm e.g. FolkRank " ! KOM – Multimedia Communications Lab 17
  • 18. AspectScore: Step 3 1. Transform Query Node "#"$ "#"$ Dancing '()*+",-. Tango ! l into Query Tags 3 Festiva 0 "%"$ "%"$ "#"$ 2. Create Folksonomy "#"$ Graph for each Query Tag "%"$ ! ! Buenos "#"$ "#"$ 3. Adapt Edge Weights Aires "#"$ 0 Buenos "#"$& Aires 0 Edge Weights are adapted "#"$ (in several iteration steps) depending on Query Tag ! KOM – Multimedia Communications Lab 18
  • 19. AspectScore: Step 4 1. Transform Query Node "#"$ "#"$ Dancing '()*+",-. Tango ! l into Query Tags 3 Festiva 0 "%"$ "%"$ "#"$ 2. Create Folksonomy "#"$ Graph for each Query Tag "%"$ ! ! Buenos "#"$ "#"$ 3. Adapt Edge Weights Aires "#"$ 0 Buenos "#"$& Aires 4. Run Ranking Algorithm 0 Run e.g. FolkRank on the "#"$ adapted folksonomy graph ! KOM – Multimedia Communications Lab 19
  • 20. AspectScore: Step 5 1. Transform Query Node into Query Tags 2. Create Folksonomy Graph for each Query Tag 3. Adapt Edge Weights The resulting rankings are accumulated giving preference to certain tag types 4. Run Ranking Algorithm e.g. topic tags Buenos Tango Buenos Aires 5. Accumulate Results Aires 1δ Topic 3δ Topic for each Query Node 1δ Location KOM – Multimedia Communications Lab 20
  • 21. Overview !  Motivation !  Basics !  Folksonomy !  Folksonomy Extended by Tag Types !  Graph-based Resource Recommendation !  Challenge Concept Drift !  AspectScore & InteliScore !  Evaluation Methodology and Metrics !  Results !  Conclusion & Future Work KOM – Multimedia Communications Lab 21
  • 22. Evaluation Methodology: LeavePostOut A post is a Pu,r= {(u,r,t)|(u,r,t) ! Y} Dancing Dancing Tango Tango Festival Festival Buenos Buenos Aires Aires For LeavePostOut, the recommendation task with user as input is harder as with tag as input [Jäschke et al. 2007] KOM – Multimedia Communications Lab 22
  • 23. Evaluation Methodology: LeaveRTOut RTr,t= {(u,r,t)|(u,r,t) ! Y} Dancing Dancing Tango Tango Festival Festival Buenos Buenos Aires Aires For LeaveRTOut, the recommendation task with tag as input is harder as with user as input KOM – Multimedia Communications Lab 23
  • 24. Evaluation Corpus Bibsonomy corpus with a p-core extraction at level 5 to reduce noise and to focus on the dense portion of the corpus Before After Tag Type Count Users 7243 69 Topic 2225 Bookmark resources 281550 9 Other 486 Bibtex resources 469654 134 Resource Type 198 Tags 216094 179 Event 182 Tag assignments 2740834 3269 Person/Organisation 143 Bookmark posts 330192 51 Activity 35 Bibtex posts 526691 959 FReSET – Domínguez García et al 2012 http://www.kom.tu-darmstadt.de/research-results/downloads/software/freset/ KOM – Multimedia Communications Lab 24 Knowledge and Data Engineering Group, University of Kassel: Benchmark Folksonomy Data from Bibsonomy, version of July 7th 2011
  • 25. Evaluation Metrics Mean Average Precision: The mean of the Average |Q| mj Precision over several queries Q 1 1 MAP(Q) = Precision(Rjk ) |Q| j=1 mj k=1 [Manning et al 2008] Mean Normalized Precision: The mean of the normalized Precision at k |Q| 1 Precisionj (k) over several queries Q MNP(Q, k) = |Q| j=1 Precisionmax,j (k) KOM – Multimedia Communications Lab 25
  • 26. Visualization of Results with Violin Plots A violin plot is a combination of a box plot and a density trace 3rd Quartile Median 1st Quartile [Hintze et al. 1998] KOM – Multimedia Communications Lab 26
  • 27. Evaluation Results LeavePostOut Evaluation results for the recommendation task having tag as input KOM – Multimedia Communications Lab 27
  • 28. Evaluation Results for LeavePostOut Evaluation results for the recommendation task having tag as input Approaches MAP AspectScore 0.2240 FolkRank 0.2136 InteliScore 0.1801 Popularity 0.0937 KOM – Multimedia Communications Lab 28
  • 29. Evaluation Results for LeaveRTOut Evaluation results for the recommendation task having tag as input KOM – Multimedia Communications Lab 29
  • 30. Evaluation Results for LeaveRTOut Evaluation results for the recommendation task having tag as input Approaches MAP Popularity 0.0834 AspectScore 0.0589 FolkRank 0.0529 InteliScore 0.0433 KOM – Multimedia Communications Lab 30
  • 31. Conclusion and Future Work Exploiting semantic information for resource ranking in folksonomies AspectScore InteliScore Tag disambiguation importance of Based on semantic tags (based on type) n relatedness between tags Perso Typ e Loca tion e.g. XESA Topic Event Oth Act er ivit y Limitations !  Manually labeled tag type dataset – error prone, subjective !  XESA based on English Wikipedia – No semantic relatedness measurable for 27% of tags in corpus Future Work !  Evaluation using CROKODIL corpus – an e-learning application with tag types !  User Study www.crokodil.de KOM – Multimedia Communications Lab 31
  • 32. Questions Contact KOM – Multimedia Communications Lab 32