SlideShare a Scribd company logo
1 of 37
Download to read offline
Semantics in
Social Tagging Systems

                           Andreas Hotho

Dominik Benz, Robert Jäschke, Beate Krause, Christoph Schmitz, Gerd Stumme

    Hertie-Lehrstuhl für Wissensverarbeitung
   Universität Kassel & Forschungszentrum L3S


             C. Cattuto, A. Baldassarri, V. Loreto, V. D. P. Servedio
         Physics Department, University of Roma “La Sapienza”, Italy
Map of Web 2.0




                artwork by R. Munroe   http://xkcd.com/
Andreas Hotho                                             27.09.08   2
Everybody is tagging…


                                             simple and intuitive way to
                                              organize resources, immediately
                                              useful
                                             uncontrolled vocabulary
                                             however: evidence for converging
                                              vocabulary / emergent semantics
                                              due to
                                                shared implicit knowledge
                                                mutual influence of users
                                                underlying social networks




                         http://xkcd.com/


                         resource
            tag   user
Andreas Hotho                                                         27.09.08   3
Agenda


  BibSonomy – a social
   bookmark and publication
   sharing system


                                       0.4


  Overview Tagging Systems
                                                                                              quot;blogquot;
                                                                                                quot;cssquot;
                                                                                           quot;designquot;
                                                                                              quot;linuxquot;
                                      0.35                                                  quot;musicquot;
                                                                                             quot;newsquot;
                                                                                    quot;programmingquot;
                                                                                         quot;softwarequot;
                                       0.3                                                     quot;webquot;



                                      0.25




                              rank
                                       0.2




  Semantics between Tags             0.15



                                       0.1



                                      0.05
                                              0    2    4    6            8    10               12           14
                                                                 month




  Summary and Outlook




Andreas Hotho                                                27.09.08                                   4
BibSonomy ― a cooperative publication management system




Large User Basis:                          We use the system
      100.051 registered users              for our daily scientific work,
      288.849 bookmarks                     in European and other projects
      258.633 publications                  and for evaluating our algorithms.
   + 986.458 publications from DBLP.

  Integrated a.o. in Citavi and JabRef.   http://www.bibsonomy.org
  Andreas Hotho                                                           27.09.08   5
Topic-specific collection of references (here: Social Network Analysis)
 Andreas Hotho                                                            27.09.08   6
Export in over 30 formats, including BibTeX and Endnote

 Andreas Hotho                                            27.09.08   7
Generates publication lists for individuals, research groups, and projects
 Andreas Hotho                                                               27.09.08   8
Entry point for conference proceedings
 Andreas Hotho                           27.09.08   9
Basket functionality for libraries
 Andreas Hotho                       27.09.08   10
Back reference to the library
 Andreas Hotho                  27.09.08   11
Posting a new publication is easy:
                 Highlight reference
                 Click on “Post Publication” button



Andreas Hotho                                          27.09.08   12
Posting a new bookmark/publication:
                 Information Extraction (Mallet) fills form for you.
                 Just add your favorite tags.




Andreas Hotho                                                           27.09.08   13
Posting a new bookmark/publication:
                 That’s it!

                 Other options:
                        Scrapers (> 60), eg for Citeseer, ACM
                       Upload BibTeX
                        Enter information manually
                       JabRef interface

Andreas Hotho                                         27.09.08   14
Agenda


  BibSonomy – a social
   bookmark and publication
   sharing system


                                       0.4


  Overview Tagging Systems
                                                                                              quot;blogquot;
                                                                                                quot;cssquot;
                                                                                           quot;designquot;
                                                                                              quot;linuxquot;
                                      0.35                                                  quot;musicquot;
                                                                                             quot;newsquot;
                                                                                    quot;programmingquot;
                                                                                         quot;softwarequot;
                                       0.3                                                     quot;webquot;



                                      0.25




                              rank
                                       0.2




  Semantics between Tags             0.15



                                       0.1



                                      0.05
                                              0    2    4    6            8    10               12            14
                                                                 month




  Summary and Outlook




Andreas Hotho                                                27.09.08                                   15
Social Tagging Systems / Delicious.com




Andreas Hotho                            27.09.08   16
Social Tagging Systems

 Simpy:
        free, “nicer” design
        special function: groups, a bookmark history function
 Mister Wong:
        Most popular system in Germany
        special function: every post has links to „recommended“ web
         sites.
    FURL and blinklist has a special rating function.
    Feed Me Links has a function to add bookmarks by mail.
    RawSugar provides an automatically generated hierarchy.
    backflip and AllMyFavorites.net uses folders.
    Chipmark, Spurl and Netvouz has tags and folders.
    http://www.simpy.com/, http://www.mister-wong.de/, http://www.furl.net/, http://
     www.blinklist.com/, http://feedmelinks.com/portal, http://www.rawsugar.com/, http://
     www.backflip.com/, http://www.allmyfavorites.net/, https://www.chipmark.com/Main,
     http://www.spurl.net/, http://www.netvouz.com/



Andreas Hotho                                                                           27.09.08   17
Social Cataloging Systems




Andreas Hotho               27.09.08   18
Social Cataloging Systems




Andreas Hotho               27.09.08   19
Social Cataloging Systems




Andreas Hotho               27.09.08   20
Social Cataloging Systems




Andreas Hotho               27.09.08   21
Social Cataloging Systems




Andreas Hotho               27.09.08   22
Social Cataloging Systems




Andreas Hotho               27.09.08   23
Agenda


  BibSonomy – a social
   bookmark and publication
   sharing system


                                       0.4


  Overview Tagging Systems
                                                                                              quot;blogquot;
                                                                                                quot;cssquot;
                                                                                           quot;designquot;
                                                                                              quot;linuxquot;
                                      0.35                                                  quot;musicquot;
                                                                                             quot;newsquot;
                                                                                    quot;programmingquot;
                                                                                         quot;softwarequot;
                                       0.3                                                     quot;webquot;



                                      0.25




                              rank
                                       0.2




  Semantics between Tags             0.15



                                       0.1



                                      0.05
                                              0    2    4    6            8    10               12            14
                                                                 month




  Summary and Outlook




Andreas Hotho                                                27.09.08                                   24
Andreas Hotho   27.09.08   25
Most related tags by cooccurrence / cosine simlarity

             art
          web2.0
                     design photography illustration blog graphics
                     ajax web tools blog webdesign                    freq
            news     blog technology politics media daily
           howto     tutorial reference tips linux programming
           video     music funny tv software media
            ajax     javascript web2.0 web programming webdesign
        tutorial     howto programming reference design css
      javascript     ajax programming css web webdesign




               art   graphic creative print portfolios nice    cosine
            web2.0   web2 web-2.0 webapp “web web_2.0
              news   blogs people weblog culture future
             howto   how-to guide tutorials help how_to
             video   entertainment awesome fun cool random
              ajax   dhtml dom js ecmascript webdev
          tutorial   tutorials tips coding code examples
        javascript   webdevelopment webdev example examples webprogramming




Andreas Hotho                                                        27.09.08   26
Semantic Grounding in WordNet



 WordNet is a large lexical database for English.

 Words with same meaning are grouped in synsets, which are ordered
  by an is-a hierarchy.

 Introduction of single artificial root node enables application of
  graph-based similarity metrics between pairs of nouns / pairs of
  verbs.

 Inclusion of top n del.icio.us tags in WordNet:
        100: 82%
     1,000: 79%
     5,000: 69%
     10,000: 61%

Andreas Hotho                                                 27.09.08   27
Example of Semantic Grounding




                                                Wordnet Synset Hierarchy:
 Original tag:
       „java“                                                     computers



 Most similar tag:                                 programming



       Freq, folkrank:      map
                                       design_patterns             languages
        „programming“
       Cosine:
        „python“                                            java                    python
                          Grounded
                          similarity




 Andreas Hotho                                                           27.09.08        28
shortest paths in WordNet




                               random




                                          siblings
                length of shortest path
                  to most related tag
Andreas Hotho                                        27.09.08   29
Results for delicious together with similarity pruning




Andreas Hotho                                            27.09.08   30
Results for delicious together with similarity pruning




Andreas Hotho                                            27.09.08   31
Association Rules           ≅ transactions

                            ≅ items

    K1 = (U £ R, T, I1)
    If users tag some resource with tag ti,
     they frequently also use tj for it.

    Usage:
       tag recommendations
       learning implications (tag hierarchy)




Andreas Hotho                                   27.09.08   32
Association Rules




    K2 = (T £ U, R, I2)
    If users tag a resource ri with a particular tag,
     they frequently also use this tag for rj .

    Usage:
       finding communities
       resource recommendations
Andreas Hotho                                            27.09.08   33
Association Rules




   K2 = (T £ U, R, I2)
   If users tag a resource ri with a particular tag,
    they frequently also use this tag for rj .

    Usage:
            finding communities
Andreas Hotho resource recommendations
                                                       27.09.08   34
Agenda


  BibSonomy – a social
   bookmark and publication
   sharing system


                                       0.4


  Overview Tagging Systems
                                                                                              quot;blogquot;
                                                                                                quot;cssquot;
                                                                                           quot;designquot;
                                                                                              quot;linuxquot;
                                      0.35                                                  quot;musicquot;
                                                                                             quot;newsquot;
                                                                                    quot;programmingquot;
                                                                                         quot;softwarequot;
                                       0.3                                                     quot;webquot;



                                      0.25




                              rank
                                       0.2




  Semantics between Tags             0.15



                                       0.1



                                      0.05
                                              0    2    4    6            8    10               12            14
                                                                 month




  Summary and Outlook




Andreas Hotho                                                27.09.08                                   35
Summary and Outlook

   Our FolkRank algorithm supports search in folksonomies.


   Relatedness measures on tags in folksonomies are a good basis
    to extract semantic relations

   Trend detection in Social Bookmarking Systems

   Tag Recommender allows to recommend user specific tags for new
    post

   Detecting Spam is a major challenge

   LogSonomies - analysing the structure of search engine query log
    files

   Learning some kind of synsets, relations and hierarchy of tags
Andreas Hotho                                                 27.09.08   36
Similar tags live on www.bibsonomy.org




                              Thanks for your attention!
                              contact:
                              hotho@cs.uni-kassel.de




Andreas Hotho                                       27.09.08   37

More Related Content

Similar to Semantics in Social Tagging Systems

Web 20- 2: Architecture Patterns And Models For The New Internet
Web 20- 2: Architecture Patterns And Models For The New InternetWeb 20- 2: Architecture Patterns And Models For The New Internet
Web 20- 2: Architecture Patterns And Models For The New Internettvawler
 
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARCEd Chi
 
Understanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveUnderstanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveYuwei Lin
 
Semantic Search for Enterprise 2.0
Semantic Search for Enterprise 2.0Semantic Search for Enterprise 2.0
Semantic Search for Enterprise 2.0Alexandre Passant
 
Harnessing the Interactive Web
Harnessing the Interactive WebHarnessing the Interactive Web
Harnessing the Interactive WebBill Warters
 
Language Computer Corporation: Text Extraction Profile
Language Computer Corporation:  Text Extraction ProfileLanguage Computer Corporation:  Text Extraction Profile
Language Computer Corporation: Text Extraction ProfileAndy Hickl
 
X Tech2007, Open Data, Licensing
X Tech2007, Open Data, LicensingX Tech2007, Open Data, Licensing
X Tech2007, Open Data, Licensingmmmmmrob
 
Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Chad Dickerson
 
Analysis of-quality-of-pkgs-in-packagist-univ-20171024
Analysis of-quality-of-pkgs-in-packagist-univ-20171024Analysis of-quality-of-pkgs-in-packagist-univ-20171024
Analysis of-quality-of-pkgs-in-packagist-univ-20171024Clark Everetts
 
Archetype autoplugins
Archetype autopluginsArchetype autoplugins
Archetype autopluginsMark Schaake
 
Using Cascalog to build an app with City of Palo Alto Open Data
Using Cascalog to build an app with City of Palo Alto Open DataUsing Cascalog to build an app with City of Palo Alto Open Data
Using Cascalog to build an app with City of Palo Alto Open DataOSCON Byrum
 
OSCON 2013: Using Cascalog to build an app with City of Palo Alto Open Data
OSCON 2013: Using Cascalog to build an app with City of Palo Alto Open DataOSCON 2013: Using Cascalog to build an app with City of Palo Alto Open Data
OSCON 2013: Using Cascalog to build an app with City of Palo Alto Open DataPaco Nathan
 
Tagging and Folksonomy Schema Design for Scalability and Performance
Tagging and Folksonomy Schema Design for Scalability and PerformanceTagging and Folksonomy Schema Design for Scalability and Performance
Tagging and Folksonomy Schema Design for Scalability and PerformanceEduard Bondarenko
 

Similar to Semantics in Social Tagging Systems (20)

Web 20- 2: Architecture Patterns And Models For The New Internet
Web 20- 2: Architecture Patterns And Models For The New InternetWeb 20- 2: Architecture Patterns And Models For The New Internet
Web 20- 2: Architecture Patterns And Models For The New Internet
 
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
 
Web2.0 and KM
Web2.0 and KMWeb2.0 and KM
Web2.0 and KM
 
Understanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveUnderstanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical Perspective
 
Semantic Search for Enterprise 2.0
Semantic Search for Enterprise 2.0Semantic Search for Enterprise 2.0
Semantic Search for Enterprise 2.0
 
Markup As An Api
Markup As An ApiMarkup As An Api
Markup As An Api
 
Harnessing the Interactive Web
Harnessing the Interactive WebHarnessing the Interactive Web
Harnessing the Interactive Web
 
Language Computer Corporation: Text Extraction Profile
Language Computer Corporation:  Text Extraction ProfileLanguage Computer Corporation:  Text Extraction Profile
Language Computer Corporation: Text Extraction Profile
 
IkeWiki Tutorial
IkeWiki TutorialIkeWiki Tutorial
IkeWiki Tutorial
 
X Tech2007, Open Data, Licensing
X Tech2007, Open Data, LicensingX Tech2007, Open Data, Licensing
X Tech2007, Open Data, Licensing
 
Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!
 
Analysis of-quality-of-pkgs-in-packagist-univ-20171024
Analysis of-quality-of-pkgs-in-packagist-univ-20171024Analysis of-quality-of-pkgs-in-packagist-univ-20171024
Analysis of-quality-of-pkgs-in-packagist-univ-20171024
 
Social Media and Web 2.0
Social Media and Web 2.0Social Media and Web 2.0
Social Media and Web 2.0
 
Discus
DiscusDiscus
Discus
 
Archetype autoplugins
Archetype autopluginsArchetype autoplugins
Archetype autoplugins
 
Using Cascalog to build an app with City of Palo Alto Open Data
Using Cascalog to build an app with City of Palo Alto Open DataUsing Cascalog to build an app with City of Palo Alto Open Data
Using Cascalog to build an app with City of Palo Alto Open Data
 
OSCON 2013: Using Cascalog to build an app with City of Palo Alto Open Data
OSCON 2013: Using Cascalog to build an app with City of Palo Alto Open DataOSCON 2013: Using Cascalog to build an app with City of Palo Alto Open Data
OSCON 2013: Using Cascalog to build an app with City of Palo Alto Open Data
 
Building Web Hack Interfaces
Building Web Hack InterfacesBuilding Web Hack Interfaces
Building Web Hack Interfaces
 
Bollean Search - NageshRao
Bollean Search - NageshRaoBollean Search - NageshRao
Bollean Search - NageshRao
 
Tagging and Folksonomy Schema Design for Scalability and Performance
Tagging and Folksonomy Schema Design for Scalability and PerformanceTagging and Folksonomy Schema Design for Scalability and Performance
Tagging and Folksonomy Schema Design for Scalability and Performance
 

More from Jakob .

Einheitliche Normdatendienste der VZG
Einheitliche Normdatendienste der VZGEinheitliche Normdatendienste der VZG
Einheitliche Normdatendienste der VZGJakob .
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedJakob .
 
Linked Open Data in Bibliotheken, Archiven & Museen
Linked Open Data in Bibliotheken, Archiven & MuseenLinked Open Data in Bibliotheken, Archiven & Museen
Linked Open Data in Bibliotheken, Archiven & MuseenJakob .
 
Collaborative Creation of a Wikidata handbook
Collaborative Creation of a Wikidata handbookCollaborative Creation of a Wikidata handbook
Collaborative Creation of a Wikidata handbookJakob .
 
Another RDF Encoding Form
Another RDF Encoding FormAnother RDF Encoding Form
Another RDF Encoding FormJakob .
 
On the Way to a Holding Ontology
On the Way to a Holding OntologyOn the Way to a Holding Ontology
On the Way to a Holding OntologyJakob .
 
Stand und Planungen im Bereich der Schnittstellen in der VZG
Stand und Planungen im Bereich der Schnittstellen in der VZGStand und Planungen im Bereich der Schnittstellen in der VZG
Stand und Planungen im Bereich der Schnittstellen in der VZGJakob .
 
Verwaltung dokumentenorientierter DTDs für den Dokument- und Publikationsserv...
Verwaltung dokumentenorientierter DTDs für den Dokument- und Publikationsserv...Verwaltung dokumentenorientierter DTDs für den Dokument- und Publikationsserv...
Verwaltung dokumentenorientierter DTDs für den Dokument- und Publikationsserv...Jakob .
 
Beschreibung von Bibliotheks-Dienstleistungen mit Mikro-Ontologien
Beschreibung von Bibliotheks-Dienstleistungen mit Mikro-OntologienBeschreibung von Bibliotheks-Dienstleistungen mit Mikro-Ontologien
Beschreibung von Bibliotheks-Dienstleistungen mit Mikro-OntologienJakob .
 
Linking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsLinking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsJakob .
 
Encoding Patron Information in RDF
Encoding Patron Information in RDFEncoding Patron Information in RDF
Encoding Patron Information in RDFJakob .
 
Libraries in a data-centered environment
Libraries in a data-centered environmentLibraries in a data-centered environment
Libraries in a data-centered environmentJakob .
 
Was gibt's wie und wo? Informationen zu Standorten, Exemplaren und Dienstleis...
Was gibt's wie und wo? Informationen zu Standorten, Exemplaren und Dienstleis...Was gibt's wie und wo? Informationen zu Standorten, Exemplaren und Dienstleis...
Was gibt's wie und wo? Informationen zu Standorten, Exemplaren und Dienstleis...Jakob .
 
FRBR light with Simplified Ontology for Bibliographic Resource
FRBR light with Simplified Ontology for Bibliographic ResourceFRBR light with Simplified Ontology for Bibliographic Resource
FRBR light with Simplified Ontology for Bibliographic ResourceJakob .
 
RDF-Daten in eigenen Anwendungen nutzen
RDF-Daten in eigenen Anwendungen nutzenRDF-Daten in eigenen Anwendungen nutzen
RDF-Daten in eigenen Anwendungen nutzenJakob .
 
Linked Data Light - Linkaggregation mit BEACON
Linked Data Light - Linkaggregation mit BEACONLinked Data Light - Linkaggregation mit BEACON
Linked Data Light - Linkaggregation mit BEACONJakob .
 
Revealing digital documents - concealed structures in data
Revealing digital documents - concealed structures in dataRevealing digital documents - concealed structures in data
Revealing digital documents - concealed structures in dataJakob .
 
Wie kommen unsere Sacherschließungsdaten ins Semantic Web? Vom lokalen Normda...
Wie kommen unsere Sacherschließungsdaten ins Semantic Web? Vom lokalen Normda...Wie kommen unsere Sacherschließungsdaten ins Semantic Web? Vom lokalen Normda...
Wie kommen unsere Sacherschließungsdaten ins Semantic Web? Vom lokalen Normda...Jakob .
 
Herausforderungen und Lösungen bei der Publikation und Nutzung von Normdaten ...
Herausforderungen und Lösungen bei der Publikation und Nutzung von Normdaten ...Herausforderungen und Lösungen bei der Publikation und Nutzung von Normdaten ...
Herausforderungen und Lösungen bei der Publikation und Nutzung von Normdaten ...Jakob .
 
Linked Data: Die Zukunft der Nutzung von Katalogdaten
Linked Data: Die Zukunft der Nutzung von KatalogdatenLinked Data: Die Zukunft der Nutzung von Katalogdaten
Linked Data: Die Zukunft der Nutzung von KatalogdatenJakob .
 

More from Jakob . (20)

Einheitliche Normdatendienste der VZG
Einheitliche Normdatendienste der VZGEinheitliche Normdatendienste der VZG
Einheitliche Normdatendienste der VZG
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 
Linked Open Data in Bibliotheken, Archiven & Museen
Linked Open Data in Bibliotheken, Archiven & MuseenLinked Open Data in Bibliotheken, Archiven & Museen
Linked Open Data in Bibliotheken, Archiven & Museen
 
Collaborative Creation of a Wikidata handbook
Collaborative Creation of a Wikidata handbookCollaborative Creation of a Wikidata handbook
Collaborative Creation of a Wikidata handbook
 
Another RDF Encoding Form
Another RDF Encoding FormAnother RDF Encoding Form
Another RDF Encoding Form
 
On the Way to a Holding Ontology
On the Way to a Holding OntologyOn the Way to a Holding Ontology
On the Way to a Holding Ontology
 
Stand und Planungen im Bereich der Schnittstellen in der VZG
Stand und Planungen im Bereich der Schnittstellen in der VZGStand und Planungen im Bereich der Schnittstellen in der VZG
Stand und Planungen im Bereich der Schnittstellen in der VZG
 
Verwaltung dokumentenorientierter DTDs für den Dokument- und Publikationsserv...
Verwaltung dokumentenorientierter DTDs für den Dokument- und Publikationsserv...Verwaltung dokumentenorientierter DTDs für den Dokument- und Publikationsserv...
Verwaltung dokumentenorientierter DTDs für den Dokument- und Publikationsserv...
 
Beschreibung von Bibliotheks-Dienstleistungen mit Mikro-Ontologien
Beschreibung von Bibliotheks-Dienstleistungen mit Mikro-OntologienBeschreibung von Bibliotheks-Dienstleistungen mit Mikro-Ontologien
Beschreibung von Bibliotheks-Dienstleistungen mit Mikro-Ontologien
 
Linking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsLinking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization Systems
 
Encoding Patron Information in RDF
Encoding Patron Information in RDFEncoding Patron Information in RDF
Encoding Patron Information in RDF
 
Libraries in a data-centered environment
Libraries in a data-centered environmentLibraries in a data-centered environment
Libraries in a data-centered environment
 
Was gibt's wie und wo? Informationen zu Standorten, Exemplaren und Dienstleis...
Was gibt's wie und wo? Informationen zu Standorten, Exemplaren und Dienstleis...Was gibt's wie und wo? Informationen zu Standorten, Exemplaren und Dienstleis...
Was gibt's wie und wo? Informationen zu Standorten, Exemplaren und Dienstleis...
 
FRBR light with Simplified Ontology for Bibliographic Resource
FRBR light with Simplified Ontology for Bibliographic ResourceFRBR light with Simplified Ontology for Bibliographic Resource
FRBR light with Simplified Ontology for Bibliographic Resource
 
RDF-Daten in eigenen Anwendungen nutzen
RDF-Daten in eigenen Anwendungen nutzenRDF-Daten in eigenen Anwendungen nutzen
RDF-Daten in eigenen Anwendungen nutzen
 
Linked Data Light - Linkaggregation mit BEACON
Linked Data Light - Linkaggregation mit BEACONLinked Data Light - Linkaggregation mit BEACON
Linked Data Light - Linkaggregation mit BEACON
 
Revealing digital documents - concealed structures in data
Revealing digital documents - concealed structures in dataRevealing digital documents - concealed structures in data
Revealing digital documents - concealed structures in data
 
Wie kommen unsere Sacherschließungsdaten ins Semantic Web? Vom lokalen Normda...
Wie kommen unsere Sacherschließungsdaten ins Semantic Web? Vom lokalen Normda...Wie kommen unsere Sacherschließungsdaten ins Semantic Web? Vom lokalen Normda...
Wie kommen unsere Sacherschließungsdaten ins Semantic Web? Vom lokalen Normda...
 
Herausforderungen und Lösungen bei der Publikation und Nutzung von Normdaten ...
Herausforderungen und Lösungen bei der Publikation und Nutzung von Normdaten ...Herausforderungen und Lösungen bei der Publikation und Nutzung von Normdaten ...
Herausforderungen und Lösungen bei der Publikation und Nutzung von Normdaten ...
 
Linked Data: Die Zukunft der Nutzung von Katalogdaten
Linked Data: Die Zukunft der Nutzung von KatalogdatenLinked Data: Die Zukunft der Nutzung von Katalogdaten
Linked Data: Die Zukunft der Nutzung von Katalogdaten
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 

Semantics in Social Tagging Systems

  • 1. Semantics in Social Tagging Systems Andreas Hotho Dominik Benz, Robert Jäschke, Beate Krause, Christoph Schmitz, Gerd Stumme Hertie-Lehrstuhl für Wissensverarbeitung Universität Kassel & Forschungszentrum L3S C. Cattuto, A. Baldassarri, V. Loreto, V. D. P. Servedio Physics Department, University of Roma “La Sapienza”, Italy
  • 2. Map of Web 2.0 artwork by R. Munroe http://xkcd.com/ Andreas Hotho 27.09.08 2
  • 3. Everybody is tagging…  simple and intuitive way to organize resources, immediately useful  uncontrolled vocabulary  however: evidence for converging vocabulary / emergent semantics due to  shared implicit knowledge  mutual influence of users  underlying social networks http://xkcd.com/ resource tag user Andreas Hotho 27.09.08 3
  • 4. Agenda  BibSonomy – a social bookmark and publication sharing system  0.4  Overview Tagging Systems quot;blogquot; quot;cssquot; quot;designquot; quot;linuxquot;  0.35 quot;musicquot; quot;newsquot; quot;programmingquot; quot;softwarequot;  0.3 quot;webquot;  0.25 rank  0.2  Semantics between Tags  0.15  0.1  0.05  0  2  4  6  8  10  12  14 month  Summary and Outlook Andreas Hotho 27.09.08 4
  • 5. BibSonomy ― a cooperative publication management system Large User Basis: We use the system  100.051 registered users  for our daily scientific work,  288.849 bookmarks  in European and other projects  258.633 publications  and for evaluating our algorithms.  + 986.458 publications from DBLP.  Integrated a.o. in Citavi and JabRef. http://www.bibsonomy.org Andreas Hotho 27.09.08 5
  • 6. Topic-specific collection of references (here: Social Network Analysis) Andreas Hotho 27.09.08 6
  • 7. Export in over 30 formats, including BibTeX and Endnote Andreas Hotho 27.09.08 7
  • 8. Generates publication lists for individuals, research groups, and projects Andreas Hotho 27.09.08 8
  • 9. Entry point for conference proceedings Andreas Hotho 27.09.08 9
  • 10. Basket functionality for libraries Andreas Hotho 27.09.08 10
  • 11. Back reference to the library Andreas Hotho 27.09.08 11
  • 12. Posting a new publication is easy:  Highlight reference  Click on “Post Publication” button Andreas Hotho 27.09.08 12
  • 13. Posting a new bookmark/publication:  Information Extraction (Mallet) fills form for you.  Just add your favorite tags. Andreas Hotho 27.09.08 13
  • 14. Posting a new bookmark/publication:  That’s it!  Other options:  Scrapers (> 60), eg for Citeseer, ACM Upload BibTeX  Enter information manually JabRef interface Andreas Hotho 27.09.08 14
  • 15. Agenda  BibSonomy – a social bookmark and publication sharing system  0.4  Overview Tagging Systems quot;blogquot; quot;cssquot; quot;designquot; quot;linuxquot;  0.35 quot;musicquot; quot;newsquot; quot;programmingquot; quot;softwarequot;  0.3 quot;webquot;  0.25 rank  0.2  Semantics between Tags  0.15  0.1  0.05  0  2  4  6  8  10  12  14 month  Summary and Outlook Andreas Hotho 27.09.08 15
  • 16. Social Tagging Systems / Delicious.com Andreas Hotho 27.09.08 16
  • 17. Social Tagging Systems  Simpy:  free, “nicer” design  special function: groups, a bookmark history function  Mister Wong:  Most popular system in Germany  special function: every post has links to „recommended“ web sites.  FURL and blinklist has a special rating function.  Feed Me Links has a function to add bookmarks by mail.  RawSugar provides an automatically generated hierarchy.  backflip and AllMyFavorites.net uses folders.  Chipmark, Spurl and Netvouz has tags and folders.  http://www.simpy.com/, http://www.mister-wong.de/, http://www.furl.net/, http:// www.blinklist.com/, http://feedmelinks.com/portal, http://www.rawsugar.com/, http:// www.backflip.com/, http://www.allmyfavorites.net/, https://www.chipmark.com/Main, http://www.spurl.net/, http://www.netvouz.com/ Andreas Hotho 27.09.08 17
  • 24. Agenda  BibSonomy – a social bookmark and publication sharing system  0.4  Overview Tagging Systems quot;blogquot; quot;cssquot; quot;designquot; quot;linuxquot;  0.35 quot;musicquot; quot;newsquot; quot;programmingquot; quot;softwarequot;  0.3 quot;webquot;  0.25 rank  0.2  Semantics between Tags  0.15  0.1  0.05  0  2  4  6  8  10  12  14 month  Summary and Outlook Andreas Hotho 27.09.08 24
  • 25. Andreas Hotho 27.09.08 25
  • 26. Most related tags by cooccurrence / cosine simlarity art web2.0 design photography illustration blog graphics ajax web tools blog webdesign freq news blog technology politics media daily howto tutorial reference tips linux programming video music funny tv software media ajax javascript web2.0 web programming webdesign tutorial howto programming reference design css javascript ajax programming css web webdesign art graphic creative print portfolios nice cosine web2.0 web2 web-2.0 webapp “web web_2.0 news blogs people weblog culture future howto how-to guide tutorials help how_to video entertainment awesome fun cool random ajax dhtml dom js ecmascript webdev tutorial tutorials tips coding code examples javascript webdevelopment webdev example examples webprogramming Andreas Hotho 27.09.08 26
  • 27. Semantic Grounding in WordNet  WordNet is a large lexical database for English.  Words with same meaning are grouped in synsets, which are ordered by an is-a hierarchy.  Introduction of single artificial root node enables application of graph-based similarity metrics between pairs of nouns / pairs of verbs.  Inclusion of top n del.icio.us tags in WordNet:  100: 82%  1,000: 79%  5,000: 69%  10,000: 61% Andreas Hotho 27.09.08 27
  • 28. Example of Semantic Grounding Wordnet Synset Hierarchy:  Original tag:  „java“ computers  Most similar tag: programming  Freq, folkrank: map design_patterns languages „programming“  Cosine: „python“ java python Grounded similarity Andreas Hotho 27.09.08 28
  • 29. shortest paths in WordNet random siblings length of shortest path to most related tag Andreas Hotho 27.09.08 29
  • 30. Results for delicious together with similarity pruning Andreas Hotho 27.09.08 30
  • 31. Results for delicious together with similarity pruning Andreas Hotho 27.09.08 31
  • 32. Association Rules ≅ transactions ≅ items  K1 = (U £ R, T, I1)  If users tag some resource with tag ti, they frequently also use tj for it.  Usage:  tag recommendations  learning implications (tag hierarchy) Andreas Hotho 27.09.08 32
  • 33. Association Rules  K2 = (T £ U, R, I2)  If users tag a resource ri with a particular tag, they frequently also use this tag for rj .  Usage:  finding communities  resource recommendations Andreas Hotho 27.09.08 33
  • 34. Association Rules  K2 = (T £ U, R, I2)  If users tag a resource ri with a particular tag, they frequently also use this tag for rj .  Usage:  finding communities Andreas Hotho resource recommendations  27.09.08 34
  • 35. Agenda  BibSonomy – a social bookmark and publication sharing system  0.4  Overview Tagging Systems quot;blogquot; quot;cssquot; quot;designquot; quot;linuxquot;  0.35 quot;musicquot; quot;newsquot; quot;programmingquot; quot;softwarequot;  0.3 quot;webquot;  0.25 rank  0.2  Semantics between Tags  0.15  0.1  0.05  0  2  4  6  8  10  12  14 month  Summary and Outlook Andreas Hotho 27.09.08 35
  • 36. Summary and Outlook  Our FolkRank algorithm supports search in folksonomies.  Relatedness measures on tags in folksonomies are a good basis to extract semantic relations  Trend detection in Social Bookmarking Systems  Tag Recommender allows to recommend user specific tags for new post  Detecting Spam is a major challenge  LogSonomies - analysing the structure of search engine query log files  Learning some kind of synsets, relations and hierarchy of tags Andreas Hotho 27.09.08 36
  • 37. Similar tags live on www.bibsonomy.org Thanks for your attention! contact: hotho@cs.uni-kassel.de Andreas Hotho 27.09.08 37