SlideShare uma empresa Scribd logo
1 de 23
Baixar para ler offline
DSNotify - Detecting and Fixing
        Broken Links in Linked Data Sets


        WebS ’09 @ DEXA 2009
        Linz, 02/09/2009
        Bernhard Haslhofer and Niko Popitsch
Bernhard Haslhofer, Niko Popitsch
Summary




Bernhard Haslhofer, Niko Popitsch   2
<mo:MusicGroup rdf:about="/music/artists/084308bd-1654-436f-ba03-df6697104e19#artist">
 <foaf:name>Green Day</foaf:name>
 <owl:sameAs rdf:resource="http://dbpedia.org/resource/Green_Day" />
 <mo:image rdf:resource="/music/images/artists/7col_in/084308bd-1654-436f-ba03-
df6697104e19.jpg" />


 <foaf:page rdf:resource="/music/artists/084308bd-1654-436f-ba03-df6697104e19.html" />
 <mo:musicbrainz rdf:resource="http://musicbrainz.org/artist/084308bd-1654-436f-ba03-
df6697104e19.html" />
 <mo:homepage rdf:resource="http://www.greenday.com/" />
 <mo:fanpage rdf:resource="http://www.greendayvideos.com/" />
 <mo:fanpage rdf:resource="http://www.greenday.net" />
 <mo:imdb rdf:resource="http://www.imdb.com/name/nm1554564/" />
 <mo:myspace rdf:resource="http://www.myspace.com/greenday" />
  ...
...
<rdf:Description rdf:about="http://dbpedia.org/resource/Green_Day">
      <dbpprop:abstract xmlns:dbpprop="http://dbpedia.org/property/" xml:lang="en">Green Day
      is an American rock trio formed in 1987. The band has consisted of Billie Joe Armstrong
      (vocals, guitar), Mike Dirnt, and Tré Cool for the majority of its existence...
      </dbpprop:abstract>
</rdf:Description>
...
<rdf:Description rdf:about="http://dbpedia.org/resource/Green_Day">
      <dbpprop:abstract xmlns:dbpprop="http://dbpedia.org/property/" xml:lang="de">Green Day
      [gɹiːn deɪ] ist eine US-amerikanische Punk-Rock-Band, mit der Anfang der 1990er das Punk-
      Revival begann. Die Band wurde 1987 von Billie Joe Armstrong und Mike Dirnt zusammen
      mit dem Schlagzeuger John Kiffmeyer alias Al Sobrante als The Sweet Children....
      </dbpprop:abstract>
</rdf:Description>
...
...but...




Bernhard Haslhofer, Niko Popitsch   8
Some numbers...

        •     Events between DBpedia 3.2 (10/2008) and 3.3
              (05/2009)
             •     # resources created: 29449

             •     # resources removed: 4789

             •     # resources moved: 729




Bernhard Haslhofer, Niko Popitsch           9
Link Integrity...
        •     is a qualitative property that is given when all links
              within and between a set of data sources are valid and
              deliver the result intended by the link creator.

        •     cf. referential integrity in RDBMS

        •     demands a solution that
             •     detects broken links between resources

             •     provides support for fixing broken links


Bernhard Haslhofer, Niko Popitsch          11
Types of broken links
        •     Removed link targets
             •     e.g., resource deleted, server not available anymore, etc.

        •     Moved link targets
             •     available at another Web location

             •     e.g., reorganization of Web resources

        •     Modified link targets


Bernhard Haslhofer, Niko Popitsch           12
The DSNotify Approach
        •     periodically monitor items (resources) in a specific
              Linked Data source

        •     extract descriptive features vector for each item

        •     store item + feature vector in index

        •     use feature vectors to detect if items have been
              removed or moved to another location

        •     if moved, add relationship between “old” and “new”
              item

Bernhard Haslhofer, Niko Popitsch     13
Architecture                                             LOD „consuming“
                                                                application



                                                                                         LOD Sources
                                       LOD Source

                                                           owl:sameAs

                                                             owl:sameAs



                                                                                                   monitor
                                             update
                                                                                * Monitor (feature extraction)
                                                                        Event
                                                                        LOG
                                                      notifications
                                       * LOD source                                        Indices
                                          updater
                                                             querying              II        RII         AII




                                        * Decider       Decision making         * Move Detector (heuristic)

                                user
                                                                                                      DSNOTIFY


Bernhard Haslhofer, Niko Popitsch                     14
Index Interaction
                    Item Index (II)           Archived Item Index (AII)       Removed Item Index (RII)
               http://dbpedia.org/resource/
     t1        Green_Day (band)


    t2                                                                         http://dbpedia.org/resource/
                                                                               Green_Day (band)


    t3        http://dbpedia.org/resource/     http://dbpedia.org/resource/
              band/Green_Day                   Green_Day (band)




    t4         http://dbpedia.org/resource/    http://dbpedia.org/resource/
               band/Alternative/Green_Day      band/Green_Day

                                               http://dbpedia.org/resource/
           time                                Green_Day (band)




Bernhard Haslhofer, Niko Popitsch                        15
Move Detection

        •     is a semi-automatic process

        •     calculate similarity between items based on their
              feature vectors using domain-specific heuristics

        •     probability > given threshold: automatic decision

        •     probability < given threshold: ask expert user



Bernhard Haslhofer, Niko Popitsch     16
DSNotify HTTP Interface

        •     GET http://<server>:<port>/<dsnotify>/item/<uri>
             •      find out what happened with an item

        •     GET http://<server>:<port>/<dsnotify>/eventChoice
             •      retrieve pending event choices (move / remove)

        •     ...



Bernhard Haslhofer, Niko Popitsch          17
Evaluation Plan
     t   -n             ...              t   -2                          t   -1                          t   0



DBpedia 2.0                         DBpedia 3.0                  DBpedia 3.1                DBpedia 3.2




                      Diff                              Diff                            Diff
              manual classification                manual classification            manual classification

              mv                    rm            mv             rm               mv             rm

Bernhard Haslhofer, Niko Popitsch                         18
Status / Future Work

        •     1st prototype (infrastructure) ready

        •     annotated test-data set based on DBpedia available

        •     Currently working on:
             •     system for simulating past modifications in DBpedia

             •     the DSNotify evaluation



Bernhard Haslhofer, Niko Popitsch            19
Fixing Your Web since 2009
Backup




Bernhard Haslhofer, Niko Popitsch     21
Evaluation Plan

        •     Monitor simulated DBpedia evolution (t-n - t0)

        •     Precision / recall of automatic move detection
             •     with different similarity thresholds

             •     with different heuristics / and feature vectors




Bernhard Haslhofer, Niko Popitsch            22
Linked Data / Web of Data

        •     Data management paradigm on the basis of Web
              technologies

        •     HTTP, URI, and RDF/S are the key technologies

        •     Applications (not Web browsers) are data consumers

        •     Links between resources play a major role



Bernhard Haslhofer, Niko Popitsch    23

Mais conteúdo relacionado

Semelhante a DSNotify - Detecting and Fixing Broken Links in Linked Data Sets

S. Dixon, C. Mesnage, B. Norton. LinkedBrainz Live
S. Dixon, C. Mesnage, B. Norton. LinkedBrainz LiveS. Dixon, C. Mesnage, B. Norton. LinkedBrainz Live
S. Dixon, C. Mesnage, B. Norton. LinkedBrainz LiveMusicNet
 
Node collaboration - Exported Resources and PuppetDB
Node collaboration - Exported Resources and PuppetDBNode collaboration - Exported Resources and PuppetDB
Node collaboration - Exported Resources and PuppetDBm_richardson
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1IPLODProject
 
A document-inspired way for tracking changes of RDF data - The case of the Op...
A document-inspired way for tracking changes of RDF data - The case of the Op...A document-inspired way for tracking changes of RDF data - The case of the Op...
A document-inspired way for tracking changes of RDF data - The case of the Op...University of Bologna
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila SueroLinked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila SueroBiblioteca Nacional de España
 
A Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and OrganizationA Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and OrganizationBoris Villazón-Terrazas
 
Getting Started with Hadoop
Getting Started with HadoopGetting Started with Hadoop
Getting Started with HadoopJosh Devins
 
W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2nolmar01
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeSören Auer
 
RO-crate-FDO-ROHub
RO-crate-FDO-ROHubRO-crate-FDO-ROHub
RO-crate-FDO-ROHubRaul Palma
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objectsStian Soiland-Reyes
 
ROHub-Argos integration
ROHub-Argos integrationROHub-Argos integration
ROHub-Argos integrationRaul Palma
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...Mark Wilkinson
 
Furore devdays 2017- rdf1(solbrig)
Furore devdays 2017- rdf1(solbrig)Furore devdays 2017- rdf1(solbrig)
Furore devdays 2017- rdf1(solbrig)DevDays
 
IBC FAIR Data Prototype Implementation slideshow
IBC FAIR Data Prototype Implementation   slideshowIBC FAIR Data Prototype Implementation   slideshow
IBC FAIR Data Prototype Implementation slideshowMark Wilkinson
 
Make our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebMake our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebFranck Michel
 

Semelhante a DSNotify - Detecting and Fixing Broken Links in Linked Data Sets (20)

S. Dixon, C. Mesnage, B. Norton. LinkedBrainz Live
S. Dixon, C. Mesnage, B. Norton. LinkedBrainz LiveS. Dixon, C. Mesnage, B. Norton. LinkedBrainz Live
S. Dixon, C. Mesnage, B. Norton. LinkedBrainz Live
 
Node collaboration - Exported Resources and PuppetDB
Node collaboration - Exported Resources and PuppetDBNode collaboration - Exported Resources and PuppetDB
Node collaboration - Exported Resources and PuppetDB
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1Linked Open Data (LOD) part 1
Linked Open Data (LOD) part 1
 
A document-inspired way for tracking changes of RDF data - The case of the Op...
A document-inspired way for tracking changes of RDF data - The case of the Op...A document-inspired way for tracking changes of RDF data - The case of the Op...
A document-inspired way for tracking changes of RDF data - The case of the Op...
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila SueroLinked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
 
A Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and OrganizationA Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and Organization
 
Getting Started with Hadoop
Getting Started with HadoopGetting Started with Hadoop
Getting Started with Hadoop
 
W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
 
RO-crate-FDO-ROHub
RO-crate-FDO-ROHubRO-crate-FDO-ROHub
RO-crate-FDO-ROHub
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
 
ROHub-Argos integration
ROHub-Argos integrationROHub-Argos integration
ROHub-Argos integration
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
 
Furore devdays 2017- rdf1(solbrig)
Furore devdays 2017- rdf1(solbrig)Furore devdays 2017- rdf1(solbrig)
Furore devdays 2017- rdf1(solbrig)
 
IBC FAIR Data Prototype Implementation slideshow
IBC FAIR Data Prototype Implementation   slideshowIBC FAIR Data Prototype Implementation   slideshow
IBC FAIR Data Prototype Implementation slideshow
 
Make our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebMake our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the Web
 

Mais de EuropeanaConnect

Europeana Connect All-Staff Meeting
Europeana Connect All-Staff MeetingEuropeana Connect All-Staff Meeting
Europeana Connect All-Staff MeetingEuropeanaConnect
 
Europeana v1.0 and Interdependencies with EuropeanaConnect
Europeana v1.0 and Interdependencies with EuropeanaConnectEuropeana v1.0 and Interdependencies with EuropeanaConnect
Europeana v1.0 and Interdependencies with EuropeanaConnectEuropeanaConnect
 
in Europeana and the projects
in Europeana and the projectsin Europeana and the projects
in Europeana and the projectsEuropeanaConnect
 
Europeana and linked cultural heritage data
Europeana and linked cultural heritage dataEuropeana and linked cultural heritage data
Europeana and linked cultural heritage dataEuropeanaConnect
 
EuropeanaConnect WP4 - Europeana Licensing Framework
EuropeanaConnect WP4 - Europeana Licensing Framework EuropeanaConnect WP4 - Europeana Licensing Framework
EuropeanaConnect WP4 - Europeana Licensing Framework EuropeanaConnect
 
Europeana - Digitale Bibliothek Europas. Fenster zur Welt für lokale, regiona...
Europeana - Digitale Bibliothek Europas. Fenster zur Welt für lokale, regiona...Europeana - Digitale Bibliothek Europas. Fenster zur Welt für lokale, regiona...
Europeana - Digitale Bibliothek Europas. Fenster zur Welt für lokale, regiona...EuropeanaConnect
 
Europeana: Europe's flagship web portal, making Europe's cultural heritage ac...
Europeana: Europe's flagship web portal, making Europe's cultural heritage ac...Europeana: Europe's flagship web portal, making Europe's cultural heritage ac...
Europeana: Europe's flagship web portal, making Europe's cultural heritage ac...EuropeanaConnect
 
Europeana and EuropeanaConnect
Europeana and EuropeanaConnect Europeana and EuropeanaConnect
Europeana and EuropeanaConnect EuropeanaConnect
 
Semantische Kontextualisierung von Museumsbestanden in Europeana
Semantische Kontextualisierung von Museumsbestanden in EuropeanaSemantische Kontextualisierung von Museumsbestanden in Europeana
Semantische Kontextualisierung von Museumsbestanden in EuropeanaEuropeanaConnect
 
EU-funded project Europeana - Europe's flagship web portal, making Europe's c...
EU-funded project Europeana - Europe's flagship web portal, making Europe's c...EU-funded project Europeana - Europe's flagship web portal, making Europe's c...
EU-funded project Europeana - Europe's flagship web portal, making Europe's c...EuropeanaConnect
 
Promoting Austrian Cultural and Scientific Heritage via EUROPEANA
Promoting Austrian Cultural and Scientific Heritage via EUROPEANAPromoting Austrian Cultural and Scientific Heritage via EUROPEANA
Promoting Austrian Cultural and Scientific Heritage via EUROPEANAEuropeanaConnect
 
Linked Data und Semantic Web-basierte Funktionalität in Europeana
Linked Data und Semantic Web-basierte Funktionalität in EuropeanaLinked Data und Semantic Web-basierte Funktionalität in Europeana
Linked Data und Semantic Web-basierte Funktionalität in EuropeanaEuropeanaConnect
 
Enhancing user access to european digital heritage
Enhancing user access to european digital heritageEnhancing user access to european digital heritage
Enhancing user access to european digital heritageEuropeanaConnect
 

Mais de EuropeanaConnect (18)

Europeana Connect All-Staff Meeting
Europeana Connect All-Staff MeetingEuropeana Connect All-Staff Meeting
Europeana Connect All-Staff Meeting
 
Europeana v1.0 and Interdependencies with EuropeanaConnect
Europeana v1.0 and Interdependencies with EuropeanaConnectEuropeana v1.0 and Interdependencies with EuropeanaConnect
Europeana v1.0 and Interdependencies with EuropeanaConnect
 
The Europeana Personas
The Europeana PersonasThe Europeana Personas
The Europeana Personas
 
in Europeana and the projects
in Europeana and the projectsin Europeana and the projects
in Europeana and the projects
 
Europeana and linked cultural heritage data
Europeana and linked cultural heritage dataEuropeana and linked cultural heritage data
Europeana and linked cultural heritage data
 
090626cc tech-summit
090626cc tech-summit090626cc tech-summit
090626cc tech-summit
 
eBooks on Demand
eBooks on Demand eBooks on Demand
eBooks on Demand
 
EuropeanaConnect WP4 - Europeana Licensing Framework
EuropeanaConnect WP4 - Europeana Licensing Framework EuropeanaConnect WP4 - Europeana Licensing Framework
EuropeanaConnect WP4 - Europeana Licensing Framework
 
Europeana - Digitale Bibliothek Europas. Fenster zur Welt für lokale, regiona...
Europeana - Digitale Bibliothek Europas. Fenster zur Welt für lokale, regiona...Europeana - Digitale Bibliothek Europas. Fenster zur Welt für lokale, regiona...
Europeana - Digitale Bibliothek Europas. Fenster zur Welt für lokale, regiona...
 
Europeana: Europe's flagship web portal, making Europe's cultural heritage ac...
Europeana: Europe's flagship web portal, making Europe's cultural heritage ac...Europeana: Europe's flagship web portal, making Europe's cultural heritage ac...
Europeana: Europe's flagship web portal, making Europe's cultural heritage ac...
 
Europeana and EuropeanaConnect
Europeana and EuropeanaConnect Europeana and EuropeanaConnect
Europeana and EuropeanaConnect
 
eBooks & more
eBooks & moreeBooks & more
eBooks & more
 
Semantische Kontextualisierung von Museumsbestanden in Europeana
Semantische Kontextualisierung von Museumsbestanden in EuropeanaSemantische Kontextualisierung von Museumsbestanden in Europeana
Semantische Kontextualisierung von Museumsbestanden in Europeana
 
EU-funded project Europeana - Europe's flagship web portal, making Europe's c...
EU-funded project Europeana - Europe's flagship web portal, making Europe's c...EU-funded project Europeana - Europe's flagship web portal, making Europe's c...
EU-funded project Europeana - Europe's flagship web portal, making Europe's c...
 
Promoting Austrian Cultural and Scientific Heritage via EUROPEANA
Promoting Austrian Cultural and Scientific Heritage via EUROPEANAPromoting Austrian Cultural and Scientific Heritage via EUROPEANA
Promoting Austrian Cultural and Scientific Heritage via EUROPEANA
 
Linked Data und Semantic Web-basierte Funktionalität in Europeana
Linked Data und Semantic Web-basierte Funktionalität in EuropeanaLinked Data und Semantic Web-basierte Funktionalität in Europeana
Linked Data und Semantic Web-basierte Funktionalität in Europeana
 
eBooks on demand
eBooks on demandeBooks on demand
eBooks on demand
 
Enhancing user access to european digital heritage
Enhancing user access to european digital heritageEnhancing user access to european digital heritage
Enhancing user access to european digital heritage
 

Último

4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 

Último (20)

4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 

DSNotify - Detecting and Fixing Broken Links in Linked Data Sets

  • 1. DSNotify - Detecting and Fixing Broken Links in Linked Data Sets WebS ’09 @ DEXA 2009 Linz, 02/09/2009 Bernhard Haslhofer and Niko Popitsch Bernhard Haslhofer, Niko Popitsch
  • 3.
  • 4.
  • 5.
  • 6. <mo:MusicGroup rdf:about="/music/artists/084308bd-1654-436f-ba03-df6697104e19#artist"> <foaf:name>Green Day</foaf:name> <owl:sameAs rdf:resource="http://dbpedia.org/resource/Green_Day" /> <mo:image rdf:resource="/music/images/artists/7col_in/084308bd-1654-436f-ba03- df6697104e19.jpg" /> <foaf:page rdf:resource="/music/artists/084308bd-1654-436f-ba03-df6697104e19.html" /> <mo:musicbrainz rdf:resource="http://musicbrainz.org/artist/084308bd-1654-436f-ba03- df6697104e19.html" /> <mo:homepage rdf:resource="http://www.greenday.com/" /> <mo:fanpage rdf:resource="http://www.greendayvideos.com/" /> <mo:fanpage rdf:resource="http://www.greenday.net" /> <mo:imdb rdf:resource="http://www.imdb.com/name/nm1554564/" /> <mo:myspace rdf:resource="http://www.myspace.com/greenday" /> ...
  • 7. ... <rdf:Description rdf:about="http://dbpedia.org/resource/Green_Day"> <dbpprop:abstract xmlns:dbpprop="http://dbpedia.org/property/" xml:lang="en">Green Day is an American rock trio formed in 1987. The band has consisted of Billie Joe Armstrong (vocals, guitar), Mike Dirnt, and Tré Cool for the majority of its existence... </dbpprop:abstract> </rdf:Description> ... <rdf:Description rdf:about="http://dbpedia.org/resource/Green_Day"> <dbpprop:abstract xmlns:dbpprop="http://dbpedia.org/property/" xml:lang="de">Green Day [gɹiːn deɪ] ist eine US-amerikanische Punk-Rock-Band, mit der Anfang der 1990er das Punk- Revival begann. Die Band wurde 1987 von Billie Joe Armstrong und Mike Dirnt zusammen mit dem Schlagzeuger John Kiffmeyer alias Al Sobrante als The Sweet Children.... </dbpprop:abstract> </rdf:Description> ...
  • 9. Some numbers... • Events between DBpedia 3.2 (10/2008) and 3.3 (05/2009) • # resources created: 29449 • # resources removed: 4789 • # resources moved: 729 Bernhard Haslhofer, Niko Popitsch 9
  • 10.
  • 11. Link Integrity... • is a qualitative property that is given when all links within and between a set of data sources are valid and deliver the result intended by the link creator. • cf. referential integrity in RDBMS • demands a solution that • detects broken links between resources • provides support for fixing broken links Bernhard Haslhofer, Niko Popitsch 11
  • 12. Types of broken links • Removed link targets • e.g., resource deleted, server not available anymore, etc. • Moved link targets • available at another Web location • e.g., reorganization of Web resources • Modified link targets Bernhard Haslhofer, Niko Popitsch 12
  • 13. The DSNotify Approach • periodically monitor items (resources) in a specific Linked Data source • extract descriptive features vector for each item • store item + feature vector in index • use feature vectors to detect if items have been removed or moved to another location • if moved, add relationship between “old” and “new” item Bernhard Haslhofer, Niko Popitsch 13
  • 14. Architecture LOD „consuming“ application LOD Sources LOD Source owl:sameAs owl:sameAs monitor update * Monitor (feature extraction) Event LOG notifications * LOD source Indices updater querying II RII AII * Decider Decision making * Move Detector (heuristic) user DSNOTIFY Bernhard Haslhofer, Niko Popitsch 14
  • 15. Index Interaction Item Index (II) Archived Item Index (AII) Removed Item Index (RII) http://dbpedia.org/resource/ t1 Green_Day (band) t2 http://dbpedia.org/resource/ Green_Day (band) t3 http://dbpedia.org/resource/ http://dbpedia.org/resource/ band/Green_Day Green_Day (band) t4 http://dbpedia.org/resource/ http://dbpedia.org/resource/ band/Alternative/Green_Day band/Green_Day http://dbpedia.org/resource/ time Green_Day (band) Bernhard Haslhofer, Niko Popitsch 15
  • 16. Move Detection • is a semi-automatic process • calculate similarity between items based on their feature vectors using domain-specific heuristics • probability > given threshold: automatic decision • probability < given threshold: ask expert user Bernhard Haslhofer, Niko Popitsch 16
  • 17. DSNotify HTTP Interface • GET http://<server>:<port>/<dsnotify>/item/<uri> • find out what happened with an item • GET http://<server>:<port>/<dsnotify>/eventChoice • retrieve pending event choices (move / remove) • ... Bernhard Haslhofer, Niko Popitsch 17
  • 18. Evaluation Plan t -n ... t -2 t -1 t 0 DBpedia 2.0 DBpedia 3.0 DBpedia 3.1 DBpedia 3.2 Diff Diff Diff manual classification manual classification manual classification mv rm mv rm mv rm Bernhard Haslhofer, Niko Popitsch 18
  • 19. Status / Future Work • 1st prototype (infrastructure) ready • annotated test-data set based on DBpedia available • Currently working on: • system for simulating past modifications in DBpedia • the DSNotify evaluation Bernhard Haslhofer, Niko Popitsch 19
  • 20. Fixing Your Web since 2009
  • 22. Evaluation Plan • Monitor simulated DBpedia evolution (t-n - t0) • Precision / recall of automatic move detection • with different similarity thresholds • with different heuristics / and feature vectors Bernhard Haslhofer, Niko Popitsch 22
  • 23. Linked Data / Web of Data • Data management paradigm on the basis of Web technologies • HTTP, URI, and RDF/S are the key technologies • Applications (not Web browsers) are data consumers • Links between resources play a major role Bernhard Haslhofer, Niko Popitsch 23