SlideShare uma empresa Scribd logo
1 de 25
Baixar para ler offline
Assessing Linked Data Mappings using
                   Network Measures

   Christophe Guéret, Paul Groth, Claus Stadler, Jens Lehmann


                9th Extended Semantic Web Conference (ESWC)
                                May 29, 2012




   http://latc-project.eu
ESWC - May 2012                     http://aksw.org
                            Assessing Linked Data mappings   http://www.vu.nl   1/25
The next 25+5 minutes
     The impact of links in the Web of Data


     Main questions
         What is the impact of link creation?
         Can we detect “bad” links based on their impact?
         Is adding links always a good thing?


     Contributions
         A framework to assess the impact of links
         Results for 5 metrics
ESWC - May 2012          Assessing Linked Data mappings     2/25
Is this a good or a bad link ?




ESWC - May 2012      Assessing Linked Data mappings   3/25
Measuring the Web of Data
     Look at the topology using network analysis tools


     Impossible to get the complete graph
         Sampling of the graph focusing on specific nodes
         See the bigger picture through aggregation


     Build the local network around a resource


     Repeat the process a sufficient number of time

ESWC - May 2012         Assessing Linked Data mappings      4/25
Network sampling process
     Use SPARQL end point or de-reference the
     resources to get the descriptions




ESWC - May 2012    Assessing Linked Data mappings   5/25
Aggregation of local results


                                                   Observed
                                                   Target




                    …




ESWC - May 2012   Assessing Linked Data mappings         6/25
Metrics
     Compute local scores for a resource


     Criteria
         Use only the local network
         Representative of a global property
         Not sensitive to change of observation scale


     5 metrics currently available in LinkQA


ESWC - May 2012         Assessing Linked Data mappings   7/25
What do we want to see?
     Increase of connectivity within topical groups
         Increase chances of finding related information


     More bridges between topical groups
         Improve browsing capabilities


     More connectivity around hubs
         Decrease the dependency upon the hubs



ESWC - May 2012         Assessing Linked Data mappings     8/25
Metric 1 – Degree
                                      Metric
                                           Number of edges
                                           around the target node


                                      Target
                                           Power-law distribution
                                           of values


                                      Intuition
                                           Presence of hubs

ESWC - May 2012   Assessing Linked Data mappings                    9/25
Metric 2 – Clustering coefficient
                                      Metric
                                           Density of links around
                                           the target node


                                      Target
                                           Increase clustering
                                           around nodes


                                      Intuition
                                           Topical clusters

ESWC - May 2012   Assessing Linked Data mappings                 10/25
Metric 3 – Centrality
                                      Metric
                                           Ratio between outgoing
                                           and incoming links


                                      Target
                                           Lower the discrepancy
                                           between the values


                                      Intuition
                                           Hubs are sensitive

ESWC - May 2012   Assessing Linked Data mappings                11/25
Metric 4 – SameAs chains
                                      Metric
                                           Number of “open”
                                           sameAs chains


                                      Target
                                           No open sameAs


                                      Intuition
                                           Peer agreement


ESWC - May 2012   Assessing Linked Data mappings              12/25
Metric 5 – Description enrichment
                                      Metric
                                           Richness of resource
                                           description


                                      Target
                                            Increase as possible


                                      Intuition
                                           “SameAsed” resources
                                           are complementary

ESWC - May 2012   Assessing Linked Data mappings                   13/25
Under the hood of LinkQA




ESWC - May 2012         Assessing Linked Data mappings                           14/25
                                    http://www.flickr.com/photos/cradlehall/5747161514
Workflow of an analysis




ESWC - May 2012   Assessing Linked Data mappings   15/25
Output of an analysis
     Results on the node and aggregated scale


     Per metric:
         Indication of change with respect to the target
         Sorted list of outlier nodes, sorted by their distance to
         the target


     Plus, a global ranking of nodes


     => Input for manual inspection by an expert
ESWC - May 2012          Assessing Linked Data mappings              16/25
Experimental results




ESWC - May 2012       Assessing Linked Data mappings   17/25
Global impact of links
     Observe the distributions to detect bad links




ESWC - May 2012      Assessing Linked Data mappings   18/25
First evaluation
     160 linking specifications for Silk, developed in
     the context of LATC


     6 linking specifications with manual verification of
     results
         50 positive links
         50 negative links


     Execute LinkQA with 10 samples of 50 links

ESWC - May 2012          Assessing Linked Data mappings   19/25
Results of the detection




     “C” if change detected in > 50% of runs

ESWC - May 2012     Assessing Linked Data mappings   20/25
Some explanations
     Low sensitivity of metrics:
         Lack of data
         Stable change


     50/50 accuracy of detection:
         Targets may not be the right ones
         Sample may not be big enough
         Semantics agnostic measures are less performant



ESWC - May 2012          Assessing Linked Data mappings    21/25
A closer look at the outliers
     See if the outliers are necessarily bad links




ESWC - May 2012      Assessing Linked Data mappings   22/25
Second evaluation
     Linking specifications for Silk, developed in the
     context of LATC


     All linking specifications sampled to have
         45 positive links
         5 negative links


     Execute LinkQA five time, on five samples



ESWC - May 2012          Assessing Linked Data mappings   23/25
Rank of positive and negative links




ESWC - May 2012   Assessing Linked Data mappings   24/25
Take home message
     LinkQA is a node centric approach to measure the
     impact of links in the WoD network
         Scalable, can be distributed


     Current results show that
         The 5 metrics defines are to be improved
         Metrics considering Semantics perform better
         The network sample seems too small
         Outliers detection improves with the number of metrics


ESWC - May 2012         Assessing Linked Data mappings       25/25

Mais conteúdo relacionado

Semelhante a Assessing Impact of Links in Linked Data Using Network Measures

J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...SBGC
 
Literature review of attribute level and
Literature review of attribute level andLiterature review of attribute level and
Literature review of attribute level andIJDKP
 
Record matching
Record matchingRecord matching
Record matchingNishna Ma
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data FusionIRJET Journal
 
LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...TELKOMNIKA JOURNAL
 
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...AAP PreK-12 Learning Group
 
Survey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesSurvey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesIRJET Journal
 
SEBD2015_PresentationVitali
SEBD2015_PresentationVitaliSEBD2015_PresentationVitali
SEBD2015_PresentationVitaliMonica Vitali
 
Mca projects in gagner, chennai slideshare
Mca projects in gagner, chennai   slideshareMca projects in gagner, chennai   slideshare
Mca projects in gagner, chennai slideshareGagnertech
 
Mca projects in gagner
Mca projects in gagnerMca projects in gagner
Mca projects in gagnerGagnertech
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES cscpconf
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIESENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIEScsandit
 
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiesEnhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiescsandit
 
Algorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
Algorithm Solved IEEE Projects 2012 2013 Java @ SeabirdssolutionsAlgorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
Algorithm Solved IEEE Projects 2012 2013 Java @ SeabirdssolutionsSBGC
 
Ieee projects 2012 for cse
Ieee projects 2012 for cseIeee projects 2012 for cse
Ieee projects 2012 for cseSBGC
 
Ieee projects 2012 for cse
Ieee projects 2012 for cseIeee projects 2012 for cse
Ieee projects 2012 for cseSBGC
 
Clustering heterogeneous categorical data using enhanced mini batch K-means ...
Clustering heterogeneous categorical data using enhanced mini  batch K-means ...Clustering heterogeneous categorical data using enhanced mini  batch K-means ...
Clustering heterogeneous categorical data using enhanced mini batch K-means ...IJECEIAES
 
Instance Matching
Instance Matching Instance Matching
Instance Matching Robert Isele
 
Survey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POISurvey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POIIRJET Journal
 

Semelhante a Assessing Impact of Links in Linked Data Using Network Measures (20)

J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
 
Literature review of attribute level and
Literature review of attribute level andLiterature review of attribute level and
Literature review of attribute level and
 
Record matching
Record matchingRecord matching
Record matching
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data Fusion
 
LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...
 
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
 
Survey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesSurvey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction Techniques
 
SEBD2015_PresentationVitali
SEBD2015_PresentationVitaliSEBD2015_PresentationVitali
SEBD2015_PresentationVitali
 
Mca projects in gagner, chennai slideshare
Mca projects in gagner, chennai   slideshareMca projects in gagner, chennai   slideshare
Mca projects in gagner, chennai slideshare
 
Mca projects in gagner
Mca projects in gagnerMca projects in gagner
Mca projects in gagner
 
Data Management.pptx
Data Management.pptxData Management.pptx
Data Management.pptx
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIESENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
 
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiesEnhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologies
 
Algorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
Algorithm Solved IEEE Projects 2012 2013 Java @ SeabirdssolutionsAlgorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
Algorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
 
Ieee projects 2012 for cse
Ieee projects 2012 for cseIeee projects 2012 for cse
Ieee projects 2012 for cse
 
Ieee projects 2012 for cse
Ieee projects 2012 for cseIeee projects 2012 for cse
Ieee projects 2012 for cse
 
Clustering heterogeneous categorical data using enhanced mini batch K-means ...
Clustering heterogeneous categorical data using enhanced mini  batch K-means ...Clustering heterogeneous categorical data using enhanced mini  batch K-means ...
Clustering heterogeneous categorical data using enhanced mini batch K-means ...
 
Instance Matching
Instance Matching Instance Matching
Instance Matching
 
Survey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POISurvey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POI
 

Mais de Christophe Guéret

HHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid IntelligenceHHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid IntelligenceChristophe Guéret
 
Informal presentation about RES
Informal presentation about RESInformal presentation about RES
Informal presentation about RESChristophe Guéret
 
Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Christophe Guéret
 
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...Christophe Guéret
 
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"Christophe Guéret
 
The Entity Registry System (ERS)
The Entity Registry System (ERS)The Entity Registry System (ERS)
The Entity Registry System (ERS)Christophe Guéret
 
Let's downscale the semantic web !
Let's downscale the semantic web !Let's downscale the semantic web !
Let's downscale the semantic web !Christophe Guéret
 
Your next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-UYour next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-UChristophe Guéret
 
The road towards a Web-based data ecosystem
The road towards a Web-based data ecosystemThe road towards a Web-based data ecosystem
The road towards a Web-based data ecosystemChristophe Guéret
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesChristophe Guéret
 
Downscaling information systems for education
Downscaling information systems for educationDownscaling information systems for education
Downscaling information systems for educationChristophe Guéret
 
ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureChristophe Guéret
 
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsChristophe Guéret
 
Exposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVOExposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVOChristophe Guéret
 
Clarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de donnéesClarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de donnéesChristophe Guéret
 
Embedding young learners into the information society
Embedding young learners into the information societyEmbedding young learners into the information society
Embedding young learners into the information societyChristophe Guéret
 

Mais de Christophe Guéret (20)

HHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid IntelligenceHHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid Intelligence
 
Informal presentation about RES
Informal presentation about RESInformal presentation about RES
Informal presentation about RES
 
Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...
 
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
 
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
 
The Entity Registry System (ERS)
The Entity Registry System (ERS)The Entity Registry System (ERS)
The Entity Registry System (ERS)
 
Let's downscale the semantic web !
Let's downscale the semantic web !Let's downscale the semantic web !
Let's downscale the semantic web !
 
Your next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-UYour next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-U
 
Linking knowledge spaces
Linking knowledge spacesLinking knowledge spaces
Linking knowledge spaces
 
The data behind the HuisKluis
The data behind the HuisKluisThe data behind the HuisKluis
The data behind the HuisKluis
 
Digital archiving 3.0
Digital archiving 3.0Digital archiving 3.0
Digital archiving 3.0
 
The road towards a Web-based data ecosystem
The road towards a Web-based data ecosystemThe road towards a Web-based data ecosystem
The road towards a Web-based data ecosystem
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital Humanities
 
Downscaling information systems for education
Downscaling information systems for educationDownscaling information systems for education
Downscaling information systems for education
 
ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructure
 
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deployments
 
ICT4D course 2013 - Sugar
ICT4D course 2013 - SugarICT4D course 2013 - Sugar
ICT4D course 2013 - Sugar
 
Exposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVOExposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVO
 
Clarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de donnéesClarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de données
 
Embedding young learners into the information society
Embedding young learners into the information societyEmbedding young learners into the information society
Embedding young learners into the information society
 

Último

Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Último (20)

Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Assessing Impact of Links in Linked Data Using Network Measures

  • 1. Assessing Linked Data Mappings using Network Measures Christophe Guéret, Paul Groth, Claus Stadler, Jens Lehmann 9th Extended Semantic Web Conference (ESWC) May 29, 2012 http://latc-project.eu ESWC - May 2012 http://aksw.org Assessing Linked Data mappings http://www.vu.nl 1/25
  • 2. The next 25+5 minutes The impact of links in the Web of Data Main questions What is the impact of link creation? Can we detect “bad” links based on their impact? Is adding links always a good thing? Contributions A framework to assess the impact of links Results for 5 metrics ESWC - May 2012 Assessing Linked Data mappings 2/25
  • 3. Is this a good or a bad link ? ESWC - May 2012 Assessing Linked Data mappings 3/25
  • 4. Measuring the Web of Data Look at the topology using network analysis tools Impossible to get the complete graph Sampling of the graph focusing on specific nodes See the bigger picture through aggregation Build the local network around a resource Repeat the process a sufficient number of time ESWC - May 2012 Assessing Linked Data mappings 4/25
  • 5. Network sampling process Use SPARQL end point or de-reference the resources to get the descriptions ESWC - May 2012 Assessing Linked Data mappings 5/25
  • 6. Aggregation of local results Observed Target … ESWC - May 2012 Assessing Linked Data mappings 6/25
  • 7. Metrics Compute local scores for a resource Criteria Use only the local network Representative of a global property Not sensitive to change of observation scale 5 metrics currently available in LinkQA ESWC - May 2012 Assessing Linked Data mappings 7/25
  • 8. What do we want to see? Increase of connectivity within topical groups Increase chances of finding related information More bridges between topical groups Improve browsing capabilities More connectivity around hubs Decrease the dependency upon the hubs ESWC - May 2012 Assessing Linked Data mappings 8/25
  • 9. Metric 1 – Degree Metric Number of edges around the target node Target Power-law distribution of values Intuition Presence of hubs ESWC - May 2012 Assessing Linked Data mappings 9/25
  • 10. Metric 2 – Clustering coefficient Metric Density of links around the target node Target Increase clustering around nodes Intuition Topical clusters ESWC - May 2012 Assessing Linked Data mappings 10/25
  • 11. Metric 3 – Centrality Metric Ratio between outgoing and incoming links Target Lower the discrepancy between the values Intuition Hubs are sensitive ESWC - May 2012 Assessing Linked Data mappings 11/25
  • 12. Metric 4 – SameAs chains Metric Number of “open” sameAs chains Target No open sameAs Intuition Peer agreement ESWC - May 2012 Assessing Linked Data mappings 12/25
  • 13. Metric 5 – Description enrichment Metric Richness of resource description Target Increase as possible Intuition “SameAsed” resources are complementary ESWC - May 2012 Assessing Linked Data mappings 13/25
  • 14. Under the hood of LinkQA ESWC - May 2012 Assessing Linked Data mappings 14/25 http://www.flickr.com/photos/cradlehall/5747161514
  • 15. Workflow of an analysis ESWC - May 2012 Assessing Linked Data mappings 15/25
  • 16. Output of an analysis Results on the node and aggregated scale Per metric: Indication of change with respect to the target Sorted list of outlier nodes, sorted by their distance to the target Plus, a global ranking of nodes => Input for manual inspection by an expert ESWC - May 2012 Assessing Linked Data mappings 16/25
  • 17. Experimental results ESWC - May 2012 Assessing Linked Data mappings 17/25
  • 18. Global impact of links Observe the distributions to detect bad links ESWC - May 2012 Assessing Linked Data mappings 18/25
  • 19. First evaluation 160 linking specifications for Silk, developed in the context of LATC 6 linking specifications with manual verification of results 50 positive links 50 negative links Execute LinkQA with 10 samples of 50 links ESWC - May 2012 Assessing Linked Data mappings 19/25
  • 20. Results of the detection “C” if change detected in > 50% of runs ESWC - May 2012 Assessing Linked Data mappings 20/25
  • 21. Some explanations Low sensitivity of metrics: Lack of data Stable change 50/50 accuracy of detection: Targets may not be the right ones Sample may not be big enough Semantics agnostic measures are less performant ESWC - May 2012 Assessing Linked Data mappings 21/25
  • 22. A closer look at the outliers See if the outliers are necessarily bad links ESWC - May 2012 Assessing Linked Data mappings 22/25
  • 23. Second evaluation Linking specifications for Silk, developed in the context of LATC All linking specifications sampled to have 45 positive links 5 negative links Execute LinkQA five time, on five samples ESWC - May 2012 Assessing Linked Data mappings 23/25
  • 24. Rank of positive and negative links ESWC - May 2012 Assessing Linked Data mappings 24/25
  • 25. Take home message LinkQA is a node centric approach to measure the impact of links in the WoD network Scalable, can be distributed Current results show that The 5 metrics defines are to be improved Metrics considering Semantics perform better The network sample seems too small Outliers detection improves with the number of metrics ESWC - May 2012 Assessing Linked Data mappings 25/25