SlideShare uma empresa Scribd logo
1 de 25
Baixar para ler offline
Assessing Linked Data Mappings using
                   Network Measures

   Christophe Guéret, Paul Groth, Claus Stadler, Jens Lehmann


                9th Extended Semantic Web Conference (ESWC)
                                May 29, 2012




   http://latc-project.eu
ESWC - May 2012                     http://aksw.org
                            Assessing Linked Data mappings   http://www.vu.nl   1/25
The next 25+5 minutes
     The impact of links in the Web of Data


     Main questions
         What is the impact of link creation?
         Can we detect “bad” links based on their impact?
         Is adding links always a good thing?


     Contributions
         A framework to assess the impact of links
         Results for 5 metrics
ESWC - May 2012          Assessing Linked Data mappings     2/25
Is this a good or a bad link ?




ESWC - May 2012      Assessing Linked Data mappings   3/25
Measuring the Web of Data
     Look at the topology using network analysis tools


     Impossible to get the complete graph
         Sampling of the graph focusing on specific nodes
         See the bigger picture through aggregation


     Build the local network around a resource


     Repeat the process a sufficient number of time

ESWC - May 2012         Assessing Linked Data mappings      4/25
Network sampling process
     Use SPARQL end point or de-reference the
     resources to get the descriptions




ESWC - May 2012    Assessing Linked Data mappings   5/25
Aggregation of local results


                                                   Observed
                                                   Target




                    …




ESWC - May 2012   Assessing Linked Data mappings         6/25
Metrics
     Compute local scores for a resource


     Criteria
         Use only the local network
         Representative of a global property
         Not sensitive to change of observation scale


     5 metrics currently available in LinkQA


ESWC - May 2012         Assessing Linked Data mappings   7/25
What do we want to see?
     Increase of connectivity within topical groups
         Increase chances of finding related information


     More bridges between topical groups
         Improve browsing capabilities


     More connectivity around hubs
         Decrease the dependency upon the hubs



ESWC - May 2012         Assessing Linked Data mappings     8/25
Metric 1 – Degree
                                      Metric
                                           Number of edges
                                           around the target node


                                      Target
                                           Power-law distribution
                                           of values


                                      Intuition
                                           Presence of hubs

ESWC - May 2012   Assessing Linked Data mappings                    9/25
Metric 2 – Clustering coefficient
                                      Metric
                                           Density of links around
                                           the target node


                                      Target
                                           Increase clustering
                                           around nodes


                                      Intuition
                                           Topical clusters

ESWC - May 2012   Assessing Linked Data mappings                 10/25
Metric 3 – Centrality
                                      Metric
                                           Ratio between outgoing
                                           and incoming links


                                      Target
                                           Lower the discrepancy
                                           between the values


                                      Intuition
                                           Hubs are sensitive

ESWC - May 2012   Assessing Linked Data mappings                11/25
Metric 4 – SameAs chains
                                      Metric
                                           Number of “open”
                                           sameAs chains


                                      Target
                                           No open sameAs


                                      Intuition
                                           Peer agreement


ESWC - May 2012   Assessing Linked Data mappings              12/25
Metric 5 – Description enrichment
                                      Metric
                                           Richness of resource
                                           description


                                      Target
                                            Increase as possible


                                      Intuition
                                           “SameAsed” resources
                                           are complementary

ESWC - May 2012   Assessing Linked Data mappings                   13/25
Under the hood of LinkQA




ESWC - May 2012         Assessing Linked Data mappings                           14/25
                                    http://www.flickr.com/photos/cradlehall/5747161514
Workflow of an analysis




ESWC - May 2012   Assessing Linked Data mappings   15/25
Output of an analysis
     Results on the node and aggregated scale


     Per metric:
         Indication of change with respect to the target
         Sorted list of outlier nodes, sorted by their distance to
         the target


     Plus, a global ranking of nodes


     => Input for manual inspection by an expert
ESWC - May 2012          Assessing Linked Data mappings              16/25
Experimental results




ESWC - May 2012       Assessing Linked Data mappings   17/25
Global impact of links
     Observe the distributions to detect bad links




ESWC - May 2012      Assessing Linked Data mappings   18/25
First evaluation
     160 linking specifications for Silk, developed in
     the context of LATC


     6 linking specifications with manual verification of
     results
         50 positive links
         50 negative links


     Execute LinkQA with 10 samples of 50 links

ESWC - May 2012          Assessing Linked Data mappings   19/25
Results of the detection




     “C” if change detected in > 50% of runs

ESWC - May 2012     Assessing Linked Data mappings   20/25
Some explanations
     Low sensitivity of metrics:
         Lack of data
         Stable change


     50/50 accuracy of detection:
         Targets may not be the right ones
         Sample may not be big enough
         Semantics agnostic measures are less performant



ESWC - May 2012          Assessing Linked Data mappings    21/25
A closer look at the outliers
     See if the outliers are necessarily bad links




ESWC - May 2012      Assessing Linked Data mappings   22/25
Second evaluation
     Linking specifications for Silk, developed in the
     context of LATC


     All linking specifications sampled to have
         45 positive links
         5 negative links


     Execute LinkQA five time, on five samples



ESWC - May 2012          Assessing Linked Data mappings   23/25
Rank of positive and negative links




ESWC - May 2012   Assessing Linked Data mappings   24/25
Take home message
     LinkQA is a node centric approach to measure the
     impact of links in the WoD network
         Scalable, can be distributed


     Current results show that
         The 5 metrics defines are to be improved
         Metrics considering Semantics perform better
         The network sample seems too small
         Outliers detection improves with the number of metrics


ESWC - May 2012         Assessing Linked Data mappings       25/25

Mais conteúdo relacionado

Semelhante a Assessing Impact of Links in Linked Data Using Network Measures

J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...SBGC
 
Literature review of attribute level and
Literature review of attribute level andLiterature review of attribute level and
Literature review of attribute level andIJDKP
 
Record matching
Record matchingRecord matching
Record matchingNishna Ma
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data FusionIRJET Journal
 
LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...TELKOMNIKA JOURNAL
 
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...AAP PreK-12 Learning Group
 
Survey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesSurvey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesIRJET Journal
 
SEBD2015_PresentationVitali
SEBD2015_PresentationVitaliSEBD2015_PresentationVitali
SEBD2015_PresentationVitaliMonica Vitali
 
Mca projects in gagner, chennai slideshare
Mca projects in gagner, chennai   slideshareMca projects in gagner, chennai   slideshare
Mca projects in gagner, chennai slideshareGagnertech
 
Mca projects in gagner
Mca projects in gagnerMca projects in gagner
Mca projects in gagnerGagnertech
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIESENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIEScsandit
 
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiesEnhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiescsandit
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES cscpconf
 
Algorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
Algorithm Solved IEEE Projects 2012 2013 Java @ SeabirdssolutionsAlgorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
Algorithm Solved IEEE Projects 2012 2013 Java @ SeabirdssolutionsSBGC
 
Ieee projects 2012 for cse
Ieee projects 2012 for cseIeee projects 2012 for cse
Ieee projects 2012 for cseSBGC
 
Ieee projects 2012 for cse
Ieee projects 2012 for cseIeee projects 2012 for cse
Ieee projects 2012 for cseSBGC
 
Clustering heterogeneous categorical data using enhanced mini batch K-means ...
Clustering heterogeneous categorical data using enhanced mini  batch K-means ...Clustering heterogeneous categorical data using enhanced mini  batch K-means ...
Clustering heterogeneous categorical data using enhanced mini batch K-means ...IJECEIAES
 
Instance Matching
Instance Matching Instance Matching
Instance Matching Robert Isele
 
Survey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POISurvey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POIIRJET Journal
 

Semelhante a Assessing Impact of Links in Linked Data Using Network Measures (20)

J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
 
Literature review of attribute level and
Literature review of attribute level andLiterature review of attribute level and
Literature review of attribute level and
 
Record matching
Record matchingRecord matching
Record matching
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data Fusion
 
LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...
 
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
Common Education Data Standards, Statewide Longitudinal Data Systems, and EDF...
 
Survey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction TechniquesSurvey on Feature Selection and Dimensionality Reduction Techniques
Survey on Feature Selection and Dimensionality Reduction Techniques
 
SEBD2015_PresentationVitali
SEBD2015_PresentationVitaliSEBD2015_PresentationVitali
SEBD2015_PresentationVitali
 
Mca projects in gagner, chennai slideshare
Mca projects in gagner, chennai   slideshareMca projects in gagner, chennai   slideshare
Mca projects in gagner, chennai slideshare
 
Mca projects in gagner
Mca projects in gagnerMca projects in gagner
Mca projects in gagner
 
Data Management.pptx
Data Management.pptxData Management.pptx
Data Management.pptx
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIESENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
 
Enhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologiesEnhancing keyword search over relational databases using ontologies
Enhancing keyword search over relational databases using ontologies
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
 
Algorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
Algorithm Solved IEEE Projects 2012 2013 Java @ SeabirdssolutionsAlgorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
Algorithm Solved IEEE Projects 2012 2013 Java @ Seabirdssolutions
 
Ieee projects 2012 for cse
Ieee projects 2012 for cseIeee projects 2012 for cse
Ieee projects 2012 for cse
 
Ieee projects 2012 for cse
Ieee projects 2012 for cseIeee projects 2012 for cse
Ieee projects 2012 for cse
 
Clustering heterogeneous categorical data using enhanced mini batch K-means ...
Clustering heterogeneous categorical data using enhanced mini  batch K-means ...Clustering heterogeneous categorical data using enhanced mini  batch K-means ...
Clustering heterogeneous categorical data using enhanced mini batch K-means ...
 
Instance Matching
Instance Matching Instance Matching
Instance Matching
 
Survey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POISurvey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POI
 

Mais de Christophe Guéret

HHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid IntelligenceHHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid IntelligenceChristophe Guéret
 
Informal presentation about RES
Informal presentation about RESInformal presentation about RES
Informal presentation about RESChristophe Guéret
 
Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Christophe Guéret
 
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...Christophe Guéret
 
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"Christophe Guéret
 
The Entity Registry System (ERS)
The Entity Registry System (ERS)The Entity Registry System (ERS)
The Entity Registry System (ERS)Christophe Guéret
 
Let's downscale the semantic web !
Let's downscale the semantic web !Let's downscale the semantic web !
Let's downscale the semantic web !Christophe Guéret
 
Your next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-UYour next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-UChristophe Guéret
 
The road towards a Web-based data ecosystem
The road towards a Web-based data ecosystemThe road towards a Web-based data ecosystem
The road towards a Web-based data ecosystemChristophe Guéret
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesChristophe Guéret
 
Downscaling information systems for education
Downscaling information systems for educationDownscaling information systems for education
Downscaling information systems for educationChristophe Guéret
 
ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureChristophe Guéret
 
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsChristophe Guéret
 
Exposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVOExposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVOChristophe Guéret
 
Clarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de donnéesClarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de donnéesChristophe Guéret
 
Embedding young learners into the information society
Embedding young learners into the information societyEmbedding young learners into the information society
Embedding young learners into the information societyChristophe Guéret
 

Mais de Christophe Guéret (20)

HHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid IntelligenceHHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid Intelligence
 
Informal presentation about RES
Informal presentation about RESInformal presentation about RES
Informal presentation about RES
 
Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...
 
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
 
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
 
The Entity Registry System (ERS)
The Entity Registry System (ERS)The Entity Registry System (ERS)
The Entity Registry System (ERS)
 
Let's downscale the semantic web !
Let's downscale the semantic web !Let's downscale the semantic web !
Let's downscale the semantic web !
 
Your next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-UYour next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-U
 
Linking knowledge spaces
Linking knowledge spacesLinking knowledge spaces
Linking knowledge spaces
 
The data behind the HuisKluis
The data behind the HuisKluisThe data behind the HuisKluis
The data behind the HuisKluis
 
Digital archiving 3.0
Digital archiving 3.0Digital archiving 3.0
Digital archiving 3.0
 
The road towards a Web-based data ecosystem
The road towards a Web-based data ecosystemThe road towards a Web-based data ecosystem
The road towards a Web-based data ecosystem
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital Humanities
 
Downscaling information systems for education
Downscaling information systems for educationDownscaling information systems for education
Downscaling information systems for education
 
ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructure
 
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deployments
 
ICT4D course 2013 - Sugar
ICT4D course 2013 - SugarICT4D course 2013 - Sugar
ICT4D course 2013 - Sugar
 
Exposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVOExposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVO
 
Clarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de donnéesClarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de données
 
Embedding young learners into the information society
Embedding young learners into the information societyEmbedding young learners into the information society
Embedding young learners into the information society
 

Último

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 

Último (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 

Assessing Impact of Links in Linked Data Using Network Measures

  • 1. Assessing Linked Data Mappings using Network Measures Christophe Guéret, Paul Groth, Claus Stadler, Jens Lehmann 9th Extended Semantic Web Conference (ESWC) May 29, 2012 http://latc-project.eu ESWC - May 2012 http://aksw.org Assessing Linked Data mappings http://www.vu.nl 1/25
  • 2. The next 25+5 minutes The impact of links in the Web of Data Main questions What is the impact of link creation? Can we detect “bad” links based on their impact? Is adding links always a good thing? Contributions A framework to assess the impact of links Results for 5 metrics ESWC - May 2012 Assessing Linked Data mappings 2/25
  • 3. Is this a good or a bad link ? ESWC - May 2012 Assessing Linked Data mappings 3/25
  • 4. Measuring the Web of Data Look at the topology using network analysis tools Impossible to get the complete graph Sampling of the graph focusing on specific nodes See the bigger picture through aggregation Build the local network around a resource Repeat the process a sufficient number of time ESWC - May 2012 Assessing Linked Data mappings 4/25
  • 5. Network sampling process Use SPARQL end point or de-reference the resources to get the descriptions ESWC - May 2012 Assessing Linked Data mappings 5/25
  • 6. Aggregation of local results Observed Target … ESWC - May 2012 Assessing Linked Data mappings 6/25
  • 7. Metrics Compute local scores for a resource Criteria Use only the local network Representative of a global property Not sensitive to change of observation scale 5 metrics currently available in LinkQA ESWC - May 2012 Assessing Linked Data mappings 7/25
  • 8. What do we want to see? Increase of connectivity within topical groups Increase chances of finding related information More bridges between topical groups Improve browsing capabilities More connectivity around hubs Decrease the dependency upon the hubs ESWC - May 2012 Assessing Linked Data mappings 8/25
  • 9. Metric 1 – Degree Metric Number of edges around the target node Target Power-law distribution of values Intuition Presence of hubs ESWC - May 2012 Assessing Linked Data mappings 9/25
  • 10. Metric 2 – Clustering coefficient Metric Density of links around the target node Target Increase clustering around nodes Intuition Topical clusters ESWC - May 2012 Assessing Linked Data mappings 10/25
  • 11. Metric 3 – Centrality Metric Ratio between outgoing and incoming links Target Lower the discrepancy between the values Intuition Hubs are sensitive ESWC - May 2012 Assessing Linked Data mappings 11/25
  • 12. Metric 4 – SameAs chains Metric Number of “open” sameAs chains Target No open sameAs Intuition Peer agreement ESWC - May 2012 Assessing Linked Data mappings 12/25
  • 13. Metric 5 – Description enrichment Metric Richness of resource description Target Increase as possible Intuition “SameAsed” resources are complementary ESWC - May 2012 Assessing Linked Data mappings 13/25
  • 14. Under the hood of LinkQA ESWC - May 2012 Assessing Linked Data mappings 14/25 http://www.flickr.com/photos/cradlehall/5747161514
  • 15. Workflow of an analysis ESWC - May 2012 Assessing Linked Data mappings 15/25
  • 16. Output of an analysis Results on the node and aggregated scale Per metric: Indication of change with respect to the target Sorted list of outlier nodes, sorted by their distance to the target Plus, a global ranking of nodes => Input for manual inspection by an expert ESWC - May 2012 Assessing Linked Data mappings 16/25
  • 17. Experimental results ESWC - May 2012 Assessing Linked Data mappings 17/25
  • 18. Global impact of links Observe the distributions to detect bad links ESWC - May 2012 Assessing Linked Data mappings 18/25
  • 19. First evaluation 160 linking specifications for Silk, developed in the context of LATC 6 linking specifications with manual verification of results 50 positive links 50 negative links Execute LinkQA with 10 samples of 50 links ESWC - May 2012 Assessing Linked Data mappings 19/25
  • 20. Results of the detection “C” if change detected in > 50% of runs ESWC - May 2012 Assessing Linked Data mappings 20/25
  • 21. Some explanations Low sensitivity of metrics: Lack of data Stable change 50/50 accuracy of detection: Targets may not be the right ones Sample may not be big enough Semantics agnostic measures are less performant ESWC - May 2012 Assessing Linked Data mappings 21/25
  • 22. A closer look at the outliers See if the outliers are necessarily bad links ESWC - May 2012 Assessing Linked Data mappings 22/25
  • 23. Second evaluation Linking specifications for Silk, developed in the context of LATC All linking specifications sampled to have 45 positive links 5 negative links Execute LinkQA five time, on five samples ESWC - May 2012 Assessing Linked Data mappings 23/25
  • 24. Rank of positive and negative links ESWC - May 2012 Assessing Linked Data mappings 24/25
  • 25. Take home message LinkQA is a node centric approach to measure the impact of links in the WoD network Scalable, can be distributed Current results show that The 5 metrics defines are to be improved Metrics considering Semantics perform better The network sample seems too small Outliers detection improves with the number of metrics ESWC - May 2012 Assessing Linked Data mappings 25/25