SlideShare uma empresa Scribd logo
1 de 28
Ilaria Tiddi, Mathieu d’Aquin, Enrico Motta
Learning to Assess
Linked Data Relationships
Using Genetic Programming
@IlaTiddi
20.10.2016
15th International Semantic Web Conference (ISWC 2016)
Research Problem
Automatically discover what makes a strong relationship
between two entities in (the Web of) Linked Data.
• relationship : a semantic path between two entities
ASongOfIceAnd
Fire(novel)
GoTASongOfIce
AndFire(topic)
dc:subject dc:subject
Research Problem
Automatically discover what makes a strong relationship
between two entities in (the Web of) Linked Data.
• relationship : a semantic path between two entities
• automatically : through graph search techniques
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
:born
:airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
Research Problem
Problem
• Entities/properties in a path might come from a number
of different, unknown data sources
Solution (the easy one)
• indexing & preprocessing of a portion of Linked Data
• a priori knowledge, computational resources
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
:born
:airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoT
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
dc:subject
Fantasy
dc:subject
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
dc:subject
Fantasy
dc:subject
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoTASongOfIce
AndFire(topic)
dc:subject
Fantasy
dc:subject
UnitedStates:bornGeorgeRRMartin
:author
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
dc:subject
Fantasy
dc:subject
UnitedStates:born
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
dc:subjectdc:subject
Fantasy
dc:subject
:born
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author :airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
:born
Research Problem
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author :airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
:born
Research Hypothesis
Problem
Uninformed searches require a cost-function to explore the
graph following the most promising paths
Hypo
Linked Data information can drive a cost-function that
detects strong relationships between entities
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author :airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
:born
Research Questions
What makes a path strong?
• Which topological or semantic features of nodes/edges?
✗ e.g. length of a path?
 entities of different datasets are connected by many paths
of similar length
How can we use Linked Data to assess strong relationships?
• Which information do we need?
• Can we use structural features of the graph?
Challenges
• find topological/semantic features to detect strong relationships
• combine these features in a cost-function
• perform an effective blind search
Proposed Approach
• A set of topological/semantic characteristics of
the Linked Data graph
• a benchmark of human-evaluated relationship
paths
Identify the cost-function for a blind search that
best performs in ranking sets of alternative
relationship paths
Automatically learn a cost-function to detect strong
relationships between Linked Data entities using a
supervised method (Genetic Programming)
Proposed Approach
Genetic Programming: why?
• Flexible learning process
• Suitable for wide search spaces (such as Linked Data)
• Results assessed with a fitness (scores vs. functions)
• Human-understandable results
• Easy to integrate in a graph search
Automatically learn a cost-function to detect strong
relationships between Linked Data entities using a
supervised method (Genetic Programming)
VS
Genetic Programming
Programs (solutions for a problem)
• trees of primitives
• functions : internal nodes (mathematical or logical
operations)
• terminals : leaf nodes (constants or variables)
Fitness function (evaluation)
• how well the program solves the problem
Genetic operations (evolution)
• reproduction
• crossover from two parents
• mutation from one parent
Termination condition
• maximum number of evolutions
• a desired fitness
Genetic Programming
Procedure
• Create random population of programs based on the primitives
• Evolve population until an ideal situation is met
✗✗
✗
✔✔✗✗ ✔
canned spaghetti meatballs spaghetti tomato sauced penne tomato sauced spaghetti
Genetic Programming
Given
• a starting population of randomly generated cost-functions
• sets of alternative paths between two Linked Data entities,
ranked by humans
Determine how good each cost-function is in ranking paths
compared to the human evaluators
✗✗
✗
✔✔✗✗ ✔
canned spaghetti meatballs spaghetti tomato sauced penne tomato sauced spaghetti
Genetic Programming
Primitives
Constant terminals
• Z= {0, 1000}
Aggregated terminals
• Topological edge weighs
indegree, outdegree, constant weight
• Semantic edge weighs
usage of namespaces, taxonomies, vocabularies
• Aggregators along the path
sum, avg, min, max
Functions (combining different information)
• Math operations
addition, multiplication, division, log
Genetic Programming
Fitness
Normalised Discounted Cumulative Gain (nDCG)
• (IR) quality of rankings provided by search engines based on
the graded relevance of the returned documents
• how good is a program in ranking paths based on human ranks
• avg(nDCG) across the dataset
• length penalty
Genetic operations
• Reproduction
• Crossover
• Mutation
Learning
• Training set + test set
• Keep fittest program for each runs on training set
• Test them (discard inconsistent)
Experiments
Dataset
Entities (random types from different sources)
• 12,630 events from Yago
• 8,185 people from the VIAF dataset
• 999 movies from the LMDB
• 1,174 countries/capitals from Geonames/ the UNESCO dataset
Paths (a set of possible paths between them)
• select a random pair
• bidirectional breadth-first search
Assessment
• 100 pairs (~10 possible paths per pair)
• 8 judges
• from (2) highly relevant to (0) not relevant
db:Dina-
Korzun
viaf:Dina-
Korzun
gn:Europe
gn:United-
Kingdom
lmdb:The
SkinGame
owl:sameAsdbo:citizenship
gno:parent
Feature
foaf:based
_near
Experiments
Results
Different runs (fitness on training set/test set)
(T) Topological primitives only
(S) Topological + semantic primitives
(N) Topological + namespaces primitives
Runs Best program Fitness TR Fitness TS
T1 log(log(min.cd × min.cd))/max.cd 0.79 0.79
T2 log(min.cd)/(avg.cd + 87) 0.77 0.78
T3 min.cd × (min.cd/max.cd) 0.78 0.72
N1 (log((max.ns/max.cd))/avg.ns) + min.ns 0.82 0.81
N2 (min.dg/sum.cd)/sum.ou) + min.ns 0.79 0.77
N3 min.ns/(log(max.cd)/avg.ns) 0.83 0.75
S1 min.ns + (sum.ns/log(log(sum.si))) 0.88 0.83
S2 min.ns + (min.cd/log(log(sum.si))) 0.88 0.86
S3 min.ns + (log(max.in)/log(log(sum.si))) 0.87 0.86
Experiments
Results
Lower performance for T-runs and N-runs
Recurrent terminals
• conditional degree (node degree depending on the RDF triple)
• namespace variety
• number of topic properties (dc:subject/skos:broader/foaf:primaryTopic)
Runs Best program Fitness TR Fitness TS
T1 log(log(min.cd × min.cd))/max.cd 0.79 0.79
T2 log(min.cd)/(avg.cd + 87) 0.77 0.78
T3 min.cd × (min.cd/max.cd) 0.78 0.72
N1 (log((max.ns/max.cd))/avg.ns) + min.ns 0.82 0.81
N2 (min.dg/sum.cd)/sum.ou) + min.ns 0.79 0.77
N3 min.ns/(log(max.cd)/avg.ns) 0.83 0.75
S1 min.ns + (sum.ns/log(log(sum.si))) 0.88 0.83
S2 min.ns + (min.cd/log(log(sum.si))) 0.88 0.86
S3 min.ns + (log(max.in)/log(log(sum.si))) 0.87 0.86
Experiments
Comparative evaluation
Best programs
• automatically learnt
vs. literature functions
• RECAP,RelFinder,Everything Is Connected Engine, Moore et al.
• ad-hoc / handcrafted information theoretical measures
Experiments
Which cost-function?
Interpretation
• pass through nodes with rich node descriptions
higher min_namespaces = higher path score
• not high level entities / few topic categories
few incoming topic categories = higher path score
• more specific entities (not hubs) for path with few topic categories
ratio conditional_degree / inTopicCategories
 specific paths are privileged over general paths
min_namespaces+
min_conditionalDegree
log(log(sum_inTopicCategories))
Conclusions
Contributions
A measure to detect strong relationships in Linked Data
 can be integrated in uninformed searches over Linked Data
vs. indexing/pre-processing techniques
 derived empirically through Genetic Programming
vs. domain-specific / handcrafted measures
 what is important in Linked Data
topological features + little knowledge about the edge vocabulary
Future work
• Integrate the measure in the blind-search process
• Explore more characteristics
• Improve the measure
THANK YOU VERY MUCH
(AND DO NOT MESS UP WITH ITALIAN FOOD)
Questions?
IlaTiddi ilaria.tiddi@open.ac.uk

Mais conteúdo relacionado

Mais procurados

The Network Data Structure in Computing
The Network Data Structure in ComputingThe Network Data Structure in Computing
The Network Data Structure in ComputingMarko Rodriguez
 
Named Entity Recognition from Online News
Named Entity Recognition from Online NewsNamed Entity Recognition from Online News
Named Entity Recognition from Online NewsBernardo Najlis
 
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...Jimmy Lai
 
DB-IR-ranking
DB-IR-rankingDB-IR-ranking
DB-IR-rankingFELIX75
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsEnrico Palumbo
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...andimou
 
Assessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset QualityAssessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset Qualityandimou
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics OverviewTony Fast
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Anubhav Jain
 
ML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive AnalyticsML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive AnalyticsErik Bernhardsson
 
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...Sease
 
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...Agnieszka Ławrynowicz
 
Modular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyModular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyFrank van Harmelen
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyChris Johnson
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...PyData
 
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous DataA Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous Dataandimou
 
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph MiningDetection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph MiningMikel Emaldi Manrique
 

Mais procurados (19)

The Network Data Structure in Computing
The Network Data Structure in ComputingThe Network Data Structure in Computing
The Network Data Structure in Computing
 
Named Entity Recognition from Online News
Named Entity Recognition from Online NewsNamed Entity Recognition from Online News
Named Entity Recognition from Online News
 
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
 
DB-IR-ranking
DB-IR-rankingDB-IR-ranking
DB-IR-ranking
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
 
Entity2rec recsys
Entity2rec recsysEntity2rec recsys
Entity2rec recsys
 
Assessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset QualityAssessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset Quality
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 
DB and IR Integration
DB and IR IntegrationDB and IR Integration
DB and IR Integration
 
ML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive AnalyticsML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive Analytics
 
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
 
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
 
Modular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyModular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxology
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
 
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous DataA Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
 
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph MiningDetection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
 

Destaque

Publishing and Using Linked Data
Publishing and Using Linked DataPublishing and Using Linked Data
Publishing and Using Linked Dataostephens
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentMartin Kaltenböck
 
An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsNikolaos Konstantinou
 
Materializing the Web of Linked Data
Materializing the Web of Linked DataMaterializing the Web of Linked Data
Materializing the Web of Linked DataNikolaos Konstantinou
 
Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsNikolaos Konstantinou
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Nikolaos Konstantinou
 
Deploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsDeploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsNikolaos Konstantinou
 
Introduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebIntroduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebNikolaos Konstantinou
 
Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Marcia Zeng
 
Entity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationEntity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationFaegheh Hasibi
 
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...Ig Bittencourt
 
Linked Data tutorial at Semtech 2012
Linked Data tutorial at Semtech 2012Linked Data tutorial at Semtech 2012
Linked Data tutorial at Semtech 2012Juan Sequeda
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Juan Sequeda
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Juan Sequeda
 
Méthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des donnéesMéthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des donnéesFrançois Scharffe
 
RDF Tutorial - SPARQL 20091031
RDF Tutorial - SPARQL 20091031RDF Tutorial - SPARQL 20091031
RDF Tutorial - SPARQL 20091031kwangsub kim
 
Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010Juan Sequeda
 

Destaque (20)

Publishing and Using Linked Data
Publishing and Using Linked DataPublishing and Using Linked Data
Publishing and Using Linked Data
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
 
An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF Graphs
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
Conclusions: Summary and Outlook
Conclusions: Summary and OutlookConclusions: Summary and Outlook
Conclusions: Summary and Outlook
 
Materializing the Web of Linked Data
Materializing the Web of Linked DataMaterializing the Web of Linked Data
Materializing the Web of Linked Data
 
Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF Graphs
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
Deploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsDeploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software Tools
 
Introduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebIntroduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic Web
 
Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]
 
Publishing Linked Data from RDB
Publishing Linked Data from RDBPublishing Linked Data from RDB
Publishing Linked Data from RDB
 
Entity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationEntity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and Evaluation
 
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
 
Linked Data tutorial at Semtech 2012
Linked Data tutorial at Semtech 2012Linked Data tutorial at Semtech 2012
Linked Data tutorial at Semtech 2012
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
 
Méthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des donnéesMéthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des données
 
RDF Tutorial - SPARQL 20091031
RDF Tutorial - SPARQL 20091031RDF Tutorial - SPARQL 20091031
RDF Tutorial - SPARQL 20091031
 
Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010
 

Semelhante a Learning to assess Linked Data relationships using Genetic Programming

The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?Frank van Harmelen
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpAdrian Ziegler
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
Synthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingSynthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingGraph-TA
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdfHODIT12
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1MostafaHazemMostafaa
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273Abutest
 
R, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science CompetitionsR, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science CompetitionsKrishna Sankar
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphsStanka Dalekova
 
04-Data-Analysis-Overview.pptx
04-Data-Analysis-Overview.pptx04-Data-Analysis-Overview.pptx
04-Data-Analysis-Overview.pptxShree Shree
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics EnvironmentIan Foster
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfhemangppatel
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Jonathan Stray
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleAndy Petrella
 
Geneticalgorithms 100403002207-phpapp02
Geneticalgorithms 100403002207-phpapp02Geneticalgorithms 100403002207-phpapp02
Geneticalgorithms 100403002207-phpapp02Amna Saeed
 

Semelhante a Learning to assess Linked Data relationships using Genetic Programming (20)

The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExp
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Synthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingSynthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modeling
 
Genetic Algorithms
Genetic AlgorithmsGenetic Algorithms
Genetic Algorithms
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdf
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
 
DeepLearning
DeepLearningDeepLearning
DeepLearning
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
R, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science CompetitionsR, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science Competitions
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphs
 
04-Data-Analysis-Overview.pptx
04-Data-Analysis-Overview.pptx04-Data-Analysis-Overview.pptx
04-Data-Analysis-Overview.pptx
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics Environment
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 
Multiscale Modeling
Multiscale ModelingMultiscale Modeling
Multiscale Modeling
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdf
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scale
 
Geneticalgorithms 100403002207-phpapp02
Geneticalgorithms 100403002207-phpapp02Geneticalgorithms 100403002207-phpapp02
Geneticalgorithms 100403002207-phpapp02
 

Mais de Vrije Universiteit Amsterdam

An ontology-based approach to improve the accessibility of ROS-based robotic ...
An ontology-based approach to improve the accessibility of ROS-based robotic ...An ontology-based approach to improve the accessibility of ROS-based robotic ...
An ontology-based approach to improve the accessibility of ROS-based robotic ...Vrije Universiteit Amsterdam
 
Update of time-invalid information in knowledge bases through mobile agents
Update of time-invalid information in knowledge bases through mobile agentsUpdate of time-invalid information in knowledge bases through mobile agents
Update of time-invalid information in knowledge bases through mobile agentsVrije Universiteit Amsterdam
 
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Vrije Universiteit Amsterdam
 
Using Neural Networks to aggregate Linked Data rules
Using Neural Networks to aggregate Linked Data rulesUsing Neural Networks to aggregate Linked Data rules
Using Neural Networks to aggregate Linked Data rulesVrije Universiteit Amsterdam
 
Walking Linked Data: a graph traversal approach to explain clusters
Walking Linked Data: a graph traversal approach to explain clustersWalking Linked Data: a graph traversal approach to explain clusters
Walking Linked Data: a graph traversal approach to explain clustersVrije Universiteit Amsterdam
 
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataDedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataVrije Universiteit Amsterdam
 

Mais de Vrije Universiteit Amsterdam (14)

Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
 
Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
 
Building intelligent systems with FAIR data
Building intelligent systems with FAIR dataBuilding intelligent systems with FAIR data
Building intelligent systems with FAIR data
 
Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
 
An ontology-based approach to improve the accessibility of ROS-based robotic ...
An ontology-based approach to improve the accessibility of ROS-based robotic ...An ontology-based approach to improve the accessibility of ROS-based robotic ...
An ontology-based approach to improve the accessibility of ROS-based robotic ...
 
Answer Worskshop @ESWC2017 - Introduction
Answer Worskshop @ESWC2017 - IntroductionAnswer Worskshop @ESWC2017 - Introduction
Answer Worskshop @ESWC2017 - Introduction
 
Update of time-invalid information in knowledge bases through mobile agents
Update of time-invalid information in knowledge bases through mobile agentsUpdate of time-invalid information in knowledge bases through mobile agents
Update of time-invalid information in knowledge bases through mobile agents
 
An Ontology Design Pattern to Define Explanations
An Ontology Design Pattern to Define ExplanationsAn Ontology Design Pattern to Define Explanations
An Ontology Design Pattern to Define Explanations
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
 
Quantifying the bias in data links
Quantifying the bias in data linksQuantifying the bias in data links
Quantifying the bias in data links
 
Using Neural Networks to aggregate Linked Data rules
Using Neural Networks to aggregate Linked Data rulesUsing Neural Networks to aggregate Linked Data rules
Using Neural Networks to aggregate Linked Data rules
 
Walking Linked Data: a graph traversal approach to explain clusters
Walking Linked Data: a graph traversal approach to explain clustersWalking Linked Data: a graph traversal approach to explain clusters
Walking Linked Data: a graph traversal approach to explain clusters
 
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataDedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
 

Último

Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807
 
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...NETWAYS
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfakankshagupta7348026
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@vikas rana
 
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Salam Al-Karadaghi
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...NETWAYS
 
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝soniya singh
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxGenesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxFamilyWorshipCenterD
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Krijn Poppe
 

Último (20)

Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
 
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdf
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@
 
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
 
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxGenesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
 

Learning to assess Linked Data relationships using Genetic Programming

  • 1. Ilaria Tiddi, Mathieu d’Aquin, Enrico Motta Learning to Assess Linked Data Relationships Using Genetic Programming @IlaTiddi 20.10.2016 15th International Semantic Web Conference (ISWC 2016)
  • 2. Research Problem Automatically discover what makes a strong relationship between two entities in (the Web of) Linked Data. • relationship : a semantic path between two entities ASongOfIceAnd Fire(novel) GoTASongOfIce AndFire(topic) dc:subject dc:subject
  • 3. Research Problem Automatically discover what makes a strong relationship between two entities in (the Web of) Linked Data. • relationship : a semantic path between two entities • automatically : through graph search techniques ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :born :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject
  • 4. Research Problem Problem • Entities/properties in a path might come from a number of different, unknown data sources Solution (the easy one) • indexing & preprocessing of a portion of Linked Data • a priori knowledge, computational resources ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :born :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject
  • 5. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoT
  • 6. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author dc:subject Fantasy dc:subject
  • 7. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author dc:subject Fantasy dc:subject
  • 8. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoTASongOfIce AndFire(topic) dc:subject Fantasy dc:subject UnitedStates:bornGeorgeRRMartin :author
  • 9. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author dc:subject Fantasy dc:subject UnitedStates:born
  • 10. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author dc:subjectdc:subject Fantasy dc:subject :born
  • 11. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject :born
  • 12. Research Problem ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data :born
  • 13. Research Hypothesis Problem Uninformed searches require a cost-function to explore the graph following the most promising paths Hypo Linked Data information can drive a cost-function that detects strong relationships between entities ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject :born
  • 14. Research Questions What makes a path strong? • Which topological or semantic features of nodes/edges? ✗ e.g. length of a path?  entities of different datasets are connected by many paths of similar length How can we use Linked Data to assess strong relationships? • Which information do we need? • Can we use structural features of the graph? Challenges • find topological/semantic features to detect strong relationships • combine these features in a cost-function • perform an effective blind search
  • 15. Proposed Approach • A set of topological/semantic characteristics of the Linked Data graph • a benchmark of human-evaluated relationship paths Identify the cost-function for a blind search that best performs in ranking sets of alternative relationship paths Automatically learn a cost-function to detect strong relationships between Linked Data entities using a supervised method (Genetic Programming)
  • 16. Proposed Approach Genetic Programming: why? • Flexible learning process • Suitable for wide search spaces (such as Linked Data) • Results assessed with a fitness (scores vs. functions) • Human-understandable results • Easy to integrate in a graph search Automatically learn a cost-function to detect strong relationships between Linked Data entities using a supervised method (Genetic Programming) VS
  • 17. Genetic Programming Programs (solutions for a problem) • trees of primitives • functions : internal nodes (mathematical or logical operations) • terminals : leaf nodes (constants or variables) Fitness function (evaluation) • how well the program solves the problem Genetic operations (evolution) • reproduction • crossover from two parents • mutation from one parent Termination condition • maximum number of evolutions • a desired fitness
  • 18. Genetic Programming Procedure • Create random population of programs based on the primitives • Evolve population until an ideal situation is met ✗✗ ✗ ✔✔✗✗ ✔ canned spaghetti meatballs spaghetti tomato sauced penne tomato sauced spaghetti
  • 19. Genetic Programming Given • a starting population of randomly generated cost-functions • sets of alternative paths between two Linked Data entities, ranked by humans Determine how good each cost-function is in ranking paths compared to the human evaluators ✗✗ ✗ ✔✔✗✗ ✔ canned spaghetti meatballs spaghetti tomato sauced penne tomato sauced spaghetti
  • 20. Genetic Programming Primitives Constant terminals • Z= {0, 1000} Aggregated terminals • Topological edge weighs indegree, outdegree, constant weight • Semantic edge weighs usage of namespaces, taxonomies, vocabularies • Aggregators along the path sum, avg, min, max Functions (combining different information) • Math operations addition, multiplication, division, log
  • 21. Genetic Programming Fitness Normalised Discounted Cumulative Gain (nDCG) • (IR) quality of rankings provided by search engines based on the graded relevance of the returned documents • how good is a program in ranking paths based on human ranks • avg(nDCG) across the dataset • length penalty Genetic operations • Reproduction • Crossover • Mutation Learning • Training set + test set • Keep fittest program for each runs on training set • Test them (discard inconsistent)
  • 22. Experiments Dataset Entities (random types from different sources) • 12,630 events from Yago • 8,185 people from the VIAF dataset • 999 movies from the LMDB • 1,174 countries/capitals from Geonames/ the UNESCO dataset Paths (a set of possible paths between them) • select a random pair • bidirectional breadth-first search Assessment • 100 pairs (~10 possible paths per pair) • 8 judges • from (2) highly relevant to (0) not relevant db:Dina- Korzun viaf:Dina- Korzun gn:Europe gn:United- Kingdom lmdb:The SkinGame owl:sameAsdbo:citizenship gno:parent Feature foaf:based _near
  • 23. Experiments Results Different runs (fitness on training set/test set) (T) Topological primitives only (S) Topological + semantic primitives (N) Topological + namespaces primitives Runs Best program Fitness TR Fitness TS T1 log(log(min.cd × min.cd))/max.cd 0.79 0.79 T2 log(min.cd)/(avg.cd + 87) 0.77 0.78 T3 min.cd × (min.cd/max.cd) 0.78 0.72 N1 (log((max.ns/max.cd))/avg.ns) + min.ns 0.82 0.81 N2 (min.dg/sum.cd)/sum.ou) + min.ns 0.79 0.77 N3 min.ns/(log(max.cd)/avg.ns) 0.83 0.75 S1 min.ns + (sum.ns/log(log(sum.si))) 0.88 0.83 S2 min.ns + (min.cd/log(log(sum.si))) 0.88 0.86 S3 min.ns + (log(max.in)/log(log(sum.si))) 0.87 0.86
  • 24. Experiments Results Lower performance for T-runs and N-runs Recurrent terminals • conditional degree (node degree depending on the RDF triple) • namespace variety • number of topic properties (dc:subject/skos:broader/foaf:primaryTopic) Runs Best program Fitness TR Fitness TS T1 log(log(min.cd × min.cd))/max.cd 0.79 0.79 T2 log(min.cd)/(avg.cd + 87) 0.77 0.78 T3 min.cd × (min.cd/max.cd) 0.78 0.72 N1 (log((max.ns/max.cd))/avg.ns) + min.ns 0.82 0.81 N2 (min.dg/sum.cd)/sum.ou) + min.ns 0.79 0.77 N3 min.ns/(log(max.cd)/avg.ns) 0.83 0.75 S1 min.ns + (sum.ns/log(log(sum.si))) 0.88 0.83 S2 min.ns + (min.cd/log(log(sum.si))) 0.88 0.86 S3 min.ns + (log(max.in)/log(log(sum.si))) 0.87 0.86
  • 25. Experiments Comparative evaluation Best programs • automatically learnt vs. literature functions • RECAP,RelFinder,Everything Is Connected Engine, Moore et al. • ad-hoc / handcrafted information theoretical measures
  • 26. Experiments Which cost-function? Interpretation • pass through nodes with rich node descriptions higher min_namespaces = higher path score • not high level entities / few topic categories few incoming topic categories = higher path score • more specific entities (not hubs) for path with few topic categories ratio conditional_degree / inTopicCategories  specific paths are privileged over general paths min_namespaces+ min_conditionalDegree log(log(sum_inTopicCategories))
  • 27. Conclusions Contributions A measure to detect strong relationships in Linked Data  can be integrated in uninformed searches over Linked Data vs. indexing/pre-processing techniques  derived empirically through Genetic Programming vs. domain-specific / handcrafted measures  what is important in Linked Data topological features + little knowledge about the edge vocabulary Future work • Integrate the measure in the blind-search process • Explore more characteristics • Improve the measure
  • 28. THANK YOU VERY MUCH (AND DO NOT MESS UP WITH ITALIAN FOOD) Questions? IlaTiddi ilaria.tiddi@open.ac.uk

Notas do Editor

  1. you need to know these datasets computational efforts that are not necessarily required
  2. LT which allows this is equivalent to performing
  3. to avoid inconclusive searches
  4. there a series of qs to be answered bablabla and if so the challenges are effective = com
  5. a a set of possible topological or semantic features of the nodes and edges in LD
  6. a a set of possible topological or semantic features of the nodes and edges in LD
  7. a a set of possible topological or semantic features of the nodes and edges in LD
  8. a a set of possible topological or semantic features of the nodes and edges in LD
  9. fitting GP to our problem
  10. combination of edge weighting functions
  11. given this dataset unwieghted fitness on trainset/testset
  12. unwieghted fitness on trainset/testset
  13. RelFinder, Recap, Everything is connected Engine, Moore et al.
  14. paths representing the strongest relationships in very simple words prioritises specific paths (e.g. a movie and a person are based in the same region) to more general paths (e.g. a movie and a person are based in the same country). only specific entities (not hubs) for paths with a small number of topic categories. (the ratio between min.cd and log(log(sum.si)) is negative if sum.si is lower than 10) Dataset stability Removal of entities from one data source at a time S-runs programs remain consistent