SlideShare uma empresa Scribd logo
1 de 22
A Domain-Agnostic Tool for Scalable Ontology
Population and Enrichment from Diverse Linked
Data Sources
Efstratios Kontopoulos, Panagiotis Mitzias, Marina Riga, Ioannis Kompatsiaris
Aim of this Work
 Issues:
 Increasing interest in building ontologies
 Various sources of Linked Open Data (LOD)
 Manual ontology population is time-consuming and error-prone
 Aims:
 Facilitate re-use of knowledge
 Automate instance extraction and population
 Support most LOD sources
Outline
 PROPheT
 Key features
 Architecture & core components
 Use case
 User evaluation
 Conclusions and future work
PROPheT – PERICLES Ontology Population Tool
 GUI-equipped instance extraction and population engine
About PERICLES
 PERICLES: Promoting and Enhancing Reuse of Information
throughout the Content Lifecycle taking account of Evolving
Semantics
 Funding: FP7, ICT Call 9, objective ICT-2011.4.3 Digital
Preservation, €10M budget
 Duration: 2013-2017
 Website: http://pericles-project.eu/
PROPheT – Key Features
 Three modes of instance population:
 class-based populating
 instance-based populating
 instance enrichment
 User-driven mapping of classes and data properties
 Exporting populated ontology in popular formats (.owl, .rdf,
.ttl, .nt and .n3.)
 Flexibility to work with any domain ontology and any
SPARQL endpoint
PROPheT – Architecture
PROPheT – Core Components
 My Model (MM) – e.g. my_ontology.owl
 External Model (EM) – e.g. DBpedia
 Extraction Module (search mechanisms)
 Mapping Module
 Storage Module (database)
 Export Module
PROPheT – Search Mechanisms
 Search by class
e.g. find instances of class dbo:Artist
 Search by instance label
e.g. find instances with label Picasso
 Search by existing instance
e.g. find instances similar to Picasso instance
 Enrich existing instance
e.g. find data properties for Picasso instance
PROPheT – Mapping Module
 Define My Model class to import new instances
 Map External Model properties to My Model properties
 Store new mappings to database for future use
PROPheT – Use Case Scenario
Scenario: “I want to populate my Cities domain ontology with instances of cities and
towns from ENVO and LinkedGeoData”
Step 1: Load my ontology to PROPheT
Step 2: Search for Instances
Step 3: Select Desired Results
Step 4: Select Class to Import Instance
Step 5: Perform Property Mapping
Step 6: Review Results
Step 7: My Populated Model
Use Case Population Times
Class Instances Population time (sec)
LinkedGeoData City 10.000 120
ENVO City 10.000 204
LinkedGeoData Town 10.000 158
Factors:
 Endpoint response speed
 Number of datatype properties/values per instance
Instance Retrieval and Population Times
Ontology Instances Retrieval time (sec) Population time (sec)
DBPedia 10.000 648 250
OpenData
Communities
10.000 510 210
DBLP 10.000 316 192
Nobel Prize 10.000 270 170
Eurostat 10.000 440 225
PROPheT - User Evaluation
 15 participants with Computer Science background
 80% of them familiar with ontologies
Conclusions & Future Work
 Conclusions:
 It is user-friendly
 It makes ontology population simple
 No SPARQL experience is required
 Future work:
 Optimize overall speed
 Handle object properties
 Make it a Protégé plugin
 Add more search options (e.g. by class label)

Mais conteúdo relacionado

Semelhante a A Domain-Agnostic Tool for Scalable Ontology Population and Enrichment from Diverse Linked Data Sources

Laurence Sigler (2023) Content management, ecommerce and interoperability fra...
Laurence Sigler (2023) Content management, ecommerce and interoperability fra...Laurence Sigler (2023) Content management, ecommerce and interoperability fra...
Laurence Sigler (2023) Content management, ecommerce and interoperability fra...
Francisco Javier Mora Serrano
 

Semelhante a A Domain-Agnostic Tool for Scalable Ontology Population and Enrichment from Diverse Linked Data Sources (20)

Building a modern in-house analytics pipeline
Building a modern in-house analytics pipelineBuilding a modern in-house analytics pipeline
Building a modern in-house analytics pipeline
 
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
 
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
 
44rd CEN WS/LT meeting PT social data
44rd CEN WS/LT meeting PT social data44rd CEN WS/LT meeting PT social data
44rd CEN WS/LT meeting PT social data
 
Data models for preserving and publishing digital research material beyond th...
Data models for preserving and publishing digital research material beyond th...Data models for preserving and publishing digital research material beyond th...
Data models for preserving and publishing digital research material beyond th...
 
MLOps pipelines using MLFlow - From training to production
MLOps pipelines using MLFlow - From training to productionMLOps pipelines using MLFlow - From training to production
MLOps pipelines using MLFlow - From training to production
 
dishank CV
dishank CVdishank CV
dishank CV
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708
 
UCIAD overview
UCIAD overviewUCIAD overview
UCIAD overview
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
Laurence Sigler (2023) Content management, ecommerce and interoperability fra...
Laurence Sigler (2023) Content management, ecommerce and interoperability fra...Laurence Sigler (2023) Content management, ecommerce and interoperability fra...
Laurence Sigler (2023) Content management, ecommerce and interoperability fra...
 
Cytoscape: Now and Future
Cytoscape: Now and FutureCytoscape: Now and Future
Cytoscape: Now and Future
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 
PlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web ScalePlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web Scale
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
 
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTK
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
 
A Cultural Heritage Repository as Source for Learning Materials
A Cultural Heritage Repository as Source for Learning MaterialsA Cultural Heritage Repository as Source for Learning Materials
A Cultural Heritage Repository as Source for Learning Materials
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics Interoperability
 
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
 

Último

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 

Último (20)

%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 

A Domain-Agnostic Tool for Scalable Ontology Population and Enrichment from Diverse Linked Data Sources

  • 1. A Domain-Agnostic Tool for Scalable Ontology Population and Enrichment from Diverse Linked Data Sources Efstratios Kontopoulos, Panagiotis Mitzias, Marina Riga, Ioannis Kompatsiaris
  • 2. Aim of this Work  Issues:  Increasing interest in building ontologies  Various sources of Linked Open Data (LOD)  Manual ontology population is time-consuming and error-prone  Aims:  Facilitate re-use of knowledge  Automate instance extraction and population  Support most LOD sources
  • 3. Outline  PROPheT  Key features  Architecture & core components  Use case  User evaluation  Conclusions and future work
  • 4. PROPheT – PERICLES Ontology Population Tool  GUI-equipped instance extraction and population engine
  • 5. About PERICLES  PERICLES: Promoting and Enhancing Reuse of Information throughout the Content Lifecycle taking account of Evolving Semantics  Funding: FP7, ICT Call 9, objective ICT-2011.4.3 Digital Preservation, €10M budget  Duration: 2013-2017  Website: http://pericles-project.eu/
  • 6. PROPheT – Key Features  Three modes of instance population:  class-based populating  instance-based populating  instance enrichment  User-driven mapping of classes and data properties  Exporting populated ontology in popular formats (.owl, .rdf, .ttl, .nt and .n3.)  Flexibility to work with any domain ontology and any SPARQL endpoint
  • 8. PROPheT – Core Components  My Model (MM) – e.g. my_ontology.owl  External Model (EM) – e.g. DBpedia  Extraction Module (search mechanisms)  Mapping Module  Storage Module (database)  Export Module
  • 9. PROPheT – Search Mechanisms  Search by class e.g. find instances of class dbo:Artist  Search by instance label e.g. find instances with label Picasso  Search by existing instance e.g. find instances similar to Picasso instance  Enrich existing instance e.g. find data properties for Picasso instance
  • 10. PROPheT – Mapping Module  Define My Model class to import new instances  Map External Model properties to My Model properties  Store new mappings to database for future use
  • 11. PROPheT – Use Case Scenario Scenario: “I want to populate my Cities domain ontology with instances of cities and towns from ENVO and LinkedGeoData”
  • 12. Step 1: Load my ontology to PROPheT
  • 13. Step 2: Search for Instances
  • 14. Step 3: Select Desired Results
  • 15. Step 4: Select Class to Import Instance
  • 16. Step 5: Perform Property Mapping
  • 17. Step 6: Review Results
  • 18. Step 7: My Populated Model
  • 19. Use Case Population Times Class Instances Population time (sec) LinkedGeoData City 10.000 120 ENVO City 10.000 204 LinkedGeoData Town 10.000 158 Factors:  Endpoint response speed  Number of datatype properties/values per instance
  • 20. Instance Retrieval and Population Times Ontology Instances Retrieval time (sec) Population time (sec) DBPedia 10.000 648 250 OpenData Communities 10.000 510 210 DBLP 10.000 316 192 Nobel Prize 10.000 270 170 Eurostat 10.000 440 225
  • 21. PROPheT - User Evaluation  15 participants with Computer Science background  80% of them familiar with ontologies
  • 22. Conclusions & Future Work  Conclusions:  It is user-friendly  It makes ontology population simple  No SPARQL experience is required  Future work:  Optimize overall speed  Handle object properties  Make it a Protégé plugin  Add more search options (e.g. by class label)