SlideShare a Scribd company logo
1 of 36
Download to read offline
Mapping the Central LOD
Ontologies to PROTON Upper-
Level Ontology
Mariana Damova, Atanas Kiryakov, Kiril
Simov, Svetoslav Petrov
ISWC’2010
Outline
• Introduction
• Problem and Conceptual Solution
• Approaches to Matching Ontologies
• Mapping Methods• Mapping Methods
• Proton Extension
• Statistics
• Experimentation
• Future Work
Linking Open Data (LOD)
FactForge (http://factforge.net)
• a reason-able view of the web of data
• the biggest and most heterogeneous body of factual knowledge on
which inference is performed
• 8 datasets from the LOD cloud (DBPedia, Freebase, UMBEL, CIA
World Factbook, MusicBrainz, Wordnet, Lingvoj, Geonames)
• an overall of 1,4 billion loaded statements
• 2,2 billion stored statements (indexed)
• 9,8 billion distinct retrievable statements
Access to FactForge
Forest Interface
- keyword search (for molecules in the RDF graph)
- auto-suggestion of URI
- SPARQL queries using facts from different datasets
(formulating SPARQL queries requires in depth knowledge about the datasets and the
schemata in FactForge)
- exploration
Outline
• Introduction
• Problem and Conceptual Solution
• Approaches to Matching Ontologies
• Mapping Methods• Mapping Methods
• Proton Extension
• Statistics
• Experimentation
• Future Work
Current State
Target State
SELECT * WHERE {
?Person dbp-ont:birthPlace ?BirthPlace ;
rdf:type opencyc:Entertainer ;
?BirthPlace geo-ont:parentFeature dbpedia:Germany .
}
Target State
SELECT * WHERE {
?Person prot:birthPlace ?BirthPlace ;
rdf:type prot:Entertainer ;
?BirthPlace prot:subRegionOf dbpedia:Germany .
}
Benefit
easier and simpler access to the wealth of data
(one needs to know a single ontology instead of learning the
vocabularies of multiple datasets)
higher degree of interoperability offering only
one single schema
better integration of the datasets in FactForge
(so the schemata of the different datasets are mapped through
a single ontology)
information from many datasets via a unified
vocabulary
(a constraint which uses a single predicate from the ontology
can match data from different datasets)
Applications
semantic search and annotation using
the entities from FactForge
semantic browsing and navigationsemantic browsing and navigation
querying FactForge in natural language
many others
PROTON - a modular, lightweight, upper-level ontology defining
about 300 classes and 100 properties
Solution
PROTON
DBPedia Freebase Geonames
about 300 classes and 100 properties
DBPedia - the RDFized version of Wikipedia, data-driven ontology,
an overall of 592 classes and 720 properties, and about 1 478 000
instances
Freebase - a social database, no ontology, hidden class hierarchy
in structured predicate names, 19632 predicates
Geonames – a geographical database that covers 6 million
geographical features with ontology of 5 classes, and 26
properties, and about 645 location denotators
Outline
• Introduction
• Problem and Conceptual Solution
• Approaches to Matching Ontologies
• Mapping Methods• Mapping Methods
• Proton Extension
• Statistics
• Experimentation
• Future Work
Approaches to Matching Ontologies
• syntactic vs. semantic mapping
• harmonizing semantics (interlingua ontology)
• bidirectional vs. unidirectional
• automated vs. manual• automated vs. manual
• OAEI - ontology matching competitions
– benchmark with best F-measure result of 80%
2009
Ontological Heterogeneity
• Semantic matching approaches cannot cope with
ontological heterogeneity
• The classes and the properties may be described in
different unrelated ontologies
• The algorithms cannot discover hidden relationships• The algorithms cannot discover hidden relationships
that hold between unrelated entities.
FactForge presents exactly such a reality of
heterogeneous facts. This makes their automated
processing inconvenient.
Our Approach
unidirectional semantic manual alignment between
PROTONPROTON
and
the schema ontologies of the selected datasets of
FactForge
Outline
• Introduction
• Problem and Conceptual Solution
• Approaches to Matching Ontologies
• Mapping Methods• Mapping Methods
• Proton Extension
• Statistics
• Experimentation
• Future Work
Mapping Methods
• A series of iterations of enrichments at conceptual and
at data levels
(b) Extending PROTON with classes and
properties
(a) Subsumption relations for classes
and properties
FactForge entity PROTON entity
subClassOf
(c) Using OWL class and property construction capabilities to
represent classes and properties from FactForge and map
them to PROTON classes
Class restricted from
FactForge predicates
PROTON entity
subClassOf
(d) Extending FactForge with instances to account for the
conceptual representation of the matching
FactForge Instance 1
FactForge Instance 2 (new)
FactForge Instance 3 (new)
Mapping rules
(a) dbp:Place
owl:subClassOf proton:Location .
(b) dbp-prop:location
rdfs:subPropertyOf proton:locatedIn .
(c) pfb:Location(c) pfb:Location
rdf:type owl:Restriction ;
owl:onProperty <http://rdf.freebase.com/ns/type.object.type> ;
owl:hasValue <http://rdf.freebase.com/ns/location.location> ;
rdfs:subClassOf proton:Location .
(d) p <rdf:type> <dbp-ont:PrimeMinister>
----------------------------------------------------
p <proton:hasPosition> j
j <proton:hasTitle> <proton:PrimeMinister>
Conceptual mismatches
dbp:Architect
rdfs:subClassOf
[ rdf:type owl:Restriction ;
owl:onProperty
proton:hasProfession ;
owl:hasValue proton:Architect
] .
Expression matching
Multiple Matching
DBPedia predicate
DBPedia ontology
predicate
PROTON predicate
dbp:place
rdfs:subPropertyOf proton:locatedIn .
dbp-prop:location
rdfs:subPropertyOf proton:locatedIn .
<http://rdf.freebase.com/ns/time.event.locations>
rdfs:subPropertyOf proton:locatedIn .
Freebase predicate
Outline
• Introduction
• Problem and Conceptual Solution
• Approaches to Matching Ontologies
• Mapping Methods• Mapping Methods
• Proton Extension
• Statistics
• Experimentation
• Future Work
PROTON Extension
• Preserve the OntoClean approach in PROTON design
• Obtain coverage of the rich data in FactForge
• Keep an optimal degree of granularity of the concept hierarchy
Proton was split into modules :
- 19 modules reflecting the conceptual divisions which surfaced
during the analysis of the data, e.g. proton event, proton social
abstraction, proton location, proton biological substance, etc.
Outline
• Introduction
• Problem and Conceptual Solution
• Approaches to Matching Ontologies
• Mapping Methods• Mapping Methods
• Proton Extension
• Statistics
• Experimentation
• Future Work
Statistics
• PROTON Extension
– 141 new Classes
– 3 new Datatype properties
– 12 new Object properties
• Mapping PROTON to FactForge
– 536 subClassOf relations
– 36 subPropertyOf relations
Outline
• Introduction
• Problem and Conceptual Solution
• Approaches to Matching Ontologies
• Mapping Methods• Mapping Methods
• Proton Extension
• Statistics
• Experimentation
• Future Work
Experiment
Data Loading
BigOWLIM – the most scalable OWL Engine
http://www.ontotext.com/owlim/
FactForge Standard
June 2010
FactForge with
mappings June
2010
NumberOfStatements 1,782,541,506 2,630,453,334
NumberOfExplicitStatements 1,143,317,531 1,942,349,578
NumberOfEntities 354,635,159 404,798,593
FactForge Extension Statistics
FactForge FactForge with
Inference Rules
FactForge with IR
and PROTON
FactForge with IR,
PROTON and DBP
mappings
FactForge with
IR, PROTON,
DBP mappings
and Freebase
mappings
NumberOfStatements
October 2010
2,237,550,385 2,237,550,617 2,237,578,643 2,255,543,166 2,375,287,183
NumberOfExplicitStatements
1,357,013,227 2,027,992,627 2,027,995,363 2,027,995,651 2,027,995,750
NumberOfEntities
524,120,454 524,120,465 524,121,955 524,121,996 524,122,009
FactForge Extension Statistics
Difference
between FactForge and Each Adding Iteration
FactForgeFactForge and
FactForge with
FactForge and
FactForge with
factForge and
FactForge with
factForge and
FactForge with
October 2010
FactForge with
Inference Rules
FactForge with
IR and
PROTON
FactForge with
IR, PROTON
and DBP
mappings
FactForge with
IR, PROTON,
DBP mappings
and Freebase
mappings
NumberOfStatements
0 232 28,258 17,992,781 137,736,798
NumberOfExplicitStatem
ents 0 670,979,400 670,982,136 670,982,424 670,982,523
NumberOfEntities
0 11 1,501 1,542 1,555
FactForge Extension Statistics
Difference
between Each Adding Iteration
FactForge FactForge and
FactForge with
Inference Rules
FactForge with
Inference Rules
and FactForge
with IR and
FactForge with IR
and PROTON and
FactForge with
IR, PROTON and
FactForge with
IR, PROTON and
DBP mappings
and FactForge
October 2010
with IR and
PROTON
IR, PROTON and
DBP mappings
and FactForge
with IR, PROTON,
DBP mappings
and Freebase
mappings
NumberOfStatements
0 232 28,026 17,964,523 119,744,017
NumberOfExplicitStatements
0 670,979,400 2,736 288 99
NumberOfEntities
0 11 1490 41 13
Experimentation: PROTON query
US non-profit organizations founded after 1950
PREFIX p-ext: <http://proton.semanticweb.org/protonue#>
PREFIX ptop: <http://proton.semanticweb.org/protont#>
PREFIX dbpedia: <http://dbpedia.org/resource/>
SELECT distinct ?s ?date ?l WHERE {
?s a p-ext:Non-ProfitOrganisation .
?s ptop:establishmentDate ?date .
?s ptop:locatedIn ?l .
?l ptop:subRegionOf dbpedia:United_States .
FILTER (?date > "1950")
}
Query: Prime Ministers born in the United Kingdom
PREFIX dbp-ont: <http://dbpedia.org/ontology/>
PREFIX dbpedia: <http://dbpedia.org/resource/>
SELECT ?PrimeMinister WHERE {
?PrimeMinister rdf:type dbp-ont:PrimeMinister .
?PrimeMinister dbp-ont:birthPlace dbpedia:United_Kingdom .
}
PREFIX pupp: <http://proton.semanticweb.org/protonu#>
PREFIX p-ext: <http://proton.semanticweb.org/protonue#>
PREFIX dbpedia: <http://dbpedia.org/resource/>
PREFIX ptop: <http://proton.semanticweb.org/protont#>
SELECT ?PrimeMinister WHERE {
?PrimeMinister ptop:hasPosition ?pos .
?pos pupp:hasTitle dbpedia:British_prime_minister .
?PrimeMinister p-ext:birthPlace dbpedia:United_Kingdom .
}
Query: Cities around the world which have Modigliani art work
PREFIX fb: <http://rdf.freebase.com/ns/>
PREFIX dbpedia: <http://dbpedia.org/resource/>
PREFIX dbp-prop: <http://dbpedia.org/property/>
PREFIX dbp-ont: <http://dbpedia.org/ontology/>
PREFIX umbel-sc: <http://umbel.org/umbel/sc/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX ot: <http://www.ontotext.com/>
SELECT DISTINCT ?painting_l ?owner_l ?city_fb_con ?city_db_loc ?city_db_cit
WHERE {WHERE {
?p fb:visual_art.artwork.artist
dbpedia:Amedeo_Modigliani ;
fb:visual_art.artwork.owners [ fb:visual_art.artwork_owner_relationship.owner ?ow ] ;
ot:preferredLabel ?painting_l.
?ow ot:preferredLabel ?owner_l .
OPTIONAL { ?ow fb:location.location.containedby [ ot:preferredLabel ?city_fb_con ] }
OPTIONAL { ?ow dbp-prop:location ?loc.
?loc rdf:type umbel-sc:City ;
ot:preferredLabel ?city_db_loc }
OPTIONAL { ?ow dbp-ont:city [ ot:preferredLabel ?city_db_cit ] }
}
Query: Cities around the world which have Modigliani art work
PREFIX dbpedia: <http://dbpedia.org/resource/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX ot: <http://www.ontotext.com/>
PREFIX ptop: <http://proton.semanticweb.org/protont#>
PREFIX p-ext: <http://proton.semanticweb.org/protonue#>
SELECT DISTINCT ?painting ?owner ?city
WHERE {
?p p-ext:author dbpedia:Amedeo_Modigliani ;
p-ext:ownership [ ptop:isOwnedBy ?ow ] ;
ot:preferredLabel ?painting .
?ow ot:preferredLabel ?owner .
?ow ptop:locatedIn [ rdf:type pupp:City ;
ot:preferredLabel ?city].
}
Outline
• Introduction
• Problem
• Conceptual Solution
• Approaches to Matching Ontologies• Approaches to Matching Ontologies
• Mapping Methods
• Proton Extension
• Statistics
• Experimentation
• Future Work
Future Work
• Two-level intermediary layer to access FactForge
mapping PROTON to UMBEL
Dataset1 Dataset2 Dataset3 Dataset4Datasets
Upper Level Ontology 2Two level
Ontology 1 Ontology 2 Ontology 3 Ontology 4
Upper Mapping and Binding Exchange Layer
A lightweight, subject concept reference structure for the Web
Upper Mapping and Binding Exchange Layer
A lightweight, subject concept reference structure for the Web
Query
triples
?S P O
?S P1 ?O1
?O1 P2 O2
Upper Level Ontology 2Two level
Intermediate
layer Upper Level Ontology 1
http://www.ontotext.com/news.html#umb_25oct10
UMBEL - Upper Mapping and Binding Exchange Layer
A lightweight, subject concept reference structure for the Web
20 000 concepts mapped to OpenCyc
Strategic partnership Ontotext – Structured Dynamics
Future Work
• Official FactForge release with the presented
mapping
• Publish Proton and mapping as LOD
• Cover more datasets from the LOD cloud• Cover more datasets from the LOD cloud
• Experiment with the balance between the
datasets and the ontologies describing them
• Extend the property mapping
http://factforge.net
(a version with Proton mapping is currently available at
http://ldsr4.ontotext.com)
Service available at
Thank you for your attention!
Questions?
Contact:
mariana.damova@ontotext.com

More Related Content

What's hot

Statistical Analysis of Web of Data Usage
Statistical Analysis of Web of Data UsageStatistical Analysis of Web of Data Usage
Statistical Analysis of Web of Data UsageMarkus Luczak-Rösch
 
Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Fram...
Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Fram...Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Fram...
Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Fram...idescitation
 
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...NAVER Engineering
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...BrianDeCost
 
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...Dominic Suciu
 
Querying Linked Geospatial Data with Incomplete Information
Querying Linked Geospatial Data with  Incomplete InformationQuerying Linked Geospatial Data with  Incomplete Information
Querying Linked Geospatial Data with Incomplete InformationCharalampos (Babis) Nikolaou
 
Accessing R from Python using RPy2
Accessing R from Python using RPy2Accessing R from Python using RPy2
Accessing R from Python using RPy2Ryan Rosario
 
Identifying Relevant Sources for Data Linking using a Semantic Web Index
Identifying Relevant Sources for Data Linking using a Semantic Web IndexIdentifying Relevant Sources for Data Linking using a Semantic Web Index
Identifying Relevant Sources for Data Linking using a Semantic Web IndexAndriy Nikolov
 
DBpedia Mappings Quality Assessment
DBpedia Mappings Quality AssessmentDBpedia Mappings Quality Assessment
DBpedia Mappings Quality Assessmentandimou
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsAnubhav Jain
 
Focused Exploration of Geospatial Context on Linked Open Data
Focused Exploration of Geospatial Context on Linked Open DataFocused Exploration of Geospatial Context on Linked Open Data
Focused Exploration of Geospatial Context on Linked Open DataThomas Gottron
 

What's hot (14)

Statistical Analysis of Web of Data Usage
Statistical Analysis of Web of Data UsageStatistical Analysis of Web of Data Usage
Statistical Analysis of Web of Data Usage
 
Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Fram...
Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Fram...Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Fram...
Distributed Algorithm for Frequent Pattern Mining using HadoopMap Reduce Fram...
 
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
 
2017 biological databases_part1_vupload
2017 biological databases_part1_vupload2017 biological databases_part1_vupload
2017 biological databases_part1_vupload
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
 
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
 
Querying Linked Geospatial Data with Incomplete Information
Querying Linked Geospatial Data with  Incomplete InformationQuerying Linked Geospatial Data with  Incomplete Information
Querying Linked Geospatial Data with Incomplete Information
 
Accessing R from Python using RPy2
Accessing R from Python using RPy2Accessing R from Python using RPy2
Accessing R from Python using RPy2
 
Identifying Relevant Sources for Data Linking using a Semantic Web Index
Identifying Relevant Sources for Data Linking using a Semantic Web IndexIdentifying Relevant Sources for Data Linking using a Semantic Web Index
Identifying Relevant Sources for Data Linking using a Semantic Web Index
 
DBpedia Mappings Quality Assessment
DBpedia Mappings Quality AssessmentDBpedia Mappings Quality Assessment
DBpedia Mappings Quality Assessment
 
Stoltzfus_EvoIO_2010
Stoltzfus_EvoIO_2010Stoltzfus_EvoIO_2010
Stoltzfus_EvoIO_2010
 
Open Source Tools for Materials Informatics
Open Source Tools for Materials InformaticsOpen Source Tools for Materials Informatics
Open Source Tools for Materials Informatics
 
Stoltzfus_EvoIO_2010
Stoltzfus_EvoIO_2010Stoltzfus_EvoIO_2010
Stoltzfus_EvoIO_2010
 
Focused Exploration of Geospatial Context on Linked Open Data
Focused Exploration of Geospatial Context on Linked Open DataFocused Exploration of Geospatial Context on Linked Open Data
Focused Exploration of Geospatial Context on Linked Open Data
 

Viewers also liked

Semantics, Sensors, and the Social Web
Semantics, Sensors, and the Social WebSemantics, Sensors, and the Social Web
Semantics, Sensors, and the Social WebThe Open University
 
Programming with LOD
Programming with LODProgramming with LOD
Programming with LODFumihiro Kato
 
ESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked data
ESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked dataESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked data
ESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked dataeswcsummerschool
 
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data TutorialESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorialeswcsummerschool
 
DAA Keynote- 99 Amazing Big Data Experiences
DAA Keynote- 99 Amazing Big Data ExperiencesDAA Keynote- 99 Amazing Big Data Experiences
DAA Keynote- 99 Amazing Big Data ExperiencesOxford Tech + UX
 
ESWC SS 2012 - Monday Tutorial 2 Barry Norton: Introduction to SPARQL
ESWC SS 2012 - Monday Tutorial 2 Barry Norton: Introduction to SPARQLESWC SS 2012 - Monday Tutorial 2 Barry Norton: Introduction to SPARQL
ESWC SS 2012 - Monday Tutorial 2 Barry Norton: Introduction to SPARQLeswcsummerschool
 
Matching Domain Ontologies A Comparative Study [Mode De Compatibilité]
Matching Domain Ontologies A Comparative Study [Mode De Compatibilité]Matching Domain Ontologies A Comparative Study [Mode De Compatibilité]
Matching Domain Ontologies A Comparative Study [Mode De Compatibilité]Leila Zemmouchi-Ghomari
 
ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award Pavel Shvaiko
 
Thesis DESIGN AND IMPLEMENTATION OF AN ONTOLOGY FOR MODELING USERS PROFILE I...
Thesis DESIGN AND IMPLEMENTATION OF AN ONTOLOGY FOR MODELING  USERS PROFILE I...Thesis DESIGN AND IMPLEMENTATION OF AN ONTOLOGY FOR MODELING  USERS PROFILE I...
Thesis DESIGN AND IMPLEMENTATION OF AN ONTOLOGY FOR MODELING USERS PROFILE I...Aggelos Ser
 
Java ontology matching state of the art and future challenges
Java  ontology matching state of the art and future challengesJava  ontology matching state of the art and future challenges
Java ontology matching state of the art and future challengesecwayerode
 
Ontology alignment representation
Ontology alignment representationOntology alignment representation
Ontology alignment representationFrançois Scharffe
 
Instance-based Ontology Matching by Instance Enrichment
Instance-based Ontology Matching by Instance EnrichmentInstance-based Ontology Matching by Instance Enrichment
Instance-based Ontology Matching by Instance EnrichmentBalthasar Schopman
 
ontology meets big data: immutability
ontology meets big data:immutabilityontology meets big data:immutability
ontology meets big data: immutabilityChris Partridge
 
Preparing for your viva voce dissertation defence.
Preparing for your viva voce dissertation defence.Preparing for your viva voce dissertation defence.
Preparing for your viva voce dissertation defence.The Free School
 
M.Phil. Thesis Defense Template_ Inst. of Geology_PU
M.Phil. Thesis Defense Template_ Inst. of Geology_PUM.Phil. Thesis Defense Template_ Inst. of Geology_PU
M.Phil. Thesis Defense Template_ Inst. of Geology_PUAtiqa khan
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology MappingPradeep B Pillai
 
Thesis Defense Presentation
Thesis Defense PresentationThesis Defense Presentation
Thesis Defense Presentationosideloc
 
My Thesis Defense Presentation
My Thesis Defense PresentationMy Thesis Defense Presentation
My Thesis Defense PresentationDavid Onoue
 

Viewers also liked (20)

Semantics, Sensors, and the Social Web
Semantics, Sensors, and the Social WebSemantics, Sensors, and the Social Web
Semantics, Sensors, and the Social Web
 
Programming with LOD
Programming with LODProgramming with LOD
Programming with LOD
 
ESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked data
ESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked dataESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked data
ESWC SS 2013 - Thursday Keynote Vassilis Christophides: Preserving linked data
 
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data TutorialESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
ESWC SS 2012 - Friday Keynote Marko Grobelnik: Big Data Tutorial
 
IndustryInform Service of Mozaika
IndustryInform Service of MozaikaIndustryInform Service of Mozaika
IndustryInform Service of Mozaika
 
DAA Keynote- 99 Amazing Big Data Experiences
DAA Keynote- 99 Amazing Big Data ExperiencesDAA Keynote- 99 Amazing Big Data Experiences
DAA Keynote- 99 Amazing Big Data Experiences
 
ESWC SS 2012 - Monday Tutorial 2 Barry Norton: Introduction to SPARQL
ESWC SS 2012 - Monday Tutorial 2 Barry Norton: Introduction to SPARQLESWC SS 2012 - Monday Tutorial 2 Barry Norton: Introduction to SPARQL
ESWC SS 2012 - Monday Tutorial 2 Barry Norton: Introduction to SPARQL
 
Matching Domain Ontologies A Comparative Study [Mode De Compatibilité]
Matching Domain Ontologies A Comparative Study [Mode De Compatibilité]Matching Domain Ontologies A Comparative Study [Mode De Compatibilité]
Matching Domain Ontologies A Comparative Study [Mode De Compatibilité]
 
ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award
 
Thesis DESIGN AND IMPLEMENTATION OF AN ONTOLOGY FOR MODELING USERS PROFILE I...
Thesis DESIGN AND IMPLEMENTATION OF AN ONTOLOGY FOR MODELING  USERS PROFILE I...Thesis DESIGN AND IMPLEMENTATION OF AN ONTOLOGY FOR MODELING  USERS PROFILE I...
Thesis DESIGN AND IMPLEMENTATION OF AN ONTOLOGY FOR MODELING USERS PROFILE I...
 
Java ontology matching state of the art and future challenges
Java  ontology matching state of the art and future challengesJava  ontology matching state of the art and future challenges
Java ontology matching state of the art and future challenges
 
Ontology alignment representation
Ontology alignment representationOntology alignment representation
Ontology alignment representation
 
Instance-based Ontology Matching by Instance Enrichment
Instance-based Ontology Matching by Instance EnrichmentInstance-based Ontology Matching by Instance Enrichment
Instance-based Ontology Matching by Instance Enrichment
 
ontology meets big data: immutability
ontology meets big data:immutabilityontology meets big data:immutability
ontology meets big data: immutability
 
Preparing for your viva voce dissertation defence.
Preparing for your viva voce dissertation defence.Preparing for your viva voce dissertation defence.
Preparing for your viva voce dissertation defence.
 
M.Phil. Thesis Defense Template_ Inst. of Geology_PU
M.Phil. Thesis Defense Template_ Inst. of Geology_PUM.Phil. Thesis Defense Template_ Inst. of Geology_PU
M.Phil. Thesis Defense Template_ Inst. of Geology_PU
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology Mapping
 
Thesis Defense Presentation
Thesis Defense PresentationThesis Defense Presentation
Thesis Defense Presentation
 
Thesis powerpoint
Thesis powerpointThesis powerpoint
Thesis powerpoint
 
My Thesis Defense Presentation
My Thesis Defense PresentationMy Thesis Defense Presentation
My Thesis Defense Presentation
 

Similar to Mapping Lo Dto Proton Revised [Compatibility Mode]

2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinalDeborah McGuinness
 
History and Background of the USEWOD Data Challenge
History and Background of the  USEWOD Data ChallengeHistory and Background of the  USEWOD Data Challenge
History and Background of the USEWOD Data ChallengeKnud Möller
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Takeshi Morita
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeDavid Portnoy
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Maulik Kamdar
 
Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Webebiquity
 
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...Dongpo Deng
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod GmodJun Zhao
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...ICZN
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis
 
Brief State of the Art - Semantic Web technologies for geospatial data - Mode...
Brief State of the Art - Semantic Web technologies for geospatial data - Mode...Brief State of the Art - Semantic Web technologies for geospatial data - Mode...
Brief State of the Art - Semantic Web technologies for geospatial data - Mode...Ana Roxin
 
On the diversity and availability of temporal information in linked open data
On the diversity and availability of temporal information in linked open dataOn the diversity and availability of temporal information in linked open data
On the diversity and availability of temporal information in linked open dataAnisa Rula
 
Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Enayat Rajabi
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsGaignard Alban
 
SemFacet paper
SemFacet paperSemFacet paper
SemFacet paperDBOnto
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paperDBOnto
 

Similar to Mapping Lo Dto Proton Revised [Compatibility Mode] (20)

2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal
 
History and Background of the USEWOD Data Challenge
History and Background of the  USEWOD Data ChallengeHistory and Background of the  USEWOD Data Challenge
History and Background of the USEWOD Data Challenge
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human Genome
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...Current advances to bridge the usability-expressivity gap in biomedical seman...
Current advances to bridge the usability-expressivity gap in biomedical seman...
 
Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Web
 
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...Toward Next Generation of Gazetteer:  Utilizing GeoSPARQL For Developing Link...
Toward Next Generation of Gazetteer: Utilizing GeoSPARQL For Developing Link...
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
COPO kick-off meeting
COPO kick-off meetingCOPO kick-off meeting
COPO kick-off meeting
 
Brief State of the Art - Semantic Web technologies for geospatial data - Mode...
Brief State of the Art - Semantic Web technologies for geospatial data - Mode...Brief State of the Art - Semantic Web technologies for geospatial data - Mode...
Brief State of the Art - Semantic Web technologies for geospatial data - Mode...
 
On the diversity and availability of temporal information in linked open data
On the diversity and availability of temporal information in linked open dataOn the diversity and availability of temporal information in linked open data
On the diversity and availability of temporal information in linked open data
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 
Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reports
 
SemFacet paper
SemFacet paperSemFacet paper
SemFacet paper
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
 

More from Mariana Damova, Ph.D

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваMariana Damova, Ph.D
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryMariana Damova, Ph.D
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionMariana Damova, Ph.D
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи Mariana Damova, Ph.D
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introductionMariana Damova, Ph.D
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Mariana Damova, Ph.D
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Mariana Damova, Ph.D
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital marketMariana Damova, Ph.D
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 нимMariana Damova, Ph.D
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...Mariana Damova, Ph.D
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологииMariana Damova, Ph.D
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основиMariana Damova, Ph.D
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Mariana Damova, Ph.D
 
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Mariana Damova, Ph.D
 

More from Mariana Damova, Ph.D (20)

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic Memory
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - Introduction
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи
 
IndustryInform Demo March 2016
IndustryInform Demo March 2016IndustryInform Demo March 2016
IndustryInform Demo March 2016
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introduction
 
Mozaika-Jan2016a
Mozaika-Jan2016aMozaika-Jan2016a
Mozaika-Jan2016a
 
Concordia july2015
Concordia july2015Concordia july2015
Concordia july2015
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital market
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 ним
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
 
Mozaika june2014
Mozaika june2014Mozaika june2014
Mozaika june2014
 
Europeana in Bulgaria
Europeana in BulgariaEuropeana in Bulgaria
Europeana in Bulgaria
 
Bulgariana europeana02112013
Bulgariana europeana02112013Bulgariana europeana02112013
Bulgariana europeana02112013
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологии
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основи
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
 
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
 

Mapping Lo Dto Proton Revised [Compatibility Mode]

  • 1. Mapping the Central LOD Ontologies to PROTON Upper- Level Ontology Mariana Damova, Atanas Kiryakov, Kiril Simov, Svetoslav Petrov ISWC’2010
  • 2. Outline • Introduction • Problem and Conceptual Solution • Approaches to Matching Ontologies • Mapping Methods• Mapping Methods • Proton Extension • Statistics • Experimentation • Future Work
  • 3. Linking Open Data (LOD) FactForge (http://factforge.net) • a reason-able view of the web of data • the biggest and most heterogeneous body of factual knowledge on which inference is performed • 8 datasets from the LOD cloud (DBPedia, Freebase, UMBEL, CIA World Factbook, MusicBrainz, Wordnet, Lingvoj, Geonames) • an overall of 1,4 billion loaded statements • 2,2 billion stored statements (indexed) • 9,8 billion distinct retrievable statements
  • 4.
  • 5. Access to FactForge Forest Interface - keyword search (for molecules in the RDF graph) - auto-suggestion of URI - SPARQL queries using facts from different datasets (formulating SPARQL queries requires in depth knowledge about the datasets and the schemata in FactForge) - exploration
  • 6. Outline • Introduction • Problem and Conceptual Solution • Approaches to Matching Ontologies • Mapping Methods• Mapping Methods • Proton Extension • Statistics • Experimentation • Future Work
  • 7. Current State Target State SELECT * WHERE { ?Person dbp-ont:birthPlace ?BirthPlace ; rdf:type opencyc:Entertainer ; ?BirthPlace geo-ont:parentFeature dbpedia:Germany . } Target State SELECT * WHERE { ?Person prot:birthPlace ?BirthPlace ; rdf:type prot:Entertainer ; ?BirthPlace prot:subRegionOf dbpedia:Germany . }
  • 8. Benefit easier and simpler access to the wealth of data (one needs to know a single ontology instead of learning the vocabularies of multiple datasets) higher degree of interoperability offering only one single schema better integration of the datasets in FactForge (so the schemata of the different datasets are mapped through a single ontology) information from many datasets via a unified vocabulary (a constraint which uses a single predicate from the ontology can match data from different datasets)
  • 9. Applications semantic search and annotation using the entities from FactForge semantic browsing and navigationsemantic browsing and navigation querying FactForge in natural language many others
  • 10. PROTON - a modular, lightweight, upper-level ontology defining about 300 classes and 100 properties Solution PROTON DBPedia Freebase Geonames about 300 classes and 100 properties DBPedia - the RDFized version of Wikipedia, data-driven ontology, an overall of 592 classes and 720 properties, and about 1 478 000 instances Freebase - a social database, no ontology, hidden class hierarchy in structured predicate names, 19632 predicates Geonames – a geographical database that covers 6 million geographical features with ontology of 5 classes, and 26 properties, and about 645 location denotators
  • 11. Outline • Introduction • Problem and Conceptual Solution • Approaches to Matching Ontologies • Mapping Methods• Mapping Methods • Proton Extension • Statistics • Experimentation • Future Work
  • 12. Approaches to Matching Ontologies • syntactic vs. semantic mapping • harmonizing semantics (interlingua ontology) • bidirectional vs. unidirectional • automated vs. manual• automated vs. manual • OAEI - ontology matching competitions – benchmark with best F-measure result of 80% 2009
  • 13. Ontological Heterogeneity • Semantic matching approaches cannot cope with ontological heterogeneity • The classes and the properties may be described in different unrelated ontologies • The algorithms cannot discover hidden relationships• The algorithms cannot discover hidden relationships that hold between unrelated entities. FactForge presents exactly such a reality of heterogeneous facts. This makes their automated processing inconvenient.
  • 14. Our Approach unidirectional semantic manual alignment between PROTONPROTON and the schema ontologies of the selected datasets of FactForge
  • 15. Outline • Introduction • Problem and Conceptual Solution • Approaches to Matching Ontologies • Mapping Methods• Mapping Methods • Proton Extension • Statistics • Experimentation • Future Work
  • 16. Mapping Methods • A series of iterations of enrichments at conceptual and at data levels (b) Extending PROTON with classes and properties (a) Subsumption relations for classes and properties FactForge entity PROTON entity subClassOf (c) Using OWL class and property construction capabilities to represent classes and properties from FactForge and map them to PROTON classes Class restricted from FactForge predicates PROTON entity subClassOf (d) Extending FactForge with instances to account for the conceptual representation of the matching FactForge Instance 1 FactForge Instance 2 (new) FactForge Instance 3 (new)
  • 17. Mapping rules (a) dbp:Place owl:subClassOf proton:Location . (b) dbp-prop:location rdfs:subPropertyOf proton:locatedIn . (c) pfb:Location(c) pfb:Location rdf:type owl:Restriction ; owl:onProperty <http://rdf.freebase.com/ns/type.object.type> ; owl:hasValue <http://rdf.freebase.com/ns/location.location> ; rdfs:subClassOf proton:Location . (d) p <rdf:type> <dbp-ont:PrimeMinister> ---------------------------------------------------- p <proton:hasPosition> j j <proton:hasTitle> <proton:PrimeMinister>
  • 18. Conceptual mismatches dbp:Architect rdfs:subClassOf [ rdf:type owl:Restriction ; owl:onProperty proton:hasProfession ; owl:hasValue proton:Architect ] . Expression matching
  • 19. Multiple Matching DBPedia predicate DBPedia ontology predicate PROTON predicate dbp:place rdfs:subPropertyOf proton:locatedIn . dbp-prop:location rdfs:subPropertyOf proton:locatedIn . <http://rdf.freebase.com/ns/time.event.locations> rdfs:subPropertyOf proton:locatedIn . Freebase predicate
  • 20. Outline • Introduction • Problem and Conceptual Solution • Approaches to Matching Ontologies • Mapping Methods• Mapping Methods • Proton Extension • Statistics • Experimentation • Future Work
  • 21. PROTON Extension • Preserve the OntoClean approach in PROTON design • Obtain coverage of the rich data in FactForge • Keep an optimal degree of granularity of the concept hierarchy Proton was split into modules : - 19 modules reflecting the conceptual divisions which surfaced during the analysis of the data, e.g. proton event, proton social abstraction, proton location, proton biological substance, etc.
  • 22. Outline • Introduction • Problem and Conceptual Solution • Approaches to Matching Ontologies • Mapping Methods• Mapping Methods • Proton Extension • Statistics • Experimentation • Future Work
  • 23. Statistics • PROTON Extension – 141 new Classes – 3 new Datatype properties – 12 new Object properties • Mapping PROTON to FactForge – 536 subClassOf relations – 36 subPropertyOf relations
  • 24. Outline • Introduction • Problem and Conceptual Solution • Approaches to Matching Ontologies • Mapping Methods• Mapping Methods • Proton Extension • Statistics • Experimentation • Future Work
  • 25. Experiment Data Loading BigOWLIM – the most scalable OWL Engine http://www.ontotext.com/owlim/ FactForge Standard June 2010 FactForge with mappings June 2010 NumberOfStatements 1,782,541,506 2,630,453,334 NumberOfExplicitStatements 1,143,317,531 1,942,349,578 NumberOfEntities 354,635,159 404,798,593
  • 26. FactForge Extension Statistics FactForge FactForge with Inference Rules FactForge with IR and PROTON FactForge with IR, PROTON and DBP mappings FactForge with IR, PROTON, DBP mappings and Freebase mappings NumberOfStatements October 2010 2,237,550,385 2,237,550,617 2,237,578,643 2,255,543,166 2,375,287,183 NumberOfExplicitStatements 1,357,013,227 2,027,992,627 2,027,995,363 2,027,995,651 2,027,995,750 NumberOfEntities 524,120,454 524,120,465 524,121,955 524,121,996 524,122,009
  • 27. FactForge Extension Statistics Difference between FactForge and Each Adding Iteration FactForgeFactForge and FactForge with FactForge and FactForge with factForge and FactForge with factForge and FactForge with October 2010 FactForge with Inference Rules FactForge with IR and PROTON FactForge with IR, PROTON and DBP mappings FactForge with IR, PROTON, DBP mappings and Freebase mappings NumberOfStatements 0 232 28,258 17,992,781 137,736,798 NumberOfExplicitStatem ents 0 670,979,400 670,982,136 670,982,424 670,982,523 NumberOfEntities 0 11 1,501 1,542 1,555
  • 28. FactForge Extension Statistics Difference between Each Adding Iteration FactForge FactForge and FactForge with Inference Rules FactForge with Inference Rules and FactForge with IR and FactForge with IR and PROTON and FactForge with IR, PROTON and FactForge with IR, PROTON and DBP mappings and FactForge October 2010 with IR and PROTON IR, PROTON and DBP mappings and FactForge with IR, PROTON, DBP mappings and Freebase mappings NumberOfStatements 0 232 28,026 17,964,523 119,744,017 NumberOfExplicitStatements 0 670,979,400 2,736 288 99 NumberOfEntities 0 11 1490 41 13
  • 29. Experimentation: PROTON query US non-profit organizations founded after 1950 PREFIX p-ext: <http://proton.semanticweb.org/protonue#> PREFIX ptop: <http://proton.semanticweb.org/protont#> PREFIX dbpedia: <http://dbpedia.org/resource/> SELECT distinct ?s ?date ?l WHERE { ?s a p-ext:Non-ProfitOrganisation . ?s ptop:establishmentDate ?date . ?s ptop:locatedIn ?l . ?l ptop:subRegionOf dbpedia:United_States . FILTER (?date > "1950") }
  • 30. Query: Prime Ministers born in the United Kingdom PREFIX dbp-ont: <http://dbpedia.org/ontology/> PREFIX dbpedia: <http://dbpedia.org/resource/> SELECT ?PrimeMinister WHERE { ?PrimeMinister rdf:type dbp-ont:PrimeMinister . ?PrimeMinister dbp-ont:birthPlace dbpedia:United_Kingdom . } PREFIX pupp: <http://proton.semanticweb.org/protonu#> PREFIX p-ext: <http://proton.semanticweb.org/protonue#> PREFIX dbpedia: <http://dbpedia.org/resource/> PREFIX ptop: <http://proton.semanticweb.org/protont#> SELECT ?PrimeMinister WHERE { ?PrimeMinister ptop:hasPosition ?pos . ?pos pupp:hasTitle dbpedia:British_prime_minister . ?PrimeMinister p-ext:birthPlace dbpedia:United_Kingdom . }
  • 31. Query: Cities around the world which have Modigliani art work PREFIX fb: <http://rdf.freebase.com/ns/> PREFIX dbpedia: <http://dbpedia.org/resource/> PREFIX dbp-prop: <http://dbpedia.org/property/> PREFIX dbp-ont: <http://dbpedia.org/ontology/> PREFIX umbel-sc: <http://umbel.org/umbel/sc/> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX ot: <http://www.ontotext.com/> SELECT DISTINCT ?painting_l ?owner_l ?city_fb_con ?city_db_loc ?city_db_cit WHERE {WHERE { ?p fb:visual_art.artwork.artist dbpedia:Amedeo_Modigliani ; fb:visual_art.artwork.owners [ fb:visual_art.artwork_owner_relationship.owner ?ow ] ; ot:preferredLabel ?painting_l. ?ow ot:preferredLabel ?owner_l . OPTIONAL { ?ow fb:location.location.containedby [ ot:preferredLabel ?city_fb_con ] } OPTIONAL { ?ow dbp-prop:location ?loc. ?loc rdf:type umbel-sc:City ; ot:preferredLabel ?city_db_loc } OPTIONAL { ?ow dbp-ont:city [ ot:preferredLabel ?city_db_cit ] } }
  • 32. Query: Cities around the world which have Modigliani art work PREFIX dbpedia: <http://dbpedia.org/resource/> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX ot: <http://www.ontotext.com/> PREFIX ptop: <http://proton.semanticweb.org/protont#> PREFIX p-ext: <http://proton.semanticweb.org/protonue#> SELECT DISTINCT ?painting ?owner ?city WHERE { ?p p-ext:author dbpedia:Amedeo_Modigliani ; p-ext:ownership [ ptop:isOwnedBy ?ow ] ; ot:preferredLabel ?painting . ?ow ot:preferredLabel ?owner . ?ow ptop:locatedIn [ rdf:type pupp:City ; ot:preferredLabel ?city]. }
  • 33. Outline • Introduction • Problem • Conceptual Solution • Approaches to Matching Ontologies• Approaches to Matching Ontologies • Mapping Methods • Proton Extension • Statistics • Experimentation • Future Work
  • 34. Future Work • Two-level intermediary layer to access FactForge mapping PROTON to UMBEL Dataset1 Dataset2 Dataset3 Dataset4Datasets Upper Level Ontology 2Two level Ontology 1 Ontology 2 Ontology 3 Ontology 4 Upper Mapping and Binding Exchange Layer A lightweight, subject concept reference structure for the Web Upper Mapping and Binding Exchange Layer A lightweight, subject concept reference structure for the Web Query triples ?S P O ?S P1 ?O1 ?O1 P2 O2 Upper Level Ontology 2Two level Intermediate layer Upper Level Ontology 1 http://www.ontotext.com/news.html#umb_25oct10 UMBEL - Upper Mapping and Binding Exchange Layer A lightweight, subject concept reference structure for the Web 20 000 concepts mapped to OpenCyc Strategic partnership Ontotext – Structured Dynamics
  • 35. Future Work • Official FactForge release with the presented mapping • Publish Proton and mapping as LOD • Cover more datasets from the LOD cloud• Cover more datasets from the LOD cloud • Experiment with the balance between the datasets and the ontologies describing them • Extend the property mapping
  • 36. http://factforge.net (a version with Proton mapping is currently available at http://ldsr4.ontotext.com) Service available at Thank you for your attention! Questions? Contact: mariana.damova@ontotext.com