SlideShare a Scribd company logo
1 of 23
Download to read offline
FactForge
Data Service and the Value of
Inferred Knowledge over LOD
           Mariana Damova, PhD


         European Open Data Forum
                 June 2012
Ontotext
   – Top-5 provider of core Semantic Technology
   – Established in year 2000; offices in Bulgaria, UK, USA
   – Active both in research and commercial projects (FP7 funding for 10 years)

• 360° semantic technology – unique portfolio:
   – Semantic Databases: high-performance RDF DBMS, scalable reasoning
   – Semantic Search: text-mining (IE), metadata generation, Information Retrieval (IR)
   – Web Mining: focused crawling, screen scraping, data fusion
   – Linked Data Management and Data Integration

   Good recognition in the SemTech community
   – Ontotext pages are ranked #1 for “semantic annotation” and “semantic repository” at
     GYM, #3 for “linked data management” at Google

   Several joint ventures and subsidiaries
   – Innovantage: leading online recruitment intelligence provider in UK
Ontotext Clients (selected)

          British Broadcasting Corporation (BBC)
                – Run its World Cup 2010 sites on top of OWLIM
                – Since Mar’12 BBC Sports and 2012 Olympics sections are driven
                  by OWLIM and a Concept Extraction service developed by Ontotext

          Press Association (UK)
                – Analysis of Sports news
                – Concept extraction
                – Linked data generation

          Top-3 USA media (not allowed to name)
          The National Archives (UK) contracted Ontotext to implement
          semantic KB and semantic search for the Government Web Archive

          British Museum (UK) Ontotext leads the development of Phase 3 of
          ResearchSpace project on collaborative research in cultural heritage;
          British Museum’s public SPARQL end-point is powered by OWLIM

          de Bibliothek (Holland) aggregation of data from 150 library databases
Linked Open Data is maturing
  LOD cloud grows by billions of triples yearly
Technologies and guidelines about
  how to produce linked data fast
  how to assure their quality
  how to provide vertical oriented data services
                                               LOD2, LATC, baseKB



                         European Data Forum           June2012     #4
This talk is about
       reasoning
               and
                     coping with diversity of the data on the web of data




                                 European Data Forum         June 2012      #5
Outline

• FactForge (beta)
• Reference Layer
• Access Modes
• Querying
   – Airports around London
   – US city – a subject of a Novel
   – US city – contactInformation

• Challenges
• Conclusion



                               European Data Forum
FactForge (beta)




the largest body of heterogeneous general knowledge on which inference has been performed

– powered by OWLIM 5.0                                           – supporting SPARQL 1.1
                                   European Data Forum
Datasets

                         REASON-ABLE VIEW
                           of LOD datasets
                  Number of explicit statements: 1,796,673,630
                            Implicit statements: 1,3
                    Retrievable statements: 14,928,925,039


                                                      CIA FactBook
   DBpedia 3.7
                     Freebase
                                     NY Times
                                                                     Lexvo



    Wordnet 3.0          Geonames                                Lingvoj
                                           MusicBrainz




materialization is performed with respect to the semantics of OWL-Horst optimized

                                 European Data Forum
Reference Layer




                                                                   PROTON – light weight upper level ontology
                                                                             ~500 classes, ~150 properties
                                                                   http://www.ontotext.com/proton-ontology

Linking at schema level:
(1) using rdfs:subClassOf and rdfs:subPropertyOf statements;
(2) using OWL expressions where there is a difference in the conceptualization
(3) using inference rules if additional individuals are necessary in the repository to support the mapping

                                               European Data Forum                          June 2012        #9
Access modes

RDF Search - retrieve ranked list of URIs related to literals, which contain specific keywords




                                      European Data Forum                    June 2012       #10
Access modes (condt)

 Exploration - traversing the data, one resource at a time
Access modes (condt)

   Exploration - traversing the data, one resource at a time,
                  inspecting inferred knowledge


- locatedIn – Denmark, Northern Europe
- Geonames types/FearureCodes (dc:type P.PPL)
- parentFeature – Denmeark, Europe
…




                                       European Data Forum      June 2012   #12
Access modes (condt)
   Exploration - traversing the data, one resource at a time,
                 inspecting inferred knowledge




- locatedIn - Europe
- subRegionOf - Europe
- hasContactInfo –
       website via Freebase
- containsLocation
  …




                                     European Data Forum        June 2012   #13
Access modes (condt)

SPARQL endpoint




                   European Data Forum   June 2012   #14
Access modes (condt)

RelFinder




                       European Data Forum   June 2012   #15
Querying
Using LOD concepts




 SELECT * WHERE {
  ?Person dbp-ont:birthPlace ?BirthPlace ;
       rdf:type dbp-ont:Politician ;
 ?BirthPlace geo-ont:parentFeature dbpedia:Germany .
 }




Using the intermediary layer




 SELECT * WHERE {
   ?Person prot:birthPlace ?BirthPlace ;
        rdf:type prot:Politicianr ;
   ?BirthPlace prot:subRegionOf dbpedia:Germany .
 }




                                                       European Data Forum
                                                                             June 2012
Find Airports near London

                                         Standard LOD vs. PROTON query
                                         13 vs. 20 results
                                         DBpedia vs. DBpedia and Geonames




                   European Data Forum                 June 2012    #17
Find airports near London - Results comparison




 Using Geospatial index of OWLIM



                                   European Data Forum   June 2012   #18
City – a subject of a science fiction author




                      European Data Forum      June 2012   #19
OWLIM 5.0 and SPARQL 1.1

Exemplary queries :
GROUP BY, min
   — Minimal and maximal population counts of European countries
Federated Query between FactForge and LinkedLifeData
    — Drugs that cure the disease from which died Alexandre Graham Bell
Literal index over dates
     – World governors in office between 1980 and 2005
Literal index over digits
     ― European countries with population above 20 MLN
Geospatial index
    — Show the distance from London of airports located at most 50 miles away from it




                                European Data Forum                 June 2012     #20
Challenges and usage

• Clean data
   – Clean up input data

• At model level
   – Contradiction detection
   – Consistency checking

• Curation and upgrading methodology



         FactForge has been used as data layer infrastructure in FP7 projects, like RENDER
         FactForge has been used in tasks of
                   linked data generation from unstructured data,
                   metadata enrichment of structured data
                             providing linkage to the entire LOD cloud
                                     for example The National Archive of UK
                                                  EDAMAM - food recommendation app
                                 European Data Forum                    June 2012      #21
Acknowledgements

  Partial funding




Colleagues
Atanas Kiryakov, CEO of Ontotext
Zdravko Tashev, Ontotext
Ivan Peikov, Ontotext
Rouslan Velkov, Ontotext
Kiril Simov, Ontotext
Barry Bishop, Ontotext
Barry Norton, Ontotext
Marin Dimitrov, Ontotext
Alex Simov, Ontotext
Jordan Dichev, Ontotext                              Links
Konstantin Penchev, Ontotext                         http://ff-dev.ontotext.com
                                                     http://www.ontotext.com/owlim
                                                     http://www.ontotext.com/factforge
                                                     Email:
                                                     info@factforge.net
                                   European Data Forum                 June 2012     #22
Thank you for your attention!




mariana.damova@ontotext.com

More Related Content

What's hot

Neo4j GraphTour New YorkOntologies and Knowledge Graphs
Neo4j GraphTour New YorkOntologies and Knowledge GraphsNeo4j GraphTour New YorkOntologies and Knowledge Graphs
Neo4j GraphTour New YorkOntologies and Knowledge GraphsNeo4j
 
Redundancy analysis on linked data #cold2014 #ISWC2014
Redundancy analysis on linked data #cold2014 #ISWC2014Redundancy analysis on linked data #cold2014 #ISWC2014
Redundancy analysis on linked data #cold2014 #ISWC2014honghan2013
 
LDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data CategoriesLDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data CategoriesMenzo Windhouwer
 
Ontology languages and OWL
Ontology languages and OWLOntology languages and OWL
Ontology languages and OWLFulvio Corno
 
A Unified Approach for Representing Metametadata
A Unified Approach for Representing MetametadataA Unified Approach for Representing Metametadata
A Unified Approach for Representing MetametadataKai Eckert
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachhorvadam
 
Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesIJwest
 
FOAF for Social Network Portability
FOAF for Social Network PortabilityFOAF for Social Network Portability
FOAF for Social Network PortabilityUldis Bojars
 
Enhancing Interoperability of FRBR-Based Metadata
Enhancing Interoperability of FRBR-Based MetadataEnhancing Interoperability of FRBR-Based Metadata
Enhancing Interoperability of FRBR-Based MetadataJenn Riley
 
Linked Open Vocabularies
Linked Open VocabulariesLinked Open Vocabularies
Linked Open VocabulariesGiorgia Lodi
 
DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World." DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World." Avalon Media System
 
The Dublin Core 1:1 Principle in the Age of Linked Data
The Dublin Core 1:1 Principle in the Age of Linked DataThe Dublin Core 1:1 Principle in the Age of Linked Data
The Dublin Core 1:1 Principle in the Age of Linked DataRichard Urban
 
Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19 Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19 Mikael Nilsson
 

What's hot (20)

Neo4j GraphTour New YorkOntologies and Knowledge Graphs
Neo4j GraphTour New YorkOntologies and Knowledge GraphsNeo4j GraphTour New YorkOntologies and Knowledge Graphs
Neo4j GraphTour New YorkOntologies and Knowledge Graphs
 
Redundancy analysis on linked data #cold2014 #ISWC2014
Redundancy analysis on linked data #cold2014 #ISWC2014Redundancy analysis on linked data #cold2014 #ISWC2014
Redundancy analysis on linked data #cold2014 #ISWC2014
 
LDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data CategoriesLDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data Categories
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 
Ontology languages and OWL
Ontology languages and OWLOntology languages and OWL
Ontology languages and OWL
 
A Unified Approach for Representing Metametadata
A Unified Approach for Representing MetametadataA Unified Approach for Representing Metametadata
A Unified Approach for Representing Metametadata
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approach
 
Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
 
FOAF for Social Network Portability
FOAF for Social Network PortabilityFOAF for Social Network Portability
FOAF for Social Network Portability
 
Linked Data and Sevices
Linked Data and SevicesLinked Data and Sevices
Linked Data and Sevices
 
Enhancing Interoperability of FRBR-Based Metadata
Enhancing Interoperability of FRBR-Based MetadataEnhancing Interoperability of FRBR-Based Metadata
Enhancing Interoperability of FRBR-Based Metadata
 
Linked Open Vocabularies
Linked Open VocabulariesLinked Open Vocabularies
Linked Open Vocabularies
 
DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World." DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World."
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
The Dublin Core 1:1 Principle in the Age of Linked Data
The Dublin Core 1:1 Principle in the Age of Linked DataThe Dublin Core 1:1 Principle in the Age of Linked Data
The Dublin Core 1:1 Principle in the Age of Linked Data
 
Fedora Migration Considerations
Fedora Migration ConsiderationsFedora Migration Considerations
Fedora Migration Considerations
 
Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19 Chicago LOMRDF update 2003-06-19
Chicago LOMRDF update 2003-06-19
 

Viewers also liked (11)

LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
LOD2 - Creating Knowledge out of Interlinked Data - General PresentationLOD2 - Creating Knowledge out of Interlinked Data - General Presentation
LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
 
Webinar@AIMS: LODE-BD
Webinar@AIMS: LODE-BDWebinar@AIMS: LODE-BD
Webinar@AIMS: LODE-BD
 
Linked Open Data, Some Business Cases
Linked Open Data,  Some Business CasesLinked Open Data,  Some Business Cases
Linked Open Data, Some Business Cases
 
Data visualization from Finland
Data visualization from FinlandData visualization from Finland
Data visualization from Finland
 
Metadata : Concentrating on the data, not on the scheme
Metadata : Concentrating on the data, not on the schemeMetadata : Concentrating on the data, not on the scheme
Metadata : Concentrating on the data, not on the scheme
 
David van Zeggeren van Sparked - Pecha Kucha presentatie
David van Zeggeren van Sparked - Pecha Kucha presentatieDavid van Zeggeren van Sparked - Pecha Kucha presentatie
David van Zeggeren van Sparked - Pecha Kucha presentatie
 
Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.
 
An introduction to Linked (Open) Data
An introduction to Linked (Open) DataAn introduction to Linked (Open) Data
An introduction to Linked (Open) Data
 
Remodelling of a Database of Bavarian Dialects into TEI XML and LOD
Remodelling of a Database of Bavarian Dialects into TEI XML and LODRemodelling of a Database of Bavarian Dialects into TEI XML and LOD
Remodelling of a Database of Bavarian Dialects into TEI XML and LOD
 
Grade@cnr
Grade@cnrGrade@cnr
Grade@cnr
 
Linked Data Management
Linked Data ManagementLinked Data Management
Linked Data Management
 

Similar to Fact forge20 edf

Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesOntotext
 
Omitola birmingham cityuniv
Omitola birmingham cityunivOmitola birmingham cityuniv
Omitola birmingham cityunivTope Omitola
 
Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift
 
Linked Open Data (LOD) part 3
Linked Open Data (LOD)  part 3Linked Open Data (LOD)  part 3
Linked Open Data (LOD) part 3IPLODProject
 
Linking Open Data with Drupal
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupalemmanuel_jamin
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Linked Data Management
Linked Data ManagementLinked Data Management
Linked Data ManagementMarin Dimitrov
 
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1manujam
 
Web Data Management in RDF Age
Web Data Management in RDF AgeWeb Data Management in RDF Age
Web Data Management in RDF AgeINRIA-OAK
 
IC-SDV 2018: Martin Kracker (EPO) Linked Open EP data – a new Product from th...
IC-SDV 2018: Martin Kracker (EPO) Linked Open EP data – a new Product from th...IC-SDV 2018: Martin Kracker (EPO) Linked Open EP data – a new Product from th...
IC-SDV 2018: Martin Kracker (EPO) Linked Open EP data – a new Product from th...Dr. Haxel Consult
 
Information Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudInformation Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudDhaval Thakker
 
Approximation and Self-Organisation on the Web of Data
Approximation and Self-Organisation on the Web of DataApproximation and Self-Organisation on the Web of Data
Approximation and Self-Organisation on the Web of DataKathrin Dentler
 

Similar to Fact forge20 edf (20)

20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
 
Omitola birmingham cityuniv
Omitola birmingham cityunivOmitola birmingham cityuniv
Omitola birmingham cityuniv
 
Europeana datainaction nov2012
Europeana datainaction nov2012Europeana datainaction nov2012
Europeana datainaction nov2012
 
Linking Open Data
Linking Open DataLinking Open Data
Linking Open Data
 
OpenAIRE schirrwagen
OpenAIRE schirrwagenOpenAIRE schirrwagen
OpenAIRE schirrwagen
 
LOD2: Guest presentation: French datalift project
LOD2: Guest presentation: French datalift projectLOD2: Guest presentation: French datalift project
LOD2: Guest presentation: French datalift project
 
Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift lod2-paris-24032011
Datalift lod2-paris-24032011
 
Linked Open Data (LOD) part 3
Linked Open Data (LOD)  part 3Linked Open Data (LOD)  part 3
Linked Open Data (LOD) part 3
 
Linking Open Data with Drupal
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupal
 
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORELOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
Linked Data Management
Linked Data ManagementLinked Data Management
Linked Data Management
 
121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1121004 linking open_data_with_drupal_v1
121004 linking open_data_with_drupal_v1
 
Lod2
Lod2Lod2
Lod2
 
Web Data Management in RDF Age
Web Data Management in RDF AgeWeb Data Management in RDF Age
Web Data Management in RDF Age
 
Lod2
Lod2Lod2
Lod2
 
IC-SDV 2018: Martin Kracker (EPO) Linked Open EP data – a new Product from th...
IC-SDV 2018: Martin Kracker (EPO) Linked Open EP data – a new Product from th...IC-SDV 2018: Martin Kracker (EPO) Linked Open EP data – a new Product from th...
IC-SDV 2018: Martin Kracker (EPO) Linked Open EP data – a new Product from th...
 
Information Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudInformation Extraction and Linked Data Cloud
Information Extraction and Linked Data Cloud
 
Approximation and Self-Organisation on the Web of Data
Approximation and Self-Organisation on the Web of DataApproximation and Self-Organisation on the Web of Data
Approximation and Self-Organisation on the Web of Data
 

More from Mariana Damova, Ph.D

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваMariana Damova, Ph.D
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryMariana Damova, Ph.D
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionMariana Damova, Ph.D
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи Mariana Damova, Ph.D
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introductionMariana Damova, Ph.D
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Mariana Damova, Ph.D
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Mariana Damova, Ph.D
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital marketMariana Damova, Ph.D
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 нимMariana Damova, Ph.D
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...Mariana Damova, Ph.D
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологииMariana Damova, Ph.D
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основиMariana Damova, Ph.D
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Mariana Damova, Ph.D
 

More from Mariana Damova, Ph.D (20)

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic Memory
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - Introduction
 
IndustryInform Service of Mozaika
IndustryInform Service of MozaikaIndustryInform Service of Mozaika
IndustryInform Service of Mozaika
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи
 
IndustryInform Demo March 2016
IndustryInform Demo March 2016IndustryInform Demo March 2016
IndustryInform Demo March 2016
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introduction
 
Mozaika-Jan2016a
Mozaika-Jan2016aMozaika-Jan2016a
Mozaika-Jan2016a
 
Concordia july2015
Concordia july2015Concordia july2015
Concordia july2015
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital market
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 ним
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
 
Mozaika june2014
Mozaika june2014Mozaika june2014
Mozaika june2014
 
Europeana in Bulgaria
Europeana in BulgariaEuropeana in Bulgaria
Europeana in Bulgaria
 
Bulgariana europeana02112013
Bulgariana europeana02112013Bulgariana europeana02112013
Bulgariana europeana02112013
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологии
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основи
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
 

Recently uploaded

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 

Recently uploaded (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Fact forge20 edf

  • 1. FactForge Data Service and the Value of Inferred Knowledge over LOD Mariana Damova, PhD European Open Data Forum June 2012
  • 2. Ontotext – Top-5 provider of core Semantic Technology – Established in year 2000; offices in Bulgaria, UK, USA – Active both in research and commercial projects (FP7 funding for 10 years) • 360° semantic technology – unique portfolio: – Semantic Databases: high-performance RDF DBMS, scalable reasoning – Semantic Search: text-mining (IE), metadata generation, Information Retrieval (IR) – Web Mining: focused crawling, screen scraping, data fusion – Linked Data Management and Data Integration Good recognition in the SemTech community – Ontotext pages are ranked #1 for “semantic annotation” and “semantic repository” at GYM, #3 for “linked data management” at Google Several joint ventures and subsidiaries – Innovantage: leading online recruitment intelligence provider in UK
  • 3. Ontotext Clients (selected) British Broadcasting Corporation (BBC) – Run its World Cup 2010 sites on top of OWLIM – Since Mar’12 BBC Sports and 2012 Olympics sections are driven by OWLIM and a Concept Extraction service developed by Ontotext Press Association (UK) – Analysis of Sports news – Concept extraction – Linked data generation Top-3 USA media (not allowed to name) The National Archives (UK) contracted Ontotext to implement semantic KB and semantic search for the Government Web Archive British Museum (UK) Ontotext leads the development of Phase 3 of ResearchSpace project on collaborative research in cultural heritage; British Museum’s public SPARQL end-point is powered by OWLIM de Bibliothek (Holland) aggregation of data from 150 library databases
  • 4. Linked Open Data is maturing LOD cloud grows by billions of triples yearly Technologies and guidelines about how to produce linked data fast how to assure their quality how to provide vertical oriented data services LOD2, LATC, baseKB European Data Forum June2012 #4
  • 5. This talk is about reasoning and coping with diversity of the data on the web of data European Data Forum June 2012 #5
  • 6. Outline • FactForge (beta) • Reference Layer • Access Modes • Querying – Airports around London – US city – a subject of a Novel – US city – contactInformation • Challenges • Conclusion European Data Forum
  • 7. FactForge (beta) the largest body of heterogeneous general knowledge on which inference has been performed – powered by OWLIM 5.0 – supporting SPARQL 1.1 European Data Forum
  • 8. Datasets REASON-ABLE VIEW of LOD datasets Number of explicit statements: 1,796,673,630 Implicit statements: 1,3 Retrievable statements: 14,928,925,039 CIA FactBook DBpedia 3.7 Freebase NY Times Lexvo Wordnet 3.0 Geonames Lingvoj MusicBrainz materialization is performed with respect to the semantics of OWL-Horst optimized European Data Forum
  • 9. Reference Layer PROTON – light weight upper level ontology ~500 classes, ~150 properties http://www.ontotext.com/proton-ontology Linking at schema level: (1) using rdfs:subClassOf and rdfs:subPropertyOf statements; (2) using OWL expressions where there is a difference in the conceptualization (3) using inference rules if additional individuals are necessary in the repository to support the mapping European Data Forum June 2012 #9
  • 10. Access modes RDF Search - retrieve ranked list of URIs related to literals, which contain specific keywords European Data Forum June 2012 #10
  • 11. Access modes (condt) Exploration - traversing the data, one resource at a time
  • 12. Access modes (condt) Exploration - traversing the data, one resource at a time, inspecting inferred knowledge - locatedIn – Denmark, Northern Europe - Geonames types/FearureCodes (dc:type P.PPL) - parentFeature – Denmeark, Europe … European Data Forum June 2012 #12
  • 13. Access modes (condt) Exploration - traversing the data, one resource at a time, inspecting inferred knowledge - locatedIn - Europe - subRegionOf - Europe - hasContactInfo – website via Freebase - containsLocation … European Data Forum June 2012 #13
  • 14. Access modes (condt) SPARQL endpoint European Data Forum June 2012 #14
  • 15. Access modes (condt) RelFinder European Data Forum June 2012 #15
  • 16. Querying Using LOD concepts SELECT * WHERE { ?Person dbp-ont:birthPlace ?BirthPlace ; rdf:type dbp-ont:Politician ; ?BirthPlace geo-ont:parentFeature dbpedia:Germany . } Using the intermediary layer SELECT * WHERE { ?Person prot:birthPlace ?BirthPlace ; rdf:type prot:Politicianr ; ?BirthPlace prot:subRegionOf dbpedia:Germany . } European Data Forum June 2012
  • 17. Find Airports near London Standard LOD vs. PROTON query 13 vs. 20 results DBpedia vs. DBpedia and Geonames European Data Forum June 2012 #17
  • 18. Find airports near London - Results comparison Using Geospatial index of OWLIM European Data Forum June 2012 #18
  • 19. City – a subject of a science fiction author European Data Forum June 2012 #19
  • 20. OWLIM 5.0 and SPARQL 1.1 Exemplary queries : GROUP BY, min — Minimal and maximal population counts of European countries Federated Query between FactForge and LinkedLifeData — Drugs that cure the disease from which died Alexandre Graham Bell Literal index over dates – World governors in office between 1980 and 2005 Literal index over digits ― European countries with population above 20 MLN Geospatial index — Show the distance from London of airports located at most 50 miles away from it European Data Forum June 2012 #20
  • 21. Challenges and usage • Clean data – Clean up input data • At model level – Contradiction detection – Consistency checking • Curation and upgrading methodology FactForge has been used as data layer infrastructure in FP7 projects, like RENDER FactForge has been used in tasks of linked data generation from unstructured data, metadata enrichment of structured data providing linkage to the entire LOD cloud for example The National Archive of UK EDAMAM - food recommendation app European Data Forum June 2012 #21
  • 22. Acknowledgements Partial funding Colleagues Atanas Kiryakov, CEO of Ontotext Zdravko Tashev, Ontotext Ivan Peikov, Ontotext Rouslan Velkov, Ontotext Kiril Simov, Ontotext Barry Bishop, Ontotext Barry Norton, Ontotext Marin Dimitrov, Ontotext Alex Simov, Ontotext Jordan Dichev, Ontotext Links Konstantin Penchev, Ontotext http://ff-dev.ontotext.com http://www.ontotext.com/owlim http://www.ontotext.com/factforge Email: info@factforge.net European Data Forum June 2012 #22
  • 23. Thank you for your attention! mariana.damova@ontotext.com