SlideShare uma empresa Scribd logo
1 de 67
Exploiting RDFS and OWL for Integrating Heterogeneous, Large-Scale, Linked Data Corpora Aidan Hogan PhD Viva
Cold Open   Figure 1: Web of Data explicit  data implicit  data Topic of thesis:   How can consumers tap into the implicit data
[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],
  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],… the micro … Linked Data Principles
  … the macro … A Web of Data Images from:  http://richard.cyganiak.de/2007/10/lod/ ;  Cyganiak, Jentzsch September 2010 August 2007 November 2007   February 2008   March 2008   September 2008   March 2009   July 2009
[object Object],[object Object],
Take  Query Answering … ,[object Object],[object Object],
Take  Query Answering …   Gimme   webpages   relating to Tim Berners-Lee foaf:page   timbl:i   timbl:i   foaf:page   ?pages  .
Hetereogenity in  terminology …   webpage:  properties   foaf:page   foaf:homepage   foaf:isPrimaryTopicOf   foaf:weblog   doap:homepage   foaf:topic   foaf:primaryTopic   mo:musicBrainz   mo:myspace   … = rdfs:subPropertyOf  = owl:inverseOf
Linked Data, RDFS and OWL:    Linked Vocabularies   … … Image from  http://blog.dbtune.org/public/.081005_lod_constellation_m.jpg : ;  Giasson, Bergman
  Hetereogenity in  naming … Tim Berners-Lee:  URIs … timbl:i dblp:100007 identica:45563 adv:timbl fb:en.tim_berners-lee db:Tim-Berners_Lee = owl:sameAs
Returning to our  Query …   Gimme   webpages   relating to Tim Berners-Lee foaf:page   timbl:i  timbl:i   foaf:page   ?pages  . ...   7 x 6 = 42  possible patterns foaf:homepage   foaf:isPrimaryTopicOf   doap:homepage   foaf:topic   foaf:primaryTopic   mo:myspace   dblp:100007 identica:45563 adv:timbl fb:en.tim_berners-lee db:Tim-Berners_Lee
[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],
Linked Data Reasoning:  Challenges   Scalable Expressive Robust Domain-Agnostic
[object Object],[object Object],[object Object],[object Object],
[object Object],
[object Object],[object Object],
  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Web Reasoning: Forward Chaining! One size does  not fit all! Don't materialise too much!
[object Object],  Our Approach
  Our Approach… ,[object Object],[object Object],[object Object],[object Object]
Scalable   Reasoning:  In-mem T-Box ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],  Scalable   Reasoning:  Two Scans
  ... ex:me foaf:homepage ex:hp . ...   ... ex:hp  rdf:type foaf:Document  . ex:me  foaf:page  ex:hp . ex:hp  foaf:topic  ex:me . ... IN-MEM   T-BOX ON-DISK   A-BOX ON-DISK OUTPUT ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Scalable   Reasoning:  No A-Box Joins
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Scalable   Reasoning:  A-Box joins?
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Authoritative   Reasoning
Survey of terminology:  counts ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Survey of terminology: counts
Survey of terminology:  ranks ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],[object Object],  Survey of terminology: ranks
[object Object],
Scalable   Distributed Reasoning   ... ... ex:me ex:presented ex:ThisTalk ...   SAME   T-BOX SAME   T-BOX SAME   T-BOX SAME   T-BOX SAME   T-BOX DIFF.   A-BOX DIFF.   A-BOX DIFF.   A-BOX DIFF.   A-BOX DIFF.   A-BOX ... ... ex:me ex:presented ex:ThisTalk ...   ... ... ex:me ex:presented ex:ThisTalk ...   ... ... ex:me ex:presented ex:ThisTalk ...   ... ... ex:me ex:presented ex:ThisTalk ...   LOCAL OUTPUT ... ... ex:me ex:presented ex:ThisTalk ...   LOCAL OUTPUT LOCAL OUTPUT LOCAL OUTPUT LOCAL OUTPUT ... ... ex:me ex:presented ex:ThisTal ... ... ex:me ex:presented ex:ThisTalk ... ... ex:me ex:presented ex:ThisTalk ... ... ex:me rdf:type ex:Awesome . ... ... ... ... ... ... ... ex:me ex:presented ex:ThisTalk ...   ... ... ex:me ex:presented ex:ThisTalk ...   ... ... ex:me ex:presented ex:ThisTalk ...   ... ... ex:me ex:presented ex:ThisTalk ...   ... ... ex:me ex:presented ex:ThisTalk ...   EXTRACT   T-BOX EXTRACT T-BOX EXTRACT   T-BOX EXTRACT   T-BOX EXTRACT   T-BOX COLLECT   T-BOX COLLECT   T-BOX COLLECT   T-BOX COLLECT   T-BOX COLLECT   T-BOX ... ...
  Reasoning Performance (1 machine)
Reasoning Performance: Distrib. ,[object Object],
  Reasoning: Results 962 million  unique/novel triples 947 million unique triples
[object Object],[object Object],
Annotated Reasoning ,[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],  Annotated Reasoning: ranks
[object Object],[object Object],[object Object],[object Object],[object Object],  Annotated Reasoning
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Why?
[object Object],  Inconsistencies:    aka. Contradictions
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Cannot compute…
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Fixing inconsistencies
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Fixing inconsistencies
[object Object],[object Object],  Inconsistencies found
Performance ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],
[object Object],
Consolidation for Linked Data
[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Consolidation: Baseline ,[object Object],[object Object],[object Object]
[object Object],  Consolidation: Baseline ,[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],  Canonicalisation timbl:i  rdf:type foaf:Person . identica:48404 foaf:knows  identica:45563  . dbpedia:Berners-Lee   dpo:birthDate  “ 1955-06-08”^^xsd:date  . dbpedia:Berners-Lee  rdf:type foaf:Person . identica:48404 foaf:knows  dbpedia:Berners-Lee  . dbpedia:Berners-Lee   dpo:birthDate  “ 1955-06-08”^^xsd:date  . ,[object Object],[object Object],[object Object]
Baseline Consolidation:  Performance ,[object Object],[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Baseline Consolidation: Results
[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Extended   Consolidation
[object Object],[object Object],[object Object],[object Object],  Derive  owl:sameAs  on-disk
Extended Consolidation:  Performance ,[object Object],[object Object],[object Object],[object Object],[object Object],
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Extended Consolidation:  Results
[object Object],
  timbl:i   foaf:page   ?pages  . ,[object Object],[object Object],[object Object],dbpedia:Berners-Lee   foaf:page   ?pages  .
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],  Conclusions

Mais conteúdo relacionado

Mais procurados

OWL: Yet to arrive on the Web of Data?
OWL: Yet to arrive on the Web of Data?OWL: Yet to arrive on the Web of Data?
OWL: Yet to arrive on the Web of Data?
Aidan Hogan
 
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsSWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
Mariano Rodriguez-Muro
 
Querying the Semantic Web with SPARQL
Querying the Semantic Web with SPARQLQuerying the Semantic Web with SPARQL
Querying the Semantic Web with SPARQL
Emanuele Della Valle
 
2009 Dils Flyweb
2009 Dils Flyweb2009 Dils Flyweb
2009 Dils Flyweb
Jun Zhao
 
2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata
Jun Zhao
 

Mais procurados (20)

OWL: Yet to arrive on the Web of Data?
OWL: Yet to arrive on the Web of Data?OWL: Yet to arrive on the Web of Data?
OWL: Yet to arrive on the Web of Data?
 
SPARQL Cheat Sheet
SPARQL Cheat SheetSPARQL Cheat Sheet
SPARQL Cheat Sheet
 
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfsSWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
SWT Lecture Session 6 - RDFS semantics, inference techniques, sesame rdfs
 
2017 biological databases_part1_vupload
2017 biological databases_part1_vupload2017 biological databases_part1_vupload
2017 biological databases_part1_vupload
 
From SQL to SPARQL
From SQL to SPARQLFrom SQL to SPARQL
From SQL to SPARQL
 
Linking the world with Python and Semantics
Linking the world with Python and SemanticsLinking the world with Python and Semantics
Linking the world with Python and Semantics
 
SPARQL-DL - Theory & Practice
SPARQL-DL - Theory & PracticeSPARQL-DL - Theory & Practice
SPARQL-DL - Theory & Practice
 
Jena Programming
Jena ProgrammingJena Programming
Jena Programming
 
Querying the Semantic Web with SPARQL
Querying the Semantic Web with SPARQLQuerying the Semantic Web with SPARQL
Querying the Semantic Web with SPARQL
 
SWT Lab 3
SWT Lab 3SWT Lab 3
SWT Lab 3
 
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...
TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from th...
 
2009 Dils Flyweb
2009 Dils Flyweb2009 Dils Flyweb
2009 Dils Flyweb
 
Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF
 
2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata
 
SPARQL Tutorial
SPARQL TutorialSPARQL Tutorial
SPARQL Tutorial
 
Introduction to SPARQL
Introduction to SPARQLIntroduction to SPARQL
Introduction to SPARQL
 
Data in RDF
Data in RDFData in RDF
Data in RDF
 
Graph Data -- RDF and Property Graphs
Graph Data -- RDF and Property GraphsGraph Data -- RDF and Property Graphs
Graph Data -- RDF and Property Graphs
 
SPARQL 1.1 Status
SPARQL 1.1 StatusSPARQL 1.1 Status
SPARQL 1.1 Status
 
SPARQL in a nutshell
SPARQL in a nutshellSPARQL in a nutshell
SPARQL in a nutshell
 

Destaque

Destaque (20)

Beyond Journalism Chicago
Beyond Journalism ChicagoBeyond Journalism Chicago
Beyond Journalism Chicago
 
Combining sequence motifs and protein interactions to unravel complex phospho...
Combining sequence motifs and protein interactions to unravel complex phospho...Combining sequence motifs and protein interactions to unravel complex phospho...
Combining sequence motifs and protein interactions to unravel complex phospho...
 
Specificity and Evolvability in Eukaryotic Protein Interaction Networks
Specificity and Evolvability in Eukaryotic Protein Interaction NetworksSpecificity and Evolvability in Eukaryotic Protein Interaction Networks
Specificity and Evolvability in Eukaryotic Protein Interaction Networks
 
Towards Social semantic journalism
Towards Social semantic journalismTowards Social semantic journalism
Towards Social semantic journalism
 
PhD viva - 11th November 2015
PhD viva - 11th November 2015PhD viva - 11th November 2015
PhD viva - 11th November 2015
 
From protein interaction networks to human phenotypes
From protein  interaction networks to human phenotypesFrom protein  interaction networks to human phenotypes
From protein interaction networks to human phenotypes
 
Using structural information to predict protein-protein interaction and enyzm...
Using structural information to predict protein-protein interaction and enyzm...Using structural information to predict protein-protein interaction and enyzm...
Using structural information to predict protein-protein interaction and enyzm...
 
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...
 
Towards Biomedical Data Integration for Analyzing the Evolution of Cognition
Towards Biomedical Data Integration for Analyzing the Evolution of CognitionTowards Biomedical Data Integration for Analyzing the Evolution of Cognition
Towards Biomedical Data Integration for Analyzing the Evolution of Cognition
 
Linked data in the digital humanities skills workshop for realising the oppo...
Linked data in the digital humanities  skills workshop for realising the oppo...Linked data in the digital humanities  skills workshop for realising the oppo...
Linked data in the digital humanities skills workshop for realising the oppo...
 
Identifying, annotating, and filtering arguments and opinions on the social w...
Identifying, annotating, and filtering arguments and opinions on the social w...Identifying, annotating, and filtering arguments and opinions on the social w...
Identifying, annotating, and filtering arguments and opinions on the social w...
 
Harrower Heravi RDA P4 Social media
Harrower Heravi RDA P4 Social mediaHarrower Heravi RDA P4 Social media
Harrower Heravi RDA P4 Social media
 
Leveraging Wikipedia-based Features for Entity Relatedness and Recommendations
Leveraging Wikipedia-based Features for Entity Relatedness and RecommendationsLeveraging Wikipedia-based Features for Entity Relatedness and Recommendations
Leveraging Wikipedia-based Features for Entity Relatedness and Recommendations
 
Data Café — A Platform For Creating Biomedical Data Lakes
Data Café — A Platform For Creating Biomedical Data LakesData Café — A Platform For Creating Biomedical Data Lakes
Data Café — A Platform For Creating Biomedical Data Lakes
 
2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis2016 07 12_purdue_bigdatainomics_seandavis
2016 07 12_purdue_bigdatainomics_seandavis
 
Sabrina Kirrane INSIGHT Viva Presentation
Sabrina Kirrane INSIGHT Viva Presentation Sabrina Kirrane INSIGHT Viva Presentation
Sabrina Kirrane INSIGHT Viva Presentation
 
Industry Report: The State of Customer Data Integration in 2013
Industry Report: The State of Customer Data Integration in 2013Industry Report: The State of Customer Data Integration in 2013
Industry Report: The State of Customer Data Integration in 2013
 
Data Journalism - Start working with Data
Data Journalism  - Start working with DataData Journalism  - Start working with Data
Data Journalism - Start working with Data
 
Systematic discovery of phosphorylation networks - Combining linear motifs an...
Systematic discovery of phosphorylation networks - Combining linear motifs an...Systematic discovery of phosphorylation networks - Combining linear motifs an...
Systematic discovery of phosphorylation networks - Combining linear motifs an...
 
Semantic annotation of biomedical data
Semantic annotation of biomedical dataSemantic annotation of biomedical data
Semantic annotation of biomedical data
 

Semelhante a Aidan's PhD Viva

A hands on overview of the semantic web
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic web
Marakana Inc.
 
ApacheCon NA 2011 report
ApacheCon NA 2011 reportApacheCon NA 2011 report
ApacheCon NA 2011 report
Koji Kawamura
 

Semelhante a Aidan's PhD Viva (20)

SPARQL introduction and training (130+ slides with exercices)
SPARQL introduction and training (130+ slides with exercices)SPARQL introduction and training (130+ slides with exercices)
SPARQL introduction and training (130+ slides with exercices)
 
code4lib 2011 preconference: What's New in Solr (since 1.4.1)
code4lib 2011 preconference: What's New in Solr (since 1.4.1)code4lib 2011 preconference: What's New in Solr (since 1.4.1)
code4lib 2011 preconference: What's New in Solr (since 1.4.1)
 
SWT Lecture Session 5 - RDFS
SWT Lecture Session 5 - RDFSSWT Lecture Session 5 - RDFS
SWT Lecture Session 5 - RDFS
 
Democratizing Big Semantic Data management
Democratizing Big Semantic Data managementDemocratizing Big Semantic Data management
Democratizing Big Semantic Data management
 
A hands on overview of the semantic web
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic web
 
Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]
 
Bridging Batch and Real-time Systems for Anomaly Detection
Bridging Batch and Real-time Systems for Anomaly DetectionBridging Batch and Real-time Systems for Anomaly Detection
Bridging Batch and Real-time Systems for Anomaly Detection
 
Sparql
SparqlSparql
Sparql
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
FAIR Projector Builder
FAIR Projector BuilderFAIR Projector Builder
FAIR Projector Builder
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 
A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...
A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...
A Generic Mapping-based Query Translation from SPARQL to Various Target Datab...
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
 
Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016
 
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
 
ApacheCon NA 2011 report
ApacheCon NA 2011 reportApacheCon NA 2011 report
ApacheCon NA 2011 report
 
Semantic web
Semantic webSemantic web
Semantic web
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your Analytics
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 

Aidan's PhD Viva

  • 1. Exploiting RDFS and OWL for Integrating Heterogeneous, Large-Scale, Linked Data Corpora Aidan Hogan PhD Viva
  • 2. Cold Open Figure 1: Web of Data explicit data implicit data Topic of thesis: How can consumers tap into the implicit data
  • 3.
  • 4.
  • 5.
  • 6. the macro … A Web of Data Images from: http://richard.cyganiak.de/2007/10/lod/ ; Cyganiak, Jentzsch September 2010 August 2007 November 2007 February 2008 March 2008 September 2008 March 2009 July 2009
  • 7.
  • 8.
  • 9. Take Query Answering … Gimme webpages relating to Tim Berners-Lee foaf:page timbl:i timbl:i foaf:page ?pages .
  • 10. Hetereogenity in terminology … webpage: properties foaf:page foaf:homepage foaf:isPrimaryTopicOf foaf:weblog doap:homepage foaf:topic foaf:primaryTopic mo:musicBrainz mo:myspace … = rdfs:subPropertyOf = owl:inverseOf
  • 11. Linked Data, RDFS and OWL: Linked Vocabularies … … Image from http://blog.dbtune.org/public/.081005_lod_constellation_m.jpg : ; Giasson, Bergman
  • 12. Hetereogenity in naming … Tim Berners-Lee: URIs … timbl:i dblp:100007 identica:45563 adv:timbl fb:en.tim_berners-lee db:Tim-Berners_Lee = owl:sameAs
  • 13. Returning to our Query … Gimme webpages relating to Tim Berners-Lee foaf:page timbl:i timbl:i foaf:page ?pages . ... 7 x 6 = 42 possible patterns foaf:homepage foaf:isPrimaryTopicOf doap:homepage foaf:topic foaf:primaryTopic mo:myspace dblp:100007 identica:45563 adv:timbl fb:en.tim_berners-lee db:Tim-Berners_Lee
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Linked Data Reasoning: Challenges Scalable Expressive Robust Domain-Agnostic
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37. Scalable Distributed Reasoning ... ... ex:me ex:presented ex:ThisTalk ... SAME T-BOX SAME T-BOX SAME T-BOX SAME T-BOX SAME T-BOX DIFF. A-BOX DIFF. A-BOX DIFF. A-BOX DIFF. A-BOX DIFF. A-BOX ... ... ex:me ex:presented ex:ThisTalk ... ... ... ex:me ex:presented ex:ThisTalk ... ... ... ex:me ex:presented ex:ThisTalk ... ... ... ex:me ex:presented ex:ThisTalk ... LOCAL OUTPUT ... ... ex:me ex:presented ex:ThisTalk ... LOCAL OUTPUT LOCAL OUTPUT LOCAL OUTPUT LOCAL OUTPUT ... ... ex:me ex:presented ex:ThisTal ... ... ex:me ex:presented ex:ThisTalk ... ... ex:me ex:presented ex:ThisTalk ... ... ex:me rdf:type ex:Awesome . ... ... ... ... ... ... ... ex:me ex:presented ex:ThisTalk ... ... ... ex:me ex:presented ex:ThisTalk ... ... ... ex:me ex:presented ex:ThisTalk ... ... ... ex:me ex:presented ex:ThisTalk ... ... ... ex:me ex:presented ex:ThisTalk ... EXTRACT T-BOX EXTRACT T-BOX EXTRACT T-BOX EXTRACT T-BOX EXTRACT T-BOX COLLECT T-BOX COLLECT T-BOX COLLECT T-BOX COLLECT T-BOX COLLECT T-BOX ... ...
  • 38. Reasoning Performance (1 machine)
  • 39.
  • 40. Reasoning: Results 962 million unique/novel triples 947 million unique triples
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50.
  • 51.
  • 52.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.
  • 61.
  • 62.
  • 63.
  • 64.
  • 65.
  • 66.
  • 67.