SlideShare uma empresa Scribd logo
1 de 19
Baixar para ler offline
The Semantic Web –
A Vision Come True, or Giving Up
the Great Plan?
Martin Hepp, @mfhepp
mfhepp@gmail.com
Semantic Web: A Decade of Achievement?
•  Linked Open Data Cloud
•  Schema.org
•  Google Knowledge Graph
•  Bing Sartori
•  Linked Data in Libraries
•  Linked Data in Public Data Initiatives
•  Etc.
Semantic Web and Linked Data Success Stories
http://www.heppresearch.com2
The LOD Cloud
A hard-wired, small-scale data integration project with no quality of service
guarantees.
http://www.heppresearch.com3
Linking Open Data cloud diagram 2014, by Max Schmachtenberg, Christian Bizer, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/
Web Data Commons
A pretty outdated RDF representation of information extracted from a biased
sample of popular Web pages, missing a lot of data in deep detail pages.
http://www.heppresearch.com4
2015-04-02: RDFa, Microdata, and Microformat data sets extracted from the
December 2014 Common Crawl corpus available for download.
The Old Testament of the Semantic Web
http://www.heppresearch.com5
Mostly WHAT a better Web should allow
§  Computers should be able to help us
process information from the Web
The New Testament of the Semantic Web
http://www.heppresearch.com6
Detailed technical assumptions about the HOW
§  Widely driven by applying principles
from small-scale, controlled settings
to the Web.
§  Need for extensions of old paradigms
acknowledged.
§  But fundamental question of match
between paradigms and ecosystem
largely unchallenged.
The Modern Sects and their Cults
http://www.heppresearch.com7
Turned assumptions and drafts into laws
§  Linked Data Principles
–  URIs over strings
§  Entity identifiers
§  Qualitative values
(enumerations)
–  Page vs. Entity / Conneg /
Redirects
–  Open Licenses
§  SPARQL endpoints
§  Reuse visible content in RDFa and
Microdata Berner-Lee, Tim: Linked Data,
http://www.w3.org/DesignIssues/LinkedData.html
An now they fight a useless war over the details of their
interpretation…
http://www.heppresearch.com8
3rd Commandment: Thou shalt not make unto thee any graven image
§  Exodus 20:4-6
§  Minimal ontological commitment, folks!
§  Occam's razor
§  Ludwig Wittgenstein: Tractatus Logico-Philosophicus:
–  “Occam's Razor is, of course, not an arbitrary rule nor one justified by its practical success. It
simply says that unnecessary elements in a symbolism mean nothing. Signs which serve one
purpose are logically equivalent; signs which serve no purpose are logically meaningless.” (*)
Image Credit: PD, https://en.wikipedia.org/?title=Crusades#/media/File:Albigensian_Crusade_01.jpg
(*) Taken from https://en.wikipedia.org/wiki/Occam's_razor#Ludwig_Wittgenstein
What is schema.org? What is GoodRelations?
1. Official Characterization
2. Purpose:
§  Focus on information extraction on the Web
§  Other uses as a by-product
3. Knowledge Representation Perspective
§  Entity Types
§  Relationship Types
§  Weak Domain / Range Semantics
§  Syntax-independent Meta-Model
And how are they related?
Questions? Suggestions? Contact me at @mfhepp!9
Official Characterization from http://schema.org
Questions? Suggestions? Contact me at @mfhepp!10
This site provides a collection of schemas that webmasters can use
to markup HTML pages in ways recognized by major
search providers, and that can also be used for
structured data interoperability (e.g. in JSON). Search
engines including Bing, Google, Yahoo! and Yandex rely on this markup to improve the
display of search results, making it easier for people to find the right Web pages.
Many sites are generated from structured data, which is often stored in
databases. When this data is formatted into HTML, it becomes very difficult to
recover the original structured data. Many applications, especially
search engines, can benefit greatly from direct access to this structured
data. On-page markup enables search engines to understand the information on web
pages and provide richer search results in order to make it easier for users to find
relevant information on the web. Markup can also enable new tools
and applications that make use of the structure.
Overview and Motivation: There is REAL Momentum
Questions? Suggestions? Contact me at @mfhepp!11
A lot of data
§  Since 2011, schema.org has been added to >25% of top-ranked e-
commerce sites product detail pages.
§  RDF-based representations are specified.
Table: Random sample of n=73 product detail pages from high-ranking Google results.
Note that these numbers have a strong bias towards popular, professionally operated sites.
Schema.org: A Data Publication Ontology
Questions? Suggestions? Contact me at @mfhepp!12
Not designed for raw data consumption (only as a by-product)
§  Historically, ontologies in computer
science aimed at harmonizing the
conceptualization and representation
of data for publishers and consumers
of the data.
§  Implicit goal of the traditional Semantic
Web stack: More or less, consumption
of raw data.
§  This requires detailed consensus on
the level of data granularity and data
semantics at scale, and high data
quality.
§  Schema.org does not make this
assumption, since its sponsors have
the power to work on semi-structured
data at Web scale.
Ontologyschema.org
Schema.org: The Semantic Web Vision Come True?
1. No OWL. Not even an ontology in the narrow sense.
2. Direct consumption difficult
§  Crawling
§  Cleansing
§  Lifting
3. No broad use of Linked Data principles
§  Mostly no global entity identifiers
§  Page = Entity (vs. httpRange-14)
§  No vocabulary reuse (*)
Likely not what the Semantic Web community had hoped for.
Questions? Suggestions? Contact me at @mfhepp!13
Web Ontology Engineering Patterns
1. Dynamic Degree of Disambiguation
2. Dynamic Data Granularity
3. Sweet Spots Rule
§  Distinctions that can be populated reliably and with little
effort
§  Distinctions that are hard to reconstruct by the recipient
Hepp (2015, forthcoming)
http://www.heppresearch.com14
The Fallacy of Raw Consumption of Web Data
http://www.heppresearch.com15
Naïve Type Membership Interpretation: SPARQL
# Find former STI members who are professors
PREFIX dbpedia-owl: <http://dbpedia.org/ontology/>
SELECT * {?s a dbpedia-owl:Professor} LIMIT 100
Naïve Type Membership Interpretation: SPARQL
http://www.heppresearch.com17
Find all professors from Web markup
<html prefix="schema: http://schema.org/!
dbpedia: http://dbpedia.org/ontology/">!
<!-- .. -->!
<div typeOf="schema:Person dbpedia:Professor" about="#person">!
<span property="schema:honorificPrefix">Prof. Dr.</span>&nbsp;!
<span property="schema:givenName">Zaphod</span>!
<span property="schema:familyname">Beeblebrox</span>!
</div>!
</html>
Type Membership as a Machine Learning Problem
http://www.heppresearch.com18
Supervised Learning: Logistic Regression
§  Input:
–  Entity e
–  Type t
–  Origin (Graph / Domain / URI) o
–  Optional: Properties and property values [(p1,v1), (p2,v2),…]
§  Output
–  t’(e) = f(e, t, o)
–  p(t(e) == True)
Example data:
(http://www.acme.org/, …#person, http://schema.org/EducationEvent)
(http://munich.eventful.com/, …#event1, http://schema.org/MusicEvent)
Hepp (2015b, forthcoming)
Let’s Do Science, not Cult!
http://www.heppresearch.com19
§  Challenge paradigms and
approaches
§  Use hard data, not beliefs and
assumptions (neither your own ones
nor the ones inherited from the old
folks)
CC BY-SA 3.0 / Nicor / https://en.wikipedia.org/wiki/North_Korea's_cult_of_personality#/
media/File:Mansudae_Grand_Monument_08.JPG
Thank you.
http://www.heppresearch.com20
HEPP RESEARCH GmbH
Prof. Dr. Martin Hepp, CEO
Contact us!
Kuppelnaustrasse 5
88212 Ravensburg, Germany
Phone +49 751 2708 5256-0
Fax +49 751 2708 5256-9
www.heppresearch.com
contact@heppresearch.com

Mais conteúdo relacionado

Mais procurados

Web Site Visibility in the Giant Graph of Commerce Data
Web Site Visibility in the Giant Graph of Commerce DataWeb Site Visibility in the Giant Graph of Commerce Data
Web Site Visibility in the Giant Graph of Commerce DataMartin Hepp
 
Open hpi semweb-06-part4
Open hpi semweb-06-part4Open hpi semweb-06-part4
Open hpi semweb-06-part4Nadine Ludwig
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataMatthew Rowe
 
GoodRelations Tutorial Part 1
GoodRelations Tutorial Part 1GoodRelations Tutorial Part 1
GoodRelations Tutorial Part 1guestecacad2
 
Searching the Web of Data (Tutorial)
Searching the Web of Data (Tutorial)Searching the Web of Data (Tutorial)
Searching the Web of Data (Tutorial)Gerard de Melo
 
Linked Data Integration and semantic web
Linked Data Integration and semantic webLinked Data Integration and semantic web
Linked Data Integration and semantic webDiego Pessoa
 
Open hpi semweb-06-part5
Open hpi semweb-06-part5Open hpi semweb-06-part5
Open hpi semweb-06-part5Nadine Ludwig
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data miningTaha Mokfi
 
Brief Introduction to Linked Data
Brief Introduction to Linked DataBrief Introduction to Linked Data
Brief Introduction to Linked DataRobert Sanderson
 
Semantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConSemantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConBill Slawski
 
Tutorial@BDA 2017 -- Knowledge Graph Expansion and Enrichment
Tutorial@BDA 2017 -- Knowledge Graph Expansion and Enrichment Tutorial@BDA 2017 -- Knowledge Graph Expansion and Enrichment
Tutorial@BDA 2017 -- Knowledge Graph Expansion and Enrichment Paris Sud University
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2guestecacad2
 
Wimmics Overview 2021
Wimmics Overview 2021Wimmics Overview 2021
Wimmics Overview 2021Fabien Gandon
 
Creating knowledge out of interlinked data
Creating knowledge out of interlinked dataCreating knowledge out of interlinked data
Creating knowledge out of interlinked dataSören Auer
 
Open hpi semweb-06-part8
Open hpi semweb-06-part8Open hpi semweb-06-part8
Open hpi semweb-06-part8Nadine Ludwig
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchNeo4j
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudOntotext
 

Mais procurados (19)

Web Site Visibility in the Giant Graph of Commerce Data
Web Site Visibility in the Giant Graph of Commerce DataWeb Site Visibility in the Giant Graph of Commerce Data
Web Site Visibility in the Giant Graph of Commerce Data
 
Introduction to Big Data Technologies
Introduction to Big Data TechnologiesIntroduction to Big Data Technologies
Introduction to Big Data Technologies
 
Open hpi semweb-06-part4
Open hpi semweb-06-part4Open hpi semweb-06-part4
Open hpi semweb-06-part4
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
 
GoodRelations Tutorial Part 1
GoodRelations Tutorial Part 1GoodRelations Tutorial Part 1
GoodRelations Tutorial Part 1
 
Searching the Web of Data (Tutorial)
Searching the Web of Data (Tutorial)Searching the Web of Data (Tutorial)
Searching the Web of Data (Tutorial)
 
Linked Data Integration and semantic web
Linked Data Integration and semantic webLinked Data Integration and semantic web
Linked Data Integration and semantic web
 
Open hpi semweb-06-part5
Open hpi semweb-06-part5Open hpi semweb-06-part5
Open hpi semweb-06-part5
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data mining
 
Brief Introduction to Linked Data
Brief Introduction to Linked DataBrief Introduction to Linked Data
Brief Introduction to Linked Data
 
Semantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConSemantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA Con
 
Tutorial@BDA 2017 -- Knowledge Graph Expansion and Enrichment
Tutorial@BDA 2017 -- Knowledge Graph Expansion and Enrichment Tutorial@BDA 2017 -- Knowledge Graph Expansion and Enrichment
Tutorial@BDA 2017 -- Knowledge Graph Expansion and Enrichment
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
 
Wimmics Overview 2021
Wimmics Overview 2021Wimmics Overview 2021
Wimmics Overview 2021
 
Creating knowledge out of interlinked data
Creating knowledge out of interlinked dataCreating knowledge out of interlinked data
Creating knowledge out of interlinked data
 
Open hpi semweb-06-part8
Open hpi semweb-06-part8Open hpi semweb-06-part8
Open hpi semweb-06-part8
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
 
Web 3.0 Emerging
Web 3.0 EmergingWeb 3.0 Emerging
Web 3.0 Emerging
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 

Destaque

Scottish Public Opinion Monitor - May 2013
Scottish Public Opinion Monitor - May 2013Scottish Public Opinion Monitor - May 2013
Scottish Public Opinion Monitor - May 2013Ipsos UK
 
Design Thinking for Advanced Manufacturing _ Industry Recommendations_Dec 2014
Design Thinking for Advanced Manufacturing _ Industry Recommendations_Dec 2014 Design Thinking for Advanced Manufacturing _ Industry Recommendations_Dec 2014
Design Thinking for Advanced Manufacturing _ Industry Recommendations_Dec 2014 Jane Cockburn
 
De Empreendedor à Empresário
De Empreendedor à EmpresárioDe Empreendedor à Empresário
De Empreendedor à EmpresárioFernando Tomé
 
Big Data World presentation - Sep. 2014
Big Data World presentation - Sep. 2014Big Data World presentation - Sep. 2014
Big Data World presentation - Sep. 2014Wing Yuen Loon
 
Open Rubrics and The Semantic Web: Open Ed 2010
Open Rubrics and The Semantic Web: Open Ed 2010Open Rubrics and The Semantic Web: Open Ed 2010
Open Rubrics and The Semantic Web: Open Ed 2010bpanulla
 
Ux och design som konverterar del 1
Ux och design som konverterar   del 1Ux och design som konverterar   del 1
Ux och design som konverterar del 1Wipcore
 
An ontologyforopenrubricexchangeontheweb
An ontologyforopenrubricexchangeonthewebAn ontologyforopenrubricexchangeontheweb
An ontologyforopenrubricexchangeonthewebbpanulla
 
Actielijst 201304
Actielijst 201304Actielijst 201304
Actielijst 201304Kees Dekker
 
淺談HTTP發展趨勢與SPDY
淺談HTTP發展趨勢與SPDY淺談HTTP發展趨勢與SPDY
淺談HTTP發展趨勢與SPDYBilly Yang
 
The Tablet Project: Charts
The Tablet Project: ChartsThe Tablet Project: Charts
The Tablet Project: ChartsNewsworks
 
Tutorial avanzado wordle
Tutorial avanzado wordleTutorial avanzado wordle
Tutorial avanzado wordleAníbal Rossi
 

Destaque (16)

1 spattern matching using biometric techniques
1 spattern matching using biometric techniques1 spattern matching using biometric techniques
1 spattern matching using biometric techniques
 
Scottish Public Opinion Monitor - May 2013
Scottish Public Opinion Monitor - May 2013Scottish Public Opinion Monitor - May 2013
Scottish Public Opinion Monitor - May 2013
 
Design Thinking for Advanced Manufacturing _ Industry Recommendations_Dec 2014
Design Thinking for Advanced Manufacturing _ Industry Recommendations_Dec 2014 Design Thinking for Advanced Manufacturing _ Industry Recommendations_Dec 2014
Design Thinking for Advanced Manufacturing _ Industry Recommendations_Dec 2014
 
De Empreendedor à Empresário
De Empreendedor à EmpresárioDe Empreendedor à Empresário
De Empreendedor à Empresário
 
RASPBIAN JESSIE WITH PIXEL
RASPBIAN JESSIE WITH PIXELRASPBIAN JESSIE WITH PIXEL
RASPBIAN JESSIE WITH PIXEL
 
Big Data World presentation - Sep. 2014
Big Data World presentation - Sep. 2014Big Data World presentation - Sep. 2014
Big Data World presentation - Sep. 2014
 
Open Rubrics and The Semantic Web: Open Ed 2010
Open Rubrics and The Semantic Web: Open Ed 2010Open Rubrics and The Semantic Web: Open Ed 2010
Open Rubrics and The Semantic Web: Open Ed 2010
 
Ux och design som konverterar del 1
Ux och design som konverterar   del 1Ux och design som konverterar   del 1
Ux och design som konverterar del 1
 
An ontologyforopenrubricexchangeontheweb
An ontologyforopenrubricexchangeonthewebAn ontologyforopenrubricexchangeontheweb
An ontologyforopenrubricexchangeontheweb
 
Actielijst 201304
Actielijst 201304Actielijst 201304
Actielijst 201304
 
Van Godtsenhoven Karen
Van Godtsenhoven KarenVan Godtsenhoven Karen
Van Godtsenhoven Karen
 
淺談HTTP發展趨勢與SPDY
淺談HTTP發展趨勢與SPDY淺談HTTP發展趨勢與SPDY
淺談HTTP發展趨勢與SPDY
 
The Tablet Project: Charts
The Tablet Project: ChartsThe Tablet Project: Charts
The Tablet Project: Charts
 
CUE one pager
CUE one pagerCUE one pager
CUE one pager
 
Tutorial avanzado wordle
Tutorial avanzado wordleTutorial avanzado wordle
Tutorial avanzado wordle
 
Top 7 Features in an LMS
Top 7 Features in an LMSTop 7 Features in an LMS
Top 7 Features in an LMS
 

Semelhante a The Semantic Web – A Vision Come True, or Giving Up the Great Plan?

Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked DataAdrian Stevenson
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commonsJesse Wang
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudPeter Haase
 
Building Satori: Web Data Extraction On Hadoop
Building Satori: Web Data Extraction On HadoopBuilding Satori: Web Data Extraction On Hadoop
Building Satori: Web Data Extraction On HadoopNikolai Avteniev
 
DataEngConf: Building Satori, a Hadoop toll for Data Extraction at LinkedIn
DataEngConf: Building Satori, a Hadoop toll for Data Extraction at LinkedInDataEngConf: Building Satori, a Hadoop toll for Data Extraction at LinkedIn
DataEngConf: Building Satori, a Hadoop toll for Data Extraction at LinkedInHakka Labs
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madnesssemanticsconference
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data GenerationFilip Radulovic
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Linked Data and Libraries: What? Why? How?
Linked Data and Libraries: What? Why? How?Linked Data and Libraries: What? Why? How?
Linked Data and Libraries: What? Why? How?Emily Nimsakont
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13Kristi Holmes
 
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...IOSR Journals
 
How links can make your open data even greater
How links can make your open data even greaterHow links can make your open data even greater
How links can make your open data even greaterCristina Sarasua
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph IntroductionSören Auer
 
FAIR data: LOUD for all audiences
FAIR data: LOUD for all audiencesFAIR data: LOUD for all audiences
FAIR data: LOUD for all audiencesAlessandro Adamou
 
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...giuseppe_futia
 

Semelhante a The Semantic Web – A Vision Come True, or Giving Up the Great Plan? (20)

Linked Data and Semantic Web Application Development by Peter Haase
Linked Data and Semantic Web Application Development by Peter HaaseLinked Data and Semantic Web Application Development by Peter Haase
Linked Data and Semantic Web Application Development by Peter Haase
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked Data
 
The Future of LOD
The Future of LODThe Future of LOD
The Future of LOD
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Lawless-3-jun15
Lawless-3-jun15Lawless-3-jun15
Lawless-3-jun15
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
 
Building Satori: Web Data Extraction On Hadoop
Building Satori: Web Data Extraction On HadoopBuilding Satori: Web Data Extraction On Hadoop
Building Satori: Web Data Extraction On Hadoop
 
DataEngConf: Building Satori, a Hadoop toll for Data Extraction at LinkedIn
DataEngConf: Building Satori, a Hadoop toll for Data Extraction at LinkedInDataEngConf: Building Satori, a Hadoop toll for Data Extraction at LinkedIn
DataEngConf: Building Satori, a Hadoop toll for Data Extraction at LinkedIn
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madness
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Linked Data
Linked DataLinked Data
Linked Data
 
Linked Data and Libraries: What? Why? How?
Linked Data and Libraries: What? Why? How?Linked Data and Libraries: What? Why? How?
Linked Data and Libraries: What? Why? How?
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
Advance Frameworks for Hidden Web Retrieval Using Innovative Vision-Based Pag...
 
Cognitive data
Cognitive dataCognitive data
Cognitive data
 
How links can make your open data even greater
How links can make your open data even greaterHow links can make your open data even greater
How links can make your open data even greater
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
FAIR data: LOUD for all audiences
FAIR data: LOUD for all audiencesFAIR data: LOUD for all audiences
FAIR data: LOUD for all audiences
 
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
 

Mais de Martin Hepp

Extending schema.org with GoodRelations and www.productontology.org
Extending schema.org with GoodRelations and www.productontology.orgExtending schema.org with GoodRelations and www.productontology.org
Extending schema.org with GoodRelations and www.productontology.orgMartin Hepp
 
The Semantic Web and its Impact on International Websites
The Semantic Web and its Impact on International WebsitesThe Semantic Web and its Impact on International Websites
The Semantic Web and its Impact on International WebsitesMartin Hepp
 
KRDB2010-GoodRelations
KRDB2010-GoodRelationsKRDB2010-GoodRelations
KRDB2010-GoodRelationsMartin Hepp
 
ISKO 2010: Linked Data in E-Commerce – The GoodRelations Ontology
ISKO 2010: Linked Data in E-Commerce – The GoodRelations OntologyISKO 2010: Linked Data in E-Commerce – The GoodRelations Ontology
ISKO 2010: Linked Data in E-Commerce – The GoodRelations OntologyMartin Hepp
 
ISKO2010: Linked Data in E-Commerce – The GoodRelations Ontology
ISKO2010: Linked Data in E-Commerce – The GoodRelations OntologyISKO2010: Linked Data in E-Commerce – The GoodRelations Ontology
ISKO2010: Linked Data in E-Commerce – The GoodRelations OntologyMartin Hepp
 
Goodrelations semtech2010
Goodrelations semtech2010Goodrelations semtech2010
Goodrelations semtech2010Martin Hepp
 
SEO, RDFa, and GoodRelations: An Implementation by a Major Online Retailer
SEO, RDFa, and GoodRelations: An Implementation by a Major Online RetailerSEO, RDFa, and GoodRelations: An Implementation by a Major Online Retailer
SEO, RDFa, and GoodRelations: An Implementation by a Major Online RetailerMartin Hepp
 
SEO, RDFa, and GoodRelations - An Implementation by a Major Online Retailer
SEO, RDFa, and GoodRelations - An Implementation by a Major Online RetailerSEO, RDFa, and GoodRelations - An Implementation by a Major Online Retailer
SEO, RDFa, and GoodRelations - An Implementation by a Major Online RetailerMartin Hepp
 
Goodrelations Presentation from SemTech 2010
Goodrelations Presentation from SemTech 2010Goodrelations Presentation from SemTech 2010
Goodrelations Presentation from SemTech 2010Martin Hepp
 
Web Page Optimization for Facebook
Web Page Optimization for FacebookWeb Page Optimization for Facebook
Web Page Optimization for FacebookMartin Hepp
 
ISWC GoodRelations Tutorial Part 1
ISWC GoodRelations Tutorial Part 1ISWC GoodRelations Tutorial Part 1
ISWC GoodRelations Tutorial Part 1Martin Hepp
 
ISWC GoodRelations Tutorial Part 3
ISWC GoodRelations Tutorial Part 3ISWC GoodRelations Tutorial Part 3
ISWC GoodRelations Tutorial Part 3Martin Hepp
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2Martin Hepp
 
ISWC GoodRelations Tutorial Part 4
ISWC GoodRelations Tutorial Part 4ISWC GoodRelations Tutorial Part 4
ISWC GoodRelations Tutorial Part 4Martin Hepp
 
Web 3.0. für Spezialversender
Web 3.0. für Spezialversender Web 3.0. für Spezialversender
Web 3.0. für Spezialversender Martin Hepp
 
eCl@ss im Web: Mehr Kunden und bessere Stammdaten für jeden eCl@ss-Anwender
eCl@ss im Web: Mehr Kunden und bessere Stammdaten für jeden eCl@ss-AnwendereCl@ss im Web: Mehr Kunden und bessere Stammdaten für jeden eCl@ss-Anwender
eCl@ss im Web: Mehr Kunden und bessere Stammdaten für jeden eCl@ss-AnwenderMartin Hepp
 
Product Variety, Consumer Preferences, and Web Technology: Can the Web of Dat...
Product Variety, Consumer Preferences, and Web Technology: Can the Web of Dat...Product Variety, Consumer Preferences, and Web Technology: Can the Web of Dat...
Product Variety, Consumer Preferences, and Web Technology: Can the Web of Dat...Martin Hepp
 
Deep Comparison Shopping
Deep Comparison ShoppingDeep Comparison Shopping
Deep Comparison ShoppingMartin Hepp
 
Semantic Web-based E-Commerce: The GoodRelations Ontology
Semantic Web-based E-Commerce: The GoodRelations OntologySemantic Web-based E-Commerce: The GoodRelations Ontology
Semantic Web-based E-Commerce: The GoodRelations OntologyMartin Hepp
 
myOntology: Community-driven Vocabulary Design and Maintenance for E-Commerce
myOntology: Community-driven Vocabulary Design and Maintenance for E-CommercemyOntology: Community-driven Vocabulary Design and Maintenance for E-Commerce
myOntology: Community-driven Vocabulary Design and Maintenance for E-CommerceMartin Hepp
 

Mais de Martin Hepp (20)

Extending schema.org with GoodRelations and www.productontology.org
Extending schema.org with GoodRelations and www.productontology.orgExtending schema.org with GoodRelations and www.productontology.org
Extending schema.org with GoodRelations and www.productontology.org
 
The Semantic Web and its Impact on International Websites
The Semantic Web and its Impact on International WebsitesThe Semantic Web and its Impact on International Websites
The Semantic Web and its Impact on International Websites
 
KRDB2010-GoodRelations
KRDB2010-GoodRelationsKRDB2010-GoodRelations
KRDB2010-GoodRelations
 
ISKO 2010: Linked Data in E-Commerce – The GoodRelations Ontology
ISKO 2010: Linked Data in E-Commerce – The GoodRelations OntologyISKO 2010: Linked Data in E-Commerce – The GoodRelations Ontology
ISKO 2010: Linked Data in E-Commerce – The GoodRelations Ontology
 
ISKO2010: Linked Data in E-Commerce – The GoodRelations Ontology
ISKO2010: Linked Data in E-Commerce – The GoodRelations OntologyISKO2010: Linked Data in E-Commerce – The GoodRelations Ontology
ISKO2010: Linked Data in E-Commerce – The GoodRelations Ontology
 
Goodrelations semtech2010
Goodrelations semtech2010Goodrelations semtech2010
Goodrelations semtech2010
 
SEO, RDFa, and GoodRelations: An Implementation by a Major Online Retailer
SEO, RDFa, and GoodRelations: An Implementation by a Major Online RetailerSEO, RDFa, and GoodRelations: An Implementation by a Major Online Retailer
SEO, RDFa, and GoodRelations: An Implementation by a Major Online Retailer
 
SEO, RDFa, and GoodRelations - An Implementation by a Major Online Retailer
SEO, RDFa, and GoodRelations - An Implementation by a Major Online RetailerSEO, RDFa, and GoodRelations - An Implementation by a Major Online Retailer
SEO, RDFa, and GoodRelations - An Implementation by a Major Online Retailer
 
Goodrelations Presentation from SemTech 2010
Goodrelations Presentation from SemTech 2010Goodrelations Presentation from SemTech 2010
Goodrelations Presentation from SemTech 2010
 
Web Page Optimization for Facebook
Web Page Optimization for FacebookWeb Page Optimization for Facebook
Web Page Optimization for Facebook
 
ISWC GoodRelations Tutorial Part 1
ISWC GoodRelations Tutorial Part 1ISWC GoodRelations Tutorial Part 1
ISWC GoodRelations Tutorial Part 1
 
ISWC GoodRelations Tutorial Part 3
ISWC GoodRelations Tutorial Part 3ISWC GoodRelations Tutorial Part 3
ISWC GoodRelations Tutorial Part 3
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2
 
ISWC GoodRelations Tutorial Part 4
ISWC GoodRelations Tutorial Part 4ISWC GoodRelations Tutorial Part 4
ISWC GoodRelations Tutorial Part 4
 
Web 3.0. für Spezialversender
Web 3.0. für Spezialversender Web 3.0. für Spezialversender
Web 3.0. für Spezialversender
 
eCl@ss im Web: Mehr Kunden und bessere Stammdaten für jeden eCl@ss-Anwender
eCl@ss im Web: Mehr Kunden und bessere Stammdaten für jeden eCl@ss-AnwendereCl@ss im Web: Mehr Kunden und bessere Stammdaten für jeden eCl@ss-Anwender
eCl@ss im Web: Mehr Kunden und bessere Stammdaten für jeden eCl@ss-Anwender
 
Product Variety, Consumer Preferences, and Web Technology: Can the Web of Dat...
Product Variety, Consumer Preferences, and Web Technology: Can the Web of Dat...Product Variety, Consumer Preferences, and Web Technology: Can the Web of Dat...
Product Variety, Consumer Preferences, and Web Technology: Can the Web of Dat...
 
Deep Comparison Shopping
Deep Comparison ShoppingDeep Comparison Shopping
Deep Comparison Shopping
 
Semantic Web-based E-Commerce: The GoodRelations Ontology
Semantic Web-based E-Commerce: The GoodRelations OntologySemantic Web-based E-Commerce: The GoodRelations Ontology
Semantic Web-based E-Commerce: The GoodRelations Ontology
 
myOntology: Community-driven Vocabulary Design and Maintenance for E-Commerce
myOntology: Community-driven Vocabulary Design and Maintenance for E-CommercemyOntology: Community-driven Vocabulary Design and Maintenance for E-Commerce
myOntology: Community-driven Vocabulary Design and Maintenance for E-Commerce
 

The Semantic Web – A Vision Come True, or Giving Up the Great Plan?

  • 1. The Semantic Web – A Vision Come True, or Giving Up the Great Plan? Martin Hepp, @mfhepp mfhepp@gmail.com
  • 2. Semantic Web: A Decade of Achievement? •  Linked Open Data Cloud •  Schema.org •  Google Knowledge Graph •  Bing Sartori •  Linked Data in Libraries •  Linked Data in Public Data Initiatives •  Etc. Semantic Web and Linked Data Success Stories http://www.heppresearch.com2
  • 3. The LOD Cloud A hard-wired, small-scale data integration project with no quality of service guarantees. http://www.heppresearch.com3 Linking Open Data cloud diagram 2014, by Max Schmachtenberg, Christian Bizer, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/
  • 4. Web Data Commons A pretty outdated RDF representation of information extracted from a biased sample of popular Web pages, missing a lot of data in deep detail pages. http://www.heppresearch.com4 2015-04-02: RDFa, Microdata, and Microformat data sets extracted from the December 2014 Common Crawl corpus available for download.
  • 5. The Old Testament of the Semantic Web http://www.heppresearch.com5 Mostly WHAT a better Web should allow §  Computers should be able to help us process information from the Web
  • 6. The New Testament of the Semantic Web http://www.heppresearch.com6 Detailed technical assumptions about the HOW §  Widely driven by applying principles from small-scale, controlled settings to the Web. §  Need for extensions of old paradigms acknowledged. §  But fundamental question of match between paradigms and ecosystem largely unchallenged.
  • 7. The Modern Sects and their Cults http://www.heppresearch.com7 Turned assumptions and drafts into laws §  Linked Data Principles –  URIs over strings §  Entity identifiers §  Qualitative values (enumerations) –  Page vs. Entity / Conneg / Redirects –  Open Licenses §  SPARQL endpoints §  Reuse visible content in RDFa and Microdata Berner-Lee, Tim: Linked Data, http://www.w3.org/DesignIssues/LinkedData.html
  • 8. An now they fight a useless war over the details of their interpretation… http://www.heppresearch.com8 3rd Commandment: Thou shalt not make unto thee any graven image §  Exodus 20:4-6 §  Minimal ontological commitment, folks! §  Occam's razor §  Ludwig Wittgenstein: Tractatus Logico-Philosophicus: –  “Occam's Razor is, of course, not an arbitrary rule nor one justified by its practical success. It simply says that unnecessary elements in a symbolism mean nothing. Signs which serve one purpose are logically equivalent; signs which serve no purpose are logically meaningless.” (*) Image Credit: PD, https://en.wikipedia.org/?title=Crusades#/media/File:Albigensian_Crusade_01.jpg (*) Taken from https://en.wikipedia.org/wiki/Occam's_razor#Ludwig_Wittgenstein
  • 9. What is schema.org? What is GoodRelations? 1. Official Characterization 2. Purpose: §  Focus on information extraction on the Web §  Other uses as a by-product 3. Knowledge Representation Perspective §  Entity Types §  Relationship Types §  Weak Domain / Range Semantics §  Syntax-independent Meta-Model And how are they related? Questions? Suggestions? Contact me at @mfhepp!9
  • 10. Official Characterization from http://schema.org Questions? Suggestions? Contact me at @mfhepp!10 This site provides a collection of schemas that webmasters can use to markup HTML pages in ways recognized by major search providers, and that can also be used for structured data interoperability (e.g. in JSON). Search engines including Bing, Google, Yahoo! and Yandex rely on this markup to improve the display of search results, making it easier for people to find the right Web pages. Many sites are generated from structured data, which is often stored in databases. When this data is formatted into HTML, it becomes very difficult to recover the original structured data. Many applications, especially search engines, can benefit greatly from direct access to this structured data. On-page markup enables search engines to understand the information on web pages and provide richer search results in order to make it easier for users to find relevant information on the web. Markup can also enable new tools and applications that make use of the structure.
  • 11. Overview and Motivation: There is REAL Momentum Questions? Suggestions? Contact me at @mfhepp!11 A lot of data §  Since 2011, schema.org has been added to >25% of top-ranked e- commerce sites product detail pages. §  RDF-based representations are specified. Table: Random sample of n=73 product detail pages from high-ranking Google results. Note that these numbers have a strong bias towards popular, professionally operated sites.
  • 12. Schema.org: A Data Publication Ontology Questions? Suggestions? Contact me at @mfhepp!12 Not designed for raw data consumption (only as a by-product) §  Historically, ontologies in computer science aimed at harmonizing the conceptualization and representation of data for publishers and consumers of the data. §  Implicit goal of the traditional Semantic Web stack: More or less, consumption of raw data. §  This requires detailed consensus on the level of data granularity and data semantics at scale, and high data quality. §  Schema.org does not make this assumption, since its sponsors have the power to work on semi-structured data at Web scale. Ontologyschema.org
  • 13. Schema.org: The Semantic Web Vision Come True? 1. No OWL. Not even an ontology in the narrow sense. 2. Direct consumption difficult §  Crawling §  Cleansing §  Lifting 3. No broad use of Linked Data principles §  Mostly no global entity identifiers §  Page = Entity (vs. httpRange-14) §  No vocabulary reuse (*) Likely not what the Semantic Web community had hoped for. Questions? Suggestions? Contact me at @mfhepp!13
  • 14. Web Ontology Engineering Patterns 1. Dynamic Degree of Disambiguation 2. Dynamic Data Granularity 3. Sweet Spots Rule §  Distinctions that can be populated reliably and with little effort §  Distinctions that are hard to reconstruct by the recipient Hepp (2015, forthcoming) http://www.heppresearch.com14
  • 15. The Fallacy of Raw Consumption of Web Data http://www.heppresearch.com15 Naïve Type Membership Interpretation: SPARQL # Find former STI members who are professors PREFIX dbpedia-owl: <http://dbpedia.org/ontology/> SELECT * {?s a dbpedia-owl:Professor} LIMIT 100
  • 16. Naïve Type Membership Interpretation: SPARQL http://www.heppresearch.com17 Find all professors from Web markup <html prefix="schema: http://schema.org/! dbpedia: http://dbpedia.org/ontology/">! <!-- .. -->! <div typeOf="schema:Person dbpedia:Professor" about="#person">! <span property="schema:honorificPrefix">Prof. Dr.</span>&nbsp;! <span property="schema:givenName">Zaphod</span>! <span property="schema:familyname">Beeblebrox</span>! </div>! </html>
  • 17. Type Membership as a Machine Learning Problem http://www.heppresearch.com18 Supervised Learning: Logistic Regression §  Input: –  Entity e –  Type t –  Origin (Graph / Domain / URI) o –  Optional: Properties and property values [(p1,v1), (p2,v2),…] §  Output –  t’(e) = f(e, t, o) –  p(t(e) == True) Example data: (http://www.acme.org/, …#person, http://schema.org/EducationEvent) (http://munich.eventful.com/, …#event1, http://schema.org/MusicEvent) Hepp (2015b, forthcoming)
  • 18. Let’s Do Science, not Cult! http://www.heppresearch.com19 §  Challenge paradigms and approaches §  Use hard data, not beliefs and assumptions (neither your own ones nor the ones inherited from the old folks) CC BY-SA 3.0 / Nicor / https://en.wikipedia.org/wiki/North_Korea's_cult_of_personality#/ media/File:Mansudae_Grand_Monument_08.JPG
  • 19. Thank you. http://www.heppresearch.com20 HEPP RESEARCH GmbH Prof. Dr. Martin Hepp, CEO Contact us! Kuppelnaustrasse 5 88212 Ravensburg, Germany Phone +49 751 2708 5256-0 Fax +49 751 2708 5256-9 www.heppresearch.com contact@heppresearch.com