SlideShare uma empresa Scribd logo
1 de 46
Tetherless World Constellation
Wither OWL
Jim Hendler
@jahendler
Tetherless World Professor of Computer, Web and Cognitive Science
Director, Rensselaer Institute for Data Exploration and Applications
Rensselaer Polytechnic Institute
Tetherless World Constellation
About the title
• Whither:
– To what place or state
• Wither:
1.Become dry and shriveled
2.To lose the freshness of youth, as from
age
Tetherless World Constellation
Semantic Web and its contribution
Slide ca. 2001. Were we right?
Tetherless World Constellation
Were we right?
Sort of? Mostly?
Tetherless World Constellation
Example: Semantic Search
IEEE Computer, Jan 2010)
Tetherless World Constellation
Contenders ca. 2010
Tetherless World Constellation
Contenders ca. 2014
Google Sem Webbers include: R. Guha, Dan Brickley, Denny Vrandečić
Natasha Noy, Chris Welty, …
Tetherless World Constellation
Google 2014
Google finds embedded metadata on >20% of its crawl – Guha, 2014
Tetherless World Constellation
Other success stories
Facebook: 2011 Oracle: 2012
Tetherless World Constellation
Some others
Tetherless World Constellation
Watson used Semantic Web
IBM
Tetherless World Constellation
All cool, but…
• Which of these use OWL in any
significant way?
(This part of the slide intentionally left blank)
Tetherless World Constellation
Why not OWL?
• This is NOT to say OWL isn’t being used
– but it’s not much by comparison
– but it’s not much on the Web
• with the possible exception of the misuse of
owl:sameAs, but let’s not go there…
• Semantic markup on the Web exceeds
anything we predicted in 2001…
– … but OWL use on the Web as a proportion of
this lags behind the expectations many of us
had
• The rest of this talk is speculation as to why…
Tetherless World Constellation
What I used to think the problem is
Ontology: the OWL DL view
• Ontology as Barad-
Dur (Sauron's
tower):
– Extremely powerful!
– Patrolled by Orcs
• Let one little hobbit
in, and the whole
thing could come
crashing down
inconsistency
Decidable Logic basis
ontology: the RDFS view
• ontology and the
tower of Babel
– We will build a tower
to reach the sky
– We only need a little
ontological
agreement
• Who cares if we all
speak different
languages?
Genesis 11:7 Let us go down, and there confound their
language, that they may not understand one another's speech.
So the Lord scattered them abroad from thence upon the face
of all the earth: and they left off to build the city.
ROI: Reasoning over
(Enterprise) data
• This "big O" Ontology finds use cases in
verticals and enterprises
– Where the vocabulary can be controlled
– Where finding things in the data is important
• Example
– Drug discovery from data
• Model the molecule (site, chemical properties, etc) as
faithfully and expressively as possible
• Use "Realization" to categorize data assets against the
ontology
– Bad or missed answers are money down the drain
BUT VERY EXPENSIVE
ROI: Web 3.0
• The "small o" ontology finds use cases in
Web Applications (at Web scales)
– A lot of data, a little semantics
– Finding anything in the mess can be a win!
• Example
– Declare simple inferable relationships and apply,
at scale, to large, heterogeneous data collections
• These are "heuristics" not every answer must be
right (qua Google)
– But remember time = money!
WHAT I USED TO BELIEVE
ROI: Web 3.0
• The "small o" ontology finds use cases in
Web Applications (at Web scales)
– A lot of data, a little semantics
– Finding anything in the mess can be a win!
• Example
– Declare simple inferable relationships and apply,
at scale, to large, heterogeneous data collections
• These are "heuristics" not every answer must be
right (qua Google)
– But remember time = money!
WHAT I NOW BELIEVE
Tetherless World Constellation
BUT, not quite the same way
• That explains why the uptake of
RDFS/SPARQL is going on
– But primarthings like schema.org
• But still doesn’t explain why OWL
isn’t used as much as it should be
– especially on the Web
– especially when the need for ontologies
is growing rapidly and many kinds of
“ontology like things” are being used
heavily
Tetherless World Constellation
What I am coming to believe
• My previous view was blaming OWL’s
problems on too much expressivity
– as opposed to RDFS (or really RDFS+)
• I now believe the problem is lack of
expressivity (in an interoperable
way)
– for tasks where people really need Web
semantics
Tetherless World Constellation
We were right:
The Web is increasingly about LINKING DATA
http://linkeddata.org/ cloud is a tiny fraction of what is out there
Tetherless World Constellation
Current linked data approaches are also broken!
• Data is converted into RDF and SPARQL’ed
– creates huge graph DBs less efficient than the
original DB
• Data is converted from DB into SPARQL
return on demand
– much better, but you must know the mapping
• owl:sameAs is (ab)used to map data to
data
– but that only lets you map equals – which is an
easy mapping to express in many ways
• defining equality right in a model theory is much
harder, and thus the abuse, but let’s leave that for
another talk
Tetherless World Constellation
DataData
MetaData
What we need
Linked Semantic Web “metadata” documents that can be used to link very large databases in
distributed data systems. This could lead to orders of magnitude reduction in information flow
for large-scale distributed data problems.
Tetherless World Constellation
What’s the problem
• We want a knowledge representation
that can do things like:
– help us find the right data for a problem
• The “Date” field in some DB could be lots of
different things
– consider “Database of 1957 NYC births” vs
“Database of 1957 NYC deaths”
– help us map between different
databases
• The problem isn’t primarily translating
ontologies to each other, it is tying the
ontologies to the data (for the mappings)
Tetherless World Constellation
Example: Mereology (and meronymy)
• OWL doesn’t have a part-whole relation
– left out of design because we couldn’t reach
consensus (and there’s 2000 yrs of argument behind
that)
• but also because most require transitive closure of
parts in many cases and that had complexity
issues
– but one of the most
used relations in
the gene ontology
and many medical
ontologies is
part of
Tetherless World Constellation
But, what? Wait!
Example: We use the fact that the brain has 5 lobes as
a driving example for qualified cardinality
- but to say that in OWL, you need to INVENT the
“hasdirectpart” relation (i.e. no interoperability on the
important part for the data!)
Tetherless World Constellation
Database mapping
• Many things in database schemas
also map to parts/wholes
– Who is in what organization?
– What components comprise an
assembly?
– Where did something occur (that was
part of another event)?
– When…
Tetherless World Constellation
When?
• Temporal reasoning is missing from
OWL
– Knowing [this talk occurred during
“ESWC 2016” AND “ESWC 2016”
occurred in May of 2016, THEREFORE
this talk occurred in May of 2016] would
be huge
• Whole books of temporal logics out there
– picking is hard
– but it is also what OWL has to do to be a standard
for this kind of mapping
Tetherless World Constellation
Geophysical reasoning
• Add math to part-wholes
– OY!
– but GDBs are widely used for mappings
of information in big databases
• especially in science (more OWL use cases)
• Talking about “math”
– lots of discussion of adding probability
to OWL
• and someone should some day
– but what about
Tetherless World Constellation
SIMPLE relationship reasoning
• Discovery Informatics and data mining
– Huge industries, and growing
– Web data, Internet of things, data
interoperability (the startup holy grails)
• Example: supposing some scientific theory
“implies” that if X increases then Y should
also increase
– Which databases would help confirm my
theory? Which would argue against it?
– Easy to check in many new database systems
– How do we express that aspect of the theory in
OWL?
Tetherless World Constellation
Digression…
(and shameless self promotion)
• That will be the topic of my keynote
at IJCAI 2016 entitled “Knowledge
Representation in the Era of Deep
Learning, Watson and the Semantic
Web”
– I’d welcome your thoughts and good
ideas I next couple of days!
Tetherless World Constellation
Back to our regularly scheduled talk:
Procedural Attachment?
• In fact, what about procedural
attachment?
– lots of literature on preserving
completeness w/respect to procedures
• used in prolog, etc.
– Why isn’t this in OWL?
• patent issues w/respect to standard
• general concern about whether procedural
attachment was possible in OWL DL
framework
Tetherless World Constellation
Procedural Attachment
No longer a patent issue …
35
Exploring knowledge graphs
• Agent reasons about a network of connected entities and chooses the
best next one to discuss based on a combination of factors
– Consistency, Novelty and Ontological relationships
36
Integrating
• Automate Agent Creation combining:
Information Extraction: Using anew
"living information extraction" technique, we
will be able to create a "never-ending
extractor" which will be pulling from web
documents information about entities and
events, and the relationships between
them.
The new system can work in a dynamic
node, and does not need human annotated
samples for training, but it works best if
there are a number of known relationships
between pages to build off of.
info extraction uses ontologies
37
Integrate
• Automate Agent Creation combining:
Information Extraction: Using anew
"living information extraction" technique, we
will be able to create a "never-ending
extractor" which will be pulling from web
documents information about entities and
events, and the relationships between
them.
The new system can work in a dynamic
node, and does not need human annotated
samples for training, but it works best if
there are a number of known relationships
between pages to build off of.
Semantic Web: The Semantic Web
provides a number of known relationships
between pages on the Web in a number of
domains. Using general knowledge
sources, like dbpedia and Yago, and
specialized knowledge sources, like the
data from musicbrainz, the reviews from
Yelp (which have semantic annotations)
and even the Open Graph of Facebook
(which is available in a semantic web
format), provides a jumpstart for the
language extraction.
However, the Semantic Web relates pages,
but doesn't have any sort of
"understanding" of what is on the pages.
ontologies
again!
38
Our Approach
• Automate Agent Creation combining:
Information Extraction: Using anew
"living information extraction" technique, we
will be able to create a "never-ending
extractor" which will be pulling from web
documents information about entities and
events, and the relationships between
them.
The new system can work in a dynamic
node, and does not need human annotated
samples for training, but it works best if
there are a number of known relationships
between pages to build off of.
Semantic Web: The Semantic Web
provides a number of known relationships
between pages on the Web in a number of
domains. Using general knowledge
sources, like dbpedia and Yago, and
specialized knowledge sources, like the
data from musicbrainz, the reviews from
Yelp (which have semantic annotations)
and even the Open Graph of Facebook
(which is available in a semantic web
format), provides a jumpstart for the
language extraction.
However, the Semantic Web relates pages,
but doesn't have any sort of
"understanding" of what is on the pages.
Cognitive Computing : Cognitive
Computing, can allow us to have a better
way of accessing information about the
entities found on the Web and finding other
information about the same entities using
various kinds of search and language
heuristics. This allows us to have more
organized information, rapidly generated,
about the entities being explored.
However, given a large graph of entities
(even the organized linked-open data cloud
has information about billions of things),
how do we choose what to display next? If
the best we can do is provide links, all of
the above isn't much better than choosing
a page and clicking from there.
and here!
39
Our Approach
• Automate Agent Creation combining:
Information Extraction: Using anew
"living information extraction" technique, we
will be able to create a "never-ending
extractor" which will be pulling from web
documents information about entities and
events, and the relationships between
them.
The new system can work in a dynamic
node, and does not need human annotated
samples for training, but it works best if
there are a number of known relationships
between pages to build off of.
Semantic Web: The Semantic Web
provides a number of known relationships
between pages on the Web in a number of
domains. Using general knowledge
sources, like dbpedia and Yago, and
specialized knowledge sources, like the
data from musicbrainz, the reviews from
Yelp (which have semantic annotations)
and even the Open Graph of Facebook
(which is available in a semantic web
format), provides a jumpstart for the
language extraction.
However, the Semantic Web relates pages,
but doesn't have any sort of
"understanding" of what is on the pages.
Cognitive Story-telling Technology: Interactive
storytelling techniques are being explored
to take information in the kind of
"knowledge graph" resulting from the
above, and tailoring the presentation to a
user using storytelling techniques. It is
aimed at presenting the information as an
interesting and meaningful story by taking
into consideration a combination of factors
ranging from topic consistency and novelty,
to learned user interests and even a user’s
emotional reactions. The system can
essentially determine "where to go next"
and what to do there in the organized
information as processed above..
“User models” (which we are encoding using an ontology)
Tetherless World Constellation
I could go on
• There are many other such examples
– Describing how data in one set could be joined
to data in another is incredibly powerful,
timely, and important
• it’s just not really what people have typically used
traditional reasoners for
– Providing the ontological glue among different
AI technologies: Priceless
• Why aren’t we doing more to understand
these issues and bring into the OWL
“family?”
– de facto standardization leads to adoption
Tetherless World Constellation
My challenge
• Is the problem that we cannot have
these powerful things in a decidable
(sound and complete) language?
– then maybe we have to give up
decidability?
• or even better, define new maths of
expressivity that have different kinds of
“sound and complete” behavior
– i.e. approximately sound and complete
» (is modern computational theory weakened by
“within epsilon” optimality?)
– i.e. “anytime” algorithms for reasoners
» (conjecture) sound and complete at infinity
Tetherless World Constellation
Challenge
• We may need to rethink how we are
defining ontologies with respect to
the necessary properties for use
on the
Web
Tetherless World Constellation
I’m not “anti-formalism”
• A sufficient formalism for Semantic Web
applications must
– Provide a model that accounts for linked data
• What is the equivalent of a DB calculus?
– Provide a means for evaluating different kinds
of completeness for reasoners
• In practice we must be able to model A-box effects
as formally as T-box technologies
– example Weaver 2012 showed what restrictions were needed
to maximize parallelism of some OWL subsets
– Think about other processors than formal
reasoners that will use the ontologies
• ontologies used in many other ways
– i.e. why does an IE system w/F1=.84 need a DL reasoner?
Tetherless World Constellation
Summary
• The growing world of (semantic web,
linked) data needs ontologies more than
ever
– OWL has some of the important things
– But is missing many of the really important
things
• The problem isn’t (formal) expressivity
– it’s the need to express other things (esp
relationships between properties, events, etc.)
– We need more research into how to formalize
these kinds of relations
Tetherless World Constellation
Acknowledgments
• I get funded by lots of folks – this talk may or
may not represent anything anyone there
believes:
– ARL, DARPA, Microsoft, Mitre, Optum Laboratories
• The Rensselaer Institute for Data Exploration and
Application pays my salary
– Thanks!!
• Many of my best ideas, including a lot in this talk
come from listening to smart people
– Frank van Harmelen and Peter Mika have influenced my
thinking about many of the ideas in this talk
• The students and my colleagues at the RPI
Tetherless World Constellation (tw.rpi.edu)
Tetherless World Constellation
ANY QUESTIONS?

Mais conteúdo relacionado

Mais procurados

Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
Caserta
 
In-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteIn-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great Taste
DataWorks Summit
 
온톨로지 개념 및 표현언어
온톨로지 개념 및 표현언어온톨로지 개념 및 표현언어
온톨로지 개념 및 표현언어
Dongbum Kim
 
Managing Nonimmune hydrops fetalis
  Managing Nonimmune hydrops fetalis   Managing Nonimmune hydrops fetalis
Managing Nonimmune hydrops fetalis
Vidya Thobbi
 

Mais procurados (20)

White Paper: EMC Greenplum Data Computing Appliance Enhances EMC IT's Global ...
White Paper: EMC Greenplum Data Computing Appliance Enhances EMC IT's Global ...White Paper: EMC Greenplum Data Computing Appliance Enhances EMC IT's Global ...
White Paper: EMC Greenplum Data Computing Appliance Enhances EMC IT's Global ...
 
Oracle Cloud Infrastructure – Compute
Oracle Cloud Infrastructure – ComputeOracle Cloud Infrastructure – Compute
Oracle Cloud Infrastructure – Compute
 
FIBO in Neo4j: Applying Knowledge Graphs in the Financial Industry
FIBO in Neo4j: Applying Knowledge Graphs in the Financial IndustryFIBO in Neo4j: Applying Knowledge Graphs in the Financial Industry
FIBO in Neo4j: Applying Knowledge Graphs in the Financial Industry
 
Introduction to PolyBase
Introduction to PolyBaseIntroduction to PolyBase
Introduction to PolyBase
 
Janus RTP forwarders @ FOSDEM 2020
Janus RTP forwarders @ FOSDEM 2020Janus RTP forwarders @ FOSDEM 2020
Janus RTP forwarders @ FOSDEM 2020
 
Designing Scalable Data Warehouse Using MySQL
Designing Scalable Data Warehouse Using MySQLDesigning Scalable Data Warehouse Using MySQL
Designing Scalable Data Warehouse Using MySQL
 
Drag and Drop Open Source GeoTools ETL with Apache NiFi
Drag and Drop Open Source GeoTools ETL with Apache NiFiDrag and Drop Open Source GeoTools ETL with Apache NiFi
Drag and Drop Open Source GeoTools ETL with Apache NiFi
 
Near real-time anomaly detection at Lyft
Near real-time anomaly detection at LyftNear real-time anomaly detection at Lyft
Near real-time anomaly detection at Lyft
 
Oracle Management Cloud, OMC architecture
Oracle Management Cloud, OMC architecture Oracle Management Cloud, OMC architecture
Oracle Management Cloud, OMC architecture
 
Kettle – Etl Tool
Kettle – Etl ToolKettle – Etl Tool
Kettle – Etl Tool
 
Python 생태계의 이해
Python 생태계의 이해Python 생태계의 이해
Python 생태계의 이해
 
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
In-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteIn-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great Taste
 
GeoMesa on Apache Spark SQL with Anthony Fox
GeoMesa on Apache Spark SQL with Anthony FoxGeoMesa on Apache Spark SQL with Anthony Fox
GeoMesa on Apache Spark SQL with Anthony Fox
 
온톨로지 개념 및 표현언어
온톨로지 개념 및 표현언어온톨로지 개념 및 표현언어
온톨로지 개념 및 표현언어
 
Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4
 
Managing Nonimmune hydrops fetalis
  Managing Nonimmune hydrops fetalis   Managing Nonimmune hydrops fetalis
Managing Nonimmune hydrops fetalis
 
Oracle Endeca 101 Developer Introduction High Level Overview
Oracle Endeca 101 Developer Introduction High Level OverviewOracle Endeca 101 Developer Introduction High Level Overview
Oracle Endeca 101 Developer Introduction High Level Overview
 
JSON-LD and MongoDB
JSON-LD and MongoDBJSON-LD and MongoDB
JSON-LD and MongoDB
 

Destaque

Modular Ontologies - A Formal Investigation of Semantics and Expressivity
Modular Ontologies - A Formal Investigation of Semantics and ExpressivityModular Ontologies - A Formal Investigation of Semantics and Expressivity
Modular Ontologies - A Formal Investigation of Semantics and Expressivity
Jie Bao
 

Destaque (20)

Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the Web
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
 
Watson: An Academic's Perspective
Watson: An Academic's PerspectiveWatson: An Academic's Perspective
Watson: An Academic's Perspective
 
Modular Ontologies - A Formal Investigation of Semantics and Expressivity
Modular Ontologies - A Formal Investigation of Semantics and ExpressivityModular Ontologies - A Formal Investigation of Semantics and Expressivity
Modular Ontologies - A Formal Investigation of Semantics and Expressivity
 
Deliverable_5.1.2
Deliverable_5.1.2Deliverable_5.1.2
Deliverable_5.1.2
 
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern MinimalizationABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
 
The General Ontology Evaluation Framework (GOEF) & the I-Choose Use Case A ...
The General Ontology Evaluation Framework (GOEF) & the I-Choose Use CaseA ...The General Ontology Evaluation Framework (GOEF) & the I-Choose Use CaseA ...
The General Ontology Evaluation Framework (GOEF) & the I-Choose Use Case A ...
 
IAS 16 Ontology Dojo
IAS 16 Ontology DojoIAS 16 Ontology Dojo
IAS 16 Ontology Dojo
 
OWL Web Ontology Language Overview
OWL Web Ontology Language OverviewOWL Web Ontology Language Overview
OWL Web Ontology Language Overview
 
Schema.org: Where did that come from!
Schema.org: Where did that come from!Schema.org: Where did that come from!
Schema.org: Where did that come from!
 
Semantic Web: The Inside Story
Semantic Web: The Inside StorySemantic Web: The Inside Story
Semantic Web: The Inside Story
 
cyclades eswc2016
cyclades eswc2016cyclades eswc2016
cyclades eswc2016
 
VOLT - ESWC 2016
VOLT - ESWC 2016VOLT - ESWC 2016
VOLT - ESWC 2016
 
Human-Aware Sensor Network Ontology: Semantic Support for Empirical Data Coll...
Human-Aware Sensor Network Ontology: Semantic Support for Empirical Data Coll...Human-Aware Sensor Network Ontology: Semantic Support for Empirical Data Coll...
Human-Aware Sensor Network Ontology: Semantic Support for Empirical Data Coll...
 
Assessing the performance of RDF Engines: Discussing RDF Benchmarks
Assessing the performance of RDF Engines: Discussing RDF Benchmarks Assessing the performance of RDF Engines: Discussing RDF Benchmarks
Assessing the performance of RDF Engines: Discussing RDF Benchmarks
 
Contextual Data Collection for Smart Cities
Contextual Data Collection for Smart CitiesContextual Data Collection for Smart Cities
Contextual Data Collection for Smart Cities
 
Semantically enhanced quality assurance in the jurion business use case
Semantically enhanced quality assurance in the jurion  business use caseSemantically enhanced quality assurance in the jurion  business use case
Semantically enhanced quality assurance in the jurion business use case
 

Semelhante a Wither OWL

MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
Dorothea Salo
 
20111120 warsaw learning curve by b hyland notes
20111120 warsaw   learning curve by b hyland notes20111120 warsaw   learning curve by b hyland notes
20111120 warsaw learning curve by b hyland notes
Bernadette Hyland-Wood
 
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar   Intro to Linked Data and SemanticsINSPIRE Hackathon Webinar   Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
plan4all
 
Choices, modelling and Frankenstein Ontologies
Choices, modelling and Frankenstein OntologiesChoices, modelling and Frankenstein Ontologies
Choices, modelling and Frankenstein Ontologies
benosteen
 

Semelhante a Wither OWL (20)

The Semantic Web: 2010 Update
The Semantic Web: 2010 UpdateThe Semantic Web: 2010 Update
The Semantic Web: 2010 Update
 
The Semantic Web: 2010 Update
The Semantic Web: 2010 Update The Semantic Web: 2010 Update
The Semantic Web: 2010 Update
 
Semantic Web: "ten year" update
Semantic Web: "ten year" updateSemantic Web: "ten year" update
Semantic Web: "ten year" update
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
 
Library Linked Data
Library Linked DataLibrary Linked Data
Library Linked Data
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
 
Digital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AIDigital Archiving, The Semantic Web, and Modern AI
Digital Archiving, The Semantic Web, and Modern AI
 
Linking American Art to the Cloud
Linking American Art to the CloudLinking American Art to the Cloud
Linking American Art to the Cloud
 
Semantic Web: introduction & overview
Semantic Web: introduction & overviewSemantic Web: introduction & overview
Semantic Web: introduction & overview
 
The importance of the Web for the Semantic Web
The importance of the Web for the Semantic WebThe importance of the Web for the Semantic Web
The importance of the Web for the Semantic Web
 
KR in the age of Deep Learning
KR in the age of Deep LearningKR in the age of Deep Learning
KR in the age of Deep Learning
 
Core Methods In Educational Data Mining
Core Methods In Educational Data MiningCore Methods In Educational Data Mining
Core Methods In Educational Data Mining
 
20111120 warsaw learning curve by b hyland notes
20111120 warsaw   learning curve by b hyland notes20111120 warsaw   learning curve by b hyland notes
20111120 warsaw learning curve by b hyland notes
 
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar   Intro to Linked Data and SemanticsINSPIRE Hackathon Webinar   Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
 
Choices, modelling and Frankenstein Ontologies
Choices, modelling and Frankenstein OntologiesChoices, modelling and Frankenstein Ontologies
Choices, modelling and Frankenstein Ontologies
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Open IE tutorial 2018
Open IE tutorial 2018Open IE tutorial 2018
Open IE tutorial 2018
 
Nosql public
Nosql publicNosql public
Nosql public
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
 

Mais de James Hendler

Mais de James Hendler (17)

Knowing what AI Systems Don't know and Why it matters
Knowing what AI  Systems Don't know and Why it mattersKnowing what AI  Systems Don't know and Why it matters
Knowing what AI Systems Don't know and Why it matters
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Knowledge Graph Semantics/Interoperability
Knowledge Graph Semantics/InteroperabilityKnowledge Graph Semantics/Interoperability
Knowledge Graph Semantics/Interoperability
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide Web
 
Enhancing Precision Wellness with Personal Health Knowledge Graphs
Enhancing Precision Wellness with Personal Health Knowledge Graphs Enhancing Precision Wellness with Personal Health Knowledge Graphs
Enhancing Precision Wellness with Personal Health Knowledge Graphs
 
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebThe Future of AI: Going BeyondDeep Learning, Watson, and the Semantic Web
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
 
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
Capacity Building: Data Science in the University  At Rensselaer Polytechnic ...Capacity Building: Data Science in the University  At Rensselaer Polytechnic ...
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
 
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
Enhancing Precision Wellness with  Knowledge Graphs and Semantic Analytics: O...Enhancing Precision Wellness with  Knowledge Graphs and Semantic Analytics: O...
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
 
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 
Facilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic MarkupFacilitating Web Science Collaboration through Semantic Markup
Facilitating Web Science Collaboration through Semantic Markup
 
Big Data and Computer Science Education
Big Data and Computer Science EducationBig Data and Computer Science Education
Big Data and Computer Science Education
 
Why Watson Won: A cognitive perspective
Why Watson Won: A cognitive perspectiveWhy Watson Won: A cognitive perspective
Why Watson Won: A cognitive perspective
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration
 
Watson at RPI - Summer 2013
Watson at RPI - Summer 2013Watson at RPI - Summer 2013
Watson at RPI - Summer 2013
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for Real
 
Future of the World WIde Web (India)
Future of the World WIde Web (India)Future of the World WIde Web (India)
Future of the World WIde Web (India)
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Wither OWL

  • 1. Tetherless World Constellation Wither OWL Jim Hendler @jahendler Tetherless World Professor of Computer, Web and Cognitive Science Director, Rensselaer Institute for Data Exploration and Applications Rensselaer Polytechnic Institute
  • 2. Tetherless World Constellation About the title • Whither: – To what place or state • Wither: 1.Become dry and shriveled 2.To lose the freshness of youth, as from age
  • 3. Tetherless World Constellation Semantic Web and its contribution Slide ca. 2001. Were we right?
  • 4. Tetherless World Constellation Were we right? Sort of? Mostly?
  • 5. Tetherless World Constellation Example: Semantic Search IEEE Computer, Jan 2010)
  • 7. Tetherless World Constellation Contenders ca. 2014 Google Sem Webbers include: R. Guha, Dan Brickley, Denny Vrandečić Natasha Noy, Chris Welty, …
  • 8. Tetherless World Constellation Google 2014 Google finds embedded metadata on >20% of its crawl – Guha, 2014
  • 9. Tetherless World Constellation Other success stories Facebook: 2011 Oracle: 2012
  • 11. Tetherless World Constellation Watson used Semantic Web IBM
  • 12. Tetherless World Constellation All cool, but… • Which of these use OWL in any significant way? (This part of the slide intentionally left blank)
  • 13. Tetherless World Constellation Why not OWL? • This is NOT to say OWL isn’t being used – but it’s not much by comparison – but it’s not much on the Web • with the possible exception of the misuse of owl:sameAs, but let’s not go there… • Semantic markup on the Web exceeds anything we predicted in 2001… – … but OWL use on the Web as a proportion of this lags behind the expectations many of us had • The rest of this talk is speculation as to why…
  • 14. Tetherless World Constellation What I used to think the problem is
  • 15. Ontology: the OWL DL view • Ontology as Barad- Dur (Sauron's tower): – Extremely powerful! – Patrolled by Orcs • Let one little hobbit in, and the whole thing could come crashing down inconsistency Decidable Logic basis
  • 16. ontology: the RDFS view • ontology and the tower of Babel – We will build a tower to reach the sky – We only need a little ontological agreement • Who cares if we all speak different languages? Genesis 11:7 Let us go down, and there confound their language, that they may not understand one another's speech. So the Lord scattered them abroad from thence upon the face of all the earth: and they left off to build the city.
  • 17. ROI: Reasoning over (Enterprise) data • This "big O" Ontology finds use cases in verticals and enterprises – Where the vocabulary can be controlled – Where finding things in the data is important • Example – Drug discovery from data • Model the molecule (site, chemical properties, etc) as faithfully and expressively as possible • Use "Realization" to categorize data assets against the ontology – Bad or missed answers are money down the drain BUT VERY EXPENSIVE
  • 18. ROI: Web 3.0 • The "small o" ontology finds use cases in Web Applications (at Web scales) – A lot of data, a little semantics – Finding anything in the mess can be a win! • Example – Declare simple inferable relationships and apply, at scale, to large, heterogeneous data collections • These are "heuristics" not every answer must be right (qua Google) – But remember time = money! WHAT I USED TO BELIEVE
  • 19. ROI: Web 3.0 • The "small o" ontology finds use cases in Web Applications (at Web scales) – A lot of data, a little semantics – Finding anything in the mess can be a win! • Example – Declare simple inferable relationships and apply, at scale, to large, heterogeneous data collections • These are "heuristics" not every answer must be right (qua Google) – But remember time = money! WHAT I NOW BELIEVE
  • 20. Tetherless World Constellation BUT, not quite the same way • That explains why the uptake of RDFS/SPARQL is going on – But primarthings like schema.org • But still doesn’t explain why OWL isn’t used as much as it should be – especially on the Web – especially when the need for ontologies is growing rapidly and many kinds of “ontology like things” are being used heavily
  • 21. Tetherless World Constellation What I am coming to believe • My previous view was blaming OWL’s problems on too much expressivity – as opposed to RDFS (or really RDFS+) • I now believe the problem is lack of expressivity (in an interoperable way) – for tasks where people really need Web semantics
  • 22. Tetherless World Constellation We were right: The Web is increasingly about LINKING DATA http://linkeddata.org/ cloud is a tiny fraction of what is out there
  • 23. Tetherless World Constellation Current linked data approaches are also broken! • Data is converted into RDF and SPARQL’ed – creates huge graph DBs less efficient than the original DB • Data is converted from DB into SPARQL return on demand – much better, but you must know the mapping • owl:sameAs is (ab)used to map data to data – but that only lets you map equals – which is an easy mapping to express in many ways • defining equality right in a model theory is much harder, and thus the abuse, but let’s leave that for another talk
  • 24. Tetherless World Constellation DataData MetaData What we need Linked Semantic Web “metadata” documents that can be used to link very large databases in distributed data systems. This could lead to orders of magnitude reduction in information flow for large-scale distributed data problems.
  • 25. Tetherless World Constellation What’s the problem • We want a knowledge representation that can do things like: – help us find the right data for a problem • The “Date” field in some DB could be lots of different things – consider “Database of 1957 NYC births” vs “Database of 1957 NYC deaths” – help us map between different databases • The problem isn’t primarily translating ontologies to each other, it is tying the ontologies to the data (for the mappings)
  • 26. Tetherless World Constellation Example: Mereology (and meronymy) • OWL doesn’t have a part-whole relation – left out of design because we couldn’t reach consensus (and there’s 2000 yrs of argument behind that) • but also because most require transitive closure of parts in many cases and that had complexity issues – but one of the most used relations in the gene ontology and many medical ontologies is part of
  • 27. Tetherless World Constellation But, what? Wait! Example: We use the fact that the brain has 5 lobes as a driving example for qualified cardinality - but to say that in OWL, you need to INVENT the “hasdirectpart” relation (i.e. no interoperability on the important part for the data!)
  • 28. Tetherless World Constellation Database mapping • Many things in database schemas also map to parts/wholes – Who is in what organization? – What components comprise an assembly? – Where did something occur (that was part of another event)? – When…
  • 29. Tetherless World Constellation When? • Temporal reasoning is missing from OWL – Knowing [this talk occurred during “ESWC 2016” AND “ESWC 2016” occurred in May of 2016, THEREFORE this talk occurred in May of 2016] would be huge • Whole books of temporal logics out there – picking is hard – but it is also what OWL has to do to be a standard for this kind of mapping
  • 30. Tetherless World Constellation Geophysical reasoning • Add math to part-wholes – OY! – but GDBs are widely used for mappings of information in big databases • especially in science (more OWL use cases) • Talking about “math” – lots of discussion of adding probability to OWL • and someone should some day – but what about
  • 31. Tetherless World Constellation SIMPLE relationship reasoning • Discovery Informatics and data mining – Huge industries, and growing – Web data, Internet of things, data interoperability (the startup holy grails) • Example: supposing some scientific theory “implies” that if X increases then Y should also increase – Which databases would help confirm my theory? Which would argue against it? – Easy to check in many new database systems – How do we express that aspect of the theory in OWL?
  • 32. Tetherless World Constellation Digression… (and shameless self promotion) • That will be the topic of my keynote at IJCAI 2016 entitled “Knowledge Representation in the Era of Deep Learning, Watson and the Semantic Web” – I’d welcome your thoughts and good ideas I next couple of days!
  • 33. Tetherless World Constellation Back to our regularly scheduled talk: Procedural Attachment? • In fact, what about procedural attachment? – lots of literature on preserving completeness w/respect to procedures • used in prolog, etc. – Why isn’t this in OWL? • patent issues w/respect to standard • general concern about whether procedural attachment was possible in OWL DL framework
  • 34. Tetherless World Constellation Procedural Attachment No longer a patent issue …
  • 35. 35 Exploring knowledge graphs • Agent reasons about a network of connected entities and chooses the best next one to discuss based on a combination of factors – Consistency, Novelty and Ontological relationships
  • 36. 36 Integrating • Automate Agent Creation combining: Information Extraction: Using anew "living information extraction" technique, we will be able to create a "never-ending extractor" which will be pulling from web documents information about entities and events, and the relationships between them. The new system can work in a dynamic node, and does not need human annotated samples for training, but it works best if there are a number of known relationships between pages to build off of. info extraction uses ontologies
  • 37. 37 Integrate • Automate Agent Creation combining: Information Extraction: Using anew "living information extraction" technique, we will be able to create a "never-ending extractor" which will be pulling from web documents information about entities and events, and the relationships between them. The new system can work in a dynamic node, and does not need human annotated samples for training, but it works best if there are a number of known relationships between pages to build off of. Semantic Web: The Semantic Web provides a number of known relationships between pages on the Web in a number of domains. Using general knowledge sources, like dbpedia and Yago, and specialized knowledge sources, like the data from musicbrainz, the reviews from Yelp (which have semantic annotations) and even the Open Graph of Facebook (which is available in a semantic web format), provides a jumpstart for the language extraction. However, the Semantic Web relates pages, but doesn't have any sort of "understanding" of what is on the pages. ontologies again!
  • 38. 38 Our Approach • Automate Agent Creation combining: Information Extraction: Using anew "living information extraction" technique, we will be able to create a "never-ending extractor" which will be pulling from web documents information about entities and events, and the relationships between them. The new system can work in a dynamic node, and does not need human annotated samples for training, but it works best if there are a number of known relationships between pages to build off of. Semantic Web: The Semantic Web provides a number of known relationships between pages on the Web in a number of domains. Using general knowledge sources, like dbpedia and Yago, and specialized knowledge sources, like the data from musicbrainz, the reviews from Yelp (which have semantic annotations) and even the Open Graph of Facebook (which is available in a semantic web format), provides a jumpstart for the language extraction. However, the Semantic Web relates pages, but doesn't have any sort of "understanding" of what is on the pages. Cognitive Computing : Cognitive Computing, can allow us to have a better way of accessing information about the entities found on the Web and finding other information about the same entities using various kinds of search and language heuristics. This allows us to have more organized information, rapidly generated, about the entities being explored. However, given a large graph of entities (even the organized linked-open data cloud has information about billions of things), how do we choose what to display next? If the best we can do is provide links, all of the above isn't much better than choosing a page and clicking from there. and here!
  • 39. 39 Our Approach • Automate Agent Creation combining: Information Extraction: Using anew "living information extraction" technique, we will be able to create a "never-ending extractor" which will be pulling from web documents information about entities and events, and the relationships between them. The new system can work in a dynamic node, and does not need human annotated samples for training, but it works best if there are a number of known relationships between pages to build off of. Semantic Web: The Semantic Web provides a number of known relationships between pages on the Web in a number of domains. Using general knowledge sources, like dbpedia and Yago, and specialized knowledge sources, like the data from musicbrainz, the reviews from Yelp (which have semantic annotations) and even the Open Graph of Facebook (which is available in a semantic web format), provides a jumpstart for the language extraction. However, the Semantic Web relates pages, but doesn't have any sort of "understanding" of what is on the pages. Cognitive Story-telling Technology: Interactive storytelling techniques are being explored to take information in the kind of "knowledge graph" resulting from the above, and tailoring the presentation to a user using storytelling techniques. It is aimed at presenting the information as an interesting and meaningful story by taking into consideration a combination of factors ranging from topic consistency and novelty, to learned user interests and even a user’s emotional reactions. The system can essentially determine "where to go next" and what to do there in the organized information as processed above.. “User models” (which we are encoding using an ontology)
  • 40. Tetherless World Constellation I could go on • There are many other such examples – Describing how data in one set could be joined to data in another is incredibly powerful, timely, and important • it’s just not really what people have typically used traditional reasoners for – Providing the ontological glue among different AI technologies: Priceless • Why aren’t we doing more to understand these issues and bring into the OWL “family?” – de facto standardization leads to adoption
  • 41. Tetherless World Constellation My challenge • Is the problem that we cannot have these powerful things in a decidable (sound and complete) language? – then maybe we have to give up decidability? • or even better, define new maths of expressivity that have different kinds of “sound and complete” behavior – i.e. approximately sound and complete » (is modern computational theory weakened by “within epsilon” optimality?) – i.e. “anytime” algorithms for reasoners » (conjecture) sound and complete at infinity
  • 42. Tetherless World Constellation Challenge • We may need to rethink how we are defining ontologies with respect to the necessary properties for use on the Web
  • 43. Tetherless World Constellation I’m not “anti-formalism” • A sufficient formalism for Semantic Web applications must – Provide a model that accounts for linked data • What is the equivalent of a DB calculus? – Provide a means for evaluating different kinds of completeness for reasoners • In practice we must be able to model A-box effects as formally as T-box technologies – example Weaver 2012 showed what restrictions were needed to maximize parallelism of some OWL subsets – Think about other processors than formal reasoners that will use the ontologies • ontologies used in many other ways – i.e. why does an IE system w/F1=.84 need a DL reasoner?
  • 44. Tetherless World Constellation Summary • The growing world of (semantic web, linked) data needs ontologies more than ever – OWL has some of the important things – But is missing many of the really important things • The problem isn’t (formal) expressivity – it’s the need to express other things (esp relationships between properties, events, etc.) – We need more research into how to formalize these kinds of relations
  • 45. Tetherless World Constellation Acknowledgments • I get funded by lots of folks – this talk may or may not represent anything anyone there believes: – ARL, DARPA, Microsoft, Mitre, Optum Laboratories • The Rensselaer Institute for Data Exploration and Application pays my salary – Thanks!! • Many of my best ideas, including a lot in this talk come from listening to smart people – Frank van Harmelen and Peter Mika have influenced my thinking about many of the ideas in this talk • The students and my colleagues at the RPI Tetherless World Constellation (tw.rpi.edu)