SlideShare uma empresa Scribd logo
1 de 53
Baixar para ler offline
Ontology-based Classification and
Faceted Search Interface for APIs
Knarig Arabshian, PhD
knarig@gmail.com
Overview
n  Motivation
n  Background
n  Problem
n  Solution
n  Related Work
n  Conclusion & Future Work
Motivation
n  Most of today’s Web content is suitable
for human consumption
n  Humans are left with the work of gathering
information from various websites
n  Web content is heterogeneous with little or no
structure
n  Data is not easily shared between web
content providers
Travel Example
Use services to
manually search for
airfares, car rentals
and hotels
Or search with
aggregating services
Use services to help plan travel
itinerary and provide information
on local sites such as weather,
events, or attractions
Use services that also
provide you with helpful
customer reviews
Semantic Web Vision
n  Web information can be processed by computers
n  Computers can integrate information from the web
“A web of data that can be processed directly
and indirectly by computers”
~Tim Berners-Lee (Inventor of WWW)
Quest for Semantics
Three main goals of the Semantic Web:
1. Building models: describe the world in abstract terms to
allow for an easier understanding of complex reality
2. Computing with knowledge: constructing reasoning
machines that can draw meaningful conclusions from
encoded knowledge
3. Exchanging Information: distribute, interlink, and
reconcile knowledge on a global scale
Planning Booking Reviews
Travel
Airline Tickets Car Rental Hotels
Using structured data, computers
can aggregate information and
customize it for the user
Travel
ontology
describes and
classifies
travel
services
Motivation
§  We can see similar problems when it comes to
API discovery on the Web
§  Discovering an API requires searching through a
large number of services on the Internet
§  Reading pages of documentation to figure out how
to use the ones that may match your application
§  Example: ProgrammableWeb (PW)
§  De facto API directory with over 14,000 APIs
§  Contains over 50 categories of services
§  API providers register their APIs in PW
§  Each API is manually categorized in a single category
by PW team
Exponential PW API Growth
Current state of PW
Current classification is
a flat categorization of
high-level service
classes without any
refinement between
common attributes
Needs a better method
for API discovery
Example: Search for Social
Advertising APIs in PW
Example: Search for Social
Advertising APIs in PW
Example: Search for Social Advertising
APIs in the Advertising Category
Search for
‘social’ and
‘advertising’
keywords in
Advertising
Category
Results in 7
APIs
Example: Search for Social Advertising
APIs in the Social Category
Search for
‘social’ and
‘advertising’
keywords in
Social
Category
Results in 2
APIs
What is needed?
A common data model has to be provided such as
an ontology in order to classify terms and
represent knowledge
Definition:
A formal, explicit specification of a shared
conceptualization ~ Tom Gruber
Overview
n  Motivation
n  Background
n  Problem
n  Solution
n  Related Work
n  Conclusion & Future Work
Ontology
§  OWL (Web Ontology Language): Approved
standard by W3C
§  Characteristics of ontologies
§  Classes: set of resources
§  Instances: ground level objects
§  Properties: relationships between classes
§  First order logic axioms
§  Class relationships such as disjointness, equivalence,
subsumption
§  Restrictions on properties such as existential, universal,
cardinality
Ontology Benefits
n  Standard way of describing the world both in terms
of language and meaning
n  Easily sharable across domains
n  Machine readable
n  Reasoning
n  Provide complex class relationships such as disjointness,
union, intersection besides pure hierarchy
n  Description logic reasoners automatically derive new
information and classify data
n  Automated classification can be very useful for dynamic data
that is continually updated
Ontology vs Relational Database
n  Similarities
n  Both use a model to identify common classes and
properties
n  ER model can be seen as a simple hierarchical
ontology
n  Differences
n  Ontologies are broader in scope (rules, incomplete
knowledge)
n  Ontologies provide a way for automated reasoning
to occur in order to discover new relationships
between entities
Example: Reasoning with a
Restaurant Ontology
Import class Cuisine
Create a restaurant
classification based on
cuisine by setting a
restriction on the
hasCuisine property
Example: Reasoning with a
Restaurant Ontology
Since ChineseCuisine has
non-disjoint siblings
JapaneseCuisine and
KoreanCuisine then also
conclude that these are
similar to ChineseCuisine
Example: Reasoning with a
Restaurant Ontology
Example: Reasoning with a
Restaurant Ontology
Run Reasoner for
Automated Classification
Conclude that
NewClass is
equivalent to
ChineseRestaurant
EQUIVALENT
Overview
n  Motivation
n  Background
n  Problem
n  Solution
n  Related Work
n  Conclusion & Future Work
Problem
§  Problem:
§  Improve API discovery and classification in Programmable
Web by providing a common data model such as an
ontology in order to automatically classify terms and perform
semantic API searches
§  Main Challenges:
§  Define high-level semantic descriptions of Programmable
Web services
§  Combine manual and automated data mining techniques to
create an ontology description of existing Programmable
Web services
§  Implement system that makes use of the ontology, such as
front-end user interface
What will improve?
§  Given a PW ontology, the system will:
§  Automatically classify existing API instances
within this ontology
§  Create an ontology-based user-interface for
automatic registration and querying
§  API providers will be able to register their services via this
interface
§  Users will be able to discover services with semantic queries
§  Example:
§  Find me an advertising service for social networks
§  Find me a social networking service for book
sharing
What do we need?
PW Service Classes Properties Feature Classes
API Individuals
AutomatedClassification
PW Service Classes
hasFeature
<140Proof, hasFeature, Advertising_Feature>
<140Proof, hasFeature, Social_Feature>
Advertising_Service
Social_Service
Advertising_Feature
Social_Feature
Advertising_Service
Social_Service
<BadgeVille, hasFeature, Advertising_Feature>
<BadgeVille, hasFeature, Social_Feature>
Refinement properties for a given PW Category to
enable automatic classification
Example: Ontology for Feature Class
Current PW Classification
VideoAdvertising Social Photo
PW Services
Travel
Video
Improved PW Classification using an
OWL Ontology
Advertising Social Photo
PW Services
VideoSocial PhotoSocial TravelSocial
Travel
AdvertisingSocial
APIs that have attributes
belonging in more than one
category will automatically be
classified
Current PW Search Interface
APIBrowse: Improved Faceted Search
Interface
Given the PW ontology, automatically generate a faceted search
interface by integrating it with a search platform such as SOLR
APIBrowse: Improved Faceted Search
Interface
Overlapping API Instances
Overlapping API Instances
Overview
n  Motivation
n  Background
n  Problem
n  Solution
n  Related Work
n  Conclusion & Future Work
LexOnt: A semi-automatic
ontology creation tool
§  A semi-automatic ontology creation tool that uses the
Programmable Web as its corpus
§  Suggest high-level property terms for a given service class
which distinguish it from the rest of the categories
§  Implemented as a Protege plugin, de facto ontology
editor, to aid in semi-automated ontology creation
§  Contributions:
§  Novel algorithm ranks terms and phrases within a PW category as
candidate property assignments by comparing them to external
domain knowledge within Wikipedia, Wordnet and the current
state of the ontology
§  Can be used even if the ontology engineer is not necessarily an
expert of a certain domain
LexOnt Algorithms
Well-known NLP algorithms used to find terms and phrases
§  TF-IDF: Text frequency-inverse document frequency
§  Score of a word in the document shows how important the word is
§  Importance of a word depends on how frequently the word has been
used in the document vs. all the documents in the corpus
§  Significant Phrases:
§  Chi-square test used to calculate the significance of collocated words
§  Two-phase process:
§  Determine collocations and terms that appear together
§  Filter out unique collocations from the list
§  Gave a very good indication of high-level property descriptions
LexOnt Algorithms
Novel algorithm uses external resources like Wikipedia, Wordnet
and the constructed ontology to highlight the important terms
even more
§  Useful for those who are not domain experts but want to
understand what the relevant terms of a domain are
§  Algorithm for using the External Knowledge Base
§  Extract Wikipedia page for each category and rank top words with TF-IDF
§  If a word or phrase in the API contains any of the top Wikipedia words, label it
§  Find synonymous or related terms to the list of generated terms using Wordnet
§  If a word or phrase in the API contains any of the related terms label them
§  If any of the generated terms lexically match terms in the ontology label them
using a color code
Top N TF-IDF from Wiki Advertising, marketing, brand, television, semiotics,
advertisement, billboard, radio, product, bowl,
sponsor, consumer, advertise, placement, super,
logo, commercial, infomercial
Top N TF-IDF from Wordnet Ad, advertisement, advertizement, advertising,
advertizing, advert, promotion, direct-mail, prview,
advertorial, mailer, newspaper-ad, commercial,
circular, teaser, top-billing
Top N TF-IDF from PW
Category
Proof, persona, stream, replies, authors, say, hello,
ad, brands, social, consumers, advertisers,
audience, ads
Top N TF-IDF Ranked based
on external KB
Advertisers (wiki), Consumers(wiki),
Social(wiki) Brands(wiki), Ads (related), Ad
(related), proof, persona, stream, replies, authors,
say, hello, audience
Top N Significant Phrases
ranked based on external KB
Stream advertising (wiki), social
stream(wiki), say hello, author, replies, google
groups, ober, michaels, proof, erik, michaels,
persona targeting
Example of Property Selection from a
Social Advertising API
LexOnt: A semi-automatic
ontology creation tool
LexOnt Implementation
§  LexOnt is implemented as a Protege plugin to
enhance the user experience of semi-
automated ontology creation
§  Four different Java APIs used for the
implementation
§  Lingpipe API used for the NLP algorithms to
generate TF-IDF terms and Significant Phrases
§  Lucene used for indexing and searching for terms
§  Protege API used for implementing the Protege
plugin GUI
§  OWL-API used for ontology generation code
LexOnt Results
§  Used PW Corpus of ~3000 APIs equalling 250MB data
§  Constructed ontology for 5 categories with following features:
§  Domain specificity
§  A priori knowledge of domain
§  Number of APIs within the domain
§  Tested for four things when evaluating LexOnt
1. The precision/recall of the TF-IDF term and Significant Phrase
generation without external KB information
2. How helpful the external KB was when choosing terms by finding
the percentage of terms used in ontology
3. Whether or not the terms were used in their exact form, similar
form or different forms
4. How quickly an ontology API was constructed by the user
LexOnt Results
1. Precision/Recall tests for terms without taking external KB
into account
n  4% precision
n  28% recall
Results:
Using only TF-IDF/Sig Phrases alone is not good enough to
determine how terms should be used
2. For categories with well-defined Wikipedia pages,
percentage of terms used from external KB was >50%
Results:
Well-defined external KBs made it much easier to quickly assess
distinguishing features of a category
LexOnt Results
Domain Number of
APIs
Specifically
Defined
External KB
A priori
knowledge
of Domain
% terms
used from
External KB
Advertising <100 √ X 50%
Travel <100 √ √ 80%
Real Estate <100 √ X 100%
Social >100 X √ 20%
LexOnt Results
3. Tested to see how these terms were actually assigned
within the instances
n  Compared matches that were exact, similar or completely
different
n  Example: if LexOnt produced a term “mobile” but the actual
ontology assignment was “mobile advertising,” this would count
as a similar match
n  Percentage of equal and similar matches for API instances
averaged over 80%
Results:
n  External KB terms were used over 80% of the time
n  Percentage of different matches was higher when category was
not well-defined such as the Utility category
LexOnt Results
4. Speed of ontology construction
n  Before we had the LexOnt tool, and only worked with generated
TF-IDF/Sig Phrase terms, it took around 15 minutes to construct an
API instance and related feature
n  After the completion of LexOnt, this dropped to 2 minutes.
Results:
n  LexOnt’s user interface and external knowledge base ranking
reduced the time for ontology construction by a factor of 7
Overview
n  Motivation
n  Background
n  Problem
n  Solution
n  Related Work
n  Conclusion & Future Work
Related Work
§  Most related work involves semi-automated ontology
creation for
§  Pure hierarchical ontologies
§  Domains that already have some kind of structural description
§  Machine learning and NLP techniques used
§  On text corpora
§  Alongside existing structured or annotated external knowledge
base
§  The work closest to LexOnt’s
§  Find property relationships between concepts
§  Use unstructured external knowledge bases
Related Work
System Corpus Ontology
Suggestions
External
Knowledge
Text2Onto annotated Probabilistic
Ontology
Models
None
OntoLT rule-based Classes and
properties
None
OntoLearn unstructured Hierarchical
classification
Definitions,
Synonyms
LexOnt unstructured Properties Wikipedia,
Wordnet,
Generated
Ontology
Conclusion
n  LexOnt has shown to be an effective tool for
semi-automated ontology creation
n  From our initial results, we have determined that
using an external knowledge base to filter out
generated terms and phrases
n  Increases the accuracy of the feature selection
n  Helps in understanding the common terms within a
corpus
LexOnt Publications
n  Knarig Arabshian and Peter Danielsen, Ontology-based Faceted Search
Interface for APIs (In Journal Submission).
n  Peter Danielsen and Knarig Arabshian, User Interface Design in Semi-
Automated Ontology Construction, International Conference on Web
Services (ICWS 2013), Santa Clara, CA, June 2013.
n  Knarig Arabshian, Peter Danielsen and Sadia Afroz, LexOnt: Semi-
Automatic ontology Creation Tool for Programmable Web, AAAI 2012
Spring Symposium on Intelligent Web Services Meet Social Computing,
Palo Alto, CA, March 2012.
n  Knarig Arabshian and Peter Danielsen, Semi-automated Ontology
Creation for High-level Service Classification 7th International
Conference on Semantics, Knowledge and Grids (SKG 2011), Beijing,
China, Oct 2011.

Mais conteúdo relacionado

Mais procurados

Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mappingsamhati27
 
Ontologies and semantic web
Ontologies and semantic webOntologies and semantic web
Ontologies and semantic webStanley Wang
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهsadegh salehi
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologySteven Miller
 
The Standardization of Semantic Web Ontology
The Standardization of Semantic Web OntologyThe Standardization of Semantic Web Ontology
The Standardization of Semantic Web OntologyMyungjin Lee
 
Eswc2012 ss ontologies
Eswc2012 ss ontologiesEswc2012 ss ontologies
Eswc2012 ss ontologiesElena Simperl
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic webWorawith Sangkatip
 
Ontology and its various aspects
Ontology and its various aspectsOntology and its various aspects
Ontology and its various aspectssamhati27
 
Using Text Comprehension Model for Learning Concepts, Context, and Topic of...
Using Text Comprehension Model for  Learning Concepts, Context, and Topic  of...Using Text Comprehension Model for  Learning Concepts, Context, and Topic  of...
Using Text Comprehension Model for Learning Concepts, Context, and Topic of...Kent State University
 
Ontology Engineering: Introduction
Ontology Engineering: IntroductionOntology Engineering: Introduction
Ontology Engineering: IntroductionGuus Schreiber
 
ONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESSONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESSKishan Patel
 
Ontology and Ontology Libraries: a Critical Study
Ontology and Ontology Libraries: a Critical StudyOntology and Ontology Libraries: a Critical Study
Ontology and Ontology Libraries: a Critical StudyDebashisnaskar
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Webliddy
 
Accessibility Issues
Accessibility IssuesAccessibility Issues
Accessibility Issuesliddy
 
Ontology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and moreOntology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and moreAdriel Café
 

Mais procurados (20)

Ontology
OntologyOntology
Ontology
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Ontologies and semantic web
Ontologies and semantic webOntologies and semantic web
Ontologies and semantic web
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغه
 
Ontology engineering
Ontology engineering Ontology engineering
Ontology engineering
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and Terminology
 
The Standardization of Semantic Web Ontology
The Standardization of Semantic Web OntologyThe Standardization of Semantic Web Ontology
The Standardization of Semantic Web Ontology
 
Eswc2012 ss ontologies
Eswc2012 ss ontologiesEswc2012 ss ontologies
Eswc2012 ss ontologies
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic web
 
Ontology and its various aspects
Ontology and its various aspectsOntology and its various aspects
Ontology and its various aspects
 
Ontologies
OntologiesOntologies
Ontologies
 
Using Text Comprehension Model for Learning Concepts, Context, and Topic of...
Using Text Comprehension Model for  Learning Concepts, Context, and Topic  of...Using Text Comprehension Model for  Learning Concepts, Context, and Topic  of...
Using Text Comprehension Model for Learning Concepts, Context, and Topic of...
 
Ontology Engineering: Introduction
Ontology Engineering: IntroductionOntology Engineering: Introduction
Ontology Engineering: Introduction
 
ONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESSONTOLOGY BASED DATA ACCESS
ONTOLOGY BASED DATA ACCESS
 
Ontology and Ontology Libraries: a Critical Study
Ontology and Ontology Libraries: a Critical StudyOntology and Ontology Libraries: a Critical Study
Ontology and Ontology Libraries: a Critical Study
 
Information Extraction
Information ExtractionInformation Extraction
Information Extraction
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Accessibility Issues
Accessibility IssuesAccessibility Issues
Accessibility Issues
 
Ontology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and moreOntology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and more
 

Destaque

Faceted Navigation for SEO: Technical Tricks to Boost Your Visibility
Faceted Navigation for SEO: Technical Tricks to Boost Your VisibilityFaceted Navigation for SEO: Technical Tricks to Boost Your Visibility
Faceted Navigation for SEO: Technical Tricks to Boost Your VisibilityAllotment Digital Marketing
 
Faceted navigation webinar
Faceted navigation webinarFaceted navigation webinar
Faceted navigation webinarAnna Morrison
 
Semantic Search Engines
Semantic Search EnginesSemantic Search Engines
Semantic Search EnginesAtul Shridhar
 
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextTwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextLeon Derczynski
 
Adding Semantic Edge to Your Content – From Authoring to Delivery
Adding Semantic Edge to Your Content – From Authoring to DeliveryAdding Semantic Edge to Your Content – From Authoring to Delivery
Adding Semantic Edge to Your Content – From Authoring to DeliveryOntotext
 
Intriduction to Ontotext's KIM platform
Intriduction to Ontotext's KIM platformIntriduction to Ontotext's KIM platform
Intriduction to Ontotext's KIM platformtoncho11
 
Ontological approach for improving semantic web search results
Ontological approach for improving semantic web search resultsOntological approach for improving semantic web search results
Ontological approach for improving semantic web search resultseSAT Journals
 
Extending facet search to the general web
Extending facet search to the general webExtending facet search to the general web
Extending facet search to the general web祺傑 林
 
In Search of a Semantic Book Search Engine: Are We There Yet?
In Search of a Semantic Book Search Engine: Are We There Yet?In Search of a Semantic Book Search Engine: Are We There Yet?
In Search of a Semantic Book Search Engine: Are We There Yet?Irfan Ullah
 
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalMauro Dragoni
 
Semantics And Search
Semantics And SearchSemantics And Search
Semantics And SearchVestforsk.no
 
Semantic data mining: an ontology based approach
Semantic data mining: an ontology based approachSemantic data mining: an ontology based approach
Semantic data mining: an ontology based approachAgnieszka Ławrynowicz
 
Text Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATEText Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATEDiana Maynard
 
Semantic security framework and context-aware role-based access control ontol...
Semantic security framework and context-aware role-based access control ontol...Semantic security framework and context-aware role-based access control ontol...
Semantic security framework and context-aware role-based access control ontol...Natalia Díaz Rodríguez
 
Semantic Search at Yahoo
Semantic Search at YahooSemantic Search at Yahoo
Semantic Search at YahooPeter Mika
 

Destaque (20)

Faceted Navigation for SEO: Technical Tricks to Boost Your Visibility
Faceted Navigation for SEO: Technical Tricks to Boost Your VisibilityFaceted Navigation for SEO: Technical Tricks to Boost Your Visibility
Faceted Navigation for SEO: Technical Tricks to Boost Your Visibility
 
Faceted navigation webinar
Faceted navigation webinarFaceted navigation webinar
Faceted navigation webinar
 
Android Apps the Right Way
Android Apps the Right WayAndroid Apps the Right Way
Android Apps the Right Way
 
Test Dependencies and the Future of Build Acceleration
Test Dependencies and the Future of Build AccelerationTest Dependencies and the Future of Build Acceleration
Test Dependencies and the Future of Build Acceleration
 
Static Analysis and Verification of C Programs
Static Analysis and Verification of C ProgramsStatic Analysis and Verification of C Programs
Static Analysis and Verification of C Programs
 
Semantic Search Engines
Semantic Search EnginesSemantic Search Engines
Semantic Search Engines
 
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextTwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
 
Adding Semantic Edge to Your Content – From Authoring to Delivery
Adding Semantic Edge to Your Content – From Authoring to DeliveryAdding Semantic Edge to Your Content – From Authoring to Delivery
Adding Semantic Edge to Your Content – From Authoring to Delivery
 
Intriduction to Ontotext's KIM platform
Intriduction to Ontotext's KIM platformIntriduction to Ontotext's KIM platform
Intriduction to Ontotext's KIM platform
 
Ontological approach for improving semantic web search results
Ontological approach for improving semantic web search resultsOntological approach for improving semantic web search results
Ontological approach for improving semantic web search results
 
A Taxonomy of Semantic Web data Retrieval Techniques
A Taxonomy of Semantic Web data Retrieval TechniquesA Taxonomy of Semantic Web data Retrieval Techniques
A Taxonomy of Semantic Web data Retrieval Techniques
 
Extending facet search to the general web
Extending facet search to the general webExtending facet search to the general web
Extending facet search to the general web
 
Faceted Navigation
Faceted NavigationFaceted Navigation
Faceted Navigation
 
In Search of a Semantic Book Search Engine: Are We There Yet?
In Search of a Semantic Book Search Engine: Are We There Yet?In Search of a Semantic Book Search Engine: Are We There Yet?
In Search of a Semantic Book Search Engine: Are We There Yet?
 
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
 
Semantics And Search
Semantics And SearchSemantics And Search
Semantics And Search
 
Semantic data mining: an ontology based approach
Semantic data mining: an ontology based approachSemantic data mining: an ontology based approach
Semantic data mining: an ontology based approach
 
Text Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATEText Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATE
 
Semantic security framework and context-aware role-based access control ontol...
Semantic security framework and context-aware role-based access control ontol...Semantic security framework and context-aware role-based access control ontol...
Semantic security framework and context-aware role-based access control ontol...
 
Semantic Search at Yahoo
Semantic Search at YahooSemantic Search at Yahoo
Semantic Search at Yahoo
 

Semelhante a Ontology-based Classification and Faceted Search Interface for APIs

Resource Discovery Landscape
Resource Discovery LandscapeResource Discovery Landscape
Resource Discovery LandscapeAndy Powell
 
O'Reilly SACon San Jose, CA - 2019 - API design tutorial
O'Reilly SACon San Jose, CA - 2019 - API design tutorialO'Reilly SACon San Jose, CA - 2019 - API design tutorial
O'Reilly SACon San Jose, CA - 2019 - API design tutorialTom Hofte
 
From Watson to Ontology Repositories - Ontolog OOR panel
From Watson to Ontology Repositories - Ontolog OOR panelFrom Watson to Ontology Repositories - Ontolog OOR panel
From Watson to Ontology Repositories - Ontolog OOR panelMathieu d'Aquin
 
Mind The Gap - Mapping a domain model to a RESTful API - OReilly SACon 2018, ...
Mind The Gap - Mapping a domain model to a RESTful API - OReilly SACon 2018, ...Mind The Gap - Mapping a domain model to a RESTful API - OReilly SACon 2018, ...
Mind The Gap - Mapping a domain model to a RESTful API - OReilly SACon 2018, ...Tom Hofte
 
Understanding Seo At A Glance
Understanding Seo At A GlanceUnderstanding Seo At A Glance
Understanding Seo At A Glancepoojagupta267
 
Greenlight Blogging For Seo 25-04-08
Greenlight   Blogging For Seo 25-04-08Greenlight   Blogging For Seo 25-04-08
Greenlight Blogging For Seo 25-04-08InBlackandWhite
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016Jessie Chuang
 
How to generate >100k visits with 300 articles?
How to generate >100k visits with 300 articles?How to generate >100k visits with 300 articles?
How to generate >100k visits with 300 articles?Senuto
 
Beginners Guide To Search Engine Optimization
Beginners Guide To Search Engine OptimizationBeginners Guide To Search Engine Optimization
Beginners Guide To Search Engine Optimizationspritemathews
 
API (Application program interface)
API (Application program interface)API (Application program interface)
API (Application program interface)Muhammad Jahanzaib
 
Create a Smooth & Satisfying Reader Experience using Metadata-Based Links...
Create a Smooth & Satisfying Reader Experience using Metadata-Based Links...Create a Smooth & Satisfying Reader Experience using Metadata-Based Links...
Create a Smooth & Satisfying Reader Experience using Metadata-Based Links...Joe Pairman
 
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...Sease
 
Aula 2 - Planning for Web Engineering by Roger Pressman
Aula 2 -  Planning for Web Engineering by Roger PressmanAula 2 -  Planning for Web Engineering by Roger Pressman
Aula 2 - Planning for Web Engineering by Roger PressmanRogerio P C do Nascimento
 
SEO Social Blog: SEO Training 2010 From SEOmoz
SEO Social Blog:  SEO Training 2010 From SEOmoz SEO Social Blog:  SEO Training 2010 From SEOmoz
SEO Social Blog: SEO Training 2010 From SEOmoz SEO Social Blog
 
Introduction to SEO
Introduction to SEOIntroduction to SEO
Introduction to SEORand Fishkin
 
Seo training-2010-100818134052-phpapp02 (1)
Seo training-2010-100818134052-phpapp02 (1)Seo training-2010-100818134052-phpapp02 (1)
Seo training-2010-100818134052-phpapp02 (1)Dharmendra Patel
 
Search Enginge Optimization: SEOmoz
Search Enginge Optimization: SEOmozSearch Enginge Optimization: SEOmoz
Search Enginge Optimization: SEOmozmbragi
 

Semelhante a Ontology-based Classification and Faceted Search Interface for APIs (20)

Resource Discovery Landscape
Resource Discovery LandscapeResource Discovery Landscape
Resource Discovery Landscape
 
O'Reilly SACon San Jose, CA - 2019 - API design tutorial
O'Reilly SACon San Jose, CA - 2019 - API design tutorialO'Reilly SACon San Jose, CA - 2019 - API design tutorial
O'Reilly SACon San Jose, CA - 2019 - API design tutorial
 
From Watson to Ontology Repositories - Ontolog OOR panel
From Watson to Ontology Repositories - Ontolog OOR panelFrom Watson to Ontology Repositories - Ontolog OOR panel
From Watson to Ontology Repositories - Ontolog OOR panel
 
Mind The Gap - Mapping a domain model to a RESTful API - OReilly SACon 2018, ...
Mind The Gap - Mapping a domain model to a RESTful API - OReilly SACon 2018, ...Mind The Gap - Mapping a domain model to a RESTful API - OReilly SACon 2018, ...
Mind The Gap - Mapping a domain model to a RESTful API - OReilly SACon 2018, ...
 
Understanding Seo At A Glance
Understanding Seo At A GlanceUnderstanding Seo At A Glance
Understanding Seo At A Glance
 
Greenlight Blogging For Seo 25-04-08
Greenlight   Blogging For Seo 25-04-08Greenlight   Blogging For Seo 25-04-08
Greenlight Blogging For Seo 25-04-08
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
 
How to generate >100k visits with 300 articles?
How to generate >100k visits with 300 articles?How to generate >100k visits with 300 articles?
How to generate >100k visits with 300 articles?
 
Beginners Guide To Search Engine Optimization
Beginners Guide To Search Engine OptimizationBeginners Guide To Search Engine Optimization
Beginners Guide To Search Engine Optimization
 
API (Application program interface)
API (Application program interface)API (Application program interface)
API (Application program interface)
 
Create a Smooth & Satisfying Reader Experience using Metadata-Based Links...
Create a Smooth & Satisfying Reader Experience using Metadata-Based Links...Create a Smooth & Satisfying Reader Experience using Metadata-Based Links...
Create a Smooth & Satisfying Reader Experience using Metadata-Based Links...
 
AAN TrafficPresentation
AAN TrafficPresentationAAN TrafficPresentation
AAN TrafficPresentation
 
SEO
SEOSEO
SEO
 
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
Rated Ranking Evaluator Enterprise: the next generation of free Search Qualit...
 
Aula 2 - Planning for Web Engineering by Roger Pressman
Aula 2 -  Planning for Web Engineering by Roger PressmanAula 2 -  Planning for Web Engineering by Roger Pressman
Aula 2 - Planning for Web Engineering by Roger Pressman
 
SEO Social Blog: SEO Training 2010 From SEOmoz
SEO Social Blog:  SEO Training 2010 From SEOmoz SEO Social Blog:  SEO Training 2010 From SEOmoz
SEO Social Blog: SEO Training 2010 From SEOmoz
 
Introduction to SEO
Introduction to SEOIntroduction to SEO
Introduction to SEO
 
Seo training-2010-100818134052-phpapp02 (1)
Seo training-2010-100818134052-phpapp02 (1)Seo training-2010-100818134052-phpapp02 (1)
Seo training-2010-100818134052-phpapp02 (1)
 
Search Enginge Optimization: SEOmoz
Search Enginge Optimization: SEOmozSearch Enginge Optimization: SEOmoz
Search Enginge Optimization: SEOmoz
 
SEO MARKETING TRAINING
SEO MARKETING TRAININGSEO MARKETING TRAINING
SEO MARKETING TRAINING
 

Mais de New York City College of Technology Computer Systems Technology Colloquium

Mais de New York City College of Technology Computer Systems Technology Colloquium (9)

Towards Improving Interface Modularity in Legacy Java Software Through Automa...
Towards Improving Interface Modularity in Legacy Java Software Through Automa...Towards Improving Interface Modularity in Legacy Java Software Through Automa...
Towards Improving Interface Modularity in Legacy Java Software Through Automa...
 
Data-driven, Interactive Scientific Articles in a Collaborative Environment w...
Data-driven, Interactive Scientific Articles in a Collaborative Environment w...Data-driven, Interactive Scientific Articles in a Collaborative Environment w...
Data-driven, Interactive Scientific Articles in a Collaborative Environment w...
 
Cloud Technology: Virtualization
Cloud Technology: VirtualizationCloud Technology: Virtualization
Cloud Technology: Virtualization
 
Google BigTable
Google BigTableGoogle BigTable
Google BigTable
 
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
 
How We Use Functional Programming to Find the Bad Guys
How We Use Functional Programming to Find the Bad GuysHow We Use Functional Programming to Find the Bad Guys
How We Use Functional Programming to Find the Bad Guys
 
Big Data Challenges and Solutions
Big Data Challenges and SolutionsBig Data Challenges and Solutions
Big Data Challenges and Solutions
 
Introduction to new features in java 8
Introduction to new features in java 8Introduction to new features in java 8
Introduction to new features in java 8
 
More than Words: Advancing Prosodic Analysis
More than Words: Advancing Prosodic AnalysisMore than Words: Advancing Prosodic Analysis
More than Words: Advancing Prosodic Analysis
 

Último

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 

Último (20)

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 

Ontology-based Classification and Faceted Search Interface for APIs

  • 1. Ontology-based Classification and Faceted Search Interface for APIs Knarig Arabshian, PhD knarig@gmail.com
  • 2. Overview n  Motivation n  Background n  Problem n  Solution n  Related Work n  Conclusion & Future Work
  • 3. Motivation n  Most of today’s Web content is suitable for human consumption n  Humans are left with the work of gathering information from various websites n  Web content is heterogeneous with little or no structure n  Data is not easily shared between web content providers
  • 4. Travel Example Use services to manually search for airfares, car rentals and hotels Or search with aggregating services Use services to help plan travel itinerary and provide information on local sites such as weather, events, or attractions Use services that also provide you with helpful customer reviews
  • 5. Semantic Web Vision n  Web information can be processed by computers n  Computers can integrate information from the web “A web of data that can be processed directly and indirectly by computers” ~Tim Berners-Lee (Inventor of WWW)
  • 6. Quest for Semantics Three main goals of the Semantic Web: 1. Building models: describe the world in abstract terms to allow for an easier understanding of complex reality 2. Computing with knowledge: constructing reasoning machines that can draw meaningful conclusions from encoded knowledge 3. Exchanging Information: distribute, interlink, and reconcile knowledge on a global scale
  • 7. Planning Booking Reviews Travel Airline Tickets Car Rental Hotels Using structured data, computers can aggregate information and customize it for the user Travel ontology describes and classifies travel services
  • 8. Motivation §  We can see similar problems when it comes to API discovery on the Web §  Discovering an API requires searching through a large number of services on the Internet §  Reading pages of documentation to figure out how to use the ones that may match your application §  Example: ProgrammableWeb (PW) §  De facto API directory with over 14,000 APIs §  Contains over 50 categories of services §  API providers register their APIs in PW §  Each API is manually categorized in a single category by PW team
  • 10. Current state of PW Current classification is a flat categorization of high-level service classes without any refinement between common attributes Needs a better method for API discovery
  • 11. Example: Search for Social Advertising APIs in PW
  • 12. Example: Search for Social Advertising APIs in PW
  • 13. Example: Search for Social Advertising APIs in the Advertising Category Search for ‘social’ and ‘advertising’ keywords in Advertising Category Results in 7 APIs
  • 14. Example: Search for Social Advertising APIs in the Social Category Search for ‘social’ and ‘advertising’ keywords in Social Category Results in 2 APIs
  • 15. What is needed? A common data model has to be provided such as an ontology in order to classify terms and represent knowledge Definition: A formal, explicit specification of a shared conceptualization ~ Tom Gruber
  • 16. Overview n  Motivation n  Background n  Problem n  Solution n  Related Work n  Conclusion & Future Work
  • 17. Ontology §  OWL (Web Ontology Language): Approved standard by W3C §  Characteristics of ontologies §  Classes: set of resources §  Instances: ground level objects §  Properties: relationships between classes §  First order logic axioms §  Class relationships such as disjointness, equivalence, subsumption §  Restrictions on properties such as existential, universal, cardinality
  • 18. Ontology Benefits n  Standard way of describing the world both in terms of language and meaning n  Easily sharable across domains n  Machine readable n  Reasoning n  Provide complex class relationships such as disjointness, union, intersection besides pure hierarchy n  Description logic reasoners automatically derive new information and classify data n  Automated classification can be very useful for dynamic data that is continually updated
  • 19. Ontology vs Relational Database n  Similarities n  Both use a model to identify common classes and properties n  ER model can be seen as a simple hierarchical ontology n  Differences n  Ontologies are broader in scope (rules, incomplete knowledge) n  Ontologies provide a way for automated reasoning to occur in order to discover new relationships between entities
  • 20. Example: Reasoning with a Restaurant Ontology Import class Cuisine Create a restaurant classification based on cuisine by setting a restriction on the hasCuisine property
  • 21. Example: Reasoning with a Restaurant Ontology Since ChineseCuisine has non-disjoint siblings JapaneseCuisine and KoreanCuisine then also conclude that these are similar to ChineseCuisine
  • 22. Example: Reasoning with a Restaurant Ontology
  • 23. Example: Reasoning with a Restaurant Ontology Run Reasoner for Automated Classification Conclude that NewClass is equivalent to ChineseRestaurant EQUIVALENT
  • 24. Overview n  Motivation n  Background n  Problem n  Solution n  Related Work n  Conclusion & Future Work
  • 25. Problem §  Problem: §  Improve API discovery and classification in Programmable Web by providing a common data model such as an ontology in order to automatically classify terms and perform semantic API searches §  Main Challenges: §  Define high-level semantic descriptions of Programmable Web services §  Combine manual and automated data mining techniques to create an ontology description of existing Programmable Web services §  Implement system that makes use of the ontology, such as front-end user interface
  • 26. What will improve? §  Given a PW ontology, the system will: §  Automatically classify existing API instances within this ontology §  Create an ontology-based user-interface for automatic registration and querying §  API providers will be able to register their services via this interface §  Users will be able to discover services with semantic queries §  Example: §  Find me an advertising service for social networks §  Find me a social networking service for book sharing
  • 27. What do we need? PW Service Classes Properties Feature Classes API Individuals AutomatedClassification PW Service Classes hasFeature <140Proof, hasFeature, Advertising_Feature> <140Proof, hasFeature, Social_Feature> Advertising_Service Social_Service Advertising_Feature Social_Feature Advertising_Service Social_Service <BadgeVille, hasFeature, Advertising_Feature> <BadgeVille, hasFeature, Social_Feature> Refinement properties for a given PW Category to enable automatic classification
  • 28. Example: Ontology for Feature Class
  • 29.
  • 30. Current PW Classification VideoAdvertising Social Photo PW Services Travel
  • 31. Video Improved PW Classification using an OWL Ontology Advertising Social Photo PW Services VideoSocial PhotoSocial TravelSocial Travel AdvertisingSocial APIs that have attributes belonging in more than one category will automatically be classified
  • 32. Current PW Search Interface
  • 33. APIBrowse: Improved Faceted Search Interface Given the PW ontology, automatically generate a faceted search interface by integrating it with a search platform such as SOLR
  • 34. APIBrowse: Improved Faceted Search Interface
  • 37. Overview n  Motivation n  Background n  Problem n  Solution n  Related Work n  Conclusion & Future Work
  • 38. LexOnt: A semi-automatic ontology creation tool §  A semi-automatic ontology creation tool that uses the Programmable Web as its corpus §  Suggest high-level property terms for a given service class which distinguish it from the rest of the categories §  Implemented as a Protege plugin, de facto ontology editor, to aid in semi-automated ontology creation §  Contributions: §  Novel algorithm ranks terms and phrases within a PW category as candidate property assignments by comparing them to external domain knowledge within Wikipedia, Wordnet and the current state of the ontology §  Can be used even if the ontology engineer is not necessarily an expert of a certain domain
  • 39. LexOnt Algorithms Well-known NLP algorithms used to find terms and phrases §  TF-IDF: Text frequency-inverse document frequency §  Score of a word in the document shows how important the word is §  Importance of a word depends on how frequently the word has been used in the document vs. all the documents in the corpus §  Significant Phrases: §  Chi-square test used to calculate the significance of collocated words §  Two-phase process: §  Determine collocations and terms that appear together §  Filter out unique collocations from the list §  Gave a very good indication of high-level property descriptions
  • 40. LexOnt Algorithms Novel algorithm uses external resources like Wikipedia, Wordnet and the constructed ontology to highlight the important terms even more §  Useful for those who are not domain experts but want to understand what the relevant terms of a domain are §  Algorithm for using the External Knowledge Base §  Extract Wikipedia page for each category and rank top words with TF-IDF §  If a word or phrase in the API contains any of the top Wikipedia words, label it §  Find synonymous or related terms to the list of generated terms using Wordnet §  If a word or phrase in the API contains any of the related terms label them §  If any of the generated terms lexically match terms in the ontology label them using a color code
  • 41. Top N TF-IDF from Wiki Advertising, marketing, brand, television, semiotics, advertisement, billboard, radio, product, bowl, sponsor, consumer, advertise, placement, super, logo, commercial, infomercial Top N TF-IDF from Wordnet Ad, advertisement, advertizement, advertising, advertizing, advert, promotion, direct-mail, prview, advertorial, mailer, newspaper-ad, commercial, circular, teaser, top-billing Top N TF-IDF from PW Category Proof, persona, stream, replies, authors, say, hello, ad, brands, social, consumers, advertisers, audience, ads Top N TF-IDF Ranked based on external KB Advertisers (wiki), Consumers(wiki), Social(wiki) Brands(wiki), Ads (related), Ad (related), proof, persona, stream, replies, authors, say, hello, audience Top N Significant Phrases ranked based on external KB Stream advertising (wiki), social stream(wiki), say hello, author, replies, google groups, ober, michaels, proof, erik, michaels, persona targeting Example of Property Selection from a Social Advertising API
  • 43. LexOnt Implementation §  LexOnt is implemented as a Protege plugin to enhance the user experience of semi- automated ontology creation §  Four different Java APIs used for the implementation §  Lingpipe API used for the NLP algorithms to generate TF-IDF terms and Significant Phrases §  Lucene used for indexing and searching for terms §  Protege API used for implementing the Protege plugin GUI §  OWL-API used for ontology generation code
  • 44. LexOnt Results §  Used PW Corpus of ~3000 APIs equalling 250MB data §  Constructed ontology for 5 categories with following features: §  Domain specificity §  A priori knowledge of domain §  Number of APIs within the domain §  Tested for four things when evaluating LexOnt 1. The precision/recall of the TF-IDF term and Significant Phrase generation without external KB information 2. How helpful the external KB was when choosing terms by finding the percentage of terms used in ontology 3. Whether or not the terms were used in their exact form, similar form or different forms 4. How quickly an ontology API was constructed by the user
  • 45. LexOnt Results 1. Precision/Recall tests for terms without taking external KB into account n  4% precision n  28% recall Results: Using only TF-IDF/Sig Phrases alone is not good enough to determine how terms should be used 2. For categories with well-defined Wikipedia pages, percentage of terms used from external KB was >50% Results: Well-defined external KBs made it much easier to quickly assess distinguishing features of a category
  • 46. LexOnt Results Domain Number of APIs Specifically Defined External KB A priori knowledge of Domain % terms used from External KB Advertising <100 √ X 50% Travel <100 √ √ 80% Real Estate <100 √ X 100% Social >100 X √ 20%
  • 47. LexOnt Results 3. Tested to see how these terms were actually assigned within the instances n  Compared matches that were exact, similar or completely different n  Example: if LexOnt produced a term “mobile” but the actual ontology assignment was “mobile advertising,” this would count as a similar match n  Percentage of equal and similar matches for API instances averaged over 80% Results: n  External KB terms were used over 80% of the time n  Percentage of different matches was higher when category was not well-defined such as the Utility category
  • 48. LexOnt Results 4. Speed of ontology construction n  Before we had the LexOnt tool, and only worked with generated TF-IDF/Sig Phrase terms, it took around 15 minutes to construct an API instance and related feature n  After the completion of LexOnt, this dropped to 2 minutes. Results: n  LexOnt’s user interface and external knowledge base ranking reduced the time for ontology construction by a factor of 7
  • 49. Overview n  Motivation n  Background n  Problem n  Solution n  Related Work n  Conclusion & Future Work
  • 50. Related Work §  Most related work involves semi-automated ontology creation for §  Pure hierarchical ontologies §  Domains that already have some kind of structural description §  Machine learning and NLP techniques used §  On text corpora §  Alongside existing structured or annotated external knowledge base §  The work closest to LexOnt’s §  Find property relationships between concepts §  Use unstructured external knowledge bases
  • 51. Related Work System Corpus Ontology Suggestions External Knowledge Text2Onto annotated Probabilistic Ontology Models None OntoLT rule-based Classes and properties None OntoLearn unstructured Hierarchical classification Definitions, Synonyms LexOnt unstructured Properties Wikipedia, Wordnet, Generated Ontology
  • 52. Conclusion n  LexOnt has shown to be an effective tool for semi-automated ontology creation n  From our initial results, we have determined that using an external knowledge base to filter out generated terms and phrases n  Increases the accuracy of the feature selection n  Helps in understanding the common terms within a corpus
  • 53. LexOnt Publications n  Knarig Arabshian and Peter Danielsen, Ontology-based Faceted Search Interface for APIs (In Journal Submission). n  Peter Danielsen and Knarig Arabshian, User Interface Design in Semi- Automated Ontology Construction, International Conference on Web Services (ICWS 2013), Santa Clara, CA, June 2013. n  Knarig Arabshian, Peter Danielsen and Sadia Afroz, LexOnt: Semi- Automatic ontology Creation Tool for Programmable Web, AAAI 2012 Spring Symposium on Intelligent Web Services Meet Social Computing, Palo Alto, CA, March 2012. n  Knarig Arabshian and Peter Danielsen, Semi-automated Ontology Creation for High-level Service Classification 7th International Conference on Semantics, Knowledge and Grids (SKG 2011), Beijing, China, Oct 2011.