SlideShare a Scribd company logo
1 of 28
Download to read offline
Semantic Faceted Search 
with SemFacet 
Evgeny Kharlamov 
Information Systems Group 
Department of Computer Science 
University of Oxford
Finding Data w/ Keywords is Hard 
§ Keyword search is the paradigm 
to access data on the Web, 
company websites, etc 
§ Limitations of keyword search 
§ Too many docs contain keywords 
§ Meaning is not built in keywords 
§ Becomes the art of 
“finding the best combination” 
§ Limited control on search
How to Improve Search Experience? 
§ Improve the search paradigm 
§ End-user oriented query formulation interfaces 
§ Faceted search 
§ Improve the data model 
§ Semantic Web models 
§ Our proposal: 
§ do both and combine 
§ Faceted search 
§ Semantic Web model
Enhancing Keyword Search with Facets 
§ A facet = control mechanism 
§ Name 
§ Set of values
Enhancing Keyword Search with Facets 
§ A facet = control mechanism 
§ Name 
§ Set of values 
§ Facets in action 
§ Choose a value
Enhancing Keyword Search with Facets 
§ A facet = control mechanism 
§ Name 
§ Set of values 
§ Facets in action 
§ Choose a value 
§ Restrict search result 
§ Advantages of facets 
§ Allow to say what you 
really mean 
§ Give control over 
search
Faceted Search in the Nutshell 
stars 
3-stars 
restaurant 
§ Search over 
one set of items 
§ Items annotated with 
§ Strings 
§ Search result: 
subset of items 
Asian 
Italian 
4-stars 5-stars 
French 
Find 4-star hotels with French restaurants
Faceted Search in the Nutshell 
stars 
3-stars 
restaurant 
§ Search over 
one set of items 
§ Items annotated with 
§ Strings 
§ Search result: 
subset of items 
Asian 
Italian 
4-stars 5-stars 
French 
Find 4-star hotels with French restaurants
Faceted Search in the Nutshell 
stars 
3-stars 
restaurant 
§ Search over 
one set of items 
§ Items annotated with 
§ Strings 
§ Search result: 
subset of items 
Asian 
Italian 
4-stars 5-stars 
French 
Find 4-star hotels with French restaurants
Faceted Search in the Nutshell 
stars 
3-stars 
restaurant 
§ Search over 
one set of items 
§ Items annotated with 
§ Strings 
§ Search result: 
subset of items 
Asian 
Italian 
4-stars 5-stars 
French 
output 
Find 4-star hotels with French restaurants
F-Search is the De Facto Standard
Semantic Web Models 
§ RDF data model 
§ objects annotated with strings and objects 
§ OWL 2 ontologies 
§ structure vocabularies of annotations 
4-stars French 
stars 
restaurant 
type 
walking 
distance to 
French restaurant is a Restaurant that offers French cuisine. 
FrenchRestaurant ⊑ Restaurant ⊓ ∃ offers.FrenchCuisine
Enhancing Search with SW in Practice
Enhancing Search with SW in Practice
Enhancing Search with SW in Practice 
Hello, my name is John Doe. 
I study at the University if Dreams. 
My daughter is Alice.... 
embedding 
semantic 
annotations 
<section itemscope itemtype = "http://dava-vocabulary.org/Person" 
itemid = "http://myitems/john-doe-1234" > 
Hello, my name is 
<span itemprop="name">John Doe</span>. 
I study at the 
<span itemprop="affiliation">University of Dreams</span> 
My daughter is 
<span itemtype = "http://dava-vocabulary.org/children" 
itemid = "http://myitems/alice-doe-5678" > 
Alice </span> 
....
Semantic Web Models 
§ RDF data model 
§ objects annotated with strings and objects 
§ OWL 2 ontologies 
§ structure vocabularies of annotations 
from 2011 to 2012 the fraction of structured data went from 
3.5% to 13%
Semantic Web Models 
§ RDF data model 
§ objects annotated with strings and objects 
§ OWL 2 ontologies 
§ structure vocabularies of annotations 
from 2011 to 2012 the fraction of structured data went from 
3.5% to 13%
How to Improve Search Experience? 
§ Improve the search paradigm 
§ End-user oriented query formulation interfaces 
§ Faceted Search 
§ Improve the data model 
§ Semantic Web models 
§ RDF Data 
§ OWL 2 ontologies 
§ Our proposal: 
§ Semantic Faceted Search that combines 
§ Faceted search 
§ Semantic Web model
Semantic Faceted Search in the Nutshell 
4-stars 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to 
output
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to 
output
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to 
output
Research Contributions 
§ Solid foundation for Semantic F-Search 
§ Projection of ontologies on 
graph data structures 
§ Allows to incorporate ontologies 
into faceted search 
§ Gives better faceted interfaces 
politicians Search 
More Focus 
type 
USpres 
Country 
More Focus 
More Focus 
Remove 
More Focus 
Remove 
§ Generate more facets / Prune irrelevant facets 
§ Scalable algorithms to 
§ generate and update facets from 
§ Data and Ontologies 
§ Algorithms to evaluate faceted queries over semantic data 
§ Exploits bottom up query evaluation 
http://en.wikipedia.org/wiki/Bill_Clinton 
William Jefferson "Bill" Clinton (born William 
Jefferson Blythe III; August 19, 1946) is an 
American politician who served as the 42nd 
President of the United States from 1993 to 
2001. Inaugurated at age 46, he was the third-youngest 
president. He took office at the end 
of the Cold War, and was the first president of 
the baby boomer generation... 
has child 
ANY 
Remove 
Remove 
is graduated from 
Stanford Uni. 
is graduated from 
Stanford Uni. 
Harvard Uni. 
Georgetown Uni.
SemFacet System 
§ Integration of 
§ Keyword search and 
§ Semantic faceted search 
§ Main features 
§ Automatic generation of f-search interfaces 
over RDF data and OWL 2 ontologies 
§ In memory 
§ Online and offline reasoning 
§ Efficient on millions of triples 
§ Flexible configuration 
§ Interchangeable triple stores 
§ RDFOX, PAGOdA, Hermit, Sesame 
§ Configurable answers (snippets) 
§ Support of Or and And facets 
Faceted Query 
Interface 
Answers as 
Snippets 
Presentation 
Layer 
Application 
Layer 
Data 
Layer 
Facet 
Generator 
Query 
Converter 
Snippet 
Generator 
Triple Store: 
Ontology 
Data 
Keyword 
Based Search 
KBS 
Engine 
Inverted Index 
e.g. DBpedia 
Abstracts 
RDFOX, PAGOdA, Hermit, Sesame
SemFacet Team 
§ Marcelo Arenas 
§ Bernardo Cuenca Grau 
§ Evgeny Kharlamov 
§ Sarunas Marciuska 
§ Dmitriy Zheleznyakov

More Related Content

Viewers also liked

Overview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationOverview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationDBOnto
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn PresentationDBOnto
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDBOnto
 
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterDBOnto
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paperDBOnto
 
PDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentationPDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentationDBOnto
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperDBOnto
 
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...DBOnto
 
PAGOdA poster
PAGOdA posterPAGOdA poster
PAGOdA posterDBOnto
 
PDQ Poster
PDQ PosterPDQ Poster
PDQ PosterDBOnto
 
RDFox Poster
RDFox PosterRDFox Poster
RDFox PosterDBOnto
 
PAGOdA Presentation
PAGOdA PresentationPAGOdA Presentation
PAGOdA PresentationDBOnto
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paperDBOnto
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DBOnto
 
Parallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationParallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationDBOnto
 
Query Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperQuery Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperDBOnto
 

Viewers also liked (16)

Overview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationOverview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentation
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn Presentation
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meeting
 
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paper
 
PDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentationPDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentation
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators Paper
 
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
 
PAGOdA poster
PAGOdA posterPAGOdA poster
PAGOdA poster
 
PDQ Poster
PDQ PosterPDQ Poster
PDQ Poster
 
RDFox Poster
RDFox PosterRDFox Poster
RDFox Poster
 
PAGOdA Presentation
PAGOdA PresentationPAGOdA Presentation
PAGOdA Presentation
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
 
Parallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationParallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox Presentation
 
Query Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperQuery Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning Paper
 

Similar to Semantic Faceted Search with SemFacet presentation

Web Search: Tips and techniques from a Librarian
Web Search: Tips and techniques from a LibrarianWeb Search: Tips and techniques from a Librarian
Web Search: Tips and techniques from a Librarianlerichard
 
Best Practices for Enterprise Search
Best Practices for Enterprise SearchBest Practices for Enterprise Search
Best Practices for Enterprise SearchChris Risner
 
Online Research_How to get the best out of internet searches
Online Research_How to get the best out of internet searchesOnline Research_How to get the best out of internet searches
Online Research_How to get the best out of internet searches211 Check
 
Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011cyberspaced educator
 
SEO for Ecommerce - an overview
SEO for Ecommerce - an overviewSEO for Ecommerce - an overview
SEO for Ecommerce - an overviewErudite
 
Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentationadeason
 
Martina Welander - Google is a two pagesite
Martina Welander - Google is a two pagesiteMartina Welander - Google is a two pagesite
Martina Welander - Google is a two pagesiteNordicSitecoreConference
 
Getting Ahead of the Curve: Leveraging Local Search in the U.K.
Getting Ahead of the Curve: Leveraging Local Search in the U.K.Getting Ahead of the Curve: Leveraging Local Search in the U.K.
Getting Ahead of the Curve: Leveraging Local Search in the U.K.George Freitag
 
Spiders, farms, and bubbles: how to become an expert internet searcher
Spiders, farms, and bubbles: how to become an expert internet searcherSpiders, farms, and bubbles: how to become an expert internet searcher
Spiders, farms, and bubbles: how to become an expert internet searcherMegan Heuer
 
Northwest Florida Association of Computer User Groups TECH 17 Better Search ...
Northwest Florida Association of  Computer User Groups TECH 17 Better Search ...Northwest Florida Association of  Computer User Groups TECH 17 Better Search ...
Northwest Florida Association of Computer User Groups TECH 17 Better Search ...hewie
 
JAB2012 Smart Search Presentation
JAB2012 Smart Search PresentationJAB2012 Smart Search Presentation
JAB2012 Smart Search PresentationChris Davenport
 
Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014notess
 
Ipe pp slides google talk 2013
Ipe pp slides google talk 2013Ipe pp slides google talk 2013
Ipe pp slides google talk 2013Elizabeth Holmes
 
RESEARCHING YOUR TOPIC_edit.pptx
RESEARCHING YOUR TOPIC_edit.pptxRESEARCHING YOUR TOPIC_edit.pptx
RESEARCHING YOUR TOPIC_edit.pptxShukurat Bello
 

Similar to Semantic Faceted Search with SemFacet presentation (20)

Web Search: Tips and techniques from a Librarian
Web Search: Tips and techniques from a LibrarianWeb Search: Tips and techniques from a Librarian
Web Search: Tips and techniques from a Librarian
 
Best Practices for Enterprise Search
Best Practices for Enterprise SearchBest Practices for Enterprise Search
Best Practices for Enterprise Search
 
Online Research_How to get the best out of internet searches
Online Research_How to get the best out of internet searchesOnline Research_How to get the best out of internet searches
Online Research_How to get the best out of internet searches
 
Carl 2014 slides_gotime
Carl 2014 slides_gotimeCarl 2014 slides_gotime
Carl 2014 slides_gotime
 
Search Analytics - Comperio
Search Analytics - ComperioSearch Analytics - Comperio
Search Analytics - Comperio
 
Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011
 
SEO for Ecommerce - an overview
SEO for Ecommerce - an overviewSEO for Ecommerce - an overview
SEO for Ecommerce - an overview
 
Audit
AuditAudit
Audit
 
Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentation
 
Google Is a Two Page Site
Google Is a Two Page SiteGoogle Is a Two Page Site
Google Is a Two Page Site
 
Martina Welander - Google is a two pagesite
Martina Welander - Google is a two pagesiteMartina Welander - Google is a two pagesite
Martina Welander - Google is a two pagesite
 
Getting Ahead of the Curve: Leveraging Local Search in the U.K.
Getting Ahead of the Curve: Leveraging Local Search in the U.K.Getting Ahead of the Curve: Leveraging Local Search in the U.K.
Getting Ahead of the Curve: Leveraging Local Search in the U.K.
 
Spiders, farms, and bubbles: how to become an expert internet searcher
Spiders, farms, and bubbles: how to become an expert internet searcherSpiders, farms, and bubbles: how to become an expert internet searcher
Spiders, farms, and bubbles: how to become an expert internet searcher
 
Northwest Florida Association of Computer User Groups TECH 17 Better Search ...
Northwest Florida Association of  Computer User Groups TECH 17 Better Search ...Northwest Florida Association of  Computer User Groups TECH 17 Better Search ...
Northwest Florida Association of Computer User Groups TECH 17 Better Search ...
 
JAB2012 Smart Search Presentation
JAB2012 Smart Search PresentationJAB2012 Smart Search Presentation
JAB2012 Smart Search Presentation
 
Hotbot ppt
Hotbot pptHotbot ppt
Hotbot ppt
 
Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014
 
Ipe pp slides google talk 2013
Ipe pp slides google talk 2013Ipe pp slides google talk 2013
Ipe pp slides google talk 2013
 
RESEARCHING YOUR TOPIC_edit.pptx
RESEARCHING YOUR TOPIC_edit.pptxRESEARCHING YOUR TOPIC_edit.pptx
RESEARCHING YOUR TOPIC_edit.pptx
 
Identifying Keywords and Searching Techniques
Identifying Keywords and Searching TechniquesIdentifying Keywords and Searching Techniques
Identifying Keywords and Searching Techniques
 

Recently uploaded

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Semantic Faceted Search with SemFacet presentation

  • 1. Semantic Faceted Search with SemFacet Evgeny Kharlamov Information Systems Group Department of Computer Science University of Oxford
  • 2. Finding Data w/ Keywords is Hard § Keyword search is the paradigm to access data on the Web, company websites, etc § Limitations of keyword search § Too many docs contain keywords § Meaning is not built in keywords § Becomes the art of “finding the best combination” § Limited control on search
  • 3. How to Improve Search Experience? § Improve the search paradigm § End-user oriented query formulation interfaces § Faceted search § Improve the data model § Semantic Web models § Our proposal: § do both and combine § Faceted search § Semantic Web model
  • 4. Enhancing Keyword Search with Facets § A facet = control mechanism § Name § Set of values
  • 5. Enhancing Keyword Search with Facets § A facet = control mechanism § Name § Set of values § Facets in action § Choose a value
  • 6. Enhancing Keyword Search with Facets § A facet = control mechanism § Name § Set of values § Facets in action § Choose a value § Restrict search result § Advantages of facets § Allow to say what you really mean § Give control over search
  • 7. Faceted Search in the Nutshell stars 3-stars restaurant § Search over one set of items § Items annotated with § Strings § Search result: subset of items Asian Italian 4-stars 5-stars French Find 4-star hotels with French restaurants
  • 8. Faceted Search in the Nutshell stars 3-stars restaurant § Search over one set of items § Items annotated with § Strings § Search result: subset of items Asian Italian 4-stars 5-stars French Find 4-star hotels with French restaurants
  • 9. Faceted Search in the Nutshell stars 3-stars restaurant § Search over one set of items § Items annotated with § Strings § Search result: subset of items Asian Italian 4-stars 5-stars French Find 4-star hotels with French restaurants
  • 10. Faceted Search in the Nutshell stars 3-stars restaurant § Search over one set of items § Items annotated with § Strings § Search result: subset of items Asian Italian 4-stars 5-stars French output Find 4-star hotels with French restaurants
  • 11. F-Search is the De Facto Standard
  • 12. Semantic Web Models § RDF data model § objects annotated with strings and objects § OWL 2 ontologies § structure vocabularies of annotations 4-stars French stars restaurant type walking distance to French restaurant is a Restaurant that offers French cuisine. FrenchRestaurant ⊑ Restaurant ⊓ ∃ offers.FrenchCuisine
  • 13. Enhancing Search with SW in Practice
  • 14. Enhancing Search with SW in Practice
  • 15. Enhancing Search with SW in Practice Hello, my name is John Doe. I study at the University if Dreams. My daughter is Alice.... embedding semantic annotations <section itemscope itemtype = "http://dava-vocabulary.org/Person" itemid = "http://myitems/john-doe-1234" > Hello, my name is <span itemprop="name">John Doe</span>. I study at the <span itemprop="affiliation">University of Dreams</span> My daughter is <span itemtype = "http://dava-vocabulary.org/children" itemid = "http://myitems/alice-doe-5678" > Alice </span> ....
  • 16. Semantic Web Models § RDF data model § objects annotated with strings and objects § OWL 2 ontologies § structure vocabularies of annotations from 2011 to 2012 the fraction of structured data went from 3.5% to 13%
  • 17. Semantic Web Models § RDF data model § objects annotated with strings and objects § OWL 2 ontologies § structure vocabularies of annotations from 2011 to 2012 the fraction of structured data went from 3.5% to 13%
  • 18. How to Improve Search Experience? § Improve the search paradigm § End-user oriented query formulation interfaces § Faceted Search § Improve the data model § Semantic Web models § RDF Data § OWL 2 ontologies § Our proposal: § Semantic Faceted Search that combines § Faceted search § Semantic Web model
  • 19. Semantic Faceted Search in the Nutshell 4-stars stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to
  • 20. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to
  • 21. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to
  • 22. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to
  • 23. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to output
  • 24. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to output
  • 25. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to output
  • 26. Research Contributions § Solid foundation for Semantic F-Search § Projection of ontologies on graph data structures § Allows to incorporate ontologies into faceted search § Gives better faceted interfaces politicians Search More Focus type USpres Country More Focus More Focus Remove More Focus Remove § Generate more facets / Prune irrelevant facets § Scalable algorithms to § generate and update facets from § Data and Ontologies § Algorithms to evaluate faceted queries over semantic data § Exploits bottom up query evaluation http://en.wikipedia.org/wiki/Bill_Clinton William Jefferson "Bill" Clinton (born William Jefferson Blythe III; August 19, 1946) is an American politician who served as the 42nd President of the United States from 1993 to 2001. Inaugurated at age 46, he was the third-youngest president. He took office at the end of the Cold War, and was the first president of the baby boomer generation... has child ANY Remove Remove is graduated from Stanford Uni. is graduated from Stanford Uni. Harvard Uni. Georgetown Uni.
  • 27. SemFacet System § Integration of § Keyword search and § Semantic faceted search § Main features § Automatic generation of f-search interfaces over RDF data and OWL 2 ontologies § In memory § Online and offline reasoning § Efficient on millions of triples § Flexible configuration § Interchangeable triple stores § RDFOX, PAGOdA, Hermit, Sesame § Configurable answers (snippets) § Support of Or and And facets Faceted Query Interface Answers as Snippets Presentation Layer Application Layer Data Layer Facet Generator Query Converter Snippet Generator Triple Store: Ontology Data Keyword Based Search KBS Engine Inverted Index e.g. DBpedia Abstracts RDFOX, PAGOdA, Hermit, Sesame
  • 28. SemFacet Team § Marcelo Arenas § Bernardo Cuenca Grau § Evgeny Kharlamov § Sarunas Marciuska § Dmitriy Zheleznyakov