SlideShare uma empresa Scribd logo
1 de 17
LIS developed its own very successful
solutions for the classification and
search of documents.
In KO, search is focused on document
properties (e.g. title, author, subject).
KO tends to fail in situations when
users express their needs in terms of
entity properties (e.g., of the author)
Search by document properties
Give me documents with subject “Michelangelo”
and “marble sculpture”

Search by entity properties
Give me documents about marble sculptures
made by a person born in Italy
Subject: Buonarroti, Michelangelo
Subject: sculpture – Renaissance
In indexes the syntax of the subjects is given
by a subject indexing language grammar.

What about the semantics?

•

•
•

Is Michelangelo the Italian artist? When and
where he was born? What are his most
famous works?

Is sculpture the form of art?
Is Renaissance the historical period? When
and where exactly?
Name: David
Class: statue
Author: Michelangelo
Date of creation: 1504
Matter: marble
Height: 4.10 m

Name: Michelangelo
Surname: Buonarroti
Class: artist
Date of birth: 6th March 1475
Date of death: 18th February 1564
KR
search by any entity property

KO
search by
title, author,
subject
Give me documents about any lake
with depth greater than 100 written by Italians
How to build high quality and scalable
descriptive ontologies?
DERA is faceted as it is inspired to the
principles and canons of the faceted
approach by Ranganathan
DERA is a KR approach as it models entities
of a domain (D) by their entity classes
(E), relations (R) and attributes (A)
Step 1: Identification of the atomic concepts
(E) watercourse, stream: a natural body of
running water flowing on or under the earth

Step 2: Analysis
a body of water
a flowing body of water
no fixed boundary
confined within a bed and stream banks
larger than a brook
Step 3: Synthesis.
(E) Body of water
(is-a) Flowing body of water
(is-a) Stream
(is-a) Brook
(is-a) River
(is-a) Still body of water
(is-a) Pond
(is-a) Lake
Step 4: Standardization.

(E) stream, watercourse: a natural body of
running water flowing on or under the earth
Step 5: Ordering
Terms and concepts in the facets are ordered

Step 6: Formalization
Descriptive ontologies are translated into
Description Logic formal ontologies, e.g.,:
River ⊑ Stream
River (Volga)
Length (Volga, 3692)
• DERA facets have explicit semantics and are
modeled as descriptive ontologies
• DERA facets inherits all the nice properties of
the faceted approach, such as robustness
and scalability
• DERA allows for a very expressive document
search by any entity property
• DERA allows for automated reasoning via the
formalization into Description Logics
ontologies
The usefulness of moving from KO to KR
• KO is methodologically very strong, but subjects are
limited in formality and expressiveness as, by
employing classification ontologies, it only supports
queries by document properties.
• KR, by employing descriptive ontologies, supports
queries by any entity property, but it is
methodologically weaker than KO.

We propose the DERA faceted KR approach
• DERA, being faceted, allows the development of high
quality and scalable descriptive ontologies
• DERA, being a KR approach, allows modeling relevant
entities of the domain and their E/R/A properties and
enables automated reasoning.
• It supports a highly expressive search of documents
exploiting entity properties.
From Knowledge Organization to Knowledge Representation (ISKO UK Conference)

Mais conteúdo relacionado

Destaque

Web Servers, Browsers, Server - Browser Interaction, Web Surfing
Web Servers, Browsers, Server - Browser Interaction, Web SurfingWeb Servers, Browsers, Server - Browser Interaction, Web Surfing
Web Servers, Browsers, Server - Browser Interaction, Web Surfing
webhostingguy
 
Search Engine Powerpoint
Search Engine PowerpointSearch Engine Powerpoint
Search Engine Powerpoint
201014161
 

Destaque (17)

The web is the platform. How to set up a web browser for students
The web is the platform. How to set up a web browser for studentsThe web is the platform. How to set up a web browser for students
The web is the platform. How to set up a web browser for students
 
More Browser Basics, Tips & Tricks 3 Draft 8
More Browser Basics, Tips & Tricks 3 Draft 8More Browser Basics, Tips & Tricks 3 Draft 8
More Browser Basics, Tips & Tricks 3 Draft 8
 
Visualizing big data in the browser using spark
Visualizing big data in the browser using sparkVisualizing big data in the browser using spark
Visualizing big data in the browser using spark
 
CIDOC CRM in Practice
CIDOC CRM in PracticeCIDOC CRM in Practice
CIDOC CRM in Practice
 
Web Servers, Browsers, Server - Browser Interaction, Web Surfing
Web Servers, Browsers, Server - Browser Interaction, Web SurfingWeb Servers, Browsers, Server - Browser Interaction, Web Surfing
Web Servers, Browsers, Server - Browser Interaction, Web Surfing
 
Corel Draw Training in Ambala ! Batra Computer Centre
Corel Draw Training in Ambala ! Batra Computer CentreCorel Draw Training in Ambala ! Batra Computer Centre
Corel Draw Training in Ambala ! Batra Computer Centre
 
Off-Page Search Engine Optimization
Off-Page Search Engine OptimizationOff-Page Search Engine Optimization
Off-Page Search Engine Optimization
 
How to Design Websites for People Who Use Search Engines By Shari Thurow
How to Design Websites for People Who Use Search Engines By Shari ThurowHow to Design Websites for People Who Use Search Engines By Shari Thurow
How to Design Websites for People Who Use Search Engines By Shari Thurow
 
Web browser
Web browserWeb browser
Web browser
 
Web Browsers
Web BrowsersWeb Browsers
Web Browsers
 
Web Browsers
Web BrowsersWeb Browsers
Web Browsers
 
Digital Marketing for Medical Technology Companies: Search Engine Optimization
Digital Marketing for Medical Technology Companies: Search Engine OptimizationDigital Marketing for Medical Technology Companies: Search Engine Optimization
Digital Marketing for Medical Technology Companies: Search Engine Optimization
 
Search engines
Search enginesSearch engines
Search engines
 
Web Browser And Search Engine ! Batra Computer Centre
Web Browser And Search Engine ! Batra Computer CentreWeb Browser And Search Engine ! Batra Computer Centre
Web Browser And Search Engine ! Batra Computer Centre
 
Search Engine Powerpoint
Search Engine PowerpointSearch Engine Powerpoint
Search Engine Powerpoint
 
Search Engine Demystified
Search Engine DemystifiedSearch Engine Demystified
Search Engine Demystified
 
Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Search Engine Optimization (SEO)
Search Engine Optimization (SEO)
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

From Knowledge Organization to Knowledge Representation (ISKO UK Conference)

  • 1.
  • 2. LIS developed its own very successful solutions for the classification and search of documents. In KO, search is focused on document properties (e.g. title, author, subject). KO tends to fail in situations when users express their needs in terms of entity properties (e.g., of the author)
  • 3. Search by document properties Give me documents with subject “Michelangelo” and “marble sculpture” Search by entity properties Give me documents about marble sculptures made by a person born in Italy
  • 4. Subject: Buonarroti, Michelangelo Subject: sculpture – Renaissance In indexes the syntax of the subjects is given by a subject indexing language grammar. What about the semantics? • • • Is Michelangelo the Italian artist? When and where he was born? What are his most famous works? Is sculpture the form of art? Is Renaissance the historical period? When and where exactly?
  • 5. Name: David Class: statue Author: Michelangelo Date of creation: 1504 Matter: marble Height: 4.10 m Name: Michelangelo Surname: Buonarroti Class: artist Date of birth: 6th March 1475 Date of death: 18th February 1564
  • 6. KR search by any entity property KO search by title, author, subject
  • 7.
  • 8.
  • 9.
  • 10. Give me documents about any lake with depth greater than 100 written by Italians
  • 11. How to build high quality and scalable descriptive ontologies? DERA is faceted as it is inspired to the principles and canons of the faceted approach by Ranganathan DERA is a KR approach as it models entities of a domain (D) by their entity classes (E), relations (R) and attributes (A)
  • 12. Step 1: Identification of the atomic concepts (E) watercourse, stream: a natural body of running water flowing on or under the earth Step 2: Analysis a body of water a flowing body of water no fixed boundary confined within a bed and stream banks larger than a brook
  • 13. Step 3: Synthesis. (E) Body of water (is-a) Flowing body of water (is-a) Stream (is-a) Brook (is-a) River (is-a) Still body of water (is-a) Pond (is-a) Lake Step 4: Standardization. (E) stream, watercourse: a natural body of running water flowing on or under the earth
  • 14. Step 5: Ordering Terms and concepts in the facets are ordered Step 6: Formalization Descriptive ontologies are translated into Description Logic formal ontologies, e.g.,: River ⊑ Stream River (Volga) Length (Volga, 3692)
  • 15. • DERA facets have explicit semantics and are modeled as descriptive ontologies • DERA facets inherits all the nice properties of the faceted approach, such as robustness and scalability • DERA allows for a very expressive document search by any entity property • DERA allows for automated reasoning via the formalization into Description Logics ontologies
  • 16. The usefulness of moving from KO to KR • KO is methodologically very strong, but subjects are limited in formality and expressiveness as, by employing classification ontologies, it only supports queries by document properties. • KR, by employing descriptive ontologies, supports queries by any entity property, but it is methodologically weaker than KO. We propose the DERA faceted KR approach • DERA, being faceted, allows the development of high quality and scalable descriptive ontologies • DERA, being a KR approach, allows modeling relevant entities of the domain and their E/R/A properties and enables automated reasoning. • It supports a highly expressive search of documents exploiting entity properties.

Notas do Editor

  1. KO has developed its own solution for the classification and search of documents in libraries and digital archives
  2. However, users may think by entity properties, instead of document properties.
  3. Subjects are informal natural language strings, or more precisely their syntax is often formal (because of the subject indexing language grammar), but their semantics (the meaning) is left implicit.
  4. Thinking in terms of entities, we can describe works and people in terms of entities. This is done in LIS, but informally.One immediate consequence is that it seems there are not systematic ways to reason about these objects.As a matter of fact, information is scattered in many resources.
  5. In fact, documents are just on particular case of entities, with title, author and subject very important properties.
  6. (1) We identify atomic concepts, synonyms and natural language definitions. Each concept is marked as E, R or A.(2) For each concept we identify its characteristics
  7. (3) Facets are built as descriptive ontologies (e.g., we make use of explicit is-a and part-of relations).(4) The preferred term is elected