3. Evolution of Knowledge Society
Hunter gathering Society Pastoral Society
Agricultural Society
Industrial SocietyInformation SocietyKnowledge Society
4. Background of Classification
● Aristotle may have been the last person to know everything there was to
be known in his own time.
● All philosophy is “a series of footnotes of Pluto”- then all theory and
practice of KR are surely a series of footnotes of Aristotle.
- Hierarchycal structure (taxonomy)
- Dichotomy
- Decachotomy (from thee point of the nature and growth of
the development of knowledge it is unrealistic to bind the universe of
subjects to a dechacotomy, because it growes in different direction and
different stages)
- Polychotomy
(these hierarchical struc. are used in knowedge organisation)
5. Species of Scheme
● Enumerative Scheme (Unidimensional)
- listing or enumerating the subject.
● Faceted Scheme : to organise the multidisciplinery /
interdisciplinary subject.
- subject formation
- phase relation
- Inter subject
- Intra facet
- Intra array
- a multidimensional approach. ( Cockroach – recipie,
cooking, pest , medicine, insect etc.)
6.
7. Person Team Award
-Player - Cricket - Arjuna
-Sachin Tendulkar - Pakistan
Test debute against
Sachin Tendulkar Pakistan Cricket Team
Arjuna award
received
8. ● Web directory use hierarchical structure (like DDC)
Ex. - dmoz, Yahoo directory etc.
● Library mainly deals with – Subjects
Web deals with both - subjects and objects
- faceted metadata
- facet term and facet value
(a homogenious group or category derived according
to the principle of facet analysis – SRR).
9.
10.
11. References
● Kumar, Krishan(2008), Theory of classification,
Vikas Publication,New Delhi.
● Patel, Dimple (2002), Organizing the web: a
faceted approach, DRTC, ISI bangalore.
● http://en.wikipedia.org/wiki/Society
12. TOPICS SPEAKERS
Faceted Approach Sandip Das
Faceted knowledge organisation
framework
Anwesha Bhattacharya
Linking of information: Semantic
annotation
Mohit Garg
Searching the web Samhati Soor
Query processing Manash Kumar
User interface Jayanta Kr. Nayek
Project: FATKS-HUM Manasa Rath
Evaluation Criteria Shiv Shakti Ghosh
Conclusion Tanmoy Mondal
14. Facet
A generic term to denote any component-be it basic
subject or an isolate idea-of a compound subject.
A generic term to denote facet idea, facet term and facet
number.
According to A. G. Taylor “Clearly defined, mutually
exclusive, and collectively exhaustive aspects, properties or
characteristics of a class or a specific subject.”
e.g- Injuries to corn in france in 1978.
16. Faceted Approach
According to Dr. S. R. Ranganathan “faceted
approach involves breaking down of a subject
into a basic subject and isolate ideas.”
e.g-Treatment of Lung Cancer in India during 80's.
17. Enumerative Classification
An enumerative scheme for classification consists
essentially of a single schedule enumerating all
subjects –of the past, the present, and the
anticipatable future.
e.g- French poetry, Italian drama,
18. Pros and Cons of Faceted Classification
Pros -
● Offers flexibility for interdisciplinary subjects.
● Ease of accommodating new concept.
● Citation order is not essential.
● Extra facets can be added.
Cons -
● Complicated class number.
● They are very complex and are difficult to use without
experience.
19. Pros and Cons of Enumerative Classification
Pros-
● Easy to construct class number of compound subjects.
● Generaly accepted and widely used throughout the world
● A fairly short and uncomplicated notation can be used.
● “Notationally” it is easier to display the stucture of the scheme.
Cons-
● Difficult to accommadate new subjects.
● There can be lack of accommodation for even simple subject..
21. Faceted Classifications on the Web
● Tower Records
(http://www.towerrecords.com )
●
Recipes
( http://www.epicurious.com )
● Annotated Wordnet
(http://www.siderean.com/wordnet17.jsp )
● Te Kete Ipurangi: the Online Learning Centre
(http://www.tki.org.nz/e/search)
25. References
● Chowdhury, GC and Chowdhury, Sudatta.(2007), Organizing
Information : from the shelf to the web, facet publisher,
london
● Ranganathan, S.R. (2006), Prolegomena to library
classification, ess ess publications, bangalore
● Kumar, Krishan(2008), Theory of classification, vikas
publication,new delhi
● Kumari, Nitu (2009), Evaluation of facted portal, DRTC, ISI
Bangalore
● S, Suman and karmakar, Debanshu (2002), The role library
classification in organizing the web, DRTC, ISI bangalore
● Patel, Dimple (2002), Organizing the web: a faceted
approach, DRTC, ISI bangalore
30. Problems of Keyword-based search
For answers to be relevant, a user must ask the appropriate
query in order to retrieve the desired information and fulfill the
information need (IN).
For keyword-based search a high number of keywords is
necessary to narrow down the search according to the
information need
The semantic ambiguity of querying languages are:
– Built upon natural language
– The query length
The only solution is query refinement.
31. DEPA
DEPA stands for Discipline, Entity, Property & Action
It is a query refinement method that enables the extraction of the
'DEPA facets' of a user query for search and retrieval purpose
The method uses the general principles of facet analysis
The method provides a user with additional and essential
contextual information, in form of list of new facets
The user can narrow down the search by expanding the original
query with the suggested facets
32. Example
● The content is represented as a string of fundamental categories DEPA (Discipline,
Entity, Property and Action) that are conceptually equivalent to ‘facets’ (Bhattacharya)
e.g “Treating Apple trees for bacterial disease in Trentino” is represented by a
concept description C1 in a context C
The facet repository FR(C) contains:
⟨ C1 : Agriculture; AppleTrees;Disease; Treating ⟩
33. Lacuna of DEPA
● Does not have explicit representation of relations
between entities
● Lacks semantics and is more syntactic
● Works better in traditional library scenario than the
web environment
● Need to be improved by formalizing the description
using classification ontology.
This is achieved in Light weight Ontology.
34. Light weight Ontology
● Lightweight ontologies are ontologies with a tree structure
where each node is associated with a natural language label.
The labels of nodes are organized according to certain
predefined patterns which capture different aspects of meaning,
i.e., facets.
35. Faceted lightweight ontology
Faceted lightweight ontology is a lightweight ontology
whose terms are extracted from a background knowledge
organized in terms of facets. Using facets allows us to build
ontologies and organize them, which in general exploits the
structure and terms of the four basic DEPA categories
37. Advantages
● The main advantage of the faceted approach is that it makes explicit the
logical relations among the concepts and concept groups and removes
the limitations of traditional hierarchies.
● Each time, by providing the context, the faceted approach allows for the
representation of different concepts.
For example, a cow can be described as an animal, as a pet, as a food
item, as a commodity, as a God for a particular community, and so on,
depending on the domain
● Faceted lightweight ontologies have a well defined structure and, as such,
they are easier to create, to share among users, and they also provide more
organized input to faceted search and navigation.
38. A new approach: DERA
How to build high quality and scalable ontologies?
DERA is faceted as it is inspired to the principles and canons of the faceted
approach by Ranganathan
DERA is a KR approach as it models entities of a domain (D) by their entity
classes (E), relations (R) and attributes (A)
It is a more general framework to encode the knowledge about the
entities e.g school, cloth etc. , not limited to documents.
39. Classification Ontologies
Classification ontologies mainly used to describe, classify and search for documents.
In these ontologies, terms denote sets of documents.
Hierarchical BT/NT relations between the terms denote superset/subset relations.
40. Descriptive ontologies in different domains
Ontologies built with the purpose of describing about real world entities.
Hierarchical is-a relations indicate subset relations.
41. Steps in DERA(I)
Step 1: Identification of the atomic concepts
(E) watercourse, stream: a natural body of running water flowing on or under the earth
Step 2: Analysis
a body of water
a flowing body of water
no fixed boundary
confined within a bed and stream banks
larger than a brook
42. Steps in DERA(II)
Step 3: Synthesis.
(E) Body of water
(is-a) Flowing body of water
(is-a) Stream
(is-a) Brook
(is-a) River
(is-a) Still body of water
(is-a) Pond
(is-a) Lake
Step 4: Standardization.
(E) stream, watercourse: a natural body of running water flowing on or under the earth
43. Steps in DERA(III)
Step 5: Ordering
Terms and concepts in the facets are ordered
Step 6: Formalization
Descriptive ontologies are translated into Description Logic formal ontologies, e.g.,:
River Stream⊑
River (Volga)
Length (Volga, 3692)
44. Properties of DERA
DERA facets have explicit semantics and are modeled as descriptive ontologies
DERA facets inherits all the nice properties of the faceted approach, such as robustness
and scalability
It allows:
Very expressive document search by any entity property
Automated reasoning via the formalization into Description Logics ontologies
Modeling of relevant entities of the domain and their E/R/A properties.
45. References
● Agostini A., Madalli, D.P., Prasad, A.R.D.(2011).
Faceted Approach To Diverse Query Processing.
● Giunchiglia et al. (2009b). Giunchiglia, F. Dutta, B.
Maltese, V. (2009). Faceted lightweight ontologies.
● Giunchiglia, F., Dutta, B., Maltese, V.(2013). From
Knowledge Organization to Knowledge
Representation.
47. Semantic Annotation
●
For the Semantic Web, we need information in
a hierarchical structure.
●
Idea is that we attach semantic metadata to the
documents,pointing to concepts.
●
An annotation, is a form of meta-data attached
to a particular section of document content
●
SA is an annotation indicating the presence of a
(semantic) entity in a particular place in a text
48. What SA does
●
Attaches metadata to documents, which makes
them more useful and more easily processable
●
They can then be used for searching and
hyperlinking,categorising, and monitoring
●
Adds value to content of libraries, enabling user
interaction with content.
●
Enhanced capability for cross-referencing and
dynamic document classification
49. Metadata
● Metadata of a document is an important source
of knowledge about various properties of the
document.
● Subject metadata is of vital significance,
expressing the thought content of the
document.
50.
51. ● The usual practice is to list the names of
concepts dealt within the document using
knowledge organisation tools like a thesaurus,
taxonomy,subject heading list, etc
● But in an automatic information processing
environment, the metadata is meant to be
processed by the machine. For machine
processing of subject metadata, concept names
do not give enough indication of their subject
matter.
52. ● The subject metadata is supposed to represent
the specific subject of the document.
● The specific subject of the document is that
division of knowledge whose extension and
intention are equal to those of its thought
content
54. ● In faceted approach to subject analysis, any
compound and complex subjects are made up
of the combination of one or more facets.
● Faceted classification schemes and subject
indexing languages label each concept with a
facet type depending on its role in a given
specific subject.
56. (1) Concept. A generic term to denote a piece of
idea.
(2) Facet. The concepts are grouped under a
class according to a single characteristic. This
group/class is termed a facet. Facet is also
defined as a class having subclasses by
applying a “train of characteristics”
(Ranganathan).
57. ● (3) Category. Denotes a basic group/class of
concepts which can be commonly identified in
all subjects of the universe of knowledge.
Ranganathan proposed these elementary
categories: personality, matter, energy, space
and time.
58. Operations in SDA
Faceted semantic annotation is a process for
preparings subject-propositions and consists
primarily of:-
●
concept identification
●
facet analysis
●
facet coordination
●
RDF representation of the facets
59. Step 1: concept identification
●
In this step the key terms are extracted from the
given sentence. Concepts which are implicit in
the sentence (such as medicine, human body,
disease, etc.) are also identified.
60. ● In medical radiology, determination of depth
dose in Roentgen Rotation Therapy using
Ionization pocket chamber
● Medicine, human body, disease, treatment,
radiation therapy, X-ray therapy, treatment
using rotation technique, determination of depth
dose, using ionized packet chamber.
61. Step 2: facet analysis
●
The process of facet analysis starts with
labelling extracted concepts as facets
according to their characteristics/role in the
specific subject.
●
For ex.“human body” has been labelled as the
entity facet as it is the key component around
which all other concepts, such as “disease” and
“treatment”, are concentrated.
62.
63. Step 3: facet coordination
●
Discipline followed by entity (with or without
modifiers) appropriately interpolated or
extrapolated by action and property (with or
without modifiers) is a logical sequence of the
elements of a basic chain manifesting in a
compound subject-proposition. Any action or
property may have further sets of action(s)
and/or property(ies) directly related to it. Their
positions are always after the action or property
to which they relate. Ex. Document annotation
in POPSI
66. ● The main application of the SDA is for
automatic categorisation of web resources in
web directories. A resource can have more than
one subject proposition.
● This will allow expression of the multiple
dimensions of the specific subject of the
resource. Simultaneously, this will provide an
equal number of access points to the resource
at the time of the user’s query.
67. References
●
Broughton, V. (2004), “Faceted classification: a
tool for subject access in the twenty first
century”,Signum, Vol. 8, pp. 5-18.
●
Broughton, V. (2006), “The need for a faceted
classification as the basis of all methods of
information retrieval”, Aslib Proceedings, Vol.
58 No. 1, pp. 49-72.
●
Cimiano, P. and Handschuh, S. (2003),
“Ontology-based linguistic annotation”,
Proceedings of the ACL 2003 Workshop on
Linguistic Annotation: Getting the Model Right,
Vol. 19, pp. 14-21.
69. Searching Information in Web
There are a number of basic ways
to access information on the
Web:
● Going directly to a site if we
have the address
● Browsing
● Conducting a search using a
Web search engine
● Exploring a subject directory
● Exploring the information
stored in live databases on the
Web, known as the "deep
Web"
● Joining an e-mail discussion
group
70. These attempts broadly fall into two categories:
Webpages in hierarchical categories (taxonomy-
based) - directory structure - human-empowered
directories - Faceted Searching
Yahoo!
Open Directory Project
dmoz
Web Search Engines – Keyword Searching
Excite
AltaVista
Google
71. Indexing in Web
● Human Assisted Indexing/
Manual Indexing
- Human-empowered
directories/ Faceted
Approach
● Automatic Indexing/ Full-
text indexing
- Full-text index Search
Engines
73. Human-assisted Indexing on Web
● Depending on humans
for its listings.
● Submitting a short
description to the
directory for entire site or
writing by Editors
● A search looks for
matches only in the
descriptions submitted.
● Changing our web pages
has no effect on our
listing.
●
Following pre-
cordinating
indexing system,
because here the
coordinatiion is done
before giving the query.
74. Automatic Indexing
CHOOSING ALL WORDS IN A DOCUMENT
ELIMINATING COMMON FUNCTION WORDS
BY CONSULTING A STOP-WORD LIST
COMPUTING THE FREQUENCY OCCURENCE
OF ALL THE WORDS IN EACH DOCUMENT
ASSIGNING MOST
FREQUENTLY OCCURRING TERMS
AS THE INDEX TERMS
75. Automatic Indexing on Web
Three elements are here :
1. Crawler or Spider
2. Index (Post-coordinate Indexing)
3. Search Engine Software
76.
77. Natural Language Search
● Natural language words or keywords
we use everyday.
● Flexible and can be combined in
different ways
● A lot of irrelvant searches
● The search engine has to look
everywhere for that word, whether in
the full text of the document or the title
of it.
●
We get more hits, recall is high, but
results are necessarily better.
● This type of searching deals with
automatic Indexing.
78. Controlled Vocabulary Search
● Preselected
● Used by the indexers of a database or a
catalog to describe a subject so that it can
be easily found.
● We may use a lot of words for the same
topic and that makes searching difficult.
● Helping to bring together under a single
word or phrase, all the material that is
available on a particular topic
● The results are usually more relevant to our
topic, so precision is high.
● The trick is to find the correct subject
headings which is sometimes not so easy.
● This type of search deals with Manual
indexing.
79. Faceted Searching vs Keyword Searching
FACETED KEYWORD
COST EXPENSIVE INEXPENSIVE
TIME MORE LESS
EXTENT OF
INDEXABLE MATTER
SUMMARIZATION COMPLETE TEXT
EXHAUSTIVITY MORE SELECTIVE VERY LESS SELECTIVE
SPECIFICITY GENERIC
TERMINOLOGY
VERY SPECIFIC
TERMINOLOGY
HEADING STRUCTURE MULTI-TERM CONTEXT LIMITED TERM-
COMBONATION
SEARCHING WIDE-RANGE MORE-SOPHISTICATED
VOCABULARY SMALLER LARGER
SURROGATION NOT OFTEN USED FREQUENTLY USED
RECALL LESS MORE
PRECISION MORE LESS
81. References
Indexing The World Wide Web:The Journey So Far
Abhishek Das
Google Inc., USA
Ankit Jain
Google Inc., USA
Towards a Framework for Adaptive FacetedSearch on Twitter
Ilknur Celik, Fabian Abel, Patrick Siehndel
Web Information Systems, Delft University of Technologyfcelik,abelg@tudelft.nl
L3S Research Center, Leibniz University Hannover, Germanysiehndel@l3s.de
Finding Information on the Internet
Session 1:
Searching the World Wide Web
Dr. Hesham Azmi
Program of Information Science
Dept. of Mass Comm.& Information Science
http://www.tourolib.org/services/students/subject-headings
Index and Indexing: Amitabha Chatterjee
83. Query Processing
● The user can make three kind of query
(Alessandro Agostini,Devika P. Madalli and
A.R.D. Prasad. Faceted Approach To Diverse
Query Processing):
– keyword-based
– by focus
– on subject
84. The Facets Repository
● The facets from facet analysis can be use to
build the facet repository available to a user to
refine a query.
● The facets repository is organized around two
main notions of the DEPA paradigm for facet
analysis
– subjetcs and facets
85. Concept terms
● A cluster’s name in representation language is
referred to as concept term.
● Two kinds of semantics are provided to a concept
term:
– an extensional semantics, defined over the documents
in the cluster named by the concept term.
– and an intensional semantics, defined by the unique
position of the concept term in a given ‘focus’.
86. Focused Terms
● A focus consists of an ordered set of related
concept terms.
● A focus is a path of concept terms corresponding to a
path in a given context.
Cx:Fruit
Orange Trentino
Apple
Cx:Fruit
Trentino
Apple
X.doc X.doc
Figure: An example of focus (right)
87. Facet Engine
● A facet engine is that computes the matching
between the focused terms of a input context and
the predefined set of facets stored in the facet
repository for a number of concepts
● The facet engine looks at all keywords generated
for each concept name in a focus for all focuses of
the hierarchy, and browse through the focus from
the root to the leaf to identify what keywords are
DEPA facets stored in facet repository.
88. Keyword-based querying
● The user types one or more keywords in the search box.
● Each keyword is mapped to zero or more concept terms in the
context . It is done using an exact string match of the keyword
to the concept term or one of its alias names, namely, its
focused terms.
● If no concept term and its alias names match any keyword, no
concept description is available to the facet engine, and as a
consequence no facets for query refinement are shown to the
user.
89. Keyword-based querying
● If one concept term or its alias names match
some keywords, then the concept description C
of the concept term is generated and processed
for query expansion.
● The facets that occur in the query expansion
are shown to the user
● When selecting one of the new facets, the user
will narrow down the search by expanding the
original query with the suggested facet.
90. Keyword-based querying
● If multiple concept terms match some
keywords, then the concept description of
each term is generated and processed for
query expansion
● The user is given the option to refine their
query to indicate which concept term, namely,
keyword they meant the most.
91. Querying by Focus
● Sometimes user knows at least something about the
subject to search, and the user’s knowledge comes from
documents stored and polyhierarchically organized in the
user’s document collection.
● In this case, it would always be desiderable for the user
to get better and better understanding of the hidden
content of the query, as it is automatically generated by a
suitable method, so as to discover new facets of the
original query that the user was not aware of before.
93. Querying by Focus
● For example, suppose the query is ‘apple’ as contextualized.
The user clicks on a concept term in a context C , ie. the user
selects a focus in C.
● Alternatively, the user types some keywords as in keyword-
based querying, but in a specific order to mean a focus in C.
● For example, the user may click on (an appropriate graphic-
version of) ‘Apple’ in context or either type keywords ‘fruit’,
‘trentino’, ‘apple’ in this order, as to mean
Cx:Fruit>Trentino>Apple
94. Querying by Focus
● By selecting the facet ‘Fruit’ the user would
narrow down the search space by excluding
all subjects about Apple computer and related
subjects as search results.
● By selecting facet ‘Trentino’ the user would be
able to narrow down the search space by
excluding all subjects about fruits that are not
related toTrentino’s production of apples.
95. Querying on Subject
● ‘Subject’ refers to the topical intent of a query.
● In faceted approach to representation of
documents in collection ,‘subjects’ are broken
down into distinct divisions, the facets of
subject.
● A typical ‘query-on-subject’ is deemed to relate
to a specific subject of a pre-existing faceted
classification.
96. Querying on Subject
For example:
A subject-based query is-”What are the documents on the
effects of nitrogen fertilizers on rice plants?”
The subject of the concept subsumed by this query is one of
possibly many focuses.
– Cx1:Rice plants>nitrogen fertilizers>effects.
– Cx2: Agriculture>rice plants>nitrogen fertilizers>effects.
– Cx3:Agriculture> effects of nitrogen> fertilizers>rice plants.
97. Querying on Subject
● A number of different but equivalent focuses
could exists for a given subject-based query.
● If multiple focuses are computed from the
query’s subject, the user is given the option to
refine the original query to indicate which
focus they meant for the searched subject.
98. Reference
● Agostini A., Madalli, D.P., Prasad, A.R.D.
(2011). Faceted Approach To Diverse Query
Processing.
100. User Interface
● A user interface is the system by which people (users) interact with
a machine.
● The goal of this interaction is effective operation and control of
the machine on the user's end, and feedback from the machine,
which aids the operator in making operational decisions.
● The prototype system provides three interfaces for general users to
search for relevant information:
Browser interface: It allows all current and perspective users to browse and navigate
information
Advanced search interface: Here users can combine their searching through multiple
selection
Basic search interface: It allows entering any number of keywords or phrases. It
supports Boolean queries and present th results with relevance ranking.
101. Faceted Search
● Facets refer to categories used to characterise information items in a collection.
● Faceted search is a technique for accessing information organized according to a
faceted classification system, allowing users to explore a collection of information by
applying multiple filters.
● It allows the user to narrow down search results with multiple filters,so user quickly
see which results are most relevant for him.
● Eg.Suppose a user want to filter a flight on price, number of stops and the total
flight duration and this query he search on Google,it will not give a relevant result but
if he will search on a faceted portal which are related to Flights and hotels like(Home
and Abroad,fare cost,Inside trip etc.) he will find relevant result in a short period.
103. Faceted Navigation
● Faceted navigation allows the user to elaborate a query progressively,
seeing the effect of each choice in one facet on the available choices in
other facets.
● Faceted navigation can also be seen as an alternative for Advanced
search where users can 'search' on the information 'facets' rather than
seen the facet come back in the 'normal' navigation.
● Advantages:
More intuitive : Easy to guess what's behind each door.
Dynamic selection of categories allowed : Supports multiple
perspectives
Systematic advantages : Fewer elements-Its ability to handle
compound search.
104.
105. List of portals on web use faceted approach
● Flameno :http://flamenco.berkeley.edu/index.html
● Wine.com: http://www.wine.com/
● Amazon.com:http://www.amazon.com/
● Ebay.com:http://www.ebay.in/
● Flipkart:http://www.flipkart.com/
● Epicurious.com://www.epicurious.com/
● Etoys.com:http://www.etoys.com/home/index.jsp
106.
107. The Flamenco Search Interface Project
Search Interfaces that Flow
● FLAMENCO stands for FLexible information Access using MEtadata in Novel
COmbinations.
● The Flamenco search interface framework has the primary design goal of allowing users to
move through large information spaces in a flexible manner without feeling lost.
● A key property of the interface is the explicit exposure of category metadata, to guide
the user toward possible choices, and to organize the results of keyword searches.
● The interface uses hierarchical faceted metadata in a manner that allows users to both
refine and expand the current query, while maintaining a consistent representation of the
collection's structure.
● This use of metadata is integrated with free-text search, allowing the user to follow links,
then add search terms, then follow more links, without interrupting the interaction
flow.
108. Flamenco Documentation
Preparing Your Data
● A Flamenco collection is a set of items that are all the same kind (for
example, all items are books, or all items are songs, and so on).
● The metadata about any given item consists of its facet values and attribute
values. The first step in preparing your collection is to decide which
information will be in facets and which will be in attributes.
● Facet values are used to organize items into categories, whereas
attribute values are only displayed with individual items.
● In the sample collection, for instance, prize is a facet indicating the type of
Nobel Prize won, whereas name is an attribute for the name of the
winner. That's because it makes sense to group Nobel Prize winners into
categories by the type of prize, but not by their names.
109. Flamenco Documentation
Preparing Your Data
● For Flamenco to load your collection, the metadata about the collection has to be
provided in tab-delimited files (also known as TSV files, with a ".tsv" extension).
● The TSV files you need to provide are:
●
attrs.tsv: gives the list of attributes. Each line in this file represents one attribute.
● facets.tsv: gives the list of facets. Each line in this file represents one facet.
● items.tsv : gives the IDs and attribute values for all the items. Each line of the file
represents one item.
● facet_terms.tsv (for each facet):gives the tree of category terms in the facet.
● facet_map.tsv (for each facet): assigns items to the category terms for that facet.
Each line in this file has two fields.
● sortkeys.tsv (optional): indicates which facets or attributes are to be used for
sorting result lists.
● text.tsv (optional) : supports the text search feature of Flamenco.
110.
111.
112.
113.
114.
115. Faceted Serch in Wine.com
Selection of Facet in Wine.com:
1. Religion: California
2. Type: Red wine, White, Bubbly
3. Style: Red- light & fruity
4. Price: $20 and below
5. Publications: International wine cellar
120. FATKS-HUM (Background)
●To investigate the feasibility of using FAT to develop a
knowledge structure suitable for the digital environment
●To develop and evaluate a prototype implementation in
collaboration with the Arts and Humanities Data Service
(AHDS) and the Humbul Humanities Hub
121. FAT-HUM
➢serves to test and demonstrate facet analysis in
humanities
➢consists of the three distinct but closely interconnected
classifications of concepts:
-broad classification representing UoK
-more detailed faceted classification tested in two areas of humanities: religion
and visual arts
-classification of generally applicable concepts
122.
123. MODEL of FAT-HUM
●stems from three faceted/analytic synthetic
classificatory schemes
● Bliss Classification scheme
● Universal Decimal Classification
● Broad System Ordering
124. Broad Classification Schedule
✔ broad general knowledge classification containing around 300 classes
✔ serves as a basic structure of disciplines to which faceted classification
for humanities is linked
125. Applicable Concepts
applicable throughout the classification are structured as
'external' vocabulary facets
kept as separate classification schedules called Common
auxiliaries
faceted classification for humanities needs to relate to the
concepts that are not particular to humanities only (e.g. place,
time, persons etc.)
purpose of this research project a test vocabulary compiled from
the BC2 and Universal Decimal Classification was created
126. MICRO (Inner classification
structure)
-FAT-HUM is a faceted classification developed by organizing
concepts of a specific field of knowledge according to the
following fundamental concept categories (facets) :
Thing - Kind - Part - Property - Material - Process - Operation - Patient -
Product - Byproduct- Agent - Place – Time
- facets of products, by product and material, for instance, may
be less relevant for humanities while some others need to be
introduced, such as 'theory and philosophy'
127. Syntax FATHUM's analytico-synthetic
feature
-Citation Order-sequence of facets in a
combined class symbol is not accidental
a) Combining concepts within the same facet
b) Combining concepts between facets
c) Combining concepts between humanities and common
auxiliaries
d)Combining concepts between different disciplines
128. Notation
Notation in the broad classification of knowledge
Notation in the facets of common auxiliaries
(B4932) Critical reviews
Notation in the humanities
590 Religion. Theology
590A Theory and philosophy of religion
Subject Relationships using symbols
Bias phase showed by 420<<590 'Religion for educational
purposes'
Filling order
540+941 stands for Religion and Art
132. Aim
• To know the intended design, implementation,
outputs, outcomes and popularity due to ease
of use.
• To provide information to project designers on
how to improve their design, the extent their
project achieved.
• To know how much a user is benefited because
of faceted portal.
133. Criteria
Content Organization
• It should show a clear and logical structure to
typical users.
• It includes putting critical information near the
top of the site, grouping related elements.
Faceted Navigation
• It should help users locate and link to
destination pages.
• Site maps, feedback on the user’s location
within the site, clickable list of page contents,
glosses should be provided.
134. Faceted Metadata
• For a huge and organized collection metadata
should be provided.
Facet exposure
• For maximum exposure facets should be placed
across the top.
Layout of Labels within Facets
• For facets having large labels column-oriented
layout should be preferred.
135.
136. Faceted Category Interface
• The retrieve result can be selected by keyword
search by pre assigned metadata terms or by a
combination of both.
Facet Hierarchy Navigation
• Faceted portals should provide faceted hierarchy
navigation.
• In this user selects anyone facet and refines within
its hierarchy.
137.
138. Multifaceted Navigation
• Users should be able to filter out large sets of
products or content by a variety of product
attributes like (size, color, feature, price range,
specifications).
Directed Navigation
• Combining information found within structured
fields(product name, size or manufacturer) with
unstructured content(product description).
139.
140.
141. Breadcrumb navigation
• When a user clicks through the site’s hierarchy,
each successive link should be indicated as a
text link.
• When a user ends up with a string of section
and subsection names where he is and where
he had been should be shown exactly.
142.
143. Browser Interface
• Should help users to search after previewing
summary
Advanced Search Interface
• Users should be able to combine their searching
within categories of facets through multiple
selection boxes
Basic Search Interface
• Should allow users to enter any number of
keywords or phrases and should also support
Boolean queries.
144. Customization & Personalization
• Skins of the same layout can be used to
express different themes, fonts, and colors can
be used.
• User should be able to save and develop own
user interface templates.
• Facilities for creating user account, profile
should be provided so that user can organize
his/her information.
147. Present Pattern
User types in a search
Search engine gives back matching results
User reads the results and picks the best one
Users' have to find out relevancy
148. Why Need?
●Users generally do not adopt new search
interfaces
●How to show a lot more information without
overwhelming or confusing
●Most users prefer simplicity unless complexity really
makes a difference
●Small details information are provided
149. Advantages
Lets the user decide how to start, and how to explore
After refinement, categories that are not relevant to the current
results disappear.
Very easy to build up complex queries
Reduces mental work & Provide Simple Error Handling
Suggests alternatives
150. Advantages
Easier to explore the collection
Helps users infer what kinds of things are in the collection
Seamless to add new facets and subcategories
Use the metadata to show where to go next
More flexible than canned hyperlinks
151. “The more successful the Web, the greater the problem of
information and resource discovery”
(Wallis and Burden).
Factors :
Expert
Time and Economy
Service providers
152. Summary
●Conversion: Customers can’t buy what they can’t find
●Efficiency: Good navigation increases productivity
●Confidence: Faceted navigation increases information scent
●Aboutness: Facets show semantic make-up of a collection
●Reduced Uncertainty: Not required to specify precise queries
●Guided Experience: Browsing categories provides a different
experience than keyword search
Fig. 2. For instance, the
term horses denotes documents about horses (animals), while the fact that it is placed under
transportation means indicates that documents about horses are also documents about transportation means.
KR employs descriptive ontologies, i.e., ontologies built at the purpose of describing
and reason about real world entities. In these ontologies, terms denote sets of real world
entities, hierarchical is-a relations provide the backbone structure to these ontologies and
indicate a subset relation, while the individuals include any real world entity. For instance,
the relation horse is-a animal indicates that horses are a subset of all animals. This is called
real world semantics in (Giunchiglia et al. 2009b). Descriptive ontologies provide
knowledge about entities in terms of classes, attributes and relations.
940 THE ARTS: AN OUTLINE
940C Art period in general sense. Art in a certain time
940D Art by place in general sense, art in certain place
940E Agents in art. Ethnic groups. Persons. Tools and equipment
590-RELIGION
590A THEORY AND PHILOSOPHY OF RELIGION
590A1 Schools of theology characterised by various attributes
590A2 Concepts in religion. Religious ideas. Theology
590A3 The Holy. The sacred. The supernatural. Object(s) of religion/worship
590A4 God. Gods (Personalised god(s) as distinct from immanent spirits)
In religion, for instance, the following 9 facets are identified to be suitable:
Thing (entity) - i.e. the main facet of Religions. Faiths
Part
Property
Processes
Operations
Patient
Agent
Time
Theory and philosophy of Religion
2&gt; users looking for information within a specific school, can select ‘Research’ from “purpose’ facet as well as ‘school’ from ‘area’ facet.