SlideShare uma empresa Scribd logo
1 de 33
Baixar para ler offline
Realizing the Full Potential of
Taxonomies
Content Strategy Workshops
Vancouver, BC, July 12, 2013
Branka Kosovac, dotWit Consulting
Branka.kosovac@dotwit.com
1
2
3
4
6 7
5
8
9 10
11
12
13
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:skos="http://www.w3.org/2004/02/skos/core#">
<skos:Concept rdf:about="http://www.my.com/#canals">
<skos:definition>A feature type category for places
such as the Erie Canal</skos:definition>
<skos:prefLabel>canals</skos:prefLabel>
<skos:altLabel>canal bends</skos:altLabel>
<skos:altLabel>canalized streams</skos:altLabel>
<skos:altLabel>ditch mouths</skos:altLabel>
<skos:altLabel>ditches</skos:altLabel>
<skos:altLabel>drainage canals</skos:altLabel>
<skos:broader
rdf:resource="http://www.my.com/#hydrographic%20structures"/>
<skos:related rdf:resource="http://www.my.com/#channels"/>
<skos:related
rdf:resource="http://www.my.com/#transportation%20features"/>
<skos:related rdf:resource="http://www.my.com/#tunnels"/>
<skos:scopeNote>Manmade waterway used by watercraft or for
drainage, irrigation, mining,
or water power</skos:scopeNote>
</skos:Concept>
</rdf:RDF>
14
<owl:Class rdf:ID="Wine">
<rdfs:subClassOf rdf:resource="&food;PotableLiquid"/>
<rdfs:subClassOf>
<owl:Restriction> <owl:onProperty rdf:resource="#hasMaker" />
<owl:cardinality rdf:datatype="&xsd;nonNegativeInteger">1</owl:cardinality>
</owl:Restriction>
</rdfs:subClassOf>
<rdfs:subClassOf>
<owl:Restriction> <owl:onProperty rdf:resource="#hasMaker" />
<owl:allValuesFrom rdf:resource="#Winery" />
</owl:Restriction>
</rdfs:subClassOf>
<rdfs:subClassOf>
<owl:Restriction> <owl:onProperty rdf:resource="#madeFromGrape" />
<owl:minCardinality
rdf:datatype="&xsd;nonNegativeInteger">1</owl:minCardinality> </owl:Restriction>
<rdfs:subClassOf>
<owl:Restriction> <owl:onProperty rdf:resource="#hasBody" />
<owl:cardinality rdf:datatype="&xsd;nonNegativeInteger">1</owl:cardinality>
</owl:Restriction>
</rdfs:subClassOf>
<rdfs:subClassOf>
<owl:Restriction> <owl:onProperty rdf:resource="#hasColor" /> <owl:cardinality
rdf:datatype="&xsd;nonNegativeInteger">1</owl:cardinality> </owl:Restriction>
</rdfs:subClassOf>
<rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#locatedIn"/>
<owl:someValuesFrom rdf:resource="&vin;Region"/> </owl:Restriction>
</rdfs:subClassOf>
<rdfs:label xml:lang="en">wine</rdfs:label>
<rdfs:label xml:lang="fr">vin</rdfs:label>
</owl:Class>
15
Continuum
from enumerations to ontologies
Enumeration Classification (Scheme)
Subject Headings
Controlled Vocabulary
Semantic Network
Term Base
Light Ontology
Thesaurus
Ontology
Contextual Taxonomy
Enterprise Taxonomy
Business Taxonomy
Tagging Taxonomy
Navigation Taxonomy
Profiling Taxonomy
Uses
• Accessing information
– Browsing
• Hierarchy
• Filtering
• Cross-navigation
– Search
• Full-text search
• Advanced search
• Faceted search
• Matching
– Personalization/Targeting
– Contextual advertising
– Contextualization
– Security
– Content to person
– Product to product
– Person to person….
• Information management
– Managing access
– Managing display
– Managing currency
– …
• Integration & interoperability
• Analytics & visualization
• Mining & intelligence
• Natural language processing
• Terminology management
• eDiscovery
• ….
How
Infrastructure
Taxonomy; Schemas; Mappings; Standards
Magic
Description/tagging, classification/filing, matching, search engine configuration…
Automated, manual, semi-automated
UI
Navigation, search UI, search results, personalized/targeted/contextualized delivery…
Objects
• Documents
• Webpages
• Content components
• Digital assets
• Knowledge assets
• Marketing
assets/resources
• Records
• Social content
• Products
• People profiles
• …
• Subject domain
• Enterprise
• Intranet
• Website
• World Wide Web
• Catalogue
– Single channel
– Multi-channel
• Application
• …
Scopes
Elements
Categories Labels Relationships
Descriptions Codes (language independent) Hierarchy
Designed
organic
Scope notes Preferred Typed
Named
Formally defined
Formal definitions
(for computer
inference)
Alternative
Synonym rings
Equivalence relationships
Generic (Is a kind of)
Partitive (is a part of)
Instance of (is an instance of)
Typed Associative
Multilingual Transitivity
Reflectivity
Symmetry
Associated vocabulary (for
auto-classification)
user-added keywords, hashtags
(for social content)
• Those that belong to the emperor
• Embalmed ones
• Those that are trained
• Suckling pigs
• Mermaids (or Sirens)
• Fabulous ones
• Stray dogs
• Those that are included in this classification
• Those that tremble as if they were mad
• Innumerable ones
• Those drawn with a very fine camel hair brush
• Et cetera
• Those that have just broken the flower vase
• Those that, at a distance, resemble flies
Taxonomy of Animals in Celestial Emporium of Benevolent Knowledge
from Jorge Luis Borges essay "The Analytical Language of John Wilkins", 1942
KINGDOM
STRUCTURAL
ORGANIZATION
METHOD OF
NUTRITION
Monera small, simple single prokaryotic cell (nucleus is
not enclosed by a membrane); some form
chains or mats
absorb food and/or
photosynthesize
Protista large, single eukaryotic cell (nucleus is
enclosed by a membrane); some form chains
or colonies
absorb, ingest, and/or
photosynthesize food
Fungi multicellular filamentous form with
specialized eukaryotic cells
absorb food
Plantae multicellular form with
specialized eukaryotic cells; do not have their
own means of locomotion
photosynthesize food
Animalia multicellular form with
specialized eukaryotic cells; have their own
means of locomotion
ingest food
Definitions of Kingdom categories in the Linnaean Classification of Living Things
Linnaean Classification of Living Things: hierarchy for homo sapiens Images taken from: Encyclopaedia Britannica
ANIMALIA
CHORDATA
SAPIENS
MAMMALIA
ORDER
GENUS
SPECIES
eukaryotic cells having cell membrane but lacking a cell
wall, multicellular, heterotrophic
animals with a notochord, dorsal nerve cord,
and pharyngeal gill slits, which may be vestigialPHYLUM
KINGDOM
CLASS
PRIMATES
warm-blooded vertebrates with hair and mammary glands
which, in females, secrete milk to feed young
FAMILY
upright posture, large brain, stereoscopic vision, flat face,
hands and feet have different specializations
HOMINIDAE
s-curved spineHOMO
HABILIS ERECTUS
high forehead, well-developed chin,
skull bones thin
collar bone, eyes face forward, grasping hands with
fingers, and two types of teeth: incisors and molars
Classification theories
Aristotle’s categories
• Class definitions
• Membership based on shared characteristics--
necessary and sufficient conditions
• Strong influence on Western thinking
• Not how the real world works, but is what
Western audiences are expecting
Prototype theory
• Categories based on prototypes
• Membership decided based on family
resemblances
Sometimes it’s easy
• when there is a single clear
distinguishing feature
• when there are well established
categories (someone of authority
created them, e.g. state/province,
zodiac sign, blood type, …)
• when you work at a “basic category”
level
• when the collection is not too large
and diverse
• when it’s single use
• when homogeneous audience
Sometimes it’s easy
Select v
circle
square
triangle
Sometimes a bit less easy
Sometimes a bit less easy
Color
Blue
Red
Yellow
Shape
Circle
Square
Triangle
Size
Small
Medium
Big
But what if…
• Your technology does not support
faceted approach or polyhierarchy?
• These are physical objects:
• Table linen you have to put into
your drawer?
• Earrings?
And sometimes…
When it gets complicated
• large and diverse collections
• multiple uses
• diverse user groups
• cultural differences
• cultural/political sensitivities
• no formal agreement/authoritative source
• emerging and volatile domains
• far from “basic categories”
• ….
What to do then?
• There are some general (but not universal) rules
• and some tricks of trade
• but above all: context, context, context…
– external users vs. internal audience
– human use vs. computer inference
– impact of error
– use scenarios
– display constraints
– supporting technology
– costs…
Categories
• mutually exclusive
• collectively exhaustive
• clear grouping principle
• relevant grouping principle
• homogeneous peer categories
• pre-coordination vs. post-coordination
• compound concepts
(“first aid” vs. “coal extraction”)
Labels
• clear
• unambiguous
• informative
• brief
• suitable for audience
• consistently formatted
• grammatically parallel
• no abbreviations, jargon, concatenation
Hierarchy
• consistent or varied depth?
• defined levels, typed relationships, or organic?
• polyhierarchy?
• lots of top level categories or deep hierarchy?
• transitive or not transitive?
Overall structure
• logical
• consistent
• well-balanced
• extensible
• fit for purpose (scenarios, business goals…)
• ordering logical and consistent
• top levels convey the scope
• no single-child categories
• no Other/Miscellaneous/General
Some techniques
• Standardize, but not more than necessary
• Consensus vs. mapping vs. standardized core
and general rules
• Derivative local taxonomies—mix & match
• Scoped labels and/or relationships
• If future use not known, follow general rules,
define ad document as much as possible
How to begin
• make sure you know what your taxonomy needs to do–now
and in the future
– user research, business requirements, vision, scenarios
• make sure you know all the constraints
– tools, costs (including long-term maintenance), available expertise,
organizational culture…
• promote and obtain high-level management support
• gather sources:
– user warrant (search logs, social content, user research/feedback logs)
– content warrant (your content, global content, your competitors’…)
– existing metadata, folksonomies, glossaries, formal or informal
taxonomies…
– publicly available taxonomies—reuse, adapt, start from scratch
(e.g. Linked Data, Taxonomy Warehouse)
How to develop
• Combination of:
– Top down (domain modelling)
– Bottom up (terminology clustering, open card sort)
• Design & Strategy
– Metadata element set, associated facets/branches
– Category/term properties, relationship types, hierarchy levels…
– Sustainable maintenance strategy
– Metrics
– Roadmap
• Development
– Know where to stop
• Validation & Testing
– Throughout development and beyond
How to complete
• Documentation
– Scope
– Design
– Maintenance guidelines
– Implementation guidance
– Use guidelines
• Deployment
– Work with developers, UX designers, taggers and don’t give up until
properly implemented
• Governance
– Roles and responsibilities
– Procedures
Exercises
• Exercise groups/topics
• Exercise tasks
– Describe vision (add context details as needed)
– Develop domain model
– High-level taxonomy design and strategy
– Develop key facet
– Record your considerations, sources, thought
process
Ask Me Anything
Branka.kosovac@dotwit.com

Mais conteúdo relacionado

Semelhante a Realizing the Full Potential of Taxonomies by Branka Kosovac

Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the WebGuus Schreiber
 
Free the Patterns! The Vital Challenge to the Pattern Community
Free the Patterns! The Vital Challenge to the Pattern CommunityFree the Patterns! The Vital Challenge to the Pattern Community
Free the Patterns! The Vital Challenge to the Pattern CommunityDouglas Schuler
 
Ontology dojo presentation eia 18 workshop take away
Ontology dojo presentation eia 18 workshop take awayOntology dojo presentation eia 18 workshop take away
Ontology dojo presentation eia 18 workshop take awayRen Pope
 
Know Your Library And Become Information Literate 2
Know Your Library And Become Information Literate 2Know Your Library And Become Information Literate 2
Know Your Library And Become Information Literate 23nrico
 
Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Janet Leu
 
Finding library resources soci 3680
Finding library resources soci 3680Finding library resources soci 3680
Finding library resources soci 3680ljackso2
 
Threshold concepts in higher ed (STLHE2015)
Threshold concepts in higher ed (STLHE2015)Threshold concepts in higher ed (STLHE2015)
Threshold concepts in higher ed (STLHE2015)Ashley Shaw
 
Taxo for km chicago 20121009
Taxo for km chicago 20121009Taxo for km chicago 20121009
Taxo for km chicago 20121009KM Chicago
 
Taxonomies & folksonomies
Taxonomies  & folksonomiesTaxonomies  & folksonomies
Taxonomies & folksonomiesAparna Sane
 
VOA Learning English with Dr. Jill on Academic English
VOA Learning English with Dr. Jill on Academic EnglishVOA Learning English with Dr. Jill on Academic English
VOA Learning English with Dr. Jill on Academic EnglishJill Robbins
 
Library Language: Vocabulary for the Modern Librarian
Library Language: Vocabulary for the Modern LibrarianLibrary Language: Vocabulary for the Modern Librarian
Library Language: Vocabulary for the Modern LibrarianLibraries Thriving
 
Bio 150 Information Sources in Biology
Bio 150 Information Sources in BiologyBio 150 Information Sources in Biology
Bio 150 Information Sources in BiologyAlyssa Young
 
Scholarly Skills for Classics & Ancient History undergraduates
Scholarly Skills for Classics & Ancient History undergraduatesScholarly Skills for Classics & Ancient History undergraduates
Scholarly Skills for Classics & Ancient History undergraduatesRichard Holmes
 

Semelhante a Realizing the Full Potential of Taxonomies by Branka Kosovac (20)

Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the Web
 
Free the Patterns! The Vital Challenge to the Pattern Community
Free the Patterns! The Vital Challenge to the Pattern CommunityFree the Patterns! The Vital Challenge to the Pattern Community
Free the Patterns! The Vital Challenge to the Pattern Community
 
Ontology dojo presentation eia 18 workshop take away
Ontology dojo presentation eia 18 workshop take awayOntology dojo presentation eia 18 workshop take away
Ontology dojo presentation eia 18 workshop take away
 
Know Your Library And Become Information Literate 2
Know Your Library And Become Information Literate 2Know Your Library And Become Information Literate 2
Know Your Library And Become Information Literate 2
 
Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.
 
Folksonomies & social tagging
Folksonomies & social taggingFolksonomies & social tagging
Folksonomies & social tagging
 
Finding library resources soci 3680
Finding library resources soci 3680Finding library resources soci 3680
Finding library resources soci 3680
 
Threshold concepts in higher ed (STLHE2015)
Threshold concepts in higher ed (STLHE2015)Threshold concepts in higher ed (STLHE2015)
Threshold concepts in higher ed (STLHE2015)
 
Ontologies Fmi 042010
Ontologies Fmi 042010Ontologies Fmi 042010
Ontologies Fmi 042010
 
Metadata
MetadataMetadata
Metadata
 
Taxo for km chicago 20121009
Taxo for km chicago 20121009Taxo for km chicago 20121009
Taxo for km chicago 20121009
 
Taxonomies & folksonomies
Taxonomies  & folksonomiesTaxonomies  & folksonomies
Taxonomies & folksonomies
 
VOA Learning English with Dr. Jill on Academic English
VOA Learning English with Dr. Jill on Academic EnglishVOA Learning English with Dr. Jill on Academic English
VOA Learning English with Dr. Jill on Academic English
 
Information Literacy Award - English
Information Literacy Award - EnglishInformation Literacy Award - English
Information Literacy Award - English
 
Library Language: Vocabulary for the Modern Librarian
Library Language: Vocabulary for the Modern LibrarianLibrary Language: Vocabulary for the Modern Librarian
Library Language: Vocabulary for the Modern Librarian
 
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
 
Bio 150 Information Sources in Biology
Bio 150 Information Sources in BiologyBio 150 Information Sources in Biology
Bio 150 Information Sources in Biology
 
PSLD 602-606
PSLD 602-606PSLD 602-606
PSLD 602-606
 
Information Literacy Award - Drama, Theatre & Dance
Information Literacy Award - Drama, Theatre & DanceInformation Literacy Award - Drama, Theatre & Dance
Information Literacy Award - Drama, Theatre & Dance
 
Scholarly Skills for Classics & Ancient History undergraduates
Scholarly Skills for Classics & Ancient History undergraduatesScholarly Skills for Classics & Ancient History undergraduates
Scholarly Skills for Classics & Ancient History undergraduates
 

Mais de Content Strategy Workshops

Personalization, Customer Journey, Omnichannel: A How-to Approach with Kevin ...
Personalization, Customer Journey, Omnichannel: A How-to Approach with Kevin ...Personalization, Customer Journey, Omnichannel: A How-to Approach with Kevin ...
Personalization, Customer Journey, Omnichannel: A How-to Approach with Kevin ...Content Strategy Workshops
 
How to Future-proof Your Content by Sarah Beckley
How to Future-proof Your Content by Sarah BeckleyHow to Future-proof Your Content by Sarah Beckley
How to Future-proof Your Content by Sarah BeckleyContent Strategy Workshops
 
Leveraging Social Content for Business Value by Selma Zafar
Leveraging Social Content for Business Value by Selma ZafarLeveraging Social Content for Business Value by Selma Zafar
Leveraging Social Content for Business Value by Selma ZafarContent Strategy Workshops
 
Content Typing, Flows, Models by Rahel Anne Bailie
Content Typing, Flows, Models by Rahel Anne BailieContent Typing, Flows, Models by Rahel Anne Bailie
Content Typing, Flows, Models by Rahel Anne BailieContent Strategy Workshops
 
Global Content Strategy: Preparing the Content Banquet by James V. Romano
Global Content Strategy: Preparing the Content Banquet by James V. RomanoGlobal Content Strategy: Preparing the Content Banquet by James V. Romano
Global Content Strategy: Preparing the Content Banquet by James V. RomanoContent Strategy Workshops
 
The City is not a Site Map (with Apologies to Christoper Alexander) by Gordon...
The City is not a Site Map (with Apologies to Christoper Alexander) by Gordon...The City is not a Site Map (with Apologies to Christoper Alexander) by Gordon...
The City is not a Site Map (with Apologies to Christoper Alexander) by Gordon...Content Strategy Workshops
 
Inventory to Insight to Action with Paula Land
Inventory to Insight to Action with Paula LandInventory to Insight to Action with Paula Land
Inventory to Insight to Action with Paula LandContent Strategy Workshops
 

Mais de Content Strategy Workshops (7)

Personalization, Customer Journey, Omnichannel: A How-to Approach with Kevin ...
Personalization, Customer Journey, Omnichannel: A How-to Approach with Kevin ...Personalization, Customer Journey, Omnichannel: A How-to Approach with Kevin ...
Personalization, Customer Journey, Omnichannel: A How-to Approach with Kevin ...
 
How to Future-proof Your Content by Sarah Beckley
How to Future-proof Your Content by Sarah BeckleyHow to Future-proof Your Content by Sarah Beckley
How to Future-proof Your Content by Sarah Beckley
 
Leveraging Social Content for Business Value by Selma Zafar
Leveraging Social Content for Business Value by Selma ZafarLeveraging Social Content for Business Value by Selma Zafar
Leveraging Social Content for Business Value by Selma Zafar
 
Content Typing, Flows, Models by Rahel Anne Bailie
Content Typing, Flows, Models by Rahel Anne BailieContent Typing, Flows, Models by Rahel Anne Bailie
Content Typing, Flows, Models by Rahel Anne Bailie
 
Global Content Strategy: Preparing the Content Banquet by James V. Romano
Global Content Strategy: Preparing the Content Banquet by James V. RomanoGlobal Content Strategy: Preparing the Content Banquet by James V. Romano
Global Content Strategy: Preparing the Content Banquet by James V. Romano
 
The City is not a Site Map (with Apologies to Christoper Alexander) by Gordon...
The City is not a Site Map (with Apologies to Christoper Alexander) by Gordon...The City is not a Site Map (with Apologies to Christoper Alexander) by Gordon...
The City is not a Site Map (with Apologies to Christoper Alexander) by Gordon...
 
Inventory to Insight to Action with Paula Land
Inventory to Insight to Action with Paula LandInventory to Insight to Action with Paula Land
Inventory to Insight to Action with Paula Land
 

Último

Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 

Último (20)

Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 

Realizing the Full Potential of Taxonomies by Branka Kosovac

  • 1. Realizing the Full Potential of Taxonomies Content Strategy Workshops Vancouver, BC, July 12, 2013 Branka Kosovac, dotWit Consulting Branka.kosovac@dotwit.com
  • 6. <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:skos="http://www.w3.org/2004/02/skos/core#"> <skos:Concept rdf:about="http://www.my.com/#canals"> <skos:definition>A feature type category for places such as the Erie Canal</skos:definition> <skos:prefLabel>canals</skos:prefLabel> <skos:altLabel>canal bends</skos:altLabel> <skos:altLabel>canalized streams</skos:altLabel> <skos:altLabel>ditch mouths</skos:altLabel> <skos:altLabel>ditches</skos:altLabel> <skos:altLabel>drainage canals</skos:altLabel> <skos:broader rdf:resource="http://www.my.com/#hydrographic%20structures"/> <skos:related rdf:resource="http://www.my.com/#channels"/> <skos:related rdf:resource="http://www.my.com/#transportation%20features"/> <skos:related rdf:resource="http://www.my.com/#tunnels"/> <skos:scopeNote>Manmade waterway used by watercraft or for drainage, irrigation, mining, or water power</skos:scopeNote> </skos:Concept> </rdf:RDF> 14
  • 7. <owl:Class rdf:ID="Wine"> <rdfs:subClassOf rdf:resource="&food;PotableLiquid"/> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#hasMaker" /> <owl:cardinality rdf:datatype="&xsd;nonNegativeInteger">1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#hasMaker" /> <owl:allValuesFrom rdf:resource="#Winery" /> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#madeFromGrape" /> <owl:minCardinality rdf:datatype="&xsd;nonNegativeInteger">1</owl:minCardinality> </owl:Restriction> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#hasBody" /> <owl:cardinality rdf:datatype="&xsd;nonNegativeInteger">1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#hasColor" /> <owl:cardinality rdf:datatype="&xsd;nonNegativeInteger">1</owl:cardinality> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#locatedIn"/> <owl:someValuesFrom rdf:resource="&vin;Region"/> </owl:Restriction> </rdfs:subClassOf> <rdfs:label xml:lang="en">wine</rdfs:label> <rdfs:label xml:lang="fr">vin</rdfs:label> </owl:Class> 15
  • 8. Continuum from enumerations to ontologies Enumeration Classification (Scheme) Subject Headings Controlled Vocabulary Semantic Network Term Base Light Ontology Thesaurus Ontology Contextual Taxonomy Enterprise Taxonomy Business Taxonomy Tagging Taxonomy Navigation Taxonomy Profiling Taxonomy
  • 9. Uses • Accessing information – Browsing • Hierarchy • Filtering • Cross-navigation – Search • Full-text search • Advanced search • Faceted search • Matching – Personalization/Targeting – Contextual advertising – Contextualization – Security – Content to person – Product to product – Person to person…. • Information management – Managing access – Managing display – Managing currency – … • Integration & interoperability • Analytics & visualization • Mining & intelligence • Natural language processing • Terminology management • eDiscovery • ….
  • 10. How Infrastructure Taxonomy; Schemas; Mappings; Standards Magic Description/tagging, classification/filing, matching, search engine configuration… Automated, manual, semi-automated UI Navigation, search UI, search results, personalized/targeted/contextualized delivery…
  • 11. Objects • Documents • Webpages • Content components • Digital assets • Knowledge assets • Marketing assets/resources • Records • Social content • Products • People profiles • … • Subject domain • Enterprise • Intranet • Website • World Wide Web • Catalogue – Single channel – Multi-channel • Application • … Scopes
  • 12. Elements Categories Labels Relationships Descriptions Codes (language independent) Hierarchy Designed organic Scope notes Preferred Typed Named Formally defined Formal definitions (for computer inference) Alternative Synonym rings Equivalence relationships Generic (Is a kind of) Partitive (is a part of) Instance of (is an instance of) Typed Associative Multilingual Transitivity Reflectivity Symmetry Associated vocabulary (for auto-classification) user-added keywords, hashtags (for social content)
  • 13. • Those that belong to the emperor • Embalmed ones • Those that are trained • Suckling pigs • Mermaids (or Sirens) • Fabulous ones • Stray dogs • Those that are included in this classification • Those that tremble as if they were mad • Innumerable ones • Those drawn with a very fine camel hair brush • Et cetera • Those that have just broken the flower vase • Those that, at a distance, resemble flies Taxonomy of Animals in Celestial Emporium of Benevolent Knowledge from Jorge Luis Borges essay "The Analytical Language of John Wilkins", 1942
  • 14. KINGDOM STRUCTURAL ORGANIZATION METHOD OF NUTRITION Monera small, simple single prokaryotic cell (nucleus is not enclosed by a membrane); some form chains or mats absorb food and/or photosynthesize Protista large, single eukaryotic cell (nucleus is enclosed by a membrane); some form chains or colonies absorb, ingest, and/or photosynthesize food Fungi multicellular filamentous form with specialized eukaryotic cells absorb food Plantae multicellular form with specialized eukaryotic cells; do not have their own means of locomotion photosynthesize food Animalia multicellular form with specialized eukaryotic cells; have their own means of locomotion ingest food Definitions of Kingdom categories in the Linnaean Classification of Living Things
  • 15. Linnaean Classification of Living Things: hierarchy for homo sapiens Images taken from: Encyclopaedia Britannica ANIMALIA CHORDATA SAPIENS MAMMALIA ORDER GENUS SPECIES eukaryotic cells having cell membrane but lacking a cell wall, multicellular, heterotrophic animals with a notochord, dorsal nerve cord, and pharyngeal gill slits, which may be vestigialPHYLUM KINGDOM CLASS PRIMATES warm-blooded vertebrates with hair and mammary glands which, in females, secrete milk to feed young FAMILY upright posture, large brain, stereoscopic vision, flat face, hands and feet have different specializations HOMINIDAE s-curved spineHOMO HABILIS ERECTUS high forehead, well-developed chin, skull bones thin collar bone, eyes face forward, grasping hands with fingers, and two types of teeth: incisors and molars
  • 16. Classification theories Aristotle’s categories • Class definitions • Membership based on shared characteristics-- necessary and sufficient conditions • Strong influence on Western thinking • Not how the real world works, but is what Western audiences are expecting Prototype theory • Categories based on prototypes • Membership decided based on family resemblances
  • 18. • when there is a single clear distinguishing feature • when there are well established categories (someone of authority created them, e.g. state/province, zodiac sign, blood type, …) • when you work at a “basic category” level • when the collection is not too large and diverse • when it’s single use • when homogeneous audience Sometimes it’s easy Select v circle square triangle
  • 19. Sometimes a bit less easy
  • 20. Sometimes a bit less easy Color Blue Red Yellow Shape Circle Square Triangle Size Small Medium Big But what if… • Your technology does not support faceted approach or polyhierarchy? • These are physical objects: • Table linen you have to put into your drawer? • Earrings?
  • 22. When it gets complicated • large and diverse collections • multiple uses • diverse user groups • cultural differences • cultural/political sensitivities • no formal agreement/authoritative source • emerging and volatile domains • far from “basic categories” • ….
  • 23. What to do then? • There are some general (but not universal) rules • and some tricks of trade • but above all: context, context, context… – external users vs. internal audience – human use vs. computer inference – impact of error – use scenarios – display constraints – supporting technology – costs…
  • 24. Categories • mutually exclusive • collectively exhaustive • clear grouping principle • relevant grouping principle • homogeneous peer categories • pre-coordination vs. post-coordination • compound concepts (“first aid” vs. “coal extraction”)
  • 25. Labels • clear • unambiguous • informative • brief • suitable for audience • consistently formatted • grammatically parallel • no abbreviations, jargon, concatenation
  • 26. Hierarchy • consistent or varied depth? • defined levels, typed relationships, or organic? • polyhierarchy? • lots of top level categories or deep hierarchy? • transitive or not transitive?
  • 27. Overall structure • logical • consistent • well-balanced • extensible • fit for purpose (scenarios, business goals…) • ordering logical and consistent • top levels convey the scope • no single-child categories • no Other/Miscellaneous/General
  • 28. Some techniques • Standardize, but not more than necessary • Consensus vs. mapping vs. standardized core and general rules • Derivative local taxonomies—mix & match • Scoped labels and/or relationships • If future use not known, follow general rules, define ad document as much as possible
  • 29. How to begin • make sure you know what your taxonomy needs to do–now and in the future – user research, business requirements, vision, scenarios • make sure you know all the constraints – tools, costs (including long-term maintenance), available expertise, organizational culture… • promote and obtain high-level management support • gather sources: – user warrant (search logs, social content, user research/feedback logs) – content warrant (your content, global content, your competitors’…) – existing metadata, folksonomies, glossaries, formal or informal taxonomies… – publicly available taxonomies—reuse, adapt, start from scratch (e.g. Linked Data, Taxonomy Warehouse)
  • 30. How to develop • Combination of: – Top down (domain modelling) – Bottom up (terminology clustering, open card sort) • Design & Strategy – Metadata element set, associated facets/branches – Category/term properties, relationship types, hierarchy levels… – Sustainable maintenance strategy – Metrics – Roadmap • Development – Know where to stop • Validation & Testing – Throughout development and beyond
  • 31. How to complete • Documentation – Scope – Design – Maintenance guidelines – Implementation guidance – Use guidelines • Deployment – Work with developers, UX designers, taggers and don’t give up until properly implemented • Governance – Roles and responsibilities – Procedures
  • 32. Exercises • Exercise groups/topics • Exercise tasks – Describe vision (add context details as needed) – Develop domain model – High-level taxonomy design and strategy – Develop key facet – Record your considerations, sources, thought process