SlideShare uma empresa Scribd logo
1 de 40
Taxonomy:
Do I need one?


                     Leigh White
            ElementalSource, LLC
Yes
What I’ll talk about

•   What happens without a taxonomy
•   What a taxonomy is and does
•   Why a taxonomy is important
•   A few first development steps
What I won’t talk about

• All the different kinds of taxonomies
• Details about development
• Tools for development
  – except DITA subjectScheme (briefly!)
A little history
What the he** IS that???
Oh, let’s call it a…

• Use the native name
• Name it after something familiar
  that it’s kind of “like”
• “Like” is murky; you have to define
  “like”
  – How it looks? Shape? Color? Size?
  – How it tastes?
  – How it acts?
Earth apples, anyone?

• aardappel (Dutch)
• pomme de terre (French)
*not apples
We know this because

• We have a taxonomy (Linnean
  classification) that specifies degrees
  of relationship between living things
Distant cousins, at best

          apple          potato

Kingdom   Plantae        Plantae

Phylum    Anthophyta     Anthophyta

Class     Eudicots       Eudicots

Order     Rosales        Solanales

Family    Rosaceae       Solanaceae

Genus     Malus          Solanum

Species   M. domestica   S. tuberosum
So, a taxonomy is

• A way of defining “like”
• A way of expressing relationships
  between things
  – We might already be instinctively
    aware of these relationships but need
    to formalize them
• A way of discovering relationships
  between things
• An information model
Taxonomies are

• typically organized by parent-child
  relationships
• typically indicated by the phrase 'is
  a kind of' or 'is a subtype of'
• the subtype has the same
  properties, behaviors, and
  constraints as the supertype plus
  one or more additional properties,
  behaviors, or constraints
Uhh…what?

• For example: car is a kind of
  vehicle, so any car is also a vehicle,
  but not every vehicle is a car
• The level “car” is more constrained
  than the level “vehicle”
• A car has all the properties of a
  vehicle plus some other properties
  specific to a car
Taxonomies are all around us

• It’s our nature to classify
• Many of these taxonomies are
  internal, arbitrary and personal
• A true taxonomy must be uniform
  and unambiguous
Other familiar taxonomies

• Dewey Decimal System
• Library of Congress System
• ICD-9/10 codes
• computer folder system
  – probably most common
    taxonomy in tech comm
And one I especially dig

• A taxonomy of wrongness!
  – www.fallacyfiles.org/taxonomy.html
We have metadata…why do we need
a taxonomy too?

• Where did that metadata come
  from?
  – You must have had some idea of how
    your content should be classified
  – If so, then you already have the
    beginnings of a taxonomy, at least in
    your head
  – So take it a step further
Metadata compliments taxonomy
and vice-versa
• Metadata describes an individual piece of
  content but doesn’t capture relationships
  very well.
• Metadata is part of content so updates
  can be unwieldy; better to maintain the
  model outside the content
• A taxonomy serves as a roadmap…it both
  describes current content and predicts
  future content
• A taxonomy highlights similarities (and
  differences) across products
• Metadata can pick up where taxonomy
  leaves off
What else are taxonomies good for?

• Controlled vocabularies
  – indexing
  – keywords
  – glossaries

• Searching/browsing/filtering
  – Faceted search
  – Filtering for custom doc publishing

• Content reuse
Amazon.com
So far…

• we’ve looked at hierarchical
  taxonomies
When hierarchy isn’t enough

 A Cockapoo is a kind of dog. It’s the
  product of a poodle and a Cocker
  Spaniel. A hierarchy cannot capture
  all these relationships.
There’s an alternative (polyarchical)
Purists might say…

• that you need different notations to
  express different kinds of
  relationships
• or that you must express the
  relationships uniformly
Maybe, maybe not

• You need what you need to capture
  the relationships you need to
  express
• No more, no less - KISS
• The relationships already exist; you
  are just using the taxonomy to
  express them
Decisions to make

• What kind of taxonomy:
  – hierarchical, polyarchical, something
    else?
• If hierarchical, how many levels?
• If polyarchical, what kinds of
  relationships and how designated?
• Tool to use? (meh)
• How to associate content with
  taxonomy?
Questions to ask
• What will the taxonomy be used for?
  – indexing, search, etc.
• Who are the users?
  – content creators, clients, SMEs, support, etc.
• What content will the taxonomy cover?
  – topics, images, demos, videos, etc.
• What are the scope and limits?
  – handling off-topic content—what to
    include/exclude
• What are the resources and constraints?
  – skills/expertise, timing, technology, funding,
    stakeholder roles, etc.
More questions to ask

• Who is responsible for development?
• What are secondary/contributor
  roles?
• How does taxonomy fit in with other
  metadata?
• How to handle ongoing support and
  maintenance?
Some first steps
• Start small—maybe just one small product
• Do content audit of everything the
  taxonomy will categorize
• Compare TOCs of existing deliverables
  – Find commonalities, differences
• Compare indexes of existing deliverables
  – Discover terms already in use
• Use folder structure
More first steps
• Assemble starting list of categories
  that cover existing content based on
  TOC, index and content audit
• Place existing content within
  taxonomy (on paper)
• Create taxonomy task force to
  review and refine
  – Avoid too many cooks
DITA Classification and Subject
Scheme
• Subject scheme
  – Defines controlled values (“buckets”)
    for classifying content
  – Defines relationships between those
    buckets
• Classification
  – Groups content into appropriate
    buckets
Subject classification scheme
subjectScheme map
<subjectScheme>
   <hasInstance>
      <subjectdef keys="product">
         <subjectdef keys="Widget"/>
            <subjectdef keys="module">
               <subjectdef keys="Meds"/>
               <subjectdef keys="AdminW"/>
            </subjectdef>
         </subjectdef>
         <subjectdef keys="Gadget"/>
            <subjectdef keys="module">
               <subjectdef keys="AdminG"/>
               <subjectdef keys="Labs"/>
            </subjectdef>
         </subjectdef>
      </subjectdef>
   </hasInstance>
</subjectScheme>
Associate topics with subjects
<map>
   <topicref href="t_configure_med.xml">
      <topicsubject>
         <subjectref keys="Meds"/>
         <subjectref keys="AdminW"/>
         <subjectref keys="AdminG"/>
      </topicsubject>
   </topicref>
</map>
Recommended reading/viewing

• The Accidental Taxonomist, Heather
  Hedden
• Organising Knowledge: Taxonomies,
  Knowledge, and Organisational
  Effectiveness, Patrick Lambe
• Joe Gelb’s presentation on
  subjectScheme:
  http://svdig.ditamap.com/videos/sv
  dig-2011-05-11.htm
Contact me



               Leigh White
      ElementalSource, LLC

elementalsource@gmail.com
              678.467.7706

Mais conteúdo relacionado

Mais procurados

RDA: Resource Description and Access
RDA: Resource Description and AccessRDA: Resource Description and Access
RDA: Resource Description and Access
Rieta Drinkwine
 
Introduction to Dublin Core Metadata
Introduction to Dublin Core MetadataIntroduction to Dublin Core Metadata
Introduction to Dublin Core Metadata
Hannes Ebner
 
LIS 653, Session 3: Principles and Standards
LIS 653, Session 3: Principles and Standards LIS 653, Session 3: Principles and Standards
LIS 653, Session 3: Principles and Standards
Dr. Starr Hoffman
 

Mais procurados (20)

Building and using ontologies
Building and using ontologies Building and using ontologies
Building and using ontologies
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomy 101
 
RDA: Resource Description and Access
RDA: Resource Description and AccessRDA: Resource Description and Access
RDA: Resource Description and Access
 
Metadata standards
Metadata standardsMetadata standards
Metadata standards
 
Automatic classification
Automatic classificationAutomatic classification
Automatic classification
 
Introduction to Dublin Core Metadata
Introduction to Dublin Core MetadataIntroduction to Dublin Core Metadata
Introduction to Dublin Core Metadata
 
Html intro
Html introHtml intro
Html intro
 
Understanding SharePoint site structure what's inside
Understanding SharePoint site structure  what's insideUnderstanding SharePoint site structure  what's inside
Understanding SharePoint site structure what's inside
 
Encoded Archival Description (EAD)
Encoded Archival Description (EAD) Encoded Archival Description (EAD)
Encoded Archival Description (EAD)
 
Bill Slawski SEO and the New Search Results
Bill Slawski   SEO and the New Search ResultsBill Slawski   SEO and the New Search Results
Bill Slawski SEO and the New Search Results
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 
LIS 653, Session 10: Controlled Vocabulary
LIS 653, Session 10: Controlled VocabularyLIS 653, Session 10: Controlled Vocabulary
LIS 653, Session 10: Controlled Vocabulary
 
RDA for Original Catalogers
RDA for Original CatalogersRDA for Original Catalogers
RDA for Original Catalogers
 
FRBR presentation by Bwsrang Basumatary
FRBR presentation by Bwsrang BasumataryFRBR presentation by Bwsrang Basumatary
FRBR presentation by Bwsrang Basumatary
 
DITA Interoperability
DITA InteroperabilityDITA Interoperability
DITA Interoperability
 
SharePoint Site IA Architecture Design Considerations - Innovate Vancouver.pdf
SharePoint Site IA Architecture Design Considerations - Innovate Vancouver.pdfSharePoint Site IA Architecture Design Considerations - Innovate Vancouver.pdf
SharePoint Site IA Architecture Design Considerations - Innovate Vancouver.pdf
 
Beyond MARC: BIBFRAME and the Future of Bibliographic Data
Beyond MARC: BIBFRAME and the Future of Bibliographic DataBeyond MARC: BIBFRAME and the Future of Bibliographic Data
Beyond MARC: BIBFRAME and the Future of Bibliographic Data
 
LIS 653, Session 3: Principles and Standards
LIS 653, Session 3: Principles and Standards LIS 653, Session 3: Principles and Standards
LIS 653, Session 3: Principles and Standards
 
RDA
RDA RDA
RDA
 
Subject gateway knowledge organisation
Subject gateway knowledge organisationSubject gateway knowledge organisation
Subject gateway knowledge organisation
 

Destaque

Using DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment ContentUsing DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment Content
Edwina Lui
 
Understanding Information Architecture
Understanding Information ArchitectureUnderstanding Information Architecture
Understanding Information Architecture
Scott Abel
 
Multiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentMultiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured Content
Joe Pairman
 

Destaque (19)

Taxonomy
TaxonomyTaxonomy
Taxonomy
 
Using DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment ContentUsing DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment Content
 
Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32
 
Pat Farrell, Migrating Legacy Documentation to XML and DITA
Pat Farrell, Migrating Legacy Documentation to XML and DITAPat Farrell, Migrating Legacy Documentation to XML and DITA
Pat Farrell, Migrating Legacy Documentation to XML and DITA
 
Surviving the Transition to DITA: Trusted Partners can Ease the Pain
Surviving the Transition to DITA: Trusted Partners can Ease the PainSurviving the Transition to DITA: Trusted Partners can Ease the Pain
Surviving the Transition to DITA: Trusted Partners can Ease the Pain
 
Converting Unstructured Docs to XML/DITA/ePub
Converting Unstructured Docs to XML/DITA/ePubConverting Unstructured Docs to XML/DITA/ePub
Converting Unstructured Docs to XML/DITA/ePub
 
Metadata: Queen to King Content?
Metadata: Queen to King Content?Metadata: Queen to King Content?
Metadata: Queen to King Content?
 
Joe Gelb: Taxonomy and Delivery
Joe Gelb: Taxonomy and DeliveryJoe Gelb: Taxonomy and Delivery
Joe Gelb: Taxonomy and Delivery
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
 
Easy steps to convert your content to structured (frame maker and xml)
Easy steps to convert your content to structured (frame maker and xml)Easy steps to convert your content to structured (frame maker and xml)
Easy steps to convert your content to structured (frame maker and xml)
 
How to Optimize Your Metadata and Taxonomy
How to Optimize Your Metadata and TaxonomyHow to Optimize Your Metadata and Taxonomy
How to Optimize Your Metadata and Taxonomy
 
Reports and DITA Metrics IXIASOFT User Conference 2016
Reports and DITA Metrics IXIASOFT User Conference 2016Reports and DITA Metrics IXIASOFT User Conference 2016
Reports and DITA Metrics IXIASOFT User Conference 2016
 
Optimizing Content Reuse with DITA
Optimizing Content Reuse with DITAOptimizing Content Reuse with DITA
Optimizing Content Reuse with DITA
 
Developing training websites in multiple languages with (mostly) open-source ...
Developing training websites in multiple languages with (mostly) open-source ...Developing training websites in multiple languages with (mostly) open-source ...
Developing training websites in multiple languages with (mostly) open-source ...
 
Blurring the Lines between ECM and CCMS
Blurring the Lines between ECM and CCMSBlurring the Lines between ECM and CCMS
Blurring the Lines between ECM and CCMS
 
Understanding Information Architecture
Understanding Information ArchitectureUnderstanding Information Architecture
Understanding Information Architecture
 
Increasing Findability with Subject Schemes (Advanced DITA Webinar)
Increasing Findability with Subject Schemes (Advanced DITA Webinar)Increasing Findability with Subject Schemes (Advanced DITA Webinar)
Increasing Findability with Subject Schemes (Advanced DITA Webinar)
 
Multiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentMultiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured Content
 
Wireframing, Mockups, and Prototyping Made Easy
Wireframing, Mockups, and Prototyping Made EasyWireframing, Mockups, and Prototyping Made Easy
Wireframing, Mockups, and Prototyping Made Easy
 

Semelhante a Taxonomy: Do I Need One

Realizing the Full Potential of Taxonomies by Branka Kosovac
Realizing the Full Potential of Taxonomies by Branka KosovacRealizing the Full Potential of Taxonomies by Branka Kosovac
Realizing the Full Potential of Taxonomies by Branka Kosovac
Content Strategy Workshops
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
IXIASOFT
 
Designing an effective information architecture (
Designing an effective information architecture (Designing an effective information architecture (
Designing an effective information architecture (
Vickey Bird
 
Library Research for Human Rights Guide
Library Research for Human Rights GuideLibrary Research for Human Rights Guide
Library Research for Human Rights Guide
Annelise Sklar
 

Semelhante a Taxonomy: Do I Need One (20)

Realizing the Full Potential of Taxonomies by Branka Kosovac
Realizing the Full Potential of Taxonomies by Branka KosovacRealizing the Full Potential of Taxonomies by Branka Kosovac
Realizing the Full Potential of Taxonomies by Branka Kosovac
 
Metadata
MetadataMetadata
Metadata
 
Folksonomies & social tagging
Folksonomies & social taggingFolksonomies & social tagging
Folksonomies & social tagging
 
Taxonomy 101: What do rockets and arugula have in common?
Taxonomy 101: What do rockets and arugula have in common?Taxonomy 101: What do rockets and arugula have in common?
Taxonomy 101: What do rockets and arugula have in common?
 
What Is Taxonomy and Why Is It Useful?
What Is Taxonomy and Why Is It Useful?What Is Taxonomy and Why Is It Useful?
What Is Taxonomy and Why Is It Useful?
 
Some thoughts on social tagging
Some thoughts on social taggingSome thoughts on social tagging
Some thoughts on social tagging
 
[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker
 
System Concepts for Object Modelling.pptx
System Concepts for Object Modelling.pptxSystem Concepts for Object Modelling.pptx
System Concepts for Object Modelling.pptx
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
 
How To Go About Researching
How To Go About ResearchingHow To Go About Researching
How To Go About Researching
 
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYINFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
 
Card Sorting Your Way to Meaningful Metadata
Card Sorting Your Way to Meaningful MetadataCard Sorting Your Way to Meaningful Metadata
Card Sorting Your Way to Meaningful Metadata
 
Designing an effective information architecture (
Designing an effective information architecture (Designing an effective information architecture (
Designing an effective information architecture (
 
Library Research for Human Rights Guide
Library Research for Human Rights GuideLibrary Research for Human Rights Guide
Library Research for Human Rights Guide
 
Analysing Qualitative Data
Analysing Qualitative DataAnalysing Qualitative Data
Analysing Qualitative Data
 
Taxonomy Fundamentals Workshop
Taxonomy Fundamentals WorkshopTaxonomy Fundamentals Workshop
Taxonomy Fundamentals Workshop
 
Michael Bolton - Heuristics: Solving Problems Rapidly
Michael Bolton - Heuristics: Solving Problems RapidlyMichael Bolton - Heuristics: Solving Problems Rapidly
Michael Bolton - Heuristics: Solving Problems Rapidly
 
Library research for Environmental Studies at UCSD
Library research for Environmental Studies at UCSDLibrary research for Environmental Studies at UCSD
Library research for Environmental Studies at UCSD
 
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics HackathonxAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
 
Thematic content analysis in psychology
Thematic content analysis in psychologyThematic content analysis in psychology
Thematic content analysis in psychology
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 

Taxonomy: Do I Need One

  • 1. Taxonomy: Do I need one? Leigh White ElementalSource, LLC
  • 2. Yes
  • 3. What I’ll talk about • What happens without a taxonomy • What a taxonomy is and does • Why a taxonomy is important • A few first development steps
  • 4. What I won’t talk about • All the different kinds of taxonomies • Details about development • Tools for development – except DITA subjectScheme (briefly!)
  • 6. What the he** IS that???
  • 7. Oh, let’s call it a… • Use the native name • Name it after something familiar that it’s kind of “like” • “Like” is murky; you have to define “like” – How it looks? Shape? Color? Size? – How it tastes? – How it acts?
  • 8. Earth apples, anyone? • aardappel (Dutch) • pomme de terre (French)
  • 10. We know this because • We have a taxonomy (Linnean classification) that specifies degrees of relationship between living things
  • 11. Distant cousins, at best apple potato Kingdom Plantae Plantae Phylum Anthophyta Anthophyta Class Eudicots Eudicots Order Rosales Solanales Family Rosaceae Solanaceae Genus Malus Solanum Species M. domestica S. tuberosum
  • 12. So, a taxonomy is • A way of defining “like” • A way of expressing relationships between things – We might already be instinctively aware of these relationships but need to formalize them • A way of discovering relationships between things • An information model
  • 13. Taxonomies are • typically organized by parent-child relationships • typically indicated by the phrase 'is a kind of' or 'is a subtype of' • the subtype has the same properties, behaviors, and constraints as the supertype plus one or more additional properties, behaviors, or constraints
  • 14. Uhh…what? • For example: car is a kind of vehicle, so any car is also a vehicle, but not every vehicle is a car • The level “car” is more constrained than the level “vehicle” • A car has all the properties of a vehicle plus some other properties specific to a car
  • 15. Taxonomies are all around us • It’s our nature to classify • Many of these taxonomies are internal, arbitrary and personal • A true taxonomy must be uniform and unambiguous
  • 16. Other familiar taxonomies • Dewey Decimal System • Library of Congress System • ICD-9/10 codes • computer folder system – probably most common taxonomy in tech comm
  • 17. And one I especially dig • A taxonomy of wrongness! – www.fallacyfiles.org/taxonomy.html
  • 18. We have metadata…why do we need a taxonomy too? • Where did that metadata come from? – You must have had some idea of how your content should be classified – If so, then you already have the beginnings of a taxonomy, at least in your head – So take it a step further
  • 19. Metadata compliments taxonomy and vice-versa • Metadata describes an individual piece of content but doesn’t capture relationships very well. • Metadata is part of content so updates can be unwieldy; better to maintain the model outside the content • A taxonomy serves as a roadmap…it both describes current content and predicts future content • A taxonomy highlights similarities (and differences) across products • Metadata can pick up where taxonomy leaves off
  • 20. What else are taxonomies good for? • Controlled vocabularies – indexing – keywords – glossaries • Searching/browsing/filtering – Faceted search – Filtering for custom doc publishing • Content reuse
  • 22.
  • 23.
  • 24.
  • 25. So far… • we’ve looked at hierarchical taxonomies
  • 26. When hierarchy isn’t enough  A Cockapoo is a kind of dog. It’s the product of a poodle and a Cocker Spaniel. A hierarchy cannot capture all these relationships.
  • 27. There’s an alternative (polyarchical)
  • 28. Purists might say… • that you need different notations to express different kinds of relationships • or that you must express the relationships uniformly
  • 29. Maybe, maybe not • You need what you need to capture the relationships you need to express • No more, no less - KISS • The relationships already exist; you are just using the taxonomy to express them
  • 30. Decisions to make • What kind of taxonomy: – hierarchical, polyarchical, something else? • If hierarchical, how many levels? • If polyarchical, what kinds of relationships and how designated? • Tool to use? (meh) • How to associate content with taxonomy?
  • 31. Questions to ask • What will the taxonomy be used for? – indexing, search, etc. • Who are the users? – content creators, clients, SMEs, support, etc. • What content will the taxonomy cover? – topics, images, demos, videos, etc. • What are the scope and limits? – handling off-topic content—what to include/exclude • What are the resources and constraints? – skills/expertise, timing, technology, funding, stakeholder roles, etc.
  • 32. More questions to ask • Who is responsible for development? • What are secondary/contributor roles? • How does taxonomy fit in with other metadata? • How to handle ongoing support and maintenance?
  • 33. Some first steps • Start small—maybe just one small product • Do content audit of everything the taxonomy will categorize • Compare TOCs of existing deliverables – Find commonalities, differences • Compare indexes of existing deliverables – Discover terms already in use • Use folder structure
  • 34. More first steps • Assemble starting list of categories that cover existing content based on TOC, index and content audit • Place existing content within taxonomy (on paper) • Create taxonomy task force to review and refine – Avoid too many cooks
  • 35. DITA Classification and Subject Scheme • Subject scheme – Defines controlled values (“buckets”) for classifying content – Defines relationships between those buckets • Classification – Groups content into appropriate buckets
  • 37. subjectScheme map <subjectScheme> <hasInstance> <subjectdef keys="product"> <subjectdef keys="Widget"/> <subjectdef keys="module"> <subjectdef keys="Meds"/> <subjectdef keys="AdminW"/> </subjectdef> </subjectdef> <subjectdef keys="Gadget"/> <subjectdef keys="module"> <subjectdef keys="AdminG"/> <subjectdef keys="Labs"/> </subjectdef> </subjectdef> </subjectdef> </hasInstance> </subjectScheme>
  • 38. Associate topics with subjects <map> <topicref href="t_configure_med.xml"> <topicsubject> <subjectref keys="Meds"/> <subjectref keys="AdminW"/> <subjectref keys="AdminG"/> </topicsubject> </topicref> </map>
  • 39. Recommended reading/viewing • The Accidental Taxonomist, Heather Hedden • Organising Knowledge: Taxonomies, Knowledge, and Organisational Effectiveness, Patrick Lambe • Joe Gelb’s presentation on subjectScheme: http://svdig.ditamap.com/videos/sv dig-2011-05-11.htm
  • 40. Contact me Leigh White ElementalSource, LLC elementalsource@gmail.com 678.467.7706