SlideShare uma empresa Scribd logo
1 de 20
Baixar para ler offline
Indexing & Tagging:
Taxonomies & Folksonomies
National Federation of Advanced Information Services (NFAIS)
Improving the User Search Experience
October 13, 2010, Philadelphia, Pennsylvania

Heather Hedden, Taxonomy Consultant
Heather Hedden’s Background
 Periodical database indexer for Information Access
 Company (IAC): Trade & Industry and PROMT
 Controlled vocabulary editor, IAC/Gale
 Freelance indexer and taxonomy consultant
 Taxonomy manager, Viziant and First Wind
 Continuing education instructor, Simmons GSLIS
 Author, The Accidental Taxonomist




                     © 2010 Heather Hedden            2
What’s the difference?

 Cataloging   with a CV
 Indexing     with a CV
 Keywording
 Tagging




              © 2010 Heather Hedden   3
Definitions
 Controlled Vocabulary – A controlled list of terms for
 concepts, usually with nonpreferred terms (“synonyms”).
 May or may not have structure and relationships
 between terms.
    Broader and includes taxonomies.
 Taxonomy – A hierarchical structure of terms, which
 may or many not include nonpreferred terms.
    Has popularly replaced “controlled vocabulary” as a
    broader concept, which may or may not be
    hierarchical.

                     © 2010 Heather Hedden                 4
Indexing

 Closed does not use CV; open does
 Trained, dedicated indexers
 Indexing guidelines and policies used
 Controlled vocabularies designed to support
   Indexer-focused scope notes
   Nonpreferred terms, displayed
   Extensive hierarchical & associative relationships
   Browsable, alphabetical display, type-ahead, etc.

                     © 2010 Heather Hedden              5
Indexers and Index Terms

 Indexers are closest to the content
 They see new concepts and terminology
 They need methods of:
   suggesting additional nonpreferred terms and
   relationships between terms
   suggesting new terms to add to the controlled
   vocabulary
   adding terms that supplement the controlled
   vocabulary

                    © 2010 Heather Hedden          6
Levels of Vocabulary Management

1.   Only controlled vocabulary used
      Indexer suggests terms for approval prior to use.
2.   Controlled vocabulary plus unapproved terms
      Indexer may use terms prior to approval.
3.   Controlled vocabulary plus keywords
      Indexer supplements with terms never reviewed.
4.   Controlled vocabulary plus shared keywords
      Indexer supplements with terms never reviewed, but
      re-usable.
                        © 2010 Heather Hedden              7
Levels of Vocabulary Management

1.   Only controlled vocabulary used
       Simple to manage and implement
       Indexers email taxonomist with suggestions
       For small controlled vocabularies
       For limited scope, relatively static content
       For small indexing operations




                       © 2010 Heather Hedden          8
Levels of Vocabulary Management

2.   Controlled vocabulary plus unapproved terms
       Indexer may create “candidate”, “unapproved,” or
       “override terms” for immediate use, but also entered into
       the system for taxonomist review
       Unapproved indexed terms could convert to nonpreferred
       terms.
       Can be restricted to only certain types of terms (usually
       named entities) or all terms
       For large indexing operations, large controlled
       vocabularies, extensive content
       More technological complex to implement
                         © 2010 Heather Hedden                 9
Levels of Vocabulary Management

3.   Controlled vocabulary plus keywords
       Indexer supplements controlled vocabulary indexing with
       any keywords entered into a separate field
       Keywords may be for new concepts, but are often for
       more specific concepts and names
       Keywords do not become part of the controlled
       vocabulary. Taxonomist may or may not look at them.
       Varies: (a) more indexing is with controlled vocabulary or
       (b) more with keywords and CV is only broad categories
       Less indexing technology, but more complex end-user
       retrieval options (3 types: taxonomy, keywords, freetext)
       Variants/synonyms are not controlled, redundant
                         © 2010 Heather Hedden                  10
Levels of Vocabulary Management

4.   Controlled vocabulary and shared keywords
       Indexer supplements controlled vocabulary indexing with
       any keywords entered into a separate field
       Keywords do not become part of the controlled
       vocabulary, but are stored in another database.
       All keywords become immediately available for all
       indexers to use and reuse
       Indexers can browse the list of previously used
       keywords; redundancies are reduced, not eliminated



                        © 2010 Heather Hedden                11
Folksonomy
 Shared keywords, created and shared by those “indexing”
 Indexing is not by indexers, but by end-users, consumers
 of the content.
     Common people, the “folk.”
     “Tagging”
 Users might also contribute structure, creating
 (hierarchical) relationships
 Originally defined as:
 “user-created bottom-up categorical structure development
 with an emergent thesaurus”
 --Thomas Vander Wal, July 2004
                     © 2010 Heather Hedden             12
Folksonomy: Who Contributes to it?

 Indexers
 Editors
 Employees
 End-user subscribers
 Any end-users



               © 2010 Heather Hedden   13
Folksonomy: Where used?
 Public web sites of high volume content and users:
    Delicious (http://delicious.com)
    Connotea (www.connotea.org)
     Diigo (www.diigo.com)
    Flickr (www.flickr.com)
    LibraryThing (www.librarything.com)
 Large enterprises that want to foster collaboration and
 innovation
 Subscription content providers?



                        © 2010 Heather Hedden              14
Social Tagging
 Also called collaborative tagging, social classification,
 social indexing
 Folksonomies plus Web 2.0/social networking features
 Social communities can be built around shared sets of
 popular content or popular tags
 Tags for popularity ratings
 Now moving into enterprises




                      © 2010 Heather Hedden                  15
Folksonomies/Social tagging

 Advantages
   Reflects trends, up-to-date, can monitor
   change and popularity. Dynamic.
   Cheaper and quicker than building and
   maintaining a taxonomy
   Facilitates workplace democracy and the
   distribution of management tasks
   Responsive to user needs

                  © 2010 Heather Hedden       16
Folksonomies/Social tagging

 Disadvantages
   Inconsistent – precision & recall deficiencies
   Biased
   Requires critical mass of involvement to be
   useful
   Does not scale well to a large volume of
   content



                   © 2010 Heather Hedden            17
Folksonomies/Social tagging
 Solutions/trends:
   Some degree of vocabulary control
   Applicable to certain areas of content, not all

 Partial vocabulary control:
   Dual taxonomy & folksonomy system
       Requires policy of taxonomy first
   Single vocabulary system with some terms
   managed/edited, and some not (yet)
       like a wiki with editors

                      © 2010 Heather Hedden          18
Conclusions
 Different people often apply taxonomies or
 folksonomies.
 Taxonomies and Folksonomies may supplement
 each other.
 Technology facilitates the application of both.
 Users understand the distinction and can use
 both.



                  © 2010 Heather Hedden        19
Questions/Contact
Heather Hedden
978-467-5195
heather@hedden.net
www.hedden-information.com
www.accidental-taxonomist.com




                     © 2010 Heather Hedden   20

Mais conteúdo relacionado

Mais procurados

Special libraries Presentation
Special libraries PresentationSpecial libraries Presentation
Special libraries Presentation
Muhammad Kashif
 
Chain indexing
Chain indexingChain indexing
Chain indexing
silambu111
 
METS(Metadata Encoding and Transmission Standard )
METS(Metadata Encoding and Transmission Standard )METS(Metadata Encoding and Transmission Standard )
METS(Metadata Encoding and Transmission Standard )
Manu K M
 
Indexing languages
Indexing languagesIndexing languages
Indexing languages
yhen06
 
Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.
ghulamsamdani
 

Mais procurados (20)

Introduction to Cataloging and Classification
Introduction to Cataloging and ClassificationIntroduction to Cataloging and Classification
Introduction to Cataloging and Classification
 
Metadata Standards
Metadata StandardsMetadata Standards
Metadata Standards
 
RDA (Resource Description & Access)
RDA (Resource Description & Access)RDA (Resource Description & Access)
RDA (Resource Description & Access)
 
FRBR model by Gaurav Boudh
FRBR model by Gaurav BoudhFRBR model by Gaurav Boudh
FRBR model by Gaurav Boudh
 
Special libraries Presentation
Special libraries PresentationSpecial libraries Presentation
Special libraries Presentation
 
Chain indexing
Chain indexingChain indexing
Chain indexing
 
METS(Metadata Encoding and Transmission Standard )
METS(Metadata Encoding and Transmission Standard )METS(Metadata Encoding and Transmission Standard )
METS(Metadata Encoding and Transmission Standard )
 
Techniques for Electronic Resource Management: Crowdsourcing for Best Practices
Techniques for Electronic Resource Management: Crowdsourcing for Best PracticesTechniques for Electronic Resource Management: Crowdsourcing for Best Practices
Techniques for Electronic Resource Management: Crowdsourcing for Best Practices
 
Indexing language concept types and characteristics
Indexing language concept types and characteristicsIndexing language concept types and characteristics
Indexing language concept types and characteristics
 
Descriptive catalogue
Descriptive catalogueDescriptive catalogue
Descriptive catalogue
 
Subject cataloging
Subject catalogingSubject cataloging
Subject cataloging
 
Indexing languages
Indexing languagesIndexing languages
Indexing languages
 
Library congress subject headings
Library congress subject headings Library congress subject headings
Library congress subject headings
 
Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.
 
Comparative study of major classification schemes
Comparative study of major classification schemesComparative study of major classification schemes
Comparative study of major classification schemes
 
Lcsh
LcshLcsh
Lcsh
 
Subject cataloging
Subject catalogingSubject cataloging
Subject cataloging
 
SLSH ppt
SLSH pptSLSH ppt
SLSH ppt
 
Indexing and abstracting preservation and conservation
Indexing and abstracting preservation and conservationIndexing and abstracting preservation and conservation
Indexing and abstracting preservation and conservation
 
International Standard Bibliographic Description: background and recent devel...
International Standard Bibliographic Description: background and recent devel...International Standard Bibliographic Description: background and recent devel...
International Standard Bibliographic Description: background and recent devel...
 

Semelhante a Taxonomies and Folksonomies

Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Standards (or lack thereof) in Institutional Repositories. Presentation for t...Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Sarah Shreeves
 
Putting Controlled Vocabulary To Work I Davis 2008
Putting Controlled Vocabulary To Work I Davis 2008Putting Controlled Vocabulary To Work I Davis 2008
Putting Controlled Vocabulary To Work I Davis 2008
Ian Davis
 

Semelhante a Taxonomies and Folksonomies (20)

Taxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignTaxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy Design
 
Taxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingTaxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-Indexing
 
Mapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesMapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual Taxonomies
 
Selecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology ManagementSelecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology Management
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
 
Taxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information ArchitectureTaxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information Architecture
 
Benefits of Taxonomies
Benefits of TaxonomiesBenefits of Taxonomies
Benefits of Taxonomies
 
Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Standards (or lack thereof) in Institutional Repositories. Presentation for t...Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Standards (or lack thereof) in Institutional Repositories. Presentation for t...
 
Taxonomy Design for SharePoint
Taxonomy Design for SharePointTaxonomy Design for SharePoint
Taxonomy Design for SharePoint
 
Putting Controlled Vocabulary To Work I Davis 2008
Putting Controlled Vocabulary To Work I Davis 2008Putting Controlled Vocabulary To Work I Davis 2008
Putting Controlled Vocabulary To Work I Davis 2008
 
Developing a draft Information Literacy thesaurus
Developing a draft Information Literacy thesaurusDeveloping a draft Information Literacy thesaurus
Developing a draft Information Literacy thesaurus
 
Testing Taxonomies
Testing TaxonomiesTesting Taxonomies
Testing Taxonomies
 
SharePoint 2010 Managed Metadata
SharePoint 2010 Managed MetadataSharePoint 2010 Managed Metadata
SharePoint 2010 Managed Metadata
 
Synonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred TermsSynonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred Terms
 
User-Driven Taxonomies
User-Driven TaxonomiesUser-Driven Taxonomies
User-Driven Taxonomies
 
Indexing
IndexingIndexing
Indexing
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
 
Policy implication of the e2.0 study D.Osimo Tech4i2
Policy implication of the e2.0 study D.Osimo Tech4i2Policy implication of the e2.0 study D.Osimo Tech4i2
Policy implication of the e2.0 study D.Osimo Tech4i2
 
Terminology Management
Terminology ManagementTerminology Management
Terminology Management
 
Successful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata DesignSuccessful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata Design
 

Mais de Heather Hedden

Mais de Heather Hedden (14)

Introduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfIntroduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdf
 
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
 
Taxonomies in Support of Search
Taxonomies in Support of SearchTaxonomies in Support of Search
Taxonomies in Support of Search
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOS
 
Mapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and OntologiesMapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and Ontologies
 
A Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge GraphsA Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge Graphs
 
Managing Taxonomy Tagging
Managing Taxonomy TaggingManaging Taxonomy Tagging
Managing Taxonomy Tagging
 
Taxonomies for Users
Taxonomies for UsersTaxonomies for Users
Taxonomies for Users
 
Taxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressTaxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPress
 
Customer-Focused Thesauri
Customer-Focused ThesauriCustomer-Focused Thesauri
Customer-Focused Thesauri
 
Managing Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan TermsManaging Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan Terms
 
Taxonomies for E-commerce
Taxonomies for E-commerceTaxonomies for E-commerce
Taxonomies for E-commerce
 
Taxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexingTaxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexing
 
Making Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesMaking Decisions in Creating Taxonomies
Making Decisions in Creating Taxonomies
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

Taxonomies and Folksonomies

  • 1. Indexing & Tagging: Taxonomies & Folksonomies National Federation of Advanced Information Services (NFAIS) Improving the User Search Experience October 13, 2010, Philadelphia, Pennsylvania Heather Hedden, Taxonomy Consultant
  • 2. Heather Hedden’s Background Periodical database indexer for Information Access Company (IAC): Trade & Industry and PROMT Controlled vocabulary editor, IAC/Gale Freelance indexer and taxonomy consultant Taxonomy manager, Viziant and First Wind Continuing education instructor, Simmons GSLIS Author, The Accidental Taxonomist © 2010 Heather Hedden 2
  • 3. What’s the difference? Cataloging with a CV Indexing with a CV Keywording Tagging © 2010 Heather Hedden 3
  • 4. Definitions Controlled Vocabulary – A controlled list of terms for concepts, usually with nonpreferred terms (“synonyms”). May or may not have structure and relationships between terms. Broader and includes taxonomies. Taxonomy – A hierarchical structure of terms, which may or many not include nonpreferred terms. Has popularly replaced “controlled vocabulary” as a broader concept, which may or may not be hierarchical. © 2010 Heather Hedden 4
  • 5. Indexing Closed does not use CV; open does Trained, dedicated indexers Indexing guidelines and policies used Controlled vocabularies designed to support Indexer-focused scope notes Nonpreferred terms, displayed Extensive hierarchical & associative relationships Browsable, alphabetical display, type-ahead, etc. © 2010 Heather Hedden 5
  • 6. Indexers and Index Terms Indexers are closest to the content They see new concepts and terminology They need methods of: suggesting additional nonpreferred terms and relationships between terms suggesting new terms to add to the controlled vocabulary adding terms that supplement the controlled vocabulary © 2010 Heather Hedden 6
  • 7. Levels of Vocabulary Management 1. Only controlled vocabulary used Indexer suggests terms for approval prior to use. 2. Controlled vocabulary plus unapproved terms Indexer may use terms prior to approval. 3. Controlled vocabulary plus keywords Indexer supplements with terms never reviewed. 4. Controlled vocabulary plus shared keywords Indexer supplements with terms never reviewed, but re-usable. © 2010 Heather Hedden 7
  • 8. Levels of Vocabulary Management 1. Only controlled vocabulary used Simple to manage and implement Indexers email taxonomist with suggestions For small controlled vocabularies For limited scope, relatively static content For small indexing operations © 2010 Heather Hedden 8
  • 9. Levels of Vocabulary Management 2. Controlled vocabulary plus unapproved terms Indexer may create “candidate”, “unapproved,” or “override terms” for immediate use, but also entered into the system for taxonomist review Unapproved indexed terms could convert to nonpreferred terms. Can be restricted to only certain types of terms (usually named entities) or all terms For large indexing operations, large controlled vocabularies, extensive content More technological complex to implement © 2010 Heather Hedden 9
  • 10. Levels of Vocabulary Management 3. Controlled vocabulary plus keywords Indexer supplements controlled vocabulary indexing with any keywords entered into a separate field Keywords may be for new concepts, but are often for more specific concepts and names Keywords do not become part of the controlled vocabulary. Taxonomist may or may not look at them. Varies: (a) more indexing is with controlled vocabulary or (b) more with keywords and CV is only broad categories Less indexing technology, but more complex end-user retrieval options (3 types: taxonomy, keywords, freetext) Variants/synonyms are not controlled, redundant © 2010 Heather Hedden 10
  • 11. Levels of Vocabulary Management 4. Controlled vocabulary and shared keywords Indexer supplements controlled vocabulary indexing with any keywords entered into a separate field Keywords do not become part of the controlled vocabulary, but are stored in another database. All keywords become immediately available for all indexers to use and reuse Indexers can browse the list of previously used keywords; redundancies are reduced, not eliminated © 2010 Heather Hedden 11
  • 12. Folksonomy Shared keywords, created and shared by those “indexing” Indexing is not by indexers, but by end-users, consumers of the content. Common people, the “folk.” “Tagging” Users might also contribute structure, creating (hierarchical) relationships Originally defined as: “user-created bottom-up categorical structure development with an emergent thesaurus” --Thomas Vander Wal, July 2004 © 2010 Heather Hedden 12
  • 13. Folksonomy: Who Contributes to it? Indexers Editors Employees End-user subscribers Any end-users © 2010 Heather Hedden 13
  • 14. Folksonomy: Where used? Public web sites of high volume content and users: Delicious (http://delicious.com) Connotea (www.connotea.org) Diigo (www.diigo.com) Flickr (www.flickr.com) LibraryThing (www.librarything.com) Large enterprises that want to foster collaboration and innovation Subscription content providers? © 2010 Heather Hedden 14
  • 15. Social Tagging Also called collaborative tagging, social classification, social indexing Folksonomies plus Web 2.0/social networking features Social communities can be built around shared sets of popular content or popular tags Tags for popularity ratings Now moving into enterprises © 2010 Heather Hedden 15
  • 16. Folksonomies/Social tagging Advantages Reflects trends, up-to-date, can monitor change and popularity. Dynamic. Cheaper and quicker than building and maintaining a taxonomy Facilitates workplace democracy and the distribution of management tasks Responsive to user needs © 2010 Heather Hedden 16
  • 17. Folksonomies/Social tagging Disadvantages Inconsistent – precision & recall deficiencies Biased Requires critical mass of involvement to be useful Does not scale well to a large volume of content © 2010 Heather Hedden 17
  • 18. Folksonomies/Social tagging Solutions/trends: Some degree of vocabulary control Applicable to certain areas of content, not all Partial vocabulary control: Dual taxonomy & folksonomy system Requires policy of taxonomy first Single vocabulary system with some terms managed/edited, and some not (yet) like a wiki with editors © 2010 Heather Hedden 18
  • 19. Conclusions Different people often apply taxonomies or folksonomies. Taxonomies and Folksonomies may supplement each other. Technology facilitates the application of both. Users understand the distinction and can use both. © 2010 Heather Hedden 19