SlideShare a Scribd company logo
1 of 20
Download to read offline
Why Digitization Increases the Value
       of Print Collections


           Baden Hughes
       The University of Melbourne
      baden.hughes@unimelb.edu.au

           HUGHES - EDUCAUSE/CARM - May 2007   1
Overview


• The properties of mass digitization initiatives
• Impact analysis
   – Coverage
   – Integration
   – Local digitization efforts
• Digitization, Libraries and the Long Tail
• Increasing Value of Print Collections

                       HUGHES - EDUCAUSE/CARM - May 2007   2
Background


• Observe the intersection of collection
  management issues (space, access) with the
  emergence of a mass digitization agenda driven
  by Google et al
• Consider the changing nature of the
  “competition” across multiple dimensions
     • Information discovery mechanism
     • Information delivery platform
     • Information repository

                     HUGHES - EDUCAUSE/CARM - May 2007   3
Mass Digitization – the
                        30,000 foot view

•   What are Google et al doing ?
    – Partnering with libraries and publishers to digitize print materials
        • Google et al bring scaling experience
        • Libraries and publishers bring content experience
    – Blending digitized materials with generic web content
    – Delivering digitized material online in a number of ways largely
      determined by copyright etc status
        • Full content, limited (a few pages), snippet (search in context), bibliographic
          reference only
    – Providing links from digitized objects to fulfillment channels such as
      libraries, book stores
    – Wrapping delivery of digitized materials within the Web 2.0 delivery
      model, allowing for richer interactions around digitized materials similar
      to those elsewhere on the web

                               HUGHES - EDUCAUSE/CARM - May 2007                        4
Exclusions

• Mass digitization efforts are not homogeneously
  addressing all print content
   – Materials which are typically reference-oriented rather than
     “popular”
   – Special collections typically excluded
   – Differentiation around delivery mechanisms based on legal
     constraints (eg full text vs bibliographic record)
   – High quality but non-archival standard scanning (compromise
     between quality and scalability)
   – Typically not interested in serial collections
   – Not exhaustive: initial targets are around 12-15% of the
     estimated 65M titles in print

                        HUGHES - EDUCAUSE/CARM - May 2007           5
Assumptions & Options


• Assuming
  – Libraries will typically not be able to compete with the
    mass digitization initiatives in terms of resource
    scaling
  – Libraries are generally interested in digitization as a
    mechanism to increase information access
• What options do we have ?
  – Join a mass digitization effort
  – Focus local digitization around distinguishing content
                     HUGHES - EDUCAUSE/CARM - May 2007         6
Impact: Coverage

• Which objects in our print collections are going to be
  digitized under broad coverage initiatives ?
   – Which libraries are currently being engaged by mass digitization
     efforts ?
   – What public statements have been made about the scope of
     mass digitization with those libraries ?
   – What are the timeframes for implementation ?
   – How many of the print holdings are likely to fall into which
     access category once digitized ?
   – How are the print collection holdings of these other libraries
     similar to, or different from, our local library collections ?

                        HUGHES - EDUCAUSE/CARM - May 2007               7
Outcomes: Coverage

•   Different types of libraries are likely to be impacted differently by
    mass digitization owing to considerably different collections
•   Access constraints will act as a high level classification – inclusion
    of an print item within a mass digitization initiative does not imply the
    same level of access online as in print
     – Which types of access are being offered for items in your print collection
       ? Take N items and search for them in a mass digitization repository,
       and note the type of access available …
•   Understanding the distinguishing aspects of your local collections
    becomes important, because anything that is approximately unique
    won’t be covered
     – What proportion of your print collection is not available anywhere else,
       or available in restricted quantities or locations ? WorldCat is a starting
       point …

                              HUGHES - EDUCAUSE/CARM - May 2007                      8
Impact: Integration

• How can we integrate these remotely held digitized
  objects into our local catalogues as primary access
  objects ?
   – Which print items within our collections are available from mass
     digitization initiatives ?
       • Take N items and search for them in a mass digitization repository,
         noting presence or absence
   – How can we create robust linkages between our catalogue
     descriptions and remotely delivered digitized print materials ?
   – What levels of integration support are going to be offered by
     mass digitization initiatives on along term basis vs being
     experimental ?

                          HUGHES - EDUCAUSE/CARM - May 2007                    9
Outcomes: Integration

• ILMS already support the core functional requirements
  for integration
   – a separation of description from actual items
   – Multiple instantiations per item, with unique identifiers
   – These are the basic requirements for integrating digitized
     materials anyway, so major system changes are an unlikely
     prerequisite
• All digitization projects have a unique identifier per item,
  but there’s no robust public APIs for resolving between
  identifier schemes (eg ISBNs to system identifiers)
   – Earlier experiences with Google et al show that APIs are
     typically a second generation feature

                        HUGHES - EDUCAUSE/CARM - May 2007         10
Impact: Local Digitization
                    Efforts

• Is there a place for local digitization efforts ?
   – What are the underlying business drivers for local
     digization ?
   – What are the expected outcomes of local digitization
     efforts ?
   – Which types of resources are targeted and how do
     these differ from those under mass digitization ?
   – What types of interactions are going to be supported
     ?
   – What resources are required to be “competitive” ?

                     HUGHES - EDUCAUSE/CARM - May 2007      11
Outcomes: Local
                     Digitization Efforts

• Where is the competitive advantage of local digitization
  schemes ?
   –   Greater focus on locally distinct collections
   –   Higher quality digitization practices
   –   Leveraging deeper linking options between resources
   –   Heterogeneous treatment – customising approach and delivery
       mechanism on a per item or per collection segment basis
• If you can identify a competitive advantage in any of
  these areas, then there’s probably a positive ROI
  proposition for local digitization, but only of items which
  are not covered by more general programs

                         HUGHES - EDUCAUSE/CARM - May 2007           12
The Long Tail Revisited

•   “The Long Tail” is the popular
    name for a commonly recognised
    feature of statistical distributions
    (Zipf, power laws, Pareto
    distributions, general Levy
    distributions)
•   In such distributions, a high
    frequency population increment is
    followed by progressively lower
    frequency population increments
•   It is not uncommon for the lower
    frequency items in total to form the
    majority of the overall



                              HUGHES - EDUCAUSE/CARM - May 2007   13
Long Tail Economics

•   Brynjolfsson, Hu and Smith (2003)
     – observed Long Tail distribution in the sales figures of Amazon.com
       exploring the relationship between number of sales and sales rankings
     – for Amazon.com, the majority of sales came from obscure items not
       widely available
•   Anderson (2004, 2006)
     – In terms of sales, products in low demand or having low sales volume
       can effectively make up a market share that rivals or exceeds the
       relatively few current best selling items assuming a distribution channel
       is large enough
•   Brynjolfsson, Hu and Simester (2006)
     – Impact of information technology in the form of internet search
       technologies which reduce the cost of item discovery can affect the
       overall distribution of product sales
•   Do these observations hold for print collections in libraries as well ?
                             HUGHES - EDUCAUSE/CARM - May 2007                 14
The Long Tail and Collection
                       Management

•   From the supply side, the factor which determines a distribution has
    a long tail is the cost of inventory and storage
     – We already know this is a problem for managing print collections in
       libraries
•   Where inventory and storage costs are insignificant it becomes
    economically viable to deliver relatively unpopular items
     – Is this a common driver for digitization at the global and local levels ?
•   On the demand side, tools such as search engines, recommender
    software, sampling tools allow customers to locate products outside
    their local geographic area
     – Consider changing usage patterns of OPACs and convergence of
       OPAC and other search technologies


                              HUGHES - EDUCAUSE/CARM - May 2007                    15
The Effect of Digitization on
                   the Long Tail

• Brynjolfsson, Hu and Simester (2006) further showed
  that the application of information technology in the form
  of internet search technologies which reduce the cost of
  item discovery can affect the overall distribution of
  product sales, in effect creating an even longer tail
  compared to traditional product delivery channels
• Digitization makes content discovery and access easier
• Digitization reduces the opportunity cost of every
  additional item in terms of inventory maintenance and
  storage
                      HUGHES - EDUCAUSE/CARM - May 2007    16
Digitization and the Long
                          Tail

• Digitization facilities the rediscovery of long tail
  conditions in the information need fulfillment space
   – At a certain scaling point, the opportunity cost of an storing and
     delivering an additional electronic item is negligible
       • Many libraries will find that mass digitization approaches offer
         considerably better economies of scale than they can realise
         themselves
   – Access is facilitated through multiple services
       • Many libraries will discover a shift to discovery mechanisms
         provided by third parties over standards-based virtual collecions
         rather than OPACss


                           HUGHES - EDUCAUSE/CARM - May 2007                 17
The Long Tail and Libraries


• Libraries are traditionally in the market of
  information need fulfillment with little
  consideration for end-user cost recovery
• It has been repeatedly recognised that the
  economics of library services are affected
  by long tail distributions
  – Consider: low circulation items and storage
    impacts      HUGHES - EDUCAUSE/CARM - May 2007   18

• Digitization extends the long tail paradigm
The Increasing Value of Print


• Assuming that mass digitization is successful, then some
  level of access to a much larger number digitized print
  resources will be realised
   – Flow on effect from digitization in terms of print access
• The level of access is the important differentiator for
  retaining and revaluing print collections
   – The general availability of print collections with unrestricted
     access via library collections will continue to offer competitive
     advantage to libraries despite digitization efforts
       • Analogous to the situation with uncatalogued holdings – “if it can’t
         be found, then it might as well not exist” becomes “if we can find it
         but not access it, then is this a tangible improvement”

                           HUGHES - EDUCAUSE/CARM - May 2007                     19
Questions, Comments ?




    HUGHES - EDUCAUSE/CARM - May 2007   20

More Related Content

Viewers also liked

Analysis of social computing applications in the EU
Analysis of social computing applications in the EUAnalysis of social computing applications in the EU
Analysis of social computing applications in the EUSanoma Netherlands
 
IA Parent Presentation - March 27th, 2014
IA Parent Presentation - March 27th, 2014IA Parent Presentation - March 27th, 2014
IA Parent Presentation - March 27th, 2014Corey Topf
 
Iad1 0809Q2 Hoorcollege2
Iad1 0809Q2 Hoorcollege2Iad1 0809Q2 Hoorcollege2
Iad1 0809Q2 Hoorcollege2Hans Kemp
 
0708 Iad2 Q4 Hoorcollege1
0708 Iad2 Q4 Hoorcollege10708 Iad2 Q4 Hoorcollege1
0708 Iad2 Q4 Hoorcollege1Hans Kemp
 
Multimodal, Crossmedia, Multi Platform
Multimodal, Crossmedia, Multi PlatformMultimodal, Crossmedia, Multi Platform
Multimodal, Crossmedia, Multi PlatformHans Kemp
 
Week 9 Sponges
Week 9 SpongesWeek 9 Sponges
Week 9 SpongesCorey Topf
 
Writers Workshop
Writers WorkshopWriters Workshop
Writers WorkshopCorey Topf
 
C D L A S E R E N A 2008
C D  L A  S E R E N A 2008C D  L A  S E R E N A 2008
C D L A S E R E N A 2008cdlaserena
 
Iad2 0809 Q3 Hoorcollege 1 Typen Navigatie En Patronen
Iad2 0809 Q3 Hoorcollege 1   Typen Navigatie En PatronenIad2 0809 Q3 Hoorcollege 1   Typen Navigatie En Patronen
Iad2 0809 Q3 Hoorcollege 1 Typen Navigatie En PatronenHans Kemp
 
Vocabulary MYP9
Vocabulary MYP9Vocabulary MYP9
Vocabulary MYP9Corey Topf
 
Techo Club Seniors
Techo Club SeniorsTecho Club Seniors
Techo Club SeniorsCorey Topf
 
0809 q2 minor assignment teams
0809 q2 minor assignment teams0809 q2 minor assignment teams
0809 q2 minor assignment teamsHans Kemp
 
0708 De gebruiker heeft altijd gelijk - user-centered design
0708 De gebruiker heeft altijd gelijk - user-centered design0708 De gebruiker heeft altijd gelijk - user-centered design
0708 De gebruiker heeft altijd gelijk - user-centered designHans Kemp
 
Vocabulary ofmiceandmen
Vocabulary ofmiceandmenVocabulary ofmiceandmen
Vocabulary ofmiceandmenCorey Topf
 
Roosevelt innovation academy
Roosevelt innovation academyRoosevelt innovation academy
Roosevelt innovation academyCorey Topf
 
Week 6 Sponges
Week 6 SpongesWeek 6 Sponges
Week 6 SpongesCorey Topf
 
Durftevragen
DurftevragenDurftevragen
Durftevragenroemen
 
Week 29 Sponges
Week 29 SpongesWeek 29 Sponges
Week 29 SpongesCorey Topf
 

Viewers also liked (20)

Analysis of social computing applications in the EU
Analysis of social computing applications in the EUAnalysis of social computing applications in the EU
Analysis of social computing applications in the EU
 
IA Parent Presentation - March 27th, 2014
IA Parent Presentation - March 27th, 2014IA Parent Presentation - March 27th, 2014
IA Parent Presentation - March 27th, 2014
 
Iad1 0809Q2 Hoorcollege2
Iad1 0809Q2 Hoorcollege2Iad1 0809Q2 Hoorcollege2
Iad1 0809Q2 Hoorcollege2
 
0708 Iad2 Q4 Hoorcollege1
0708 Iad2 Q4 Hoorcollege10708 Iad2 Q4 Hoorcollege1
0708 Iad2 Q4 Hoorcollege1
 
Multimodal, Crossmedia, Multi Platform
Multimodal, Crossmedia, Multi PlatformMultimodal, Crossmedia, Multi Platform
Multimodal, Crossmedia, Multi Platform
 
Week 9 Sponges
Week 9 SpongesWeek 9 Sponges
Week 9 Sponges
 
Incroyable
IncroyableIncroyable
Incroyable
 
Writers Workshop
Writers WorkshopWriters Workshop
Writers Workshop
 
Ppt
PptPpt
Ppt
 
C D L A S E R E N A 2008
C D  L A  S E R E N A 2008C D  L A  S E R E N A 2008
C D L A S E R E N A 2008
 
Iad2 0809 Q3 Hoorcollege 1 Typen Navigatie En Patronen
Iad2 0809 Q3 Hoorcollege 1   Typen Navigatie En PatronenIad2 0809 Q3 Hoorcollege 1   Typen Navigatie En Patronen
Iad2 0809 Q3 Hoorcollege 1 Typen Navigatie En Patronen
 
Vocabulary MYP9
Vocabulary MYP9Vocabulary MYP9
Vocabulary MYP9
 
Techo Club Seniors
Techo Club SeniorsTecho Club Seniors
Techo Club Seniors
 
0809 q2 minor assignment teams
0809 q2 minor assignment teams0809 q2 minor assignment teams
0809 q2 minor assignment teams
 
0708 De gebruiker heeft altijd gelijk - user-centered design
0708 De gebruiker heeft altijd gelijk - user-centered design0708 De gebruiker heeft altijd gelijk - user-centered design
0708 De gebruiker heeft altijd gelijk - user-centered design
 
Vocabulary ofmiceandmen
Vocabulary ofmiceandmenVocabulary ofmiceandmen
Vocabulary ofmiceandmen
 
Roosevelt innovation academy
Roosevelt innovation academyRoosevelt innovation academy
Roosevelt innovation academy
 
Week 6 Sponges
Week 6 SpongesWeek 6 Sponges
Week 6 Sponges
 
Durftevragen
DurftevragenDurftevragen
Durftevragen
 
Week 29 Sponges
Week 29 SpongesWeek 29 Sponges
Week 29 Sponges
 

Similar to Why Digitization Increases the Value of Print Collections

Why Digization Increases the Value of Print Collections"
Why Digization Increases the Value of Print Collections"Why Digization Increases the Value of Print Collections"
Why Digization Increases the Value of Print Collections"guest6ce11f
 
Collective Collections: Right-scaling Cooperative Stewardship”
Collective Collections: Right-scaling Cooperative Stewardship”Collective Collections: Right-scaling Cooperative Stewardship”
Collective Collections: Right-scaling Cooperative Stewardship”Cathal McCauley
 
Collective collections: rightscaling cooperative stewardship
Collective collections: rightscaling cooperative stewardshipCollective collections: rightscaling cooperative stewardship
Collective collections: rightscaling cooperative stewardshipConstance Malpas
 
Taiga4lightningtalks
Taiga4lightningtalksTaiga4lightningtalks
Taiga4lightningtalkskantelman
 
The Effects of Cross-Pollination : How non-library mass market services are c...
The Effects of Cross-Pollination : How non-library mass market services are c...The Effects of Cross-Pollination : How non-library mass market services are c...
The Effects of Cross-Pollination : How non-library mass market services are c...Baden Hughes
 
An approach to knowledge management, learning and communication - Case Study ...
An approach to knowledge management, learning and communication - Case Study ...An approach to knowledge management, learning and communication - Case Study ...
An approach to knowledge management, learning and communication - Case Study ...Richard Vines
 
Moving Shared Print to the Network Level
Moving Shared Print to the Network LevelMoving Shared Print to the Network Level
Moving Shared Print to the Network LevelMaine_SharedCollections
 
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...OpenAIRE
 
Notes from breakout session group 2
Notes from breakout session  group 2Notes from breakout session  group 2
Notes from breakout session group 2SarahFahmy
 
Digital library services and the changing environment
Digital library services and the changing environmentDigital library services and the changing environment
Digital library services and the changing environmentJohn MacColl
 
Connecting Users to Collections
Connecting Users to CollectionsConnecting Users to Collections
Connecting Users to CollectionsFSU Libraries
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE
 
Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Nikos Manouselis
 
Technology and Education: The emergency of Openness
Technology and Education: The emergency of OpennessTechnology and Education: The emergency of Openness
Technology and Education: The emergency of OpennessREA Brasil
 
OER Policy and Developing Countries
OER Policy and Developing CountriesOER Policy and Developing Countries
OER Policy and Developing CountriesCarolina Rossini
 
collection development policy for e-resources
collection development policy for e-resourcescollection development policy for e-resources
collection development policy for e-resourcesaditi bhandarkar
 
The powers of consortia: scaling capacity, learning, innovation and influence
The powers of consortia: scaling capacity, learning, innovation and influenceThe powers of consortia: scaling capacity, learning, innovation and influence
The powers of consortia: scaling capacity, learning, innovation and influencelisld
 
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slidesDuraSpace
 
GI2010 symposium-charvat (sise)
GI2010 symposium-charvat (sise)GI2010 symposium-charvat (sise)
GI2010 symposium-charvat (sise)IGN Vorstand
 

Similar to Why Digitization Increases the Value of Print Collections (20)

Why Digization Increases the Value of Print Collections"
Why Digization Increases the Value of Print Collections"Why Digization Increases the Value of Print Collections"
Why Digization Increases the Value of Print Collections"
 
Collective Collections: Right-scaling Cooperative Stewardship”
Collective Collections: Right-scaling Cooperative Stewardship”Collective Collections: Right-scaling Cooperative Stewardship”
Collective Collections: Right-scaling Cooperative Stewardship”
 
Collective collections: rightscaling cooperative stewardship
Collective collections: rightscaling cooperative stewardshipCollective collections: rightscaling cooperative stewardship
Collective collections: rightscaling cooperative stewardship
 
Taiga4lightningtalks
Taiga4lightningtalksTaiga4lightningtalks
Taiga4lightningtalks
 
The Effects of Cross-Pollination : How non-library mass market services are c...
The Effects of Cross-Pollination : How non-library mass market services are c...The Effects of Cross-Pollination : How non-library mass market services are c...
The Effects of Cross-Pollination : How non-library mass market services are c...
 
An approach to knowledge management, learning and communication - Case Study ...
An approach to knowledge management, learning and communication - Case Study ...An approach to knowledge management, learning and communication - Case Study ...
An approach to knowledge management, learning and communication - Case Study ...
 
Moving Shared Print to the Network Level
Moving Shared Print to the Network LevelMoving Shared Print to the Network Level
Moving Shared Print to the Network Level
 
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...
OpenAIRE-COAR conference 2014: Aligning Repository Networks - RED CLARA/LaRef...
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Notes from breakout session group 2
Notes from breakout session  group 2Notes from breakout session  group 2
Notes from breakout session group 2
 
Digital library services and the changing environment
Digital library services and the changing environmentDigital library services and the changing environment
Digital library services and the changing environment
 
Connecting Users to Collections
Connecting Users to CollectionsConnecting Users to Collections
Connecting Users to Collections
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation Repositories
 
Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...
 
Technology and Education: The emergency of Openness
Technology and Education: The emergency of OpennessTechnology and Education: The emergency of Openness
Technology and Education: The emergency of Openness
 
OER Policy and Developing Countries
OER Policy and Developing CountriesOER Policy and Developing Countries
OER Policy and Developing Countries
 
collection development policy for e-resources
collection development policy for e-resourcescollection development policy for e-resources
collection development policy for e-resources
 
The powers of consortia: scaling capacity, learning, innovation and influence
The powers of consortia: scaling capacity, learning, innovation and influenceThe powers of consortia: scaling capacity, learning, innovation and influence
The powers of consortia: scaling capacity, learning, innovation and influence
 
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
5.17.18 "The 2.5% Commitment: Investing in Open" presentation slides
 
GI2010 symposium-charvat (sise)
GI2010 symposium-charvat (sise)GI2010 symposium-charvat (sise)
GI2010 symposium-charvat (sise)
 

More from Baden Hughes

Closing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsClosing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsBaden Hughes
 
Managing Perl Installations: A SysAdmin's View
Managing Perl Installations: A SysAdmin's ViewManaging Perl Installations: A SysAdmin's View
Managing Perl Installations: A SysAdmin's ViewBaden Hughes
 
If We're Not There Yet, How Far Do We Have To Go ? Web Metadata at The Univer...
If We're Not There Yet, How Far Do We Have To Go ? Web Metadata at The Univer...If We're Not There Yet, How Far Do We Have To Go ? Web Metadata at The Univer...
If We're Not There Yet, How Far Do We Have To Go ? Web Metadata at The Univer...Baden Hughes
 
Building Computational Grids with Apple’s Xgrid Middleware
Building Computational Grids with Apple’s Xgrid MiddlewareBuilding Computational Grids with Apple’s Xgrid Middleware
Building Computational Grids with Apple’s Xgrid MiddlewareBaden Hughes
 
Functional Requirements for an Interlinear Text Editor
Functional Requirements for an Interlinear Text EditorFunctional Requirements for an Interlinear Text Editor
Functional Requirements for an Interlinear Text EditorBaden Hughes
 
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...Baden Hughes
 
Disambiguating Advanced Computing for Humanities Researchers
Disambiguating Advanced Computing for Humanities ResearchersDisambiguating Advanced Computing for Humanities Researchers
Disambiguating Advanced Computing for Humanities ResearchersBaden Hughes
 
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...Baden Hughes
 
Encoding and Presenting Interlinear Text Using XML Technologies
Encoding and Presenting Interlinear Text Using XML TechnologiesEncoding and Presenting Interlinear Text Using XML Technologies
Encoding and Presenting Interlinear Text Using XML TechnologiesBaden Hughes
 
Refactoring Metadata:
Refactoring Metadata:Refactoring Metadata:
Refactoring Metadata:Baden Hughes
 
Towards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language CommunitiesTowards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language CommunitiesBaden Hughes
 
Change Management and Versioning in Ontologies
Change Management and Versioning in OntologiesChange Management and Versioning in Ontologies
Change Management and Versioning in OntologiesBaden Hughes
 
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...Baden Hughes
 

More from Baden Hughes (13)

Closing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsClosing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary Linguistics
 
Managing Perl Installations: A SysAdmin's View
Managing Perl Installations: A SysAdmin's ViewManaging Perl Installations: A SysAdmin's View
Managing Perl Installations: A SysAdmin's View
 
If We're Not There Yet, How Far Do We Have To Go ? Web Metadata at The Univer...
If We're Not There Yet, How Far Do We Have To Go ? Web Metadata at The Univer...If We're Not There Yet, How Far Do We Have To Go ? Web Metadata at The Univer...
If We're Not There Yet, How Far Do We Have To Go ? Web Metadata at The Univer...
 
Building Computational Grids with Apple’s Xgrid Middleware
Building Computational Grids with Apple’s Xgrid MiddlewareBuilding Computational Grids with Apple’s Xgrid Middleware
Building Computational Grids with Apple’s Xgrid Middleware
 
Functional Requirements for an Interlinear Text Editor
Functional Requirements for an Interlinear Text EditorFunctional Requirements for an Interlinear Text Editor
Functional Requirements for an Interlinear Text Editor
 
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Pro...
 
Disambiguating Advanced Computing for Humanities Researchers
Disambiguating Advanced Computing for Humanities ResearchersDisambiguating Advanced Computing for Humanities Researchers
Disambiguating Advanced Computing for Humanities Researchers
 
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
 
Encoding and Presenting Interlinear Text Using XML Technologies
Encoding and Presenting Interlinear Text Using XML TechnologiesEncoding and Presenting Interlinear Text Using XML Technologies
Encoding and Presenting Interlinear Text Using XML Technologies
 
Refactoring Metadata:
Refactoring Metadata:Refactoring Metadata:
Refactoring Metadata:
 
Towards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language CommunitiesTowards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language Communities
 
Change Management and Versioning in Ontologies
Change Management and Versioning in OntologiesChange Management and Versioning in Ontologies
Change Management and Versioning in Ontologies
 
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
 

Recently uploaded

Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 

Recently uploaded (20)

Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 

Why Digitization Increases the Value of Print Collections

  • 1. Why Digitization Increases the Value of Print Collections Baden Hughes The University of Melbourne baden.hughes@unimelb.edu.au HUGHES - EDUCAUSE/CARM - May 2007 1
  • 2. Overview • The properties of mass digitization initiatives • Impact analysis – Coverage – Integration – Local digitization efforts • Digitization, Libraries and the Long Tail • Increasing Value of Print Collections HUGHES - EDUCAUSE/CARM - May 2007 2
  • 3. Background • Observe the intersection of collection management issues (space, access) with the emergence of a mass digitization agenda driven by Google et al • Consider the changing nature of the “competition” across multiple dimensions • Information discovery mechanism • Information delivery platform • Information repository HUGHES - EDUCAUSE/CARM - May 2007 3
  • 4. Mass Digitization – the 30,000 foot view • What are Google et al doing ? – Partnering with libraries and publishers to digitize print materials • Google et al bring scaling experience • Libraries and publishers bring content experience – Blending digitized materials with generic web content – Delivering digitized material online in a number of ways largely determined by copyright etc status • Full content, limited (a few pages), snippet (search in context), bibliographic reference only – Providing links from digitized objects to fulfillment channels such as libraries, book stores – Wrapping delivery of digitized materials within the Web 2.0 delivery model, allowing for richer interactions around digitized materials similar to those elsewhere on the web HUGHES - EDUCAUSE/CARM - May 2007 4
  • 5. Exclusions • Mass digitization efforts are not homogeneously addressing all print content – Materials which are typically reference-oriented rather than “popular” – Special collections typically excluded – Differentiation around delivery mechanisms based on legal constraints (eg full text vs bibliographic record) – High quality but non-archival standard scanning (compromise between quality and scalability) – Typically not interested in serial collections – Not exhaustive: initial targets are around 12-15% of the estimated 65M titles in print HUGHES - EDUCAUSE/CARM - May 2007 5
  • 6. Assumptions & Options • Assuming – Libraries will typically not be able to compete with the mass digitization initiatives in terms of resource scaling – Libraries are generally interested in digitization as a mechanism to increase information access • What options do we have ? – Join a mass digitization effort – Focus local digitization around distinguishing content HUGHES - EDUCAUSE/CARM - May 2007 6
  • 7. Impact: Coverage • Which objects in our print collections are going to be digitized under broad coverage initiatives ? – Which libraries are currently being engaged by mass digitization efforts ? – What public statements have been made about the scope of mass digitization with those libraries ? – What are the timeframes for implementation ? – How many of the print holdings are likely to fall into which access category once digitized ? – How are the print collection holdings of these other libraries similar to, or different from, our local library collections ? HUGHES - EDUCAUSE/CARM - May 2007 7
  • 8. Outcomes: Coverage • Different types of libraries are likely to be impacted differently by mass digitization owing to considerably different collections • Access constraints will act as a high level classification – inclusion of an print item within a mass digitization initiative does not imply the same level of access online as in print – Which types of access are being offered for items in your print collection ? Take N items and search for them in a mass digitization repository, and note the type of access available … • Understanding the distinguishing aspects of your local collections becomes important, because anything that is approximately unique won’t be covered – What proportion of your print collection is not available anywhere else, or available in restricted quantities or locations ? WorldCat is a starting point … HUGHES - EDUCAUSE/CARM - May 2007 8
  • 9. Impact: Integration • How can we integrate these remotely held digitized objects into our local catalogues as primary access objects ? – Which print items within our collections are available from mass digitization initiatives ? • Take N items and search for them in a mass digitization repository, noting presence or absence – How can we create robust linkages between our catalogue descriptions and remotely delivered digitized print materials ? – What levels of integration support are going to be offered by mass digitization initiatives on along term basis vs being experimental ? HUGHES - EDUCAUSE/CARM - May 2007 9
  • 10. Outcomes: Integration • ILMS already support the core functional requirements for integration – a separation of description from actual items – Multiple instantiations per item, with unique identifiers – These are the basic requirements for integrating digitized materials anyway, so major system changes are an unlikely prerequisite • All digitization projects have a unique identifier per item, but there’s no robust public APIs for resolving between identifier schemes (eg ISBNs to system identifiers) – Earlier experiences with Google et al show that APIs are typically a second generation feature HUGHES - EDUCAUSE/CARM - May 2007 10
  • 11. Impact: Local Digitization Efforts • Is there a place for local digitization efforts ? – What are the underlying business drivers for local digization ? – What are the expected outcomes of local digitization efforts ? – Which types of resources are targeted and how do these differ from those under mass digitization ? – What types of interactions are going to be supported ? – What resources are required to be “competitive” ? HUGHES - EDUCAUSE/CARM - May 2007 11
  • 12. Outcomes: Local Digitization Efforts • Where is the competitive advantage of local digitization schemes ? – Greater focus on locally distinct collections – Higher quality digitization practices – Leveraging deeper linking options between resources – Heterogeneous treatment – customising approach and delivery mechanism on a per item or per collection segment basis • If you can identify a competitive advantage in any of these areas, then there’s probably a positive ROI proposition for local digitization, but only of items which are not covered by more general programs HUGHES - EDUCAUSE/CARM - May 2007 12
  • 13. The Long Tail Revisited • “The Long Tail” is the popular name for a commonly recognised feature of statistical distributions (Zipf, power laws, Pareto distributions, general Levy distributions) • In such distributions, a high frequency population increment is followed by progressively lower frequency population increments • It is not uncommon for the lower frequency items in total to form the majority of the overall HUGHES - EDUCAUSE/CARM - May 2007 13
  • 14. Long Tail Economics • Brynjolfsson, Hu and Smith (2003) – observed Long Tail distribution in the sales figures of Amazon.com exploring the relationship between number of sales and sales rankings – for Amazon.com, the majority of sales came from obscure items not widely available • Anderson (2004, 2006) – In terms of sales, products in low demand or having low sales volume can effectively make up a market share that rivals or exceeds the relatively few current best selling items assuming a distribution channel is large enough • Brynjolfsson, Hu and Simester (2006) – Impact of information technology in the form of internet search technologies which reduce the cost of item discovery can affect the overall distribution of product sales • Do these observations hold for print collections in libraries as well ? HUGHES - EDUCAUSE/CARM - May 2007 14
  • 15. The Long Tail and Collection Management • From the supply side, the factor which determines a distribution has a long tail is the cost of inventory and storage – We already know this is a problem for managing print collections in libraries • Where inventory and storage costs are insignificant it becomes economically viable to deliver relatively unpopular items – Is this a common driver for digitization at the global and local levels ? • On the demand side, tools such as search engines, recommender software, sampling tools allow customers to locate products outside their local geographic area – Consider changing usage patterns of OPACs and convergence of OPAC and other search technologies HUGHES - EDUCAUSE/CARM - May 2007 15
  • 16. The Effect of Digitization on the Long Tail • Brynjolfsson, Hu and Simester (2006) further showed that the application of information technology in the form of internet search technologies which reduce the cost of item discovery can affect the overall distribution of product sales, in effect creating an even longer tail compared to traditional product delivery channels • Digitization makes content discovery and access easier • Digitization reduces the opportunity cost of every additional item in terms of inventory maintenance and storage HUGHES - EDUCAUSE/CARM - May 2007 16
  • 17. Digitization and the Long Tail • Digitization facilities the rediscovery of long tail conditions in the information need fulfillment space – At a certain scaling point, the opportunity cost of an storing and delivering an additional electronic item is negligible • Many libraries will find that mass digitization approaches offer considerably better economies of scale than they can realise themselves – Access is facilitated through multiple services • Many libraries will discover a shift to discovery mechanisms provided by third parties over standards-based virtual collecions rather than OPACss HUGHES - EDUCAUSE/CARM - May 2007 17
  • 18. The Long Tail and Libraries • Libraries are traditionally in the market of information need fulfillment with little consideration for end-user cost recovery • It has been repeatedly recognised that the economics of library services are affected by long tail distributions – Consider: low circulation items and storage impacts HUGHES - EDUCAUSE/CARM - May 2007 18 • Digitization extends the long tail paradigm
  • 19. The Increasing Value of Print • Assuming that mass digitization is successful, then some level of access to a much larger number digitized print resources will be realised – Flow on effect from digitization in terms of print access • The level of access is the important differentiator for retaining and revaluing print collections – The general availability of print collections with unrestricted access via library collections will continue to offer competitive advantage to libraries despite digitization efforts • Analogous to the situation with uncatalogued holdings – “if it can’t be found, then it might as well not exist” becomes “if we can find it but not access it, then is this a tangible improvement” HUGHES - EDUCAUSE/CARM - May 2007 19
  • 20. Questions, Comments ? HUGHES - EDUCAUSE/CARM - May 2007 20