SlideShare a Scribd company logo
1 of 73
Download to read offline
http://www.niso.org/news/events/2012/nisowebinars/ebooks_preservation/



  Understanding Critical Elements of E-
    books: Acquiring, Sharing, and
              Preserving

         Part 2: Heritage Lost?
  Ensuring the Preservation of E-books

                  May 23, 2012

Speakers: Jeremy York and Sheila Morrissey
HATHITRUST!
                          A Shared Digital Repository!




We’re	
  Preserving	
  the	
  Past,	
  
What	
  About	
  the	
  Present?	
  
    NISO	
  Webinar:	
  Ensuring	
  the	
  Preserva;on	
  of	
  E-­‐Books	
  
                               May	
  23,	
  2012	
  
            Jeremy	
  York,	
  Project	
  Librarian,	
  HathiTrust	
  
Outline	
  
•  About	
  HathiTrust	
  
•  Preserva;on	
  and	
  Access	
  Strategies	
  
•  What	
  about	
  the	
  present?	
  
Partnership	
  
Arizona State University     North Carolina State        University of Connecticut
Baylor University                 University             University of Florida
Boston College               Northwestern University     University of Illinois
Boston University            The Ohio State University   University of Illinois at Chicago
California Digital Library   The Pennsylvania State
                                                         The University of Iowa
Columbia University               University
                             Princeton University        University of Maryland
Cornell University
Dartmouth College            Purdue University           University of Miami
Duke University              Stanford University         University of Michigan
Emory University             Texas A&M University        University of Minnesota
Florida State University     Universidad Complutense     University of Missouri
Getty Research Institute          de Madrid              University of Nebraska-Lincoln
Harvard University Library   University of Arizona       The University of North
Indiana University           University of Calgary                    Carolina at Chapel
Johns Hopkins University     University of California
                                                         Hill
Lafayette College                 Berkeley
                                  Davis                  University of Notre Dame
Library of Congress
Massachusetts Institute of        Irvine                 University of Pennsylvania
     Technology                   Los Angeles            University of Pittsburgh
McGill University`                Merced                 University of Utah
Michigan State University         Riverside              University of Virginia
New York Public Library           San Diego              University of Washington
New York University               San Francisco          University of Wisconsin-
North Carolina Central            Santa Barbara                       Madison
     University                   Santa Cruz	
  
                                                         Utah State University
                             The University of Chicago
                                                         Washington University
                                                         Yale University Library
The	
  Name	
  
•  The	
  meaning	
  behind	
  the	
  name	
  
   –  Hathi	
  (hah-­‐tee)-­‐-­‐Hindi	
  for	
  elephant	
  
   –  Big,	
  strong	
  
   –  Never	
  forgets,	
  wise	
  
   –  Secure	
  
   –  Trustworthy	
  
Strategic	
  
                          Advisory	
  
                           Board	
  

                        Guidance	
  on	
          •  12-­‐member	
  Board	
  of	
  
                        Policy,	
  Planning	
        Governors	
  
   Execu;ve	
  
  CommiVee	
                                      •  Execu;ve	
  CommiVee	
  
                                                  •  Execu;ve	
  Director	
  

Budget/Finances	
  
Decision-­‐making	
        HathiTrust	
  
Digital	
  Repository	
  
•  Launched	
  2008	
  
•  Ini;al	
  focus	
  on	
  digi;zed	
  book	
  and	
  journal	
  
   content	
  
    –  10,309,742	
  total	
  volumes	
  	
  
    –  5,464,306	
  book	
  ;tles	
  
    –  271,119	
  serial	
  ;tles	
  
    –  3,001,018	
  public	
  domain	
  (~29%)	
  
•  “Light”	
  archive	
  
Collec;ons	
  and	
  Collabora;on	
  
•  Comprehensive	
  collec;on	
  
    -  Preserva;on…with	
  Access	
  
•  Shared	
  strategies	
  
    –  Copyright	
  
    –  Collec;on	
  management,	
  development	
  
    –  Preserva;on	
  
    –  Discovery	
  /	
  Use	
  
    –  Bibliographic	
  Indeterminacy	
  
    –  Efficient	
  user	
  services	
  
•  Public	
  Good	
  
Preserva;on	
  and	
  
     Access	
  
Repository	
  Philosophy/Design	
  
•  OAIS/TRAC	
  
•  Consistency	
  
•  Standardiza;on	
  
•  Simplicity	
  (in	
  design,	
  not	
  func;on)	
  
•  Prac;cality	
  
•  Sustainability	
  
What	
  about	
  the	
  
  Present?	
  
Dates	
                                                               Collec;ons	
  




Languages	
  
                        La;n	
       Remaining	
  
       Arabic	
          1%	
        Languages	
  
        2%	
                            14%	
  
                    Italian	
  
   Japanese	
  
                      3%	
  
      3%	
  
         Russian	
                                       English	
  
           4%	
                                           48%	
  

          Chinese	
  
            4%	
  
              Spanish	
  
                5%	
   French	
  
                            7%	
            German	
  
                                              9%	
  
To	
  contribute	
  to	
  the	
  common	
  good	
  by	
  collec;ng,	
  
organizing,	
  preserving,	
  communica(ng,	
  and	
  sharing	
  
the	
  record	
  of	
  human	
  knowledge	
  
•  Rights	
  holders	
  open	
  access	
  	
  
•  Publishers	
  deposit	
  master	
  files	
  
•  Publish	
  directly	
  into	
  the	
  repository	
  
jPach:	
  Journal	
  Publishing	
  in	
  HathiTrust	
  
•  hVp://lib.umich.edu/jpach	
  
•  Package	
  of	
  tools	
  to	
  enable	
  publica;on	
  of	
  open	
  
   access	
  journals	
  
•  Includes	
  modifica;ons	
  to	
  exis;ng	
  code	
  base;	
  
   new	
  components	
  to	
  facilitate	
  ingest,	
  display,	
  
   and	
  discoverability	
  of	
  born-­‐digital	
  open-­‐access	
  
   journal	
  literature	
  
•  Allow	
  integra;on	
  with	
  popular	
  journal	
  
   publishing	
  tools	
  such	
  as	
  Open	
  Journal	
  Systems	
  
   (OJS)	
  
Key	
  Elements	
  
•  Openness	
  
    –  Content	
  must	
  be	
  licensed	
  for	
  perpetual	
  open	
  access	
  
•  Addi;onal	
  formats	
  
    –  Fixity	
  of	
  bitstream	
  guaranteed	
  where	
  preserva;on	
  
       specifica;ons	
  cannot	
  be	
  developed	
  
•  Allow	
  download	
  of	
  content	
  not	
  rendered	
  in	
  the	
  
   interface	
  
•  Support	
  ar;cles	
  and	
  contextual	
  informa;on	
  (lists	
  
   of	
  editors,	
  submission	
  requirements)	
  
•  Support	
  for	
  revisions	
  to	
  content	
  
Publishing	
  into	
  the	
  
   Repository	
  
Higher	
  Educa;on	
  

                     Source	
  /	
  
Editorial	
                              Market	
  
                     Archive	
  
Publishing	
  into	
  the	
  Repository	
  
•  Openness	
  
   –  Con;nual	
  stewardship	
  and	
  access	
  
•  Sustainability	
  
   –  Library	
  as	
  engine	
  of	
  communica;on	
  
How	
  to	
  find	
  out	
  more	
  
•    About:	
  hVp://www.hathitrust.org/about	
  
•    TwiVer:	
  hVp://twiVer.com/hathitrust	
  
•    Facebook:	
  hVp://www.facebook.com/hathitrust	
  
•    Monthly	
  newsleVer:	
  	
  
     –  hVp:www.hathitrust.org/updates	
  
     –  RSS	
  hVp://www.hathitrust.org/updates_rss	
  
•  Contact	
  us:	
  feedback@issues.hathitrust.org	
  
•  Blogs:	
  hVp://www.hathitrust.org/blogs	
  
     –  Large-­‐scale	
  Search	
  
     –  Perspec;ves	
  from	
  HathiTrust	
  
Thank	
  you	
  very	
  much!	
  
File Format Considerations in
 the Preservation of e-Books


              Sheila Morrissey
      Senior Research Developer, Portico
    NISO Webinar: Heritage Lost? Ensuring
         the Preservation of E-books
                May 23, 1012
Portico - Third Party Preservation


                           Portico is among the largest community-
                            supported digital archives in the world.




                        Working with libraries, publishers,
                           and funders, we preserve e-
                           journals, e-books, and other
                          electronic scholarly content to
                        ensure researchers and students
                        will have access to it in the future.
Portico - Participating Content


                          Over 2,000 societies, and associations have
                           committed content to Portico through 147
                                   publishers agreements.

                                       Committed Content




                            »     E-journal titles          13,675
                            »     E-book titles            129,781
                            »     D-collections                 46
Portico – Preserved Content


                                       Preserved Content

                       »    E-journal titles                   9,568
                       »    E-book titles                     16,861
                       »    D-collections                         12



                       »    Archival Units                 19,433,869
                       »    Preserved Files                319,737,011
Portico - Audit and Certification


   In 2010, Portico became
   the first digital
   preservation service to be
   independently audited by
   the Center for Research
   Libraries (CRL) and
   subsequently certified as a
   trusted, reliable digital
   preservation solution that
   serves the needs of the
   library community.
Portico - History

                           2006                     2009
   2002                   Portico                  Portico
Launch of                 ingests                  ingests
Electronic               initial e-               initial e-                     2009
Archiving                 journal                   book                         CRL
 Initiative              content                  content                       audit of
     by                  into the                 into the                      Portico
  JSTOR                  archive                  archive                       begins




                2005                    2007                      2009                        2010
               Portico                 Portico                   Portico                     Portico
              Launched                 makes                   fulfills first                ingests
                                         first                    PCA                       initial d-
                                       trigger                    claim                    collection
                                         title                                              content
                                      available
Digital Preservation



   Digital preservation is the series of management policies and activities
   necessary to ensure the enduring usability, authenticity, discoverability,
   and accessibility of content over the very long-term. The key goals of
   digital preservation include:


        Usability             Authenticity            Discoverability             Accessibility
   •  the intellectual      •  the provenance of      •  the content must      •  the content must be
      content of the item      the content must be       have logical             available for use to
      must remain usable       proven and the            bibliographic            the appropriate
      via the delivery         content an authentic      metadata so that it      community
      mechanism of             replica of the            can be found by end
      current technology       original                  users through time
Preservation: Legal aspects




   Legal right to preserve content
      »    Not always the same as access rights
      »    Specified in contracts
      »    Includes embedded or supplemental files, such as images
      »    DRM removed
Usability - Preserve Intellectual Content
Usability - Preserve Intellectual Content
Usability: Rendition and Delivery



    Content is rendered to support current delivery
      platform, i.e. web browser.


                       … rendered & delivered …




    Rendition engine can be modified to meet new
      technology requirements.
Portico – Another Look at the History
                                                    2009                 2011
                           2006                                         iPad 2
                                                   Portico
   2002                   Portico                  ingests               Kindle
Launch of                 ingests                 initial e-             Fire
Electronic               initial e-                 book                 Nook
Archiving                 journal                 content               Simple
 Initiative              content                                        Touch
     by                  into the                 Kindle 2
  JSTOR                  archive                    Nook                ePub3




                2005                     2007                   2010                  2012
               Portico                 Portico                 iPad 1               Portico
              Launched                  makes                   Nook                ingests
                                          first                 Color              initial d-
                                        trigger                                   collection
                                          title                                    content
                                      available                                      iPad 3
                                       iPhone
                                      Kindle 1
Usability: Anticipated usage …
Usability: … and new usage
Authenticity, Discoverability:
Preservation Context
Context
Context
Context
Context
Context
Context
.
.
.
Formats: Packages
Formats: Packages
Formats: Packages
E-Book Packages in Portico Submissions




  Flat directory
     »  ONIX xml file with bibliographic metadata, one PDF file per book
           Front Cover image JPG files
E-Book Packages in Portico Submissions



  TAR file (multiple books per file)
     »  XML manifest file
     »  One directory for each book,
           Proprietary XML file (3 possible versions of XML) with bibliographic
            metadata,
           Subdirectory with files for front matter “chapters” (XML. PDF, OCR of
            PDF)
           Subdirectory with files for regular “chapters” (XML. PDF, OCR of PDF)
            front
           Subdirectory with files for back matter “chapters” (XML. PDF, OCR of
            PDF)
           Subdirectory with TIFF file for cover image of book
E-Book Packages in Portico Submissions




  ZIP file (sometimes one book per file, sometime multiple
      books)
     »  Sometimes flat (all books at one level)
     »  Sometimes one directory for each book,
           Sometimes cover images (JPG or TIFF)
           Sometimes one PDF for entire book in addition to PDF for each chapter
     »  Sometimes a manifest
Formats: Text Content




               Hello,	
  World!!	
  
Formats: Text Content

  BT
  /H2 <</MCID 0 >>BDC      Hello,	
  World!!	
  
  /CS0 cs 0.31 0.506
  0.741 scn
  /TT0 1 Tf
  -0.004 Tc 0.006 Tw
  12.96 0 0 12.96 72
  697.68 Tm
  [(H)-4(e)-1(l)-1(l)-11
  (o,)-3( W)-15(or)-6
  (l)-11(d!)-12(!)]TJ
  0 Tc 0 Tw 6.481 0 Td
  ( )Tj
  EMC
  ET
Formats: Text Content

  <html>
  <head>                    Hello,	
  World!!	
  
  <style type="text/css">
  <!--
    p { color: #4F81BD;
  font-family: serif;
  font-weight: bold;
  font-size: 13pt; }
    -->
  </style>
  </head>
  <body><p>Hello, World!!
  </p></body>
  </html>
Trade-offs: Expressiveness vs. Simplicity

                                   Hello,	
  World!!	
  
Formats: Rich Content

             Hello,	
  World!!	
  
Formats: Rich Content

  BT


                                              Hello,	
  World!!	
  
  /H2 <</MCID 0 >>BDC
  /CS0 cs 0.31 0.506 0.741 scn
  /TT0 1 Tf
  -0.004 Tc 0.006 Tw 12.96 0 0 12.96 264
  697.68 Tm
  [(H)-4(e)-1(l)-2(l)-11(o,)-3( W)-15(or)-6
  (l)-11(d!)-12(!)]TJ
  0 Tc 0 Tw 6.481 0 Td
  ( )Tj
  EMC
  /P <</MCID 1 >>BDC
  /CS1 cs 0 scn
  /TT1 1 Tf
  11.04 0 0 11.04 72 682.08 Tm
  ( )Tj
  EMC
  /P <</MCID 2 >>BDC
  36.478 -24.185 Td
  ( )Tj
  EMC
  ET
  /Figure <</MCID 3 >>BDC
  q
  /GS0 gs
  336 0 0 252 139.1000061 414.6812744 cm
  /Im0 Do
  Q
  EMC
Formats: Rich Content


                          Hello,	
  World!!	
  




           (iText RUPS)
Formats: Rich Content

  <html>
  <head>
  <style type="text/css">
                             Hello,	
  World!!	
  
  <!--
    p { color: #4F81BD;
  font-family: serif;
  font-weight: bold; font-
  size: 13pt; }-->
  </style>
  </head>
  <body><p>Hello, World!!
  <br/><span><IMG
  width="447" height="336"
  src=“images/
  Image_001.jpg"/></
  span></p></body>
  </html>
Trade-offs: Encapsulation vs. Articulation


            mydir/
                     myFile.pdf



            mydir/
                     myFile.html
                     images/
                            Image01.jpg
E-book formats in Portico Submissions


       PDF
          »  One file per chapter
          »  One file per book
       TIFF
          »  One file per page
       JPEG
          »  One file per page
       XML
          »    For bibliographic metadata
          »    Proprietary
          »    ONIX variants
          »    NLM variants
Looking ahead: EPUB 3


       EPUB 3 (http://idpf.org/epub/30 )



           »  “EPUB defines a means of representing,
              packaging and encoding structured and
              semantically enhanced Web content--
              including HTML5, CSS, SVG, images,
              and other resources-- for distribution in a
              single-file format.”
Looking ahead: EPUB 3


       EPUB 3

          »  Web standards for key component
             technologies
          »  Free and open specification
          »  Must work in at least some appliance
                Outside publisher’s own workflow
EPUB3 Packaging
EPUB3 Formats




  “Profiles” of standard formats for authoring content
     »  XHTML5, SVG 1.1, CSS 2.1, CSS 3
           Constraints (extensions to HTML5, constraints on SVG)
           Specs a “moving target”


  Conforming readers must support rendition of certain formats
     »  Image, audio, video
           Defined fallbacks


  Globalization, Encoding, Fonts
Complications: The New “Browser Wars”




  Amazon
     »  Announces it is replacing MOBI with K8

  iBooks
     »    Different mimetype
     »    Proprietary extension of CSS Media Queries
     »    Proprietary XML namespace
     »    Etc.
Complications: "More What You’d Call ‘Guidelines’
Than Actual Rules”




                  Pirates of the Caribbean: The Black Pearl. The Walt Disney
                  Company (2003)
Questions or
  Comments?

     Sheila Morrissey
sheila.morrissey@ithaka.org
       @sheilaMorr
     www.portico.org

More Related Content

What's hot

A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter Getaneh Alemu
 
Open access resources in LIS education
Open access resources in LIS educationOpen access resources in LIS education
Open access resources in LIS educationSarika Sawant
 
Open Access, Journal, Institutional Repository and Beyond
Open Access, Journal, Institutional Repository and BeyondOpen Access, Journal, Institutional Repository and Beyond
Open Access, Journal, Institutional Repository and BeyondLeslie Chan
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional RepositoriesNIFT
 
Services for Publishing and Digital products
Services for Publishing  and Digital productsServices for Publishing  and Digital products
Services for Publishing and Digital productsCamille Thomas
 
The library in the life of the user
The library in the life of the userThe library in the life of the user
The library in the life of the userlisld
 
Manage it locally to share it globally: RDM and Wikimedia Commons
Manage it locally to share it globally: RDM and Wikimedia CommonsManage it locally to share it globally: RDM and Wikimedia Commons
Manage it locally to share it globally: RDM and Wikimedia CommonsNick Sheppard
 
A librarian's road map to open access
A librarian's road map to open accessA librarian's road map to open access
A librarian's road map to open accessNick Sheppard
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional RepositoriesSarika Sawant
 
Linked Open Data: Identifying Opportunities
Linked Open Data: Identifying OpportunitiesLinked Open Data: Identifying Opportunities
Linked Open Data: Identifying OpportunitiesLibrary_Connect
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEDINA, University of Edinburgh
 

What's hot (20)

Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
 
The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?
 
A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter A theory of digital library metadata : enrich then filter
A theory of digital library metadata : enrich then filter
 
Open access resources in LIS education
Open access resources in LIS educationOpen access resources in LIS education
Open access resources in LIS education
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Arlitsch may4-3
Arlitsch may4-3Arlitsch may4-3
Arlitsch may4-3
 
Open Access, Journal, Institutional Repository and Beyond
Open Access, Journal, Institutional Repository and BeyondOpen Access, Journal, Institutional Repository and Beyond
Open Access, Journal, Institutional Repository and Beyond
 
ALA 2016 NISO Standards Update Hillman Bibliographic Roadmap
ALA 2016 NISO Standards Update Hillman Bibliographic RoadmapALA 2016 NISO Standards Update Hillman Bibliographic Roadmap
ALA 2016 NISO Standards Update Hillman Bibliographic Roadmap
 
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
March 18 NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? P...
 
ResourceSync - NISO Update Jan 2014
ResourceSync - NISO Update Jan 2014ResourceSync - NISO Update Jan 2014
ResourceSync - NISO Update Jan 2014
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional Repositories
 
Services for Publishing and Digital products
Services for Publishing  and Digital productsServices for Publishing  and Digital products
Services for Publishing and Digital products
 
The library in the life of the user
The library in the life of the userThe library in the life of the user
The library in the life of the user
 
Manage it locally to share it globally: RDM and Wikimedia Commons
Manage it locally to share it globally: RDM and Wikimedia CommonsManage it locally to share it globally: RDM and Wikimedia Commons
Manage it locally to share it globally: RDM and Wikimedia Commons
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
A librarian's road map to open access
A librarian's road map to open accessA librarian's road map to open access
A librarian's road map to open access
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional Repositories
 
Linked Open Data: Identifying Opportunities
Linked Open Data: Identifying OpportunitiesLinked Open Data: Identifying Opportunities
Linked Open Data: Identifying Opportunities
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly Resources
 

Similar to NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lost? Ensuring the Preservation of E-books

Towards a Cloud Library
Towards a Cloud LibraryTowards a Cloud Library
Towards a Cloud LibraryRachel Frick
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessAbby Clobridge
 
Wikipedia and Libraries: Increasing your Library’s Visibilityi
Wikipedia and Libraries: Increasing your Library’s VisibilityiWikipedia and Libraries: Increasing your Library’s Visibilityi
Wikipedia and Libraries: Increasing your Library’s VisibilityiJake Orlowitz
 
DiFiore: JSTOR & Portico: Committed to preserving the scholarly record , Bing...
DiFiore: JSTOR & Portico: Committed to preserving the scholarly record , Bing...DiFiore: JSTOR & Portico: Committed to preserving the scholarly record , Bing...
DiFiore: JSTOR & Portico: Committed to preserving the scholarly record , Bing...Elizabeth Brown
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
Preserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSSPreserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSSEDINA, University of Edinburgh
 
Digital collections and humanities research
Digital collections and humanities researchDigital collections and humanities research
Digital collections and humanities researchHarriett Green
 
How Can Digital Collections Support Shared Print Initiatives?
How Can Digital Collections Support Shared Print Initiatives?How Can Digital Collections Support Shared Print Initiatives?
How Can Digital Collections Support Shared Print Initiatives?Maine_SharedCollections
 
Sarah Michalak, HathiTrust #RLUK14
Sarah Michalak, HathiTrust #RLUK14Sarah Michalak, HathiTrust #RLUK14
Sarah Michalak, HathiTrust #RLUK14ResearchLibrariesUK
 
Change Management for Libraries
Change Management for LibrariesChange Management for Libraries
Change Management for LibrariesThomas King
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemASIS&T
 
Lo and Behold: Reveries of a Connected Campus
Lo and Behold: Reveries of a Connected CampusLo and Behold: Reveries of a Connected Campus
Lo and Behold: Reveries of a Connected CampusEwan McAndrew
 
Research methodology workshop may 2012
Research methodology workshop may 2012Research methodology workshop may 2012
Research methodology workshop may 2012Sarika Sawant
 
Contributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and WikimediaContributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and WikimediaNick Sheppard
 
Open Access and Libraries
Open Access and LibrariesOpen Access and Libraries
Open Access and LibrariesEllyssa Kroski
 

Similar to NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lost? Ensuring the Preservation of E-books (20)

Towards a Cloud Library
Towards a Cloud LibraryTowards a Cloud Library
Towards a Cloud Library
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
 
Wikipedia and Libraries: Increasing your Library’s Visibilityi
Wikipedia and Libraries: Increasing your Library’s VisibilityiWikipedia and Libraries: Increasing your Library’s Visibilityi
Wikipedia and Libraries: Increasing your Library’s Visibilityi
 
DiFiore: JSTOR & Portico: Committed to preserving the scholarly record , Bing...
DiFiore: JSTOR & Portico: Committed to preserving the scholarly record , Bing...DiFiore: JSTOR & Portico: Committed to preserving the scholarly record , Bing...
DiFiore: JSTOR & Portico: Committed to preserving the scholarly record , Bing...
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
Preserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSSPreserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSS
 
Digital collections and humanities research
Digital collections and humanities researchDigital collections and humanities research
Digital collections and humanities research
 
How Can Digital Collections Support Shared Print Initiatives?
How Can Digital Collections Support Shared Print Initiatives?How Can Digital Collections Support Shared Print Initiatives?
How Can Digital Collections Support Shared Print Initiatives?
 
Sarah Michalak, HathiTrust #RLUK14
Sarah Michalak, HathiTrust #RLUK14Sarah Michalak, HathiTrust #RLUK14
Sarah Michalak, HathiTrust #RLUK14
 
Change Management for Libraries
Change Management for LibrariesChange Management for Libraries
Change Management for Libraries
 
Institutional Uses of HathiTrust
Institutional Uses of HathiTrustInstitutional Uses of HathiTrust
Institutional Uses of HathiTrust
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management Ecosystem
 
NISO Standards Update @ ALA Midwinter, January 27, 2013 in Seattle, WA
NISO Standards Update @ ALA Midwinter, January 27, 2013 in Seattle, WANISO Standards Update @ ALA Midwinter, January 27, 2013 in Seattle, WA
NISO Standards Update @ ALA Midwinter, January 27, 2013 in Seattle, WA
 
Lo and Behold: Reveries of a Connected Campus
Lo and Behold: Reveries of a Connected CampusLo and Behold: Reveries of a Connected Campus
Lo and Behold: Reveries of a Connected Campus
 
Research methodology workshop may 2012
Research methodology workshop may 2012Research methodology workshop may 2012
Research methodology workshop may 2012
 
Cbhl apr2014
Cbhl apr2014Cbhl apr2014
Cbhl apr2014
 
E journals indest
E journals indestE journals indest
E journals indest
 
Contributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and WikimediaContributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and Wikimedia
 
Open Access and Libraries
Open Access and LibrariesOpen Access and Libraries
Open Access and Libraries
 
EDINA Serials UKLA SafeNet
EDINA Serials UKLA SafeNetEDINA Serials UKLA SafeNet
EDINA Serials UKLA SafeNet
 

More from National Information Standards Organization (NISO)

More from National Information Standards Organization (NISO) (20)

Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
 
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
 

Recently uploaded

Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 

Recently uploaded (20)

Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 

NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lost? Ensuring the Preservation of E-books

  • 1. http://www.niso.org/news/events/2012/nisowebinars/ebooks_preservation/ Understanding Critical Elements of E- books: Acquiring, Sharing, and Preserving Part 2: Heritage Lost? Ensuring the Preservation of E-books May 23, 2012 Speakers: Jeremy York and Sheila Morrissey
  • 2. HATHITRUST! A Shared Digital Repository! We’re  Preserving  the  Past,   What  About  the  Present?   NISO  Webinar:  Ensuring  the  Preserva;on  of  E-­‐Books   May  23,  2012   Jeremy  York,  Project  Librarian,  HathiTrust  
  • 3. Outline   •  About  HathiTrust   •  Preserva;on  and  Access  Strategies   •  What  about  the  present?  
  • 4. Partnership   Arizona State University North Carolina State University of Connecticut Baylor University University University of Florida Boston College Northwestern University University of Illinois Boston University The Ohio State University University of Illinois at Chicago California Digital Library The Pennsylvania State The University of Iowa Columbia University University Princeton University University of Maryland Cornell University Dartmouth College Purdue University University of Miami Duke University Stanford University University of Michigan Emory University Texas A&M University University of Minnesota Florida State University Universidad Complutense University of Missouri Getty Research Institute de Madrid University of Nebraska-Lincoln Harvard University Library University of Arizona The University of North Indiana University University of Calgary Carolina at Chapel Johns Hopkins University University of California Hill Lafayette College Berkeley Davis University of Notre Dame Library of Congress Massachusetts Institute of Irvine University of Pennsylvania Technology Los Angeles University of Pittsburgh McGill University` Merced University of Utah Michigan State University Riverside University of Virginia New York Public Library San Diego University of Washington New York University San Francisco University of Wisconsin- North Carolina Central Santa Barbara Madison University Santa Cruz   Utah State University The University of Chicago Washington University Yale University Library
  • 5. The  Name   •  The  meaning  behind  the  name   –  Hathi  (hah-­‐tee)-­‐-­‐Hindi  for  elephant   –  Big,  strong   –  Never  forgets,  wise   –  Secure   –  Trustworthy  
  • 6. Strategic   Advisory   Board   Guidance  on   •  12-­‐member  Board  of   Policy,  Planning   Governors   Execu;ve   CommiVee   •  Execu;ve  CommiVee   •  Execu;ve  Director   Budget/Finances   Decision-­‐making   HathiTrust  
  • 7. Digital  Repository   •  Launched  2008   •  Ini;al  focus  on  digi;zed  book  and  journal   content   –  10,309,742  total  volumes     –  5,464,306  book  ;tles   –  271,119  serial  ;tles   –  3,001,018  public  domain  (~29%)   •  “Light”  archive  
  • 8. Collec;ons  and  Collabora;on   •  Comprehensive  collec;on   -  Preserva;on…with  Access   •  Shared  strategies   –  Copyright   –  Collec;on  management,  development   –  Preserva;on   –  Discovery  /  Use   –  Bibliographic  Indeterminacy   –  Efficient  user  services   •  Public  Good  
  • 10. Repository  Philosophy/Design   •  OAIS/TRAC   •  Consistency   •  Standardiza;on   •  Simplicity  (in  design,  not  func;on)   •  Prac;cality   •  Sustainability  
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. What  about  the   Present?  
  • 17. Dates   Collec;ons   Languages   La;n   Remaining   Arabic   1%   Languages   2%   14%   Italian   Japanese   3%   3%   Russian   English   4%   48%   Chinese   4%   Spanish   5%   French   7%   German   9%  
  • 18. To  contribute  to  the  common  good  by  collec;ng,   organizing,  preserving,  communica(ng,  and  sharing   the  record  of  human  knowledge  
  • 19. •  Rights  holders  open  access     •  Publishers  deposit  master  files   •  Publish  directly  into  the  repository  
  • 20. jPach:  Journal  Publishing  in  HathiTrust   •  hVp://lib.umich.edu/jpach   •  Package  of  tools  to  enable  publica;on  of  open   access  journals   •  Includes  modifica;ons  to  exis;ng  code  base;   new  components  to  facilitate  ingest,  display,   and  discoverability  of  born-­‐digital  open-­‐access   journal  literature   •  Allow  integra;on  with  popular  journal   publishing  tools  such  as  Open  Journal  Systems   (OJS)  
  • 21. Key  Elements   •  Openness   –  Content  must  be  licensed  for  perpetual  open  access   •  Addi;onal  formats   –  Fixity  of  bitstream  guaranteed  where  preserva;on   specifica;ons  cannot  be  developed   •  Allow  download  of  content  not  rendered  in  the   interface   •  Support  ar;cles  and  contextual  informa;on  (lists   of  editors,  submission  requirements)   •  Support  for  revisions  to  content  
  • 22.
  • 23.
  • 24. Publishing  into  the   Repository  
  • 25. Higher  Educa;on   Source  /   Editorial   Market   Archive  
  • 26. Publishing  into  the  Repository   •  Openness   –  Con;nual  stewardship  and  access   •  Sustainability   –  Library  as  engine  of  communica;on  
  • 27. How  to  find  out  more   •  About:  hVp://www.hathitrust.org/about   •  TwiVer:  hVp://twiVer.com/hathitrust   •  Facebook:  hVp://www.facebook.com/hathitrust   •  Monthly  newsleVer:     –  hVp:www.hathitrust.org/updates   –  RSS  hVp://www.hathitrust.org/updates_rss   •  Contact  us:  feedback@issues.hathitrust.org   •  Blogs:  hVp://www.hathitrust.org/blogs   –  Large-­‐scale  Search   –  Perspec;ves  from  HathiTrust  
  • 28. Thank  you  very  much!  
  • 29. File Format Considerations in the Preservation of e-Books Sheila Morrissey Senior Research Developer, Portico NISO Webinar: Heritage Lost? Ensuring the Preservation of E-books May 23, 1012
  • 30. Portico - Third Party Preservation Portico is among the largest community- supported digital archives in the world. Working with libraries, publishers, and funders, we preserve e- journals, e-books, and other electronic scholarly content to ensure researchers and students will have access to it in the future.
  • 31. Portico - Participating Content Over 2,000 societies, and associations have committed content to Portico through 147 publishers agreements. Committed Content »  E-journal titles 13,675 »  E-book titles 129,781 »  D-collections 46
  • 32. Portico – Preserved Content Preserved Content »  E-journal titles 9,568 »  E-book titles 16,861 »  D-collections 12 »  Archival Units 19,433,869 »  Preserved Files 319,737,011
  • 33. Portico - Audit and Certification In 2010, Portico became the first digital preservation service to be independently audited by the Center for Research Libraries (CRL) and subsequently certified as a trusted, reliable digital preservation solution that serves the needs of the library community.
  • 34. Portico - History 2006 2009 2002 Portico Portico Launch of ingests ingests Electronic initial e- initial e- 2009 Archiving journal book CRL Initiative content content audit of by into the into the Portico JSTOR archive archive begins 2005 2007 2009 2010 Portico Portico Portico Portico Launched makes fulfills first ingests first PCA initial d- trigger claim collection title content available
  • 35. Digital Preservation Digital preservation is the series of management policies and activities necessary to ensure the enduring usability, authenticity, discoverability, and accessibility of content over the very long-term. The key goals of digital preservation include: Usability Authenticity Discoverability Accessibility •  the intellectual •  the provenance of •  the content must •  the content must be content of the item the content must be have logical available for use to must remain usable proven and the bibliographic the appropriate via the delivery content an authentic metadata so that it community mechanism of replica of the can be found by end current technology original users through time
  • 36. Preservation: Legal aspects Legal right to preserve content »  Not always the same as access rights »  Specified in contracts »  Includes embedded or supplemental files, such as images »  DRM removed
  • 37. Usability - Preserve Intellectual Content
  • 38. Usability - Preserve Intellectual Content
  • 39. Usability: Rendition and Delivery Content is rendered to support current delivery platform, i.e. web browser. … rendered & delivered … Rendition engine can be modified to meet new technology requirements.
  • 40. Portico – Another Look at the History 2009 2011 2006 iPad 2 Portico 2002 Portico ingests Kindle Launch of ingests initial e- Fire Electronic initial e- book Nook Archiving journal content Simple Initiative content Touch by into the Kindle 2 JSTOR archive Nook ePub3 2005 2007 2010 2012 Portico Portico iPad 1 Portico Launched makes Nook ingests first Color initial d- trigger collection title content available iPad 3 iPhone Kindle 1
  • 42. Usability: … and new usage
  • 50. . . .
  • 54. E-Book Packages in Portico Submissions Flat directory »  ONIX xml file with bibliographic metadata, one PDF file per book   Front Cover image JPG files
  • 55. E-Book Packages in Portico Submissions TAR file (multiple books per file) »  XML manifest file »  One directory for each book,   Proprietary XML file (3 possible versions of XML) with bibliographic metadata,   Subdirectory with files for front matter “chapters” (XML. PDF, OCR of PDF)   Subdirectory with files for regular “chapters” (XML. PDF, OCR of PDF) front   Subdirectory with files for back matter “chapters” (XML. PDF, OCR of PDF)   Subdirectory with TIFF file for cover image of book
  • 56. E-Book Packages in Portico Submissions ZIP file (sometimes one book per file, sometime multiple books) »  Sometimes flat (all books at one level) »  Sometimes one directory for each book,   Sometimes cover images (JPG or TIFF)   Sometimes one PDF for entire book in addition to PDF for each chapter »  Sometimes a manifest
  • 57. Formats: Text Content Hello,  World!!  
  • 58. Formats: Text Content BT /H2 <</MCID 0 >>BDC Hello,  World!!   /CS0 cs 0.31 0.506 0.741 scn /TT0 1 Tf -0.004 Tc 0.006 Tw 12.96 0 0 12.96 72 697.68 Tm [(H)-4(e)-1(l)-1(l)-11 (o,)-3( W)-15(or)-6 (l)-11(d!)-12(!)]TJ 0 Tc 0 Tw 6.481 0 Td ( )Tj EMC ET
  • 59. Formats: Text Content <html> <head> Hello,  World!!   <style type="text/css"> <!-- p { color: #4F81BD; font-family: serif; font-weight: bold; font-size: 13pt; } --> </style> </head> <body><p>Hello, World!! </p></body> </html>
  • 60. Trade-offs: Expressiveness vs. Simplicity Hello,  World!!  
  • 61. Formats: Rich Content Hello,  World!!  
  • 62. Formats: Rich Content BT Hello,  World!!   /H2 <</MCID 0 >>BDC /CS0 cs 0.31 0.506 0.741 scn /TT0 1 Tf -0.004 Tc 0.006 Tw 12.96 0 0 12.96 264 697.68 Tm [(H)-4(e)-1(l)-2(l)-11(o,)-3( W)-15(or)-6 (l)-11(d!)-12(!)]TJ 0 Tc 0 Tw 6.481 0 Td ( )Tj EMC /P <</MCID 1 >>BDC /CS1 cs 0 scn /TT1 1 Tf 11.04 0 0 11.04 72 682.08 Tm ( )Tj EMC /P <</MCID 2 >>BDC 36.478 -24.185 Td ( )Tj EMC ET /Figure <</MCID 3 >>BDC q /GS0 gs 336 0 0 252 139.1000061 414.6812744 cm /Im0 Do Q EMC
  • 63. Formats: Rich Content Hello,  World!!   (iText RUPS)
  • 64. Formats: Rich Content <html> <head> <style type="text/css"> Hello,  World!!   <!-- p { color: #4F81BD; font-family: serif; font-weight: bold; font- size: 13pt; }--> </style> </head> <body><p>Hello, World!! <br/><span><IMG width="447" height="336" src=“images/ Image_001.jpg"/></ span></p></body> </html>
  • 65. Trade-offs: Encapsulation vs. Articulation mydir/ myFile.pdf mydir/ myFile.html images/ Image01.jpg
  • 66. E-book formats in Portico Submissions PDF »  One file per chapter »  One file per book TIFF »  One file per page JPEG »  One file per page XML »  For bibliographic metadata »  Proprietary »  ONIX variants »  NLM variants
  • 67. Looking ahead: EPUB 3 EPUB 3 (http://idpf.org/epub/30 ) »  “EPUB defines a means of representing, packaging and encoding structured and semantically enhanced Web content-- including HTML5, CSS, SVG, images, and other resources-- for distribution in a single-file format.”
  • 68. Looking ahead: EPUB 3 EPUB 3 »  Web standards for key component technologies »  Free and open specification »  Must work in at least some appliance   Outside publisher’s own workflow
  • 70. EPUB3 Formats “Profiles” of standard formats for authoring content »  XHTML5, SVG 1.1, CSS 2.1, CSS 3   Constraints (extensions to HTML5, constraints on SVG)   Specs a “moving target” Conforming readers must support rendition of certain formats »  Image, audio, video   Defined fallbacks Globalization, Encoding, Fonts
  • 71. Complications: The New “Browser Wars” Amazon »  Announces it is replacing MOBI with K8 iBooks »  Different mimetype »  Proprietary extension of CSS Media Queries »  Proprietary XML namespace »  Etc.
  • 72. Complications: "More What You’d Call ‘Guidelines’ Than Actual Rules” Pirates of the Caribbean: The Black Pearl. The Walt Disney Company (2003)
  • 73. Questions or Comments? Sheila Morrissey sheila.morrissey@ithaka.org @sheilaMorr www.portico.org