SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
State Library   • Part of the North
                  Carolina Department
of North          of Cultural Resources
Carolina
                • Work closely/pool
                  resources with the
                  State Archives

                • Digital Information
                  Management Program
CONTENT                        STAFF
  State Publications
      Genealogy Research          ~ 4.75 FTE
       North Caroliniana


                            Local server
 CONTENTdm                   (state-supported)
Connexion Digital Import   Offsite storage
                                 (vendor)


SYSTEMS STORAGE
Born-Digital 3.25      Digitized   .75


CONTENTdm             CONTENTdm
   Connexion            Project Client


            Local Storage                .75


          Remote Storage
CONTENT
• We preserve access and master copies
• 1.27 TB, 162,000+ files
• Mostly .tif,
  .pdf, .jpg, .txt
CONTENT
File structure by “project”
   admindocs
   fulltext
   images_access
   images_master
   images_processed
   metadata

Naming convention
  pubs_serial_annualreportclean2005.pdf
  gen_statefair_lifecharacterthomasruffin1871_0001.tif
Local storage
  • managed by department-wide IT
  • includes working & preservation
    content
  • server is shared, but our directory is
    restricted
  • daily incremental backups


                       STORAGE
OCLC’s    • Began using in 2008
Digital   • Web interface for access
Archive   • FTP or automatic uploads
          • Integrated with
            CONTENTdm
          • Detailed reporting, broken
            out by CONTENTdm
            collection
          • Fixity checks, virus checks
+
• Integration with    • Integration with
  CONTENTdm             CONTENTdm
• Fixity checks and   • Finding and
                        retrieving items
  virus scans
                      • Manifest/batch
• Responsive            upload requirement
  support             • Vendor-side error
• Extensive reports     reporting
                      • Verifying storage
                        contents
DuraSpace’s   • Began using in 2012
              • Web interface for
                access
              • Web interface or
                client-side tools for
                upload
              • Content Management
                System-agnostic
              • Fixity checks
+
• Presentation is like a
  traditional gui file manager
                                 • Searching
• Can designate spaces,          • Sorting
  permissions
• Can make a space public
                                 • Verifying storage
• Powerful upload tools            contents
• Fixity scans                   • Overwriting isn’t
• Robust reporting
• Easy to get content out
                                   hard to do
• Choice of storage services     • Batch delete
• VERY collaborative support
• Non-profit
                                 • MD5
PREPARATION
        CONTENTdm



             ?    Local
   OCLC DA
                 server
CONTENTdm         Local server




1.   Exported metadata from CONTENTdm
2.   Exported file names from local server
3.   Bashed preservation file names, checksums
4.   Identified and recovered missing files
1. Exported metadata           Onerous to
   from CONTENTdm              impossible
2. Exported file names
                                     Easy
   from local server
3. Bashed preservation       Easy but time
   file names, checksums       consuming
4. Identified and                 Easy-ish
   recovered missing files
1. Exported metadata             Onerous to
   from CONTENTdm                impossible



• OCLC had to provide export for largest &
  most critical collection
• 363 MB tar file -> 18 x 100+ MB csv files
• Added frustration: metadata for
  compound objects v. multi-page pdfs
2. Exported file names             Easy
   from local server
1. Bashed preservation     Easy but time
   file names, checksums     consuming


• Spreadsheet gymnastics
• Manual review for filename/checksum
  inconsistencies
4. Identified and                Easy-ish
   recovered missing files



• Missing from CONTENTdm? Added by
  librarians
• Missing from local server? Request to
  OCLC or re-download from CONTENTdm
THE MOVE
         Local server        DuraCloud



1.   Tested sync and upload tools
2.   Discussed spaces
3.   Ran sync tool on local preservation storage
4.   Ongoing maintenance: upload tool
1. Tested sync and upload tools          Easy



• Helped determine flags to manage
  computer resources during sync
• Verified logging output, permissions
• Helped flesh out local workflow
2. Discussed spaces            Easy, and
                             Interesting


• Many spaces or few, to accommodate
  different workflows?
• Assignment of permissions
3. Ran sync tool on local           Easy
   preservation storage


• Ran continuously for 5 2/3 days
• 94,177 items
4. Ongoing maintenance: upload tool   Easy-ish



• Uploads done weekly and monthly
• Upload tool used to avoid accidental
  overwriting
• Have to create “mock” file structure
Working




                      Staging – Limited Access
directory

                                                 Local server


            Staging
Working
directory


                                                 DuraCloud
Working
directory
Insights   • Room for preservation metadata
             improvement
           • Working with full metadata dumps
             is problematic
           • Need for more automated
             monitoring for local storage
           • Integration with CMS not helpful
             unless FULL integration
                      in other words:
           • Streamlined ingest = streamlined
             preservation
Still more   • No, really: manual management
               and auditing is getting less feasible
thoughts     • What is acceptable content loss?
             • What is acceptable preservation
               metadata error rate?
             • Responsiveness to enhancement
               requests should be figured into
               vendor choice
             • At 5 years out, PREMIS lite is just
               fine
Migrating from OCLC's Digital Archive to DuraCloud

Mais conteúdo relacionado

Mais procurados

Plugging the Holes: Security and Compatability in Hadoop
Plugging the Holes: Security and Compatability in HadoopPlugging the Holes: Security and Compatability in Hadoop
Plugging the Holes: Security and Compatability in HadoopOwen O'Malley
 
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage Tiering
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage TieringHadoop Meetup Jan 2019 - Router-Based Federation and Storage Tiering
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage TieringErik Krogen
 
What is Node.js? (ICON UK)
What is Node.js? (ICON UK)What is Node.js? (ICON UK)
What is Node.js? (ICON UK)Tim Davis
 
Self hosted server applications - Adam Horvath
Self hosted server applications - Adam HorvathSelf hosted server applications - Adam Horvath
Self hosted server applications - Adam Horvathadamhorvath
 
05.m3 cms list-ofwebserver
05.m3 cms list-ofwebserver05.m3 cms list-ofwebserver
05.m3 cms list-ofwebservertarensi
 
Microsoft Offical Course 20410C_06
Microsoft Offical Course 20410C_06Microsoft Offical Course 20410C_06
Microsoft Offical Course 20410C_06gameaxt
 
Hadoop Meetup Jan 2019 - Mounting Remote Stores in HDFS
Hadoop Meetup Jan 2019 - Mounting Remote Stores in HDFSHadoop Meetup Jan 2019 - Mounting Remote Stores in HDFS
Hadoop Meetup Jan 2019 - Mounting Remote Stores in HDFSErik Krogen
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3GlusterFS
 
DSpace Current State, Concerns and Solution by DSquare Technologies (DSpace S...
DSpace Current State, Concerns and Solution by DSquare Technologies (DSpace S...DSpace Current State, Concerns and Solution by DSquare Technologies (DSpace S...
DSpace Current State, Concerns and Solution by DSquare Technologies (DSpace S...DSquare Technologies
 
Face Off Domino vs Exchange On Premises
Face Off Domino vs Exchange On PremisesFace Off Domino vs Exchange On Premises
Face Off Domino vs Exchange On PremisesGabriella Davis
 
Red Hat Storage Server For AWS
Red Hat Storage Server For AWSRed Hat Storage Server For AWS
Red Hat Storage Server For AWSRed_Hat_Storage
 
OpenStack and Ceph case study at the University of Alabama
OpenStack and Ceph case study at the University of AlabamaOpenStack and Ceph case study at the University of Alabama
OpenStack and Ceph case study at the University of AlabamaKamesh Pemmaraju
 
2014 sept 4_hadoop_security
2014 sept 4_hadoop_security2014 sept 4_hadoop_security
2014 sept 4_hadoop_securityAdam Muise
 
Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSGlusterFS
 
HTTP - The Other Face Of Domino
HTTP - The Other Face Of DominoHTTP - The Other Face Of Domino
HTTP - The Other Face Of DominoGabriella Davis
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS
 
Sql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton VidishchevSql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton VidishchevAlex Tumanoff
 
× The Road To A #Perfect10 - How To Get Ready For Domino, Sametime, VOP and T...
× The Road To A #Perfect10 - How To Get Ready For Domino, Sametime, VOP and T...× The Road To A #Perfect10 - How To Get Ready For Domino, Sametime, VOP and T...
× The Road To A #Perfect10 - How To Get Ready For Domino, Sametime, VOP and T...Gabriella Davis
 

Mais procurados (20)

Plugging the Holes: Security and Compatability in Hadoop
Plugging the Holes: Security and Compatability in HadoopPlugging the Holes: Security and Compatability in Hadoop
Plugging the Holes: Security and Compatability in Hadoop
 
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage Tiering
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage TieringHadoop Meetup Jan 2019 - Router-Based Federation and Storage Tiering
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage Tiering
 
What is Node.js? (ICON UK)
What is Node.js? (ICON UK)What is Node.js? (ICON UK)
What is Node.js? (ICON UK)
 
Self hosted server applications - Adam Horvath
Self hosted server applications - Adam HorvathSelf hosted server applications - Adam Horvath
Self hosted server applications - Adam Horvath
 
05.m3 cms list-ofwebserver
05.m3 cms list-ofwebserver05.m3 cms list-ofwebserver
05.m3 cms list-ofwebserver
 
Microsoft Offical Course 20410C_06
Microsoft Offical Course 20410C_06Microsoft Offical Course 20410C_06
Microsoft Offical Course 20410C_06
 
Hadoop Meetup Jan 2019 - Mounting Remote Stores in HDFS
Hadoop Meetup Jan 2019 - Mounting Remote Stores in HDFSHadoop Meetup Jan 2019 - Mounting Remote Stores in HDFS
Hadoop Meetup Jan 2019 - Mounting Remote Stores in HDFS
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3
 
DSpace Current State, Concerns and Solution by DSquare Technologies (DSpace S...
DSpace Current State, Concerns and Solution by DSquare Technologies (DSpace S...DSpace Current State, Concerns and Solution by DSquare Technologies (DSpace S...
DSpace Current State, Concerns and Solution by DSquare Technologies (DSpace S...
 
Jakarta EE 8 on JDK17
Jakarta EE 8 on JDK17Jakarta EE 8 on JDK17
Jakarta EE 8 on JDK17
 
Qts 4.2 presentation
Qts 4.2 presentationQts 4.2 presentation
Qts 4.2 presentation
 
Face Off Domino vs Exchange On Premises
Face Off Domino vs Exchange On PremisesFace Off Domino vs Exchange On Premises
Face Off Domino vs Exchange On Premises
 
Red Hat Storage Server For AWS
Red Hat Storage Server For AWSRed Hat Storage Server For AWS
Red Hat Storage Server For AWS
 
OpenStack and Ceph case study at the University of Alabama
OpenStack and Ceph case study at the University of AlabamaOpenStack and Ceph case study at the University of Alabama
OpenStack and Ceph case study at the University of Alabama
 
2014 sept 4_hadoop_security
2014 sept 4_hadoop_security2014 sept 4_hadoop_security
2014 sept 4_hadoop_security
 
Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFS
 
HTTP - The Other Face Of Domino
HTTP - The Other Face Of DominoHTTP - The Other Face Of Domino
HTTP - The Other Face Of Domino
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 
Sql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton VidishchevSql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton Vidishchev
 
× The Road To A #Perfect10 - How To Get Ready For Domino, Sametime, VOP and T...
× The Road To A #Perfect10 - How To Get Ready For Domino, Sametime, VOP and T...× The Road To A #Perfect10 - How To Get Ready For Domino, Sametime, VOP and T...
× The Road To A #Perfect10 - How To Get Ready For Domino, Sametime, VOP and T...
 

Semelhante a Migrating from OCLC's Digital Archive to DuraCloud

Q2 Briefing Presentation
Q2 Briefing PresentationQ2 Briefing Presentation
Q2 Briefing PresentationKurt Carlsen
 
Enterprise Content Management 101 for the Hospitality Industry
Enterprise Content Management 101 for the Hospitality IndustryEnterprise Content Management 101 for the Hospitality Industry
Enterprise Content Management 101 for the Hospitality IndustryAlfresco Software
 
Capacity - Ransomware - Protection - Three Windows File Server Upgrades to Avoid
Capacity - Ransomware - Protection - Three Windows File Server Upgrades to AvoidCapacity - Ransomware - Protection - Three Windows File Server Upgrades to Avoid
Capacity - Ransomware - Protection - Three Windows File Server Upgrades to AvoidStorage Switzerland
 
Using Archivematica 0.8 for Digitized Content
Using Archivematica 0.8 for Digitized ContentUsing Archivematica 0.8 for Digitized Content
Using Archivematica 0.8 for Digitized Contentsbigelow
 
Why DAM is more than just file storage
Why DAM is more than just file storageWhy DAM is more than just file storage
Why DAM is more than just file storageResourceSpace
 
Alfresco 4: Scalability and Performance
Alfresco 4: Scalability and PerformanceAlfresco 4: Scalability and Performance
Alfresco 4: Scalability and PerformanceAlfresco Software
 
Alfresco scalability and performnce
Alfresco   scalability and performnceAlfresco   scalability and performnce
Alfresco scalability and performncePaul Hampton
 
Enterprise WordPress - Performance, Scalability and Redundancy
Enterprise WordPress - Performance, Scalability and RedundancyEnterprise WordPress - Performance, Scalability and Redundancy
Enterprise WordPress - Performance, Scalability and RedundancyJohn Giaconia
 
Caching: A Guided Tour - 10/12/2010
Caching: A Guided Tour - 10/12/2010Caching: A Guided Tour - 10/12/2010
Caching: A Guided Tour - 10/12/2010Jason Ragsdale
 
Stop Those Prying Eyes Getting To Your Data SPTechCon
Stop Those Prying Eyes Getting To Your Data SPTechConStop Those Prying Eyes Getting To Your Data SPTechCon
Stop Those Prying Eyes Getting To Your Data SPTechConLiam Cleary [MVP]
 
Petabyte scale on commodity infrastructure
Petabyte scale on commodity infrastructurePetabyte scale on commodity infrastructure
Petabyte scale on commodity infrastructureelliando dias
 
TLi Consulting - Field management Solution
TLi Consulting - Field management SolutionTLi Consulting - Field management Solution
TLi Consulting - Field management SolutionNicolas Embleton
 
Managing Multisite: Lessons from a Large Network
Managing Multisite: Lessons from a Large NetworkManaging Multisite: Lessons from a Large Network
Managing Multisite: Lessons from a Large NetworkWilliam Earnhardt
 
Session 3 - Windows Server 2012 with Jared Thibodeau
Session 3 - Windows Server 2012 with Jared ThibodeauSession 3 - Windows Server 2012 with Jared Thibodeau
Session 3 - Windows Server 2012 with Jared ThibodeauCTE Solutions Inc.
 
The Dev-Admin Chimera: Customising Connections (with Gab Davis)
The Dev-Admin Chimera: Customising Connections (with Gab Davis)The Dev-Admin Chimera: Customising Connections (with Gab Davis)
The Dev-Admin Chimera: Customising Connections (with Gab Davis)Mark Myers
 
SoftLayer Storage Services Overview
SoftLayer Storage Services OverviewSoftLayer Storage Services Overview
SoftLayer Storage Services OverviewMichael Fork
 
Cloud storage solution technical requirement
Cloud storage solution  technical requirementCloud storage solution  technical requirement
Cloud storage solution technical requirementtaotao1240
 
Making Session Stores More Intelligent
Making Session Stores More IntelligentMaking Session Stores More Intelligent
Making Session Stores More IntelligentKyle Davis
 

Semelhante a Migrating from OCLC's Digital Archive to DuraCloud (20)

Q2 Briefing Presentation
Q2 Briefing PresentationQ2 Briefing Presentation
Q2 Briefing Presentation
 
Enterprise Content Management 101 for the Hospitality Industry
Enterprise Content Management 101 for the Hospitality IndustryEnterprise Content Management 101 for the Hospitality Industry
Enterprise Content Management 101 for the Hospitality Industry
 
Capacity - Ransomware - Protection - Three Windows File Server Upgrades to Avoid
Capacity - Ransomware - Protection - Three Windows File Server Upgrades to AvoidCapacity - Ransomware - Protection - Three Windows File Server Upgrades to Avoid
Capacity - Ransomware - Protection - Three Windows File Server Upgrades to Avoid
 
Using Archivematica 0.8 for Digitized Content
Using Archivematica 0.8 for Digitized ContentUsing Archivematica 0.8 for Digitized Content
Using Archivematica 0.8 for Digitized Content
 
Why DAM is more than just file storage
Why DAM is more than just file storageWhy DAM is more than just file storage
Why DAM is more than just file storage
 
Alfresco 4: Scalability and Performance
Alfresco 4: Scalability and PerformanceAlfresco 4: Scalability and Performance
Alfresco 4: Scalability and Performance
 
Alfresco scalability and performnce
Alfresco   scalability and performnceAlfresco   scalability and performnce
Alfresco scalability and performnce
 
Enterprise WordPress - Performance, Scalability and Redundancy
Enterprise WordPress - Performance, Scalability and RedundancyEnterprise WordPress - Performance, Scalability and Redundancy
Enterprise WordPress - Performance, Scalability and Redundancy
 
Caching: A Guided Tour - 10/12/2010
Caching: A Guided Tour - 10/12/2010Caching: A Guided Tour - 10/12/2010
Caching: A Guided Tour - 10/12/2010
 
Stop Those Prying Eyes Getting To Your Data SPTechCon
Stop Those Prying Eyes Getting To Your Data SPTechConStop Those Prying Eyes Getting To Your Data SPTechCon
Stop Those Prying Eyes Getting To Your Data SPTechCon
 
Domino testing presentation
Domino testing presentationDomino testing presentation
Domino testing presentation
 
Petabyte scale on commodity infrastructure
Petabyte scale on commodity infrastructurePetabyte scale on commodity infrastructure
Petabyte scale on commodity infrastructure
 
IBM DB2
IBM DB2IBM DB2
IBM DB2
 
TLi Consulting - Field management Solution
TLi Consulting - Field management SolutionTLi Consulting - Field management Solution
TLi Consulting - Field management Solution
 
Managing Multisite: Lessons from a Large Network
Managing Multisite: Lessons from a Large NetworkManaging Multisite: Lessons from a Large Network
Managing Multisite: Lessons from a Large Network
 
Session 3 - Windows Server 2012 with Jared Thibodeau
Session 3 - Windows Server 2012 with Jared ThibodeauSession 3 - Windows Server 2012 with Jared Thibodeau
Session 3 - Windows Server 2012 with Jared Thibodeau
 
The Dev-Admin Chimera: Customising Connections (with Gab Davis)
The Dev-Admin Chimera: Customising Connections (with Gab Davis)The Dev-Admin Chimera: Customising Connections (with Gab Davis)
The Dev-Admin Chimera: Customising Connections (with Gab Davis)
 
SoftLayer Storage Services Overview
SoftLayer Storage Services OverviewSoftLayer Storage Services Overview
SoftLayer Storage Services Overview
 
Cloud storage solution technical requirement
Cloud storage solution  technical requirementCloud storage solution  technical requirement
Cloud storage solution technical requirement
 
Making Session Stores More Intelligent
Making Session Stores More IntelligentMaking Session Stores More Intelligent
Making Session Stores More Intelligent
 

Migrating from OCLC's Digital Archive to DuraCloud

  • 1.
  • 2. State Library • Part of the North Carolina Department of North of Cultural Resources Carolina • Work closely/pool resources with the State Archives • Digital Information Management Program
  • 3. CONTENT STAFF State Publications Genealogy Research ~ 4.75 FTE North Caroliniana Local server CONTENTdm (state-supported) Connexion Digital Import Offsite storage (vendor) SYSTEMS STORAGE
  • 4. Born-Digital 3.25 Digitized .75 CONTENTdm CONTENTdm Connexion Project Client Local Storage .75 Remote Storage
  • 5. CONTENT • We preserve access and master copies • 1.27 TB, 162,000+ files • Mostly .tif, .pdf, .jpg, .txt
  • 6. CONTENT File structure by “project” admindocs fulltext images_access images_master images_processed metadata Naming convention pubs_serial_annualreportclean2005.pdf gen_statefair_lifecharacterthomasruffin1871_0001.tif
  • 7. Local storage • managed by department-wide IT • includes working & preservation content • server is shared, but our directory is restricted • daily incremental backups STORAGE
  • 8. OCLC’s • Began using in 2008 Digital • Web interface for access Archive • FTP or automatic uploads • Integrated with CONTENTdm • Detailed reporting, broken out by CONTENTdm collection • Fixity checks, virus checks
  • 9.
  • 10.
  • 11.
  • 12. + • Integration with • Integration with CONTENTdm CONTENTdm • Fixity checks and • Finding and retrieving items virus scans • Manifest/batch • Responsive upload requirement support • Vendor-side error • Extensive reports reporting • Verifying storage contents
  • 13. DuraSpace’s • Began using in 2012 • Web interface for access • Web interface or client-side tools for upload • Content Management System-agnostic • Fixity checks
  • 14.
  • 15. + • Presentation is like a traditional gui file manager • Searching • Can designate spaces, • Sorting permissions • Can make a space public • Verifying storage • Powerful upload tools contents • Fixity scans • Overwriting isn’t • Robust reporting • Easy to get content out hard to do • Choice of storage services • Batch delete • VERY collaborative support • Non-profit • MD5
  • 16. PREPARATION CONTENTdm ? Local OCLC DA server
  • 17. CONTENTdm Local server 1. Exported metadata from CONTENTdm 2. Exported file names from local server 3. Bashed preservation file names, checksums 4. Identified and recovered missing files
  • 18. 1. Exported metadata Onerous to from CONTENTdm impossible 2. Exported file names Easy from local server 3. Bashed preservation Easy but time file names, checksums consuming 4. Identified and Easy-ish recovered missing files
  • 19. 1. Exported metadata Onerous to from CONTENTdm impossible • OCLC had to provide export for largest & most critical collection • 363 MB tar file -> 18 x 100+ MB csv files • Added frustration: metadata for compound objects v. multi-page pdfs
  • 20. 2. Exported file names Easy from local server 1. Bashed preservation Easy but time file names, checksums consuming • Spreadsheet gymnastics • Manual review for filename/checksum inconsistencies
  • 21. 4. Identified and Easy-ish recovered missing files • Missing from CONTENTdm? Added by librarians • Missing from local server? Request to OCLC or re-download from CONTENTdm
  • 22. THE MOVE Local server DuraCloud 1. Tested sync and upload tools 2. Discussed spaces 3. Ran sync tool on local preservation storage 4. Ongoing maintenance: upload tool
  • 23. 1. Tested sync and upload tools Easy • Helped determine flags to manage computer resources during sync • Verified logging output, permissions • Helped flesh out local workflow
  • 24. 2. Discussed spaces Easy, and Interesting • Many spaces or few, to accommodate different workflows? • Assignment of permissions
  • 25. 3. Ran sync tool on local Easy preservation storage • Ran continuously for 5 2/3 days • 94,177 items
  • 26. 4. Ongoing maintenance: upload tool Easy-ish • Uploads done weekly and monthly • Upload tool used to avoid accidental overwriting • Have to create “mock” file structure
  • 27. Working Staging – Limited Access directory Local server Staging Working directory DuraCloud Working directory
  • 28. Insights • Room for preservation metadata improvement • Working with full metadata dumps is problematic • Need for more automated monitoring for local storage • Integration with CMS not helpful unless FULL integration in other words: • Streamlined ingest = streamlined preservation
  • 29. Still more • No, really: manual management and auditing is getting less feasible thoughts • What is acceptable content loss? • What is acceptable preservation metadata error rate? • Responsiveness to enhancement requests should be figured into vendor choice • At 5 years out, PREMIS lite is just fine