This document summarizes best practices for acquiring, preserving, and providing access to born-digital materials. It outlines workflows for the acquisition, accession, arrangement and description, and discovery and access of digital archives. Key steps include donor surveys, disk imaging, file analysis, and producing archival information packages using open-source tools like BitCurator, fiwalk, Bulk Extractor, and Archivematica. The benefits of an open-source approach are collaborative development and standards compliance, while issues include specific technical requirements and changing tools.
PISA-VET launch_El Iza Mohamedou_19 March 2024.pptx
Getting Bits off Disks: Using open source tools to stabilize and prepare born-digital materials for long-term preservation
1. Getting Bits off Disks:
Using open source tools to stabilize and prepare
born-digital materials
for long-term preservation
Sam Meister
University of Montana
Best Practices Exchange 2013 November
13, 2013
15. Disk Imaging
“A single file or storage device containing the
complete contents and structure representing
a data storage medium or device, such as a
hard drive, tape drive, floppy
disk, CD/DVD/BD, or USB flash drive”
42. “an effort to build, test, and analyze systems and software
for incorporating digital forensics methods
into the workflows of a variety of collecting institutions”
56. “a free and open-source digital preservation system
that is designed to maintain standards-based,
long-term access
to collections of digital objects”
66. A&D
Current
•
Integrate Born Digital
materials into existing
A&D process / tools (mix
of Excel, Word, XMetal
XML editor)
Future
•
Determine tools needed
for reviewing content
(data visualization)
•
Integrate Born Digital
materials into collection
management system
68. Lessons Learned
• Embrace iterative approach (use what you have and
get what you need when you need it)
• Capture as much metadata as possible
(descriptive, structural, administrative)
• Start with workflow requirements (what needs to be
done) then test tools (what things will get it done)
• Build flexibility into system (may not always be ideal
scenarios)
69. Open Source - Issues
• May require specific IT environment (Linux)
• Tools likely to change quickly
• User interfaces / experience may be simple
• Will need ongoing support from IT / Systems staff
70. Open Source - Benefits
• Limited initial resources needed to install and test
• Provides opportunity to engage systems / IT in new
areas
• Designed and developed in collaboration with archival
community
• Direct communication channels to contribute to /
modify development roadmap
• Quickly build initial standards-compliant workflow
Sam – discuss donor survey / site visit Types of information captured / purpose of collecting information for appraisal / selection decision-making Each potential acquisition is a new case to be investigated High potential for various types of content and format types Donor survey is tool to capture initial information to assist in determining feasibility of acquiring materials Caveat / Disclamer: Not all acquisition scenarios will allow for use of donor survey tool before acquisition decision made
Current = Word document
FutureWeb Form – exports XML or tabular data To allow for integration / interoperability with collection management / descriptive system
Sam – discuss feasibility assessment process Series of questions to assist in acquisition decision-making Analyzing sample set of files / data may be required to determine answers Ultimate question is resource-based – cost/ / benefit analysis New content / media / format types may require new software / hardware to acquire / accession materials Jenny: briefly review content; introduce customer to accession process; frequent repeat or reluctant customers
Sam – discuss transfer processTwo basic transfersPhysical media and/or Network transfer / agreement / forms Jenny: Transferring within Windows environment (using a server share to isolate files); calculating and comparing checksums; transfer agreement completed. Financial issues.
Current = Word documentDigital Materials Transfer document functions as appendix to deed of gift Documents details of transfer / acquisition process
FutureWeb Form – exports XML or tabular data To allow for integration / interoperability with collection management / descriptive system
Sam – overview of accession steps
Sam – provide overview of current born digital workstation Media drives Use of digital forensics hardware and software Born Digital Log – record / document accession process in Access database discuss disk imaging purpose / function
Sam – provide overview of current born digital workstation Media drives Use of digital forensics hardware and software Born Digital Log – record / document accession process in Access database discuss disk imaging purpose / function Jenny : When we do this, why we mostly don’t
Sam – born digital workstation version 2
Sam – born digital workstation version 2
Sam – 3.5 floppy drive
Sam – 5.25 floppy drive
Sam – zip drive
Sam – write blockers
Sam – overview of disk imaging steps
Sam – give overview of Born digital Log to document accession process
Sam – discuss purpose of Photograph media Documenting label text and artifact characteristics May / may not continue this step / practice in the future
Sam – 5.25 floppy drive
Kryoflux hardware and software as option to capture raw bitstream from unrecognized / unknown filesystems
Sam – overview of analysis steps
Sam – discuss tools used for initial analysis BitCurator
Sam – discuss use of fiwalk to extract / generate filesystem metadata for disk images
Sam – discuss use of fiwalk to extract / generate filesystem metadata for disk images
Sam – discuss use of fiwalk to extract / generate filesystem metadata for disk images
Sam – discuss processing / preparing of data / files and metadata for storageJenny: Currently, AIP is produced manually and stored on Windows drive. Will need to revise process with TRIM. Could make use of Archivematica, but waiting until after ERMS implementation.
Sam – discuss processing / preparing of data / files and metadata for storageArchivematica
Sam – archivematica transfer steps
Sam – overview of archivematica ingest steps
Sam – archivematica storage of AIP
Sam – discuss current and potential future uses of ArchivematicaDescribe continued used in relation to overall digital preservation program development
Sam – describe general A&D strategy Basic steps are same for analog and digital materials
Sam – describe current and future A&D process Current = in development Future = dependent on decision to implement an ACMS