Shows production workflow currently in use for a bare-metal installation of Archivematica 0.8, without direct access to AIPstore or DIPstore. Screenshots show different Archivematica configurations available.
4. Transfers go in, not SIPs
• Verify transfer compliance
• Rename with transfer UUID
• Assign file UUIDs to objects
• Assign checksums and file sizes to objects
• Verify metadata directory checksums
• Generate METS.xml document
• Extract packages
• Scan for viruses
• Sanitize object's file and directory names
• Sanitize transfer name
• Characterize and extract metadata
• Create SIP(s)
5. Transfer directory structure (0.8)
Transfer_Name
/logs (only for logs created by Archivematica)
/metadata
checksum.md5 (must have this name)
/submissionDocumentation (optional)
info_about_digitization.xls
/objects
example.MKV
transfer_name.csv
/access (optional)
example.MP4 (must have same name)
transfer_name.csv
processingMCP.xml
7. Ingest Planning: organization
• Decide how to organize transfer contents :
logical or arbitrary groupings?
• Don’t make AIP too big!
• Organize files into single transfers – we
did this on a network drive
8. Ingest Planning: organization
TIFFs: arbitrary blocks of ~ 1,000 Audio & video: fewer files, fewer errors,
for migration; prob. logical groups easier to organize by fonds or series
9. Ingest Planning: Quality
• Need time for quality control of digital
objects and metadata before ingest
• Set aside problem or “do later” files, such
as those requiring rescanning
10. Ingest Planning: Quality
• Check metadata completeness &
accuracy, image quality: no garbage in!
• Started with export from current db:
current workflow is to only ingest items
already described to item level
• One item #, one file (and no more)
• Filenames have to agree with item # and
with csv ingested: spaces, capitalization,
etc, including extensions
11. Ingest Planning: Quality
• Using custom MS Access form, volunteers
inspected master & derivative images next
to their descriptions to ensure correct
image
• Image sizes double-checked in case any
too small
12. Ingest Planning: TIFFs
• Separate files not to be
ingested and those to be
ingested as sub-items
• Not for ingest: not owned
by us, should never have
been digitized; or need
descriptions
• Sub-items: multi-page,
need a procedure first
16. Configure Archivematica
• Workflow (s): configure for each transfer unless
using default
• AIP compression: will be the same for all
transfers processing at the same time
• Normalization choices
17. Workflow: processingMCP.xml
• Overrides default processing xml files
used to process born-digital materials
• Can customize to make the processing
faster
• Must always have exactly this name, even
if contents vary