The document provides an overview of digital activities at the British Library, including their digitization policy and process, partnerships, digital preservation efforts, and web archiving program. The British Library aims to digitize a critical mass of content from their collection, including 20 million pages of literature, 1 million newspaper pages, and 4,000 hours of audio recordings in the next 3 years. They prioritize unique, culturally significant content and work with various partners on digitization projects and digital preservation.
26. Thank you. e [email_address] t +44 (0) 20 7412 7182 01101101010101011001110100110110101010101100111010011011010101010110011101001101101010101011001110100110110101010101100111010110
Notas do Editor
Such as TTP and Mss items, like the Lindisfarne Gospels
Top Priority - digitisation of newspapers 20 million pages of 19 th Century literature 1 million pages of historic newspapers in addition to the 3m already digitised; 4,000 hours of Archival Sound Recordings in addition to the 4,000 hours already digitised; 100,000 pages of Greek manuscripts
projects in partnership with publishers, external funding bodies and other sponsors and donors, technology providers, user communities and other content holders. Full costs of the project cost of access, delivery and sustainability. Assess each proposal in terms of: Our drivers for digitisation; Business models, IP and copyright issues, and access to content (through the Library’s Digitisation and Web Monitoring Group); Operational viability, technical standards, outputs and preservation issues
163,992 images in IDP Database (Dec 2008) X books available in reading rooms for Microsoft Books
Microsoft Digitisation – 25 million pages in TIFF format = 1.2 PB Standards for Master files in large scale digitisation projects OCR Accuracy - Burney Newspapers 40% to 50% word
Burney: 1,100 volumes of the earliest known newspapers 1,000,000 pages Purchased in 1818 for £13,500 Digitised from the microfilm This is a common scenario for many Libraries and means that the long-term management of the surrogates takes on a more important aspect. Both of these surrogates have their own problems in terms of deterioration and risk and lend themselves to further analysis.
20 million pages, or roughly 80,000 books
Risk Analysis Build an accurate view of digital assets, provide evidence to prioritise activities Cover physical media, software environments, disk- and file-formats Highlight: 3% of optical media have unreadable files DLP Digital Preservation Project Ensure that DLP exploits new methods and technology Provide expertise DP Change Manager Ensure that Library processes support long-term access to digital content Review and adjust content handling processes Content Stabilisation Project Stabliise legacy content on handheld media; make it ready for DLP Address media failure; damaged items, poor or inconsistent packaging Technology watch Keep up-to-date on file formats, preservation tools and technology E.g. Disk imaging, checksum
Digital Lives AHRC funded project, focusing on the preservation of e-manuscripts PRESERV2 JISC funded project developing demonstrator preservation services, primarily to UK HE. BL involved in an advisory role Partners include TNA, Oxford and Southampton Universities ROAR Demonstrator: http://roar.eprints.org/ (see Preserv profile options) INSPECT Developing methodology to identify and categorise the significant properties of digital objects JISC funded BL involved in an advisory role Workshop at BL PLANETS: € 15m budget, 16 partners, 4 years
The UK Web Archiving Consortium The British Library The National Archives National Library of Wales National Library of Scotland JISC The Wellcome Trust