The National Library of the Netherlands (KB) is mass-digitizing all Dutch publications since 1470. This article outlines KB's strategy for making this output publicly available.
In the next 20 years, the Dutch national library (KB) will mass-digitize all Dutch printed books, newspapers and magazines since 1470, a total of 730 million pages. Until recently, this was done by public funding alone. To speed up things in a climate of ongoing budget cuts, KB entered into public-private partnerships with both Google and Proquest to digitize 42 million pages by 2013. Besides the availability of funding, digitization priority is determined by a mix of client and institutional needs such as copyright status, uniqueness, institutional capability and user demand.
At the same time, KB is answering user demand for centralized access and content distribution by streamlining its scattered online services portfolio. For this, KB develops two strategic lines of action.
* The first is on metadata (searching FOR publications): in 2013, KB will unify metadata searching across all its paper and digital collections via OCLC's WorldCat Local.
* The second is on full-text (searching IN publications): for searching in full-text historic publications (i.e. mass digitization output) KB is currently developing its Platform for Digital Publications. Besides a search engine, it is also a:
* Presentation environment, associating each full-text object with a standardized webpage and persistent URL, offering a uniform look and feel, and unique reference for all KB's full-texts. This landing page enables third-party services (e.g. WorldCat Local, Europeana, Google) to refer to objects in a persistent way.
* Delivery platform, enabling KB to deliver content in the workflows of users via APIs and expose it to research communities.
* Aggregator, enabling KB to set up a network of partners to bring together all Dutch digital books, newspapers and magazines, at the same time supporting Europeana's content aggregation strategy.
Grateful 7 speech thanking everyone that has helped.pdf
The Big Dutch 20 Year 730 Million Page Digitisation Challenge
1. The Big Du tch 20 Year
730 M illion Page
Digitisation Challenge
w.alumni.ubc.ca/wp/wp-content/uploads/Gohagan_RLDutchWW_2012_01_HollandWindmillTulips.jpg
10th International Conference on the Book
30th June 2012, Barcelona, Spain
Olaf Janssen, National Library of the Netherlands – olaf.janssen@kb.nl / @ookgezellig / slideshare.net/OlafJanssenNL
11. since
1470
http://marksayers.files.wordpress.com/2011/05/charles-darwin-2.png
12. A whopping
730
pages
http://www.portlandmonthlymag.com/assets/0004/4122/surprised-woman.jpg
http://morristrust.com/wp-content/uploads/2012/04/Surprised-Face.jpg
13. A whopping
730.000
pages
http://www.portlandmonthlymag.com/assets/0004/4122/surprised-woman.jpg
http://morristrust.com/wp-content/uploads/2012/04/Surprised-Face.jpg
14. A whopping
730.000.000
pages
http://www.portlandmonthlymag.com/assets/0004/4122/surprised-woman.jpg
http://morristrust.com/wp-content/uploads/2012/04/Surprised-Face.jpg
15. For the next
7300 days (approx.)
http://www.picturesfromourpast.com/gallery/RightsManaged/20110817/CKSA011_YC019.jpg
http://imgc.allpostersimages.com/images/P-473-488-90/56/5641/2RYMG00Z/posters/george-marks-surprised-woman-posing-portrait.jpg
16. That’s
100.000
pages
every single day !!
http://www.allposters.co.uk/-sp/Man-Wiping-Forehead-Posters_i8018953_.htm
17. And of course, after digitisat
ion,
we want to make many peop
le
happy with our content.
http://annakrentz.blogspot.nl/2011/05/dutch-liberation.html
18. I work on this because I believe…
http://annakrentz.blogspot.nl/2011/05/dutch-liberation.html
19. ultimately people want to know who they are.
For that they explore their histories & origins.
I want to help them in exploring these worlds
http://annakrentz.blogspot.nl/2011/05/dutch-liberation.html
20. ultimately people want to know who they are.
For that they explore their histories & origins.
I want to help them in exploring these worlds
http://annakrentz.blogspot.nl/2011/05/dutch-liberation.html
21. ultimately people want to know who they are.
For that they explore their histories & origins.
I want to help them in exploring these worlds.
http://annakrentz.blogspot.nl/2011/05/dutch-liberation.html
22. The key idea I’d like to share with you today:
How KB goes about
tackling these grand
challenges…
http://mestadelsbilder.files.wordpress.com/2011/06/dali.jpg
23. First, we a re creating digital
content ...
http://www.bjp-online.com/IMG/001/110001/getty-archive-collection.jpg?1282665196
34. 2010-today
Mass scale, private & public funding
Proquest partnership
(12M pages, 1450-1700)
http://www.kb.nl/nieuws/2011/proquest-en.html
35. 2010-today
Mass scale, private & public funding
Google partnership
(35M pages, full-text, 1701-1871)
http://www.kb.nl/nieuws/2010/google-en.html
36. OK, so we’re very busy creatin
g
loads of digital content …
http://www.bjp-online.com/IMG/001/110001/getty-archive-collection.jpg?1282665196
37. problem!!
Houston, we’ve a
http://2.bp.blogspot.com/_BWzuYwiS6-I/TMgeRsFd3mI/AAAAAAAAElw/3cvgbZSPWcs/s1600/doctor+macro+judy+scared.jpg
38. Although we create & store our digital content in a strictly
standardized process … (JP2, JPG, XML-OCR, MPEG21, ALTO,
PDF … *)
* http://kb.nl/hrd/digitalisering/index-en.html
http://www.electrohype.org/press/pionjar/IBM_System360_Mod_50.jpg
39. .. this back-end standardisation does not reflect in the front-end
http://www.electrohype.org/press/pionjar/IBM_System360_Mod_50.jpg
42. Memory o
KB Treasures f the
Netherlan
ds
KB full-text
Historic boo K B full-text
ks spapers
H istoric new
43. Memory o
KB Treasures f the
Netherlan
ds
KB full-text
Historic boo K B full-text
ks spapers
H istoric new
t Proquest
Google full-tex Historic
books
Historic books
44. To many people KB’s website portfolio feels something like this
http://berichtenuithetverleden.files.wordpress.com/2011/03/escher.jpg
45. Current KB websites are inconsistent in
Images: http://www.corbisimages.com/Search#pg=h+armstrong+roberts
URL-logic Search logic Design
Object Display of
Branding presentation result set
User
experience
46. Current KB websites are inconsistent in
Images: http://www.corbisimages.com/Search#pg=h+armstrong+roberts
URL-logic Search logic Design
Object Display of
Branding presentation result set
Scattered & unrelated
User
experience collections
47. Current KB websites are inconsistent in
Images: http://www.corbisimages.com/Search#pg=h+armstrong+roberts
URL-logic Search logic Design
Object Display of
Branding presentation result set
Scattered & unrelated
User Non-
experience collections interoperability
48. For short:
Current KB websites don’t m
eet expectations of
modern & future generations
http://www.corbisimages.com/stock-photo/rights-managed/NT3707756/depressed-cheerleader?popup=1
http://www.corbisimages.com/stock-photo/rights-managed/42-20036948/1960s-1970s-seated-baby-in-diaper-with?popup=1
51. KB is implementing 3 lines of action
http://simplehomeschool.net/wp-content/uploads/2011/06/woman_walking_between_bookshelf-e1308357752773.jpg
http://www.corbisimages.com/stock-photo/rights-managed/NT3765115/an-old-hat-trick?popup=1
52. 1.
for
publications
Unified searching
KB is implementing 3 lines of action
http://simplehomeschool.net/wp-content/uploads/2011/06/woman_walking_between_bookshelf-e1308357752773.jpg
http://www.corbisimages.com/stock-photo/rights-managed/NT3765115/an-old-hat-trick?popup=1
53. 1.
for
publications
Unified searching
2.
in
publications
Unified searching
KB is implementing 3 lines of action
http://simplehomeschool.net/wp-content/uploads/2011/06/woman_walking_between_bookshelf-e1308357752773.jpg
http://www.corbisimages.com/stock-photo/rights-managed/NT3765115/an-old-hat-trick?popup=1
54. 1.
for
publications
Unified searching
2.
in
publications
Unified searching
3.
Unified
KB is implementing 3 lines of action
object presentation
http://simplehomeschool.net/wp-content/uploads/2011/06/woman_walking_between_bookshelf-e1308357752773.jpg
http://www.corbisimages.com/stock-photo/rights-managed/NT3765115/an-old-hat-trick?popup=1
56. 1. Unified searching for publications
Metadata
KB General Catalogue
searching for
• (e-)books
• (e-)magazines
• (e-)newspapers
57. 1. Unified searching for publications
Metadata
KB General Catalogue
searching for
• (e-)books
• (e-)magazines
• (e-)newspapers
MetaLib
searching for
• scholarly e-journals
• licensed 3rd party databases
58. 1. Unified searching for publications
Metadata
KB General Catalogue
searching for
• (e-)books WorldCat Local
• (e-)magazines
KB’s single starting point for
• (e-)newspapers
searching for publications
MetaLib
searching for
• scholarly e-journals
• licensed 3rd party databases
60. 1. Unified searching for publications
Downsides of WorldCat Local
1. No
full-text searching
61. 1. Unified searching for publications
Downsides of WorldCat Local
http://www.corbisimages.com/Search#pg=h+armstrong+roberts&p=1&ColorFormat=2&q=sad
1. No 2. No
full-text searching object presentation
76. 3. Unified object presentation
Landing page + persistent ID
Landing page
(within Platform for Digital Publications)
77. 3. Unified object presentation
Landing page + persistent ID
persistent ID
Landing page
(within Platform for Digital Publications)
78. 3. Unified object presentation
Landing page + persistent ID
KB metadata search
(via WCLocal)
Landing page
(within Platform for Digital Publications)
79. 3. Unified object presentation
Landing page + persistent ID
KB metadata search
(via WCLocal)
KB full-text search
(via Platform for Digital Landing page
Publications) (within Platform for Digital Publications)
80. 3. Unified object presentation
Landing page + persistent ID
KB metadata search Scientist, student etc.
(via WCLocal)
KB full-text search
(via Platform for Digital Landing page
Publications) (within Platform for Digital Publications)
81. 3. Unified object presentation
Landing page + persistent ID
KB metadata search Scientist, student etc.
(via WCLocal)
KB full-text search
(via Platform for Digital Landing page
Publications) (within Platform for Digital Publications)
82. 3. Unified object presentation
Landing page + persistent ID
KB metadata search Scientist, student etc.
(via WCLocal)
KB full-text search
(via Platform for Digital Landing page
Publications) (within Platform for Digital Publications)
83. 3. Unified object presentation
Landing page + persistent ID
KB metadata search Scientist, student etc.
(via WCLocal)
KB full-text search
(via Platform for Digital Landing page
Publications) (within Platform for Digital Publications)
84. So… we’re very busy creating
loads of digital content …
http://www.bjp-online.com/IMG/001/110001/getty-archive-collection.jpg?1282665196
85. So… we’re very busy creating
loads of digital content …
and we’re also creating
unified discovery & presenta
tion …
http://www.bjp-online.com/IMG/001/110001/getty-archive-collection.jpg?1282665196