SlideShare uma empresa Scribd logo
1 de 14
Baixar para ler offline
What libraries can learn from Google –
and what they can do better
Günter Mühlberger
University Innsbruck Library
Agenda
• Introduction
• A story about digitisation
• The continuation of the story
• Some conclusions
Introduction
• Department for Digitisation and Digital Preservation
– Founded in 2002, 14 FTE, R&D and Digitisation Services
– Since 1998 coordinated several R&D EU projects in the digital library
domain
– Currently involved in several projects, e.g.: IMPACT (mass-digitisation
of textual material, text recognition and language technologies),
Prestoprime (long term preservation of audio-visual material), both
projects will set up a CoC
– Coordinator of the library network eBooks on Demand (EOD) with 30
member libraries in 13 countries: Digitisation on Demand service
• Several medium and large scale digitisation projects + respective
applications for searching, browsing, archiving
– Catalogue cards
– Newspapers and newspaper clippings
– Books and journals
• Our mission
– To make a valuable contribution to an up to date digital library
A short story
• January 2007
– Collection of 30.000 books from a monastery
“Servitenbibliothek” as present to the library
– No spare shelves at the library for such a collection since
a collection of German dissertations occupies the best
magazines
– Suggestion to get rid of the dissertations
– Decision to digitize first and than to throw them away
• During 2007
– Several experiments with document scanners, cutting of
the documents, workflows, etc.
Digitisation of dissertations
• 2008 – mid 2010
– Real production process with two parallel document scanners and
up to 70.000 pages per day, 50.000 pages as average
– Average of 2’ per dissertation (110 pages) including ALL steps in
the workflow
– Convincing scan quality: Tests show that OCR will be nearly perfect
– All extra pages (supplements, tables, etc.) are treated extra
– Single cutting of documents too time consuming
– Change of paper quality
• Summer 2010
– We have processed 216.000 dissertations with 24 mill. pages,
1800 shelf meters
– 400 GB image data (TIFF IV bitonal)
– Overall time invested: 8000 hours or 5 person years
– High quality industrial equipment for less than 50.000 EUR
– Tests for OCR processing the 24 mill. pages are encouraging
Continuation of the story
• How can we give access to this large collection?
– Copyright comes in
• Investigations on Austrian copyright
– We are allowed to scan for preservation purposes. O.k!
– We are allowed to store for preservation. O.k!
– We are allowed to print out a copy and use it instead of what we
had before we digitised everything. Hm!
– We are allowed to use this copy for interlibrary loan – but need to
get it back. Uups!
– We are not allowed to make them available to the public. O.k!
– We are not allowed to make them available to our researchers and
students at the university. Uups!
– We are not allowed to make them available to other libraries
owning the same dissertations. Pff!
– We are allowed to provide access on a handful of dedicated
computers at the library. Mmh!
Some more considerations
• “Making available” is a new kind of use
– Copying, distribution, translation, exhibiting, etc. are traditional use
forms and publisher contracts cover this kind of use
– In 2003 (following the EU Directive on Copyright from 2001) a new
kind of use was introduced: “making available”
– Since this is a new right “old” contracts (usually) do not cover this
right.
– The author is therefore the right holder, not the publisher.
– In some countries it is more complicated (e.g. Germany) but as a rule
of thumb most authors in Europe still have the right to decide by
whom, when and how their digitised work will be made available to
the public
• Dissertations
– Even simpler since no publishers or RROs are involved
– Dissertations were printed on behalf of the authors, never distributed
via the book market
Our approach to copyright
• Let’s the social Internet work for us
– Dissertations will be made available online, but only title page,
table of contents and abstract/introduction will be shown to
everyone
– Under discussion: Maybe also some more pages and search
snippets
– Readers will get the chance to write a short “Request”: I would
need this book for my scientific work, etc.
– Readers will be encouraged to contact potential right holders (“Do
the diligent search for us”)
• Registration mechanism
– A big displayer will appear: If you are the author or if you know the
author/right holder – please help us!
– Authors will need to register (personal coordinates), set some
options and confirm their statement
Authorisation
• Copyright options
– They may want to make a general statement: Open Access,
Creative Commons, All rights reserved
– A cooperation with authors organisation (RRO) will make sense
– Or they may want to make a specific statement: This library is
allowed to do that and that. Than it is a simple bilateral, non-
exclusive contract.
• How to identify the right holder?
– Digital signatures or eCards would make life much easier.
• Current plan:
– Author provides address.
– He receives a letter with a list of TAN codes which will be needed
for any action within the system.
– If he chooses to “reserve all rights” the data are transferred to the
RRO(s)
– Minimal risk remains but can be neglected
Our dream
• We hope
– That it becomes a “self-runner” where those who need the
information will convince those who have the rights to
provide free access – or at least provide some access rights
for libraries
– That authors will understand why it is so important that
libraries digitise current material and provide access to
everyone
– That users will understand that authors have rights
(copyright and personal rights) which need to be respected
– That RROs and publishers will understand that not everyone
is interested in “making money with books written 30 years
ago” but that many are also willing to support the idea of
open access
– That thousands and ten-thousands of authors and readers
will take part
What we can learn from Google
• Mission of Google is to organise the information universe based on
technological innovation
– Therefore books are highly important (they contain much better information
than websites)
– Digitisation of books was just one step towards the overall objective
• If you have a mission, do the first step first and afterwards sort out the
problems
– Organise the cheapest way to scan, build your own machines, workflow, etc.
– Make a reasonable compromise between quantity and quality
– Be innovative (take what is here but put it together in a new way)
• Convert problems into chances
– Rather sure that Google underestimated the impact of copyright
– Settlement was probably not foreseen from the very beginning, but now it is a
great business opportunity for them
– If it comes, it will allow them to make a lot of money
• Battle on books is won /lost in the 20th century not in the public
domain
– Who reads books from 19th, 18th or 17th century?
What libraries can do better
• Libraries also need to follow their mission: to preserve the
intellectual heritage of mankind and to provide free access
to everyone
– Google is not a library
– It does many things as if it were a library (and better), but it never
will become a library
– Preservation comprises analogue AND digital preservation (go hand
in hand)
• to digitise (collect) everything
– Libraries are collection holders, not Google or anyone else
– Digitisation (and everything what is connected) has to be part of
the daily business and not only of projects
– Digitisation should be twofold: on-demand AND via mass
digitisation (including cutting of documents and 20th century
material)
– A natural consequence is to also collect modern material in digital
format (right from the beginning, pre-press files)
What libraries can do better
• to cooperate among each other (nationally and internationally)
– Most libraries have the same books, even duplicates within an
institution
– Swedish books in Austria, German books in Sweden, etc.
– Open access material will no longer belong to one library, but to
everyone!
– Therefore it makes definitely sense to cut one book and store the
pages digitally and analogue (acid free box)
• to involve readers (and right holders)
– Libraries have a “natural authority” which needs to be exploited as a
market advantage
– Libraries are much nearer to authors and readers than anyone else, but
they need to give them the chance to express themselves
– They may be slow, old-fashioned and technologically not on the fore-
front but they are trustful organisations and are able to mobilise
thousands or even hundred thousands of users
Let’s go to work!

Mais conteúdo relacionado

Mais procurados

Can you save the web? Web Archiving!
Can you save the web? Web Archiving!Can you save the web? Web Archiving!
Can you save the web? Web Archiving!Vangelis Banos
 
Danish library association and the danish digital library
Danish library association and the danish digital libraryDanish library association and the danish digital library
Danish library association and the danish digital libraryMichel Steen-Hansen
 
What a difference 10 years makes | But where to from here?
What a difference 10 years makes | But where to from here?What a difference 10 years makes | But where to from here?
What a difference 10 years makes | But where to from here?Adrian Kingston
 
Hohmann liber2006text
Hohmann liber2006textHohmann liber2006text
Hohmann liber2006textTina Hohmann
 
BlogForever Project presentation at MTSR2013
BlogForever Project presentation at MTSR2013BlogForever Project presentation at MTSR2013
BlogForever Project presentation at MTSR2013eimgreece
 

Mais procurados (6)

Danish Library Association - voorstelling door Hellen Niegaard
Danish Library Association - voorstelling door Hellen NiegaardDanish Library Association - voorstelling door Hellen Niegaard
Danish Library Association - voorstelling door Hellen Niegaard
 
Can you save the web? Web Archiving!
Can you save the web? Web Archiving!Can you save the web? Web Archiving!
Can you save the web? Web Archiving!
 
Danish library association and the danish digital library
Danish library association and the danish digital libraryDanish library association and the danish digital library
Danish library association and the danish digital library
 
What a difference 10 years makes | But where to from here?
What a difference 10 years makes | But where to from here?What a difference 10 years makes | But where to from here?
What a difference 10 years makes | But where to from here?
 
Hohmann liber2006text
Hohmann liber2006textHohmann liber2006text
Hohmann liber2006text
 
BlogForever Project presentation at MTSR2013
BlogForever Project presentation at MTSR2013BlogForever Project presentation at MTSR2013
BlogForever Project presentation at MTSR2013
 

Destaque

Slidecast final draft
Slidecast final draftSlidecast final draft
Slidecast final drafthmzucker
 
Irregular verbs
Irregular verbsIrregular verbs
Irregular verbsIsra1976
 
Aperiodic crystal workshop 2013: TEM
Aperiodic crystal workshop 2013: TEMAperiodic crystal workshop 2013: TEM
Aperiodic crystal workshop 2013: TEMJoke Hadermann
 
Tem for incommensurately modulated materials
Tem for incommensurately modulated materialsTem for incommensurately modulated materials
Tem for incommensurately modulated materialsJoke Hadermann
 
Direct space structure solution from precession electron diffraction data: Re...
Direct space structure solution from precession electron diffraction data: Re...Direct space structure solution from precession electron diffraction data: Re...
Direct space structure solution from precession electron diffraction data: Re...Joke Hadermann
 
Complementarity of advanced TEM to bulk diffraction techniques
Complementarity of advanced TEM to bulk diffraction techniquesComplementarity of advanced TEM to bulk diffraction techniques
Complementarity of advanced TEM to bulk diffraction techniquesJoke Hadermann
 
Determining a structure with electron crystallography - Overview of the paper...
Determining a structure with electron crystallography - Overview of the paper...Determining a structure with electron crystallography - Overview of the paper...
Determining a structure with electron crystallography - Overview of the paper...Joke Hadermann
 
Mapping of chemical order in inorganic compounds
Mapping of chemical order in inorganic compoundsMapping of chemical order in inorganic compounds
Mapping of chemical order in inorganic compoundsJoke Hadermann
 
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...Joke Hadermann
 
Irregular verbs
Irregular verbsIrregular verbs
Irregular verbsIsra1976
 
New oxide structures using lone pairs cations as "chemical scissors"
New oxide structures using lone pairs cations as "chemical scissors"New oxide structures using lone pairs cations as "chemical scissors"
New oxide structures using lone pairs cations as "chemical scissors"Joke Hadermann
 
TEM Winterworkshop 2011: electron diffraction
TEM Winterworkshop 2011: electron diffractionTEM Winterworkshop 2011: electron diffraction
TEM Winterworkshop 2011: electron diffractionJoke Hadermann
 
Scheelite CGEW/MO for luminescence - Summary of the paper
Scheelite CGEW/MO for luminescence - Summary of the paperScheelite CGEW/MO for luminescence - Summary of the paper
Scheelite CGEW/MO for luminescence - Summary of the paperJoke Hadermann
 
TEM workshop 2013: Electron diffraction
TEM workshop 2013: Electron diffractionTEM workshop 2013: Electron diffraction
TEM workshop 2013: Electron diffractionJoke Hadermann
 

Destaque (16)

Slidecast final draft
Slidecast final draftSlidecast final draft
Slidecast final draft
 
Irregular verbs
Irregular verbsIrregular verbs
Irregular verbs
 
Slide
SlideSlide
Slide
 
Slidecast
SlidecastSlidecast
Slidecast
 
Aperiodic crystal workshop 2013: TEM
Aperiodic crystal workshop 2013: TEMAperiodic crystal workshop 2013: TEM
Aperiodic crystal workshop 2013: TEM
 
Tem for incommensurately modulated materials
Tem for incommensurately modulated materialsTem for incommensurately modulated materials
Tem for incommensurately modulated materials
 
Direct space structure solution from precession electron diffraction data: Re...
Direct space structure solution from precession electron diffraction data: Re...Direct space structure solution from precession electron diffraction data: Re...
Direct space structure solution from precession electron diffraction data: Re...
 
Complementarity of advanced TEM to bulk diffraction techniques
Complementarity of advanced TEM to bulk diffraction techniquesComplementarity of advanced TEM to bulk diffraction techniques
Complementarity of advanced TEM to bulk diffraction techniques
 
Determining a structure with electron crystallography - Overview of the paper...
Determining a structure with electron crystallography - Overview of the paper...Determining a structure with electron crystallography - Overview of the paper...
Determining a structure with electron crystallography - Overview of the paper...
 
Mapping of chemical order in inorganic compounds
Mapping of chemical order in inorganic compoundsMapping of chemical order in inorganic compounds
Mapping of chemical order in inorganic compounds
 
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
Solving the Structure of Li Ion Battery Materials with Precession Electron Di...
 
Irregular verbs
Irregular verbsIrregular verbs
Irregular verbs
 
New oxide structures using lone pairs cations as "chemical scissors"
New oxide structures using lone pairs cations as "chemical scissors"New oxide structures using lone pairs cations as "chemical scissors"
New oxide structures using lone pairs cations as "chemical scissors"
 
TEM Winterworkshop 2011: electron diffraction
TEM Winterworkshop 2011: electron diffractionTEM Winterworkshop 2011: electron diffraction
TEM Winterworkshop 2011: electron diffraction
 
Scheelite CGEW/MO for luminescence - Summary of the paper
Scheelite CGEW/MO for luminescence - Summary of the paperScheelite CGEW/MO for luminescence - Summary of the paper
Scheelite CGEW/MO for luminescence - Summary of the paper
 
TEM workshop 2013: Electron diffraction
TEM workshop 2013: Electron diffractionTEM workshop 2013: Electron diffraction
TEM workshop 2013: Electron diffraction
 

Semelhante a Muehlberger umea google

You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?The European Library
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02The European Library
 
53 million objects! Now what?
53 million objects! Now what?53 million objects! Now what?
53 million objects! Now what?David Haskiya
 
The Public Domain Charter
The Public Domain CharterThe Public Domain Charter
The Public Domain CharterPaul Keller
 
Save this book: posterity’s challenge
Save this book: posterity’s challengeSave this book: posterity’s challenge
Save this book: posterity’s challengeBookrepublic
 
Naple presentation danish digital library
Naple presentation danish digital libraryNaple presentation danish digital library
Naple presentation danish digital libraryJakobheide
 
eReading talk SJSU 2012 01
eReading talk SJSU 2012 01eReading talk SJSU 2012 01
eReading talk SJSU 2012 01TAPintoIT
 
Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...Phonothèque MMSH
 
Bryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 MbBryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 Mbguestd67478
 
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...The European Library
 

Semelhante a Muehlberger umea google (20)

You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?You’ve Digitised Your Collection. What Next ?
You’ve Digitised Your Collection. What Next ?
 
You've Digitised. What Next ?
You've Digitised. What Next ?You've Digitised. What Next ?
You've Digitised. What Next ?
 
Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02Dunning seedi-2013-130517083015-phpapp02
Dunning seedi-2013-130517083015-phpapp02
 
Proyecto Arrow. Ana Manchado Mangas
Proyecto Arrow. Ana Manchado MangasProyecto Arrow. Ana Manchado Mangas
Proyecto Arrow. Ana Manchado Mangas
 
Save This Book
Save This BookSave This Book
Save This Book
 
53 million objects! Now what?
53 million objects! Now what?53 million objects! Now what?
53 million objects! Now what?
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Europeana en CARARE
Europeana en CARAREEuropeana en CARARE
Europeana en CARARE
 
31 36
31 3631 36
31 36
 
Sistema Compartit a l'ICOLC
Sistema Compartit a l'ICOLCSistema Compartit a l'ICOLC
Sistema Compartit a l'ICOLC
 
Luca Martinelli Europeana
Luca Martinelli EuropeanaLuca Martinelli Europeana
Luca Martinelli Europeana
 
Digitallibrary
DigitallibraryDigitallibrary
Digitallibrary
 
The Public Domain Charter
The Public Domain CharterThe Public Domain Charter
The Public Domain Charter
 
Save this book: posterity’s challenge
Save this book: posterity’s challengeSave this book: posterity’s challenge
Save this book: posterity’s challenge
 
Naple presentation danish digital library
Naple presentation danish digital libraryNaple presentation danish digital library
Naple presentation danish digital library
 
eReading talk SJSU 2012 01
eReading talk SJSU 2012 01eReading talk SJSU 2012 01
eReading talk SJSU 2012 01
 
Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...Copyright challenges and policy choices in European heritage projects Tools, ...
Copyright challenges and policy choices in European heritage projects Tools, ...
 
Bryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 MbBryssel 22. 23.10.2009 Mb
Bryssel 22. 23.10.2009 Mb
 
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
British Library Labs, Aly Conteh, Digitisation Programme Manager at British L...
 

Último

SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Basic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationBasic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationNeilDeclaro1
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answersdalebeck957
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxPooja Bhuva
 

Último (20)

SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Basic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationBasic Intentional Injuries Health Education
Basic Intentional Injuries Health Education
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answers
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 

Muehlberger umea google

  • 1. What libraries can learn from Google – and what they can do better Günter Mühlberger University Innsbruck Library
  • 2. Agenda • Introduction • A story about digitisation • The continuation of the story • Some conclusions
  • 3. Introduction • Department for Digitisation and Digital Preservation – Founded in 2002, 14 FTE, R&D and Digitisation Services – Since 1998 coordinated several R&D EU projects in the digital library domain – Currently involved in several projects, e.g.: IMPACT (mass-digitisation of textual material, text recognition and language technologies), Prestoprime (long term preservation of audio-visual material), both projects will set up a CoC – Coordinator of the library network eBooks on Demand (EOD) with 30 member libraries in 13 countries: Digitisation on Demand service • Several medium and large scale digitisation projects + respective applications for searching, browsing, archiving – Catalogue cards – Newspapers and newspaper clippings – Books and journals • Our mission – To make a valuable contribution to an up to date digital library
  • 4. A short story • January 2007 – Collection of 30.000 books from a monastery “Servitenbibliothek” as present to the library – No spare shelves at the library for such a collection since a collection of German dissertations occupies the best magazines – Suggestion to get rid of the dissertations – Decision to digitize first and than to throw them away • During 2007 – Several experiments with document scanners, cutting of the documents, workflows, etc.
  • 5. Digitisation of dissertations • 2008 – mid 2010 – Real production process with two parallel document scanners and up to 70.000 pages per day, 50.000 pages as average – Average of 2’ per dissertation (110 pages) including ALL steps in the workflow – Convincing scan quality: Tests show that OCR will be nearly perfect – All extra pages (supplements, tables, etc.) are treated extra – Single cutting of documents too time consuming – Change of paper quality • Summer 2010 – We have processed 216.000 dissertations with 24 mill. pages, 1800 shelf meters – 400 GB image data (TIFF IV bitonal) – Overall time invested: 8000 hours or 5 person years – High quality industrial equipment for less than 50.000 EUR – Tests for OCR processing the 24 mill. pages are encouraging
  • 6. Continuation of the story • How can we give access to this large collection? – Copyright comes in • Investigations on Austrian copyright – We are allowed to scan for preservation purposes. O.k! – We are allowed to store for preservation. O.k! – We are allowed to print out a copy and use it instead of what we had before we digitised everything. Hm! – We are allowed to use this copy for interlibrary loan – but need to get it back. Uups! – We are not allowed to make them available to the public. O.k! – We are not allowed to make them available to our researchers and students at the university. Uups! – We are not allowed to make them available to other libraries owning the same dissertations. Pff! – We are allowed to provide access on a handful of dedicated computers at the library. Mmh!
  • 7. Some more considerations • “Making available” is a new kind of use – Copying, distribution, translation, exhibiting, etc. are traditional use forms and publisher contracts cover this kind of use – In 2003 (following the EU Directive on Copyright from 2001) a new kind of use was introduced: “making available” – Since this is a new right “old” contracts (usually) do not cover this right. – The author is therefore the right holder, not the publisher. – In some countries it is more complicated (e.g. Germany) but as a rule of thumb most authors in Europe still have the right to decide by whom, when and how their digitised work will be made available to the public • Dissertations – Even simpler since no publishers or RROs are involved – Dissertations were printed on behalf of the authors, never distributed via the book market
  • 8. Our approach to copyright • Let’s the social Internet work for us – Dissertations will be made available online, but only title page, table of contents and abstract/introduction will be shown to everyone – Under discussion: Maybe also some more pages and search snippets – Readers will get the chance to write a short “Request”: I would need this book for my scientific work, etc. – Readers will be encouraged to contact potential right holders (“Do the diligent search for us”) • Registration mechanism – A big displayer will appear: If you are the author or if you know the author/right holder – please help us! – Authors will need to register (personal coordinates), set some options and confirm their statement
  • 9. Authorisation • Copyright options – They may want to make a general statement: Open Access, Creative Commons, All rights reserved – A cooperation with authors organisation (RRO) will make sense – Or they may want to make a specific statement: This library is allowed to do that and that. Than it is a simple bilateral, non- exclusive contract. • How to identify the right holder? – Digital signatures or eCards would make life much easier. • Current plan: – Author provides address. – He receives a letter with a list of TAN codes which will be needed for any action within the system. – If he chooses to “reserve all rights” the data are transferred to the RRO(s) – Minimal risk remains but can be neglected
  • 10. Our dream • We hope – That it becomes a “self-runner” where those who need the information will convince those who have the rights to provide free access – or at least provide some access rights for libraries – That authors will understand why it is so important that libraries digitise current material and provide access to everyone – That users will understand that authors have rights (copyright and personal rights) which need to be respected – That RROs and publishers will understand that not everyone is interested in “making money with books written 30 years ago” but that many are also willing to support the idea of open access – That thousands and ten-thousands of authors and readers will take part
  • 11. What we can learn from Google • Mission of Google is to organise the information universe based on technological innovation – Therefore books are highly important (they contain much better information than websites) – Digitisation of books was just one step towards the overall objective • If you have a mission, do the first step first and afterwards sort out the problems – Organise the cheapest way to scan, build your own machines, workflow, etc. – Make a reasonable compromise between quantity and quality – Be innovative (take what is here but put it together in a new way) • Convert problems into chances – Rather sure that Google underestimated the impact of copyright – Settlement was probably not foreseen from the very beginning, but now it is a great business opportunity for them – If it comes, it will allow them to make a lot of money • Battle on books is won /lost in the 20th century not in the public domain – Who reads books from 19th, 18th or 17th century?
  • 12. What libraries can do better • Libraries also need to follow their mission: to preserve the intellectual heritage of mankind and to provide free access to everyone – Google is not a library – It does many things as if it were a library (and better), but it never will become a library – Preservation comprises analogue AND digital preservation (go hand in hand) • to digitise (collect) everything – Libraries are collection holders, not Google or anyone else – Digitisation (and everything what is connected) has to be part of the daily business and not only of projects – Digitisation should be twofold: on-demand AND via mass digitisation (including cutting of documents and 20th century material) – A natural consequence is to also collect modern material in digital format (right from the beginning, pre-press files)
  • 13. What libraries can do better • to cooperate among each other (nationally and internationally) – Most libraries have the same books, even duplicates within an institution – Swedish books in Austria, German books in Sweden, etc. – Open access material will no longer belong to one library, but to everyone! – Therefore it makes definitely sense to cut one book and store the pages digitally and analogue (acid free box) • to involve readers (and right holders) – Libraries have a “natural authority” which needs to be exploited as a market advantage – Libraries are much nearer to authors and readers than anyone else, but they need to give them the chance to express themselves – They may be slow, old-fashioned and technologically not on the fore- front but they are trustful organisations and are able to mobilise thousands or even hundred thousands of users
  • 14. Let’s go to work!