SlideShare a Scribd company logo
1 of 17
HathiTrust--a GovDocs
         Repository?
 Brian Vetruba, Catalog Librarian/Germanic Studies
                     Librarian
         Washington University in St. Louis
               bvetruba@wustl.edu

  Leveraging Your Strengths: Regional Government
Documents Conference | Federal Reserve Bank St. Louis
                    May 4, 2012
Overview
• Began in 2008
• Over 10.2 million volumes
• Over 2.9 million public domain
  (PD) volumes (“full view”)
• Over 60 partners

                                   hāthī (    ) (pronounced
                                   HAH-tee) is the Hindi word
                                   for elephant.
Content
                                         HathiTrust Partners:




   Content by call number, language, date                And more...
   http://www.hathitrust.org/statistics_visualizations
US Gov Docs in HathiTrust
• Ca. 300,000 = 4% of all
  titles in HathiTrust

• 80% of gov docs in
  HathiTrust in public
  domain
Percentage of Docs in HathiTrust (est.)
            1895-2009




                   Brown (2011)
HathiTrust Compared to Google Books
More titles found in Google but HathiTrust
 provides more full-text
      Total docs = 385   Titles Found     Full-text
      (1940s)
      HathiTrust         98               90
      Google             181              4

HathiTrust better for searching serials
  One record for all issues of a title
  Title changes noted
                                        Sare (2012)
Who can do what
Everyone                     HathiTrust Partners
• View PD content            • Download entire volumes
• Search PD and copyright      of PD materials
  materials                  • Create private or public
• View public collections      collections
• Download single PD pages   • Have a voice in the future
• Download MARC records        of HathiTrust
Advanced Search in
    HathiTrust
Searching & Discovering Content
HT Catalog http://www.hathitrust.org/

HT WorldCat Local prototype
http://hathitrust.worldcat.org/


Resource Discovery Tools
Searching & Discovering Content
Loading records into local ILS
 http://www.hathitrust.org/data


Bibliographic and Data APIs
 http://www.hathitrust.org/data


Widgets
 http://www.hathitrust.org/widgets
Searching & Discovering Content
Embed links

 Public collections
     http://babel.hathitrust.org/cgi/mb

 Individual items


From a LibGuide for a
  German literature
       course
Final Thoughts
CHALLENGES
• Searching/retrieval obstacles (e.g. no SUDoc search)
• Inaccurate copyright statuses impeding access
• Inaccurate linkages and bibliographic info
PROGRESS
• Commitment to expand and enhance access to gov docs
• Research study examining how to improve access
• Coordination with Committee on Institutional Cooperation
  and others to create a digital corpus of 1+ million print
  docs
Questions about HathiTrust
  http://www.hathitrust.org/help
  feedback@issues.hathitrust.org
           @hathitrust

           More info:
http://libguides.wustl.edu/hathitrust
More info on loading records into ILS
Kent State Univ.:
http://techserv.lib.muohio.edu/ovgtsl11/presentations/Panchyshyn.pptx
Univ. of Denver:
http://www.slideserve.com/holleb/harvesting-hathitrust-documents-a-new-
model-for-online-access
Univ. of Colorado-Denver:
Beall, Jeffrey. 2009. “Free Books: Loading Brief MARC Records for Open-
Access Books in an Academic Library Catalog.” Cataloging & Classification
Quarterly 47 (5) (January 4): 452–463. doi:10.1080/01639370902870215.
Bibliography

Malpas, Constance. 2011. Cloud-sourcing Research Collections Managing
Print in the Mass-digitized Library Environment. Dublin, Ohio  OCLC
                                                              :
Research. Accessed May 2, 2012
http://www.oclc.org/research/publications/library/2011/2011-01.pdf

York, Jeremy. 2012. HathiTrust: Issues and Challenges in Preserving the
Published Record [PowerPoint slides]. Accessed April 30, 2012
http://www.hathitrust.org/documents/HathiTrust-Amigos-201202.pptx

Brown, Christopher C. 2011. Harvesting HathiTrust Documents: A New Model
for Online Access [PowerPoint slides]. Accessed April 30, 2012
http://www.slideserve.com/holleb/harvesting-hathitrust-documents-a-new-
model-for-online-access

York, Jeremy. 2012. “HathiTrust: The Elephant in the Library.” Library Issues:
Briefings for Faculty and Administrators 32 (3) (January). Accessed May 2,
2012 http://www.libraryissues.com/sub/LI320003.asp .

Sare, Laura. 2012. “A Comparison of HathiTrust and Google Books Using
Federal Publications.” Practical Academic Librarianship: The International
Journal of the SLA Academic Division 2 (1): 1–25. Accessed May 2, 2012
http://journals.tdl.org/pal/article/viewFile/5880/5922
Thank you!
bvetruba@wustl.edu
     @bvetruba

More Related Content

What's hot

LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...PrattSILS
 
LIS 653 Posters Fall 2014
LIS 653 Posters Fall 2014 LIS 653 Posters Fall 2014
LIS 653 Posters Fall 2014 PrattSILS
 
Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)
Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)
Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)James Jacobs
 
LIS 653 fall 2013 final project posters
LIS 653 fall 2013 final project postersLIS 653 fall 2013 final project posters
LIS 653 fall 2013 final project postersPrattSILS
 
LIS 653 | Knowledge Organization | Spring 2018
LIS 653 | Knowledge Organization | Spring 2018LIS 653 | Knowledge Organization | Spring 2018
LIS 653 | Knowledge Organization | Spring 2018PrattSILS
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
INFO 653 Posters, Fall 2019
INFO 653 Posters, Fall 2019INFO 653 Posters, Fall 2019
INFO 653 Posters, Fall 2019PrattSILS
 
Online Genealogy Intro for Mendon NY Public Library and Historical Society
Online Genealogy Intro for Mendon NY Public Library and Historical SocietyOnline Genealogy Intro for Mendon NY Public Library and Historical Society
Online Genealogy Intro for Mendon NY Public Library and Historical SocietyLarry Naukam
 
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...lljohnston
 
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014Kimberly Hoffman
 
Presentación Prof. Maria Esther Vida. DataBootCampVE/31 octubre 2013
Presentación Prof. Maria Esther Vida. DataBootCampVE/31 octubre 2013Presentación Prof. Maria Esther Vida. DataBootCampVE/31 octubre 2013
Presentación Prof. Maria Esther Vida. DataBootCampVE/31 octubre 2013ProgramaMediosCentroCarterVE
 
Librarians in the Intelligence Process
Librarians in the Intelligence ProcessLibrarians in the Intelligence Process
Librarians in the Intelligence Processdavidshumaker
 
Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Alannah Fitzgerald
 
AALL2015A3-Tech Skills Law Librarians Need
AALL2015A3-Tech Skills Law Librarians NeedAALL2015A3-Tech Skills Law Librarians Need
AALL2015A3-Tech Skills Law Librarians NeedJill Sonnesyn
 
LIS 653 posters spring 2015
LIS 653 posters spring 2015LIS 653 posters spring 2015
LIS 653 posters spring 2015PrattSILS
 
From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle Kimberly Hoffman
 
LIS 653-02 Spring 2014 Final Presentation Posters
LIS 653-02 Spring 2014 Final Presentation PostersLIS 653-02 Spring 2014 Final Presentation Posters
LIS 653-02 Spring 2014 Final Presentation PostersPrattSILS
 

What's hot (20)

LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
LIS 653 Knowledge Organization | Pratt Institute School of Information | Fall...
 
LIS 653 Posters Fall 2014
LIS 653 Posters Fall 2014 LIS 653 Posters Fall 2014
LIS 653 Posters Fall 2014
 
Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)
Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)
Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)
 
The IFLA Trend Report
The IFLA Trend ReportThe IFLA Trend Report
The IFLA Trend Report
 
LIS 653 fall 2013 final project posters
LIS 653 fall 2013 final project postersLIS 653 fall 2013 final project posters
LIS 653 fall 2013 final project posters
 
LIS 653 | Knowledge Organization | Spring 2018
LIS 653 | Knowledge Organization | Spring 2018LIS 653 | Knowledge Organization | Spring 2018
LIS 653 | Knowledge Organization | Spring 2018
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
Open Data Journalism
Open Data JournalismOpen Data Journalism
Open Data Journalism
 
INFO 653 Posters, Fall 2019
INFO 653 Posters, Fall 2019INFO 653 Posters, Fall 2019
INFO 653 Posters, Fall 2019
 
Online Genealogy Intro for Mendon NY Public Library and Historical Society
Online Genealogy Intro for Mendon NY Public Library and Historical SocietyOnline Genealogy Intro for Mendon NY Public Library and Historical Society
Online Genealogy Intro for Mendon NY Public Library and Historical Society
 
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
 
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
 
Presentación Prof. Maria Esther Vida. DataBootCampVE/31 octubre 2013
Presentación Prof. Maria Esther Vida. DataBootCampVE/31 octubre 2013Presentación Prof. Maria Esther Vida. DataBootCampVE/31 octubre 2013
Presentación Prof. Maria Esther Vida. DataBootCampVE/31 octubre 2013
 
Librarians in the Intelligence Process
Librarians in the Intelligence ProcessLibrarians in the Intelligence Process
Librarians in the Intelligence Process
 
Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...
 
AALL2015A3-Tech Skills Law Librarians Need
AALL2015A3-Tech Skills Law Librarians NeedAALL2015A3-Tech Skills Law Librarians Need
AALL2015A3-Tech Skills Law Librarians Need
 
LIS 653 posters spring 2015
LIS 653 posters spring 2015LIS 653 posters spring 2015
LIS 653 posters spring 2015
 
From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle
 
open GLAM - building the digital commons
open GLAM - building the digital commonsopen GLAM - building the digital commons
open GLAM - building the digital commons
 
LIS 653-02 Spring 2014 Final Presentation Posters
LIS 653-02 Spring 2014 Final Presentation PostersLIS 653-02 Spring 2014 Final Presentation Posters
LIS 653-02 Spring 2014 Final Presentation Posters
 

Similar to HathiTrust--a GovDocs Repository?

HathiTrust Reserach Center Nov2013
HathiTrust Reserach Center Nov2013HathiTrust Reserach Center Nov2013
HathiTrust Reserach Center Nov2013Beth Plale
 
Building a Public Research Center for the HathiTrust Digital Library
Building a Public Research Center for the HathiTrust Digital LibraryBuilding a Public Research Center for the HathiTrust Digital Library
Building a Public Research Center for the HathiTrust Digital LibraryRobert H. McDonald
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
Wrangling metadata from hathi trust and pubmed to provide full text linking t...
Wrangling metadata from hathi trust and pubmed to provide full text linking t...Wrangling metadata from hathi trust and pubmed to provide full text linking t...
Wrangling metadata from hathi trust and pubmed to provide full text linking t...NASIG
 
Relationship status: Libraries and linked data in Europe
Relationship status: Libraries and linked data in EuropeRelationship status: Libraries and linked data in Europe
Relationship status: Libraries and linked data in EuropeDiane Rasmussen Pennington
 
Collection assessment in a collaborative environment: Biodiversity Heritage L...
Collection assessment in a collaborative environment: Biodiversity Heritage L...Collection assessment in a collaborative environment: Biodiversity Heritage L...
Collection assessment in a collaborative environment: Biodiversity Heritage L...Connie Rinaldo
 
Linked Data: A short(-ish) introduction
Linked Data: A short(-ish) introductionLinked Data: A short(-ish) introduction
Linked Data: A short(-ish) introductionPete Johnston
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Beth Plale
 
Schema.org - An Extending Influence
Schema.org - An Extending InfluenceSchema.org - An Extending Influence
Schema.org - An Extending InfluenceRichard Wallis
 
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBeth Plale
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collectionslljohnston
 
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital TextsCase Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital TextsBeth Plale
 
Trove: A Government 2.0 Showcase August 2010, Australian Parliament
Trove: A Government 2.0 Showcase August 2010, Australian ParliamentTrove: A Government 2.0 Showcase August 2010, Australian Parliament
Trove: A Government 2.0 Showcase August 2010, Australian ParliamentRose Holley
 
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky ReichEDINA, University of Edinburgh
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Adrian Stevenson
 

Similar to HathiTrust--a GovDocs Repository? (20)

HathiTrust Reserach Center Nov2013
HathiTrust Reserach Center Nov2013HathiTrust Reserach Center Nov2013
HathiTrust Reserach Center Nov2013
 
Building a Public Research Center for the HathiTrust Digital Library
Building a Public Research Center for the HathiTrust Digital LibraryBuilding a Public Research Center for the HathiTrust Digital Library
Building a Public Research Center for the HathiTrust Digital Library
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
Wrangling metadata from hathi trust and pubmed to provide full text linking t...
Wrangling metadata from hathi trust and pubmed to provide full text linking t...Wrangling metadata from hathi trust and pubmed to provide full text linking t...
Wrangling metadata from hathi trust and pubmed to provide full text linking t...
 
Linked Data
Linked DataLinked Data
Linked Data
 
HathiTrust
HathiTrustHathiTrust
HathiTrust
 
Relationship status: Libraries and linked data in Europe
Relationship status: Libraries and linked data in EuropeRelationship status: Libraries and linked data in Europe
Relationship status: Libraries and linked data in Europe
 
Limitreal
LimitrealLimitreal
Limitreal
 
Collection assessment in a collaborative environment: Biodiversity Heritage L...
Collection assessment in a collaborative environment: Biodiversity Heritage L...Collection assessment in a collaborative environment: Biodiversity Heritage L...
Collection assessment in a collaborative environment: Biodiversity Heritage L...
 
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
 
Linked Data: A short(-ish) introduction
Linked Data: A short(-ish) introductionLinked Data: A short(-ish) introduction
Linked Data: A short(-ish) introduction
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014
 
Schema.org - An Extending Influence
Schema.org - An Extending InfluenceSchema.org - An Extending Influence
Schema.org - An Extending Influence
 
English 1102 2018
English 1102 2018English 1102 2018
English 1102 2018
 
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collections
 
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital TextsCase Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
 
Trove: A Government 2.0 Showcase August 2010, Australian Parliament
Trove: A Government 2.0 Showcase August 2010, Australian ParliamentTrove: A Government 2.0 Showcase August 2010, Australian Parliament
Trove: A Government 2.0 Showcase August 2010, Australian Parliament
 
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?
 

Recently uploaded

4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptxJonalynLegaspi2
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 

Recently uploaded (20)

4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptx
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 

HathiTrust--a GovDocs Repository?

  • 1. HathiTrust--a GovDocs Repository? Brian Vetruba, Catalog Librarian/Germanic Studies Librarian Washington University in St. Louis bvetruba@wustl.edu Leveraging Your Strengths: Regional Government Documents Conference | Federal Reserve Bank St. Louis May 4, 2012
  • 2. Overview • Began in 2008 • Over 10.2 million volumes • Over 2.9 million public domain (PD) volumes (“full view”) • Over 60 partners hāthī ( ) (pronounced HAH-tee) is the Hindi word for elephant.
  • 3. Content HathiTrust Partners: Content by call number, language, date And more... http://www.hathitrust.org/statistics_visualizations
  • 4. US Gov Docs in HathiTrust • Ca. 300,000 = 4% of all titles in HathiTrust • 80% of gov docs in HathiTrust in public domain
  • 5. Percentage of Docs in HathiTrust (est.) 1895-2009 Brown (2011)
  • 6. HathiTrust Compared to Google Books More titles found in Google but HathiTrust provides more full-text Total docs = 385 Titles Found Full-text (1940s) HathiTrust 98 90 Google 181 4 HathiTrust better for searching serials One record for all issues of a title Title changes noted Sare (2012)
  • 7. Who can do what Everyone HathiTrust Partners • View PD content • Download entire volumes • Search PD and copyright of PD materials materials • Create private or public • View public collections collections • Download single PD pages • Have a voice in the future • Download MARC records of HathiTrust
  • 8. Advanced Search in HathiTrust
  • 9. Searching & Discovering Content HT Catalog http://www.hathitrust.org/ HT WorldCat Local prototype http://hathitrust.worldcat.org/ Resource Discovery Tools
  • 10. Searching & Discovering Content Loading records into local ILS http://www.hathitrust.org/data Bibliographic and Data APIs http://www.hathitrust.org/data Widgets http://www.hathitrust.org/widgets
  • 11.
  • 12. Searching & Discovering Content Embed links  Public collections  http://babel.hathitrust.org/cgi/mb  Individual items From a LibGuide for a German literature course
  • 13. Final Thoughts CHALLENGES • Searching/retrieval obstacles (e.g. no SUDoc search) • Inaccurate copyright statuses impeding access • Inaccurate linkages and bibliographic info PROGRESS • Commitment to expand and enhance access to gov docs • Research study examining how to improve access • Coordination with Committee on Institutional Cooperation and others to create a digital corpus of 1+ million print docs
  • 14. Questions about HathiTrust http://www.hathitrust.org/help feedback@issues.hathitrust.org @hathitrust More info: http://libguides.wustl.edu/hathitrust
  • 15. More info on loading records into ILS Kent State Univ.: http://techserv.lib.muohio.edu/ovgtsl11/presentations/Panchyshyn.pptx Univ. of Denver: http://www.slideserve.com/holleb/harvesting-hathitrust-documents-a-new- model-for-online-access Univ. of Colorado-Denver: Beall, Jeffrey. 2009. “Free Books: Loading Brief MARC Records for Open- Access Books in an Academic Library Catalog.” Cataloging & Classification Quarterly 47 (5) (January 4): 452–463. doi:10.1080/01639370902870215.
  • 16. Bibliography Malpas, Constance. 2011. Cloud-sourcing Research Collections Managing Print in the Mass-digitized Library Environment. Dublin, Ohio  OCLC : Research. Accessed May 2, 2012 http://www.oclc.org/research/publications/library/2011/2011-01.pdf York, Jeremy. 2012. HathiTrust: Issues and Challenges in Preserving the Published Record [PowerPoint slides]. Accessed April 30, 2012 http://www.hathitrust.org/documents/HathiTrust-Amigos-201202.pptx Brown, Christopher C. 2011. Harvesting HathiTrust Documents: A New Model for Online Access [PowerPoint slides]. Accessed April 30, 2012 http://www.slideserve.com/holleb/harvesting-hathitrust-documents-a-new- model-for-online-access York, Jeremy. 2012. “HathiTrust: The Elephant in the Library.” Library Issues: Briefings for Faculty and Administrators 32 (3) (January). Accessed May 2, 2012 http://www.libraryissues.com/sub/LI320003.asp . Sare, Laura. 2012. “A Comparison of HathiTrust and Google Books Using Federal Publications.” Practical Academic Librarianship: The International Journal of the SLA Academic Division 2 (1): 1–25. Accessed May 2, 2012 http://journals.tdl.org/pal/article/viewFile/5880/5922

Editor's Notes

  1. HathiTrust was launched in 2008 by a 12-university consortium known as the Committee on Institutional Cooperation (CIC), along with the University of California system. It has grown to more than 60 partners, including Columbia, Princeton, Yale, Duke, and Johns Hopkins, Also MissouUnlike other e-book initiatives, both PRESERVATION and ACCESS are main focal points. HathiTrust stated intention to preserve digital volumes over long term. hāthī (हाथी) (pronounced HAH-tee) is the Hindi word for elephant, an animal highly regarded for its memory, wisdom, and strength.5,422,301 book titlesTrustworthy Repository Audit and Certification (TRAC)269,168 serial titles
  2. Much of the current content in HathiTrust was digitized as part of the Google Books. Another major source is from the Internet Archive. Increasing amount of content coming out of digitization by partner libraries. So if most of content in HT is in Google or IA, how is it different-- digital library organized by libraries for libraries and their users -- Catalog structure to facilitate access -- use all the metadata -- fine tune search interfaces to fit user’s needs --open access to data and meta--key goal PRESERVATION over the long haul. -- Locally digitized collections from partners increasingly important-- coordination w/ other digital library initiatives like Digital Public Library of America, as wellCLICK ON HATHI TRUST TO SHOW CONTENT VISUALIZATIONS. Overlap -- Median overlap for ARL libraries is 50% Higher for smaller college libraries (already 50% in May 2011) Jeremy York, "HathiTrust: Aspiring to Build the Universal Library". UKSG Annual Conference, March 26, 2012.
  3. According to the report on HT Constitutional Conventional, there are an estimated 300,000 of US documents in HathiTrust which according to the report 1/5/ to 1/3 of all printed documentsOthers estimate the Gov Docs constitute about 4% of all titles in HathiTrust. Malpas in her report Cloud Sourcing notes that ~ 80 of gov docs in public domain and thus are or should be in viewable in their entirety. Gov Docs account for high percentage of public domain materials. Automatic rights determination: Conducted on all works at time of ingest and when records are modifiedPublic domain worldwideUS works published before 1923, US federal government publications, non-US works published prior to 1872Public domain in the United StatesNon-US works published prior to 1923
  4. Using Monthly Catalog of US Government Publications, 1895-1976 via Proquest and Catalog of Government Publications (1976 onward), Christopher Brown at University of Denver looked at the percentage of government documents in HathiTrust.Best coverage for 1970-1980s; worse coverage for late 19th century. And of course drops off in 2000s with GPOs decision to do away with most print documents.
  5. Sare, Laura. 2012. “A Comparison of HathiTrust and Google Books Using Federal Publications.” Practical Academic Librarianship: The International Journal of the SLA Academic Division 2Using a random sample of 1540 federal documents published between 1943 and 1976, Sare looked at number of titles found, full text, search interface, and quality of bibliographic records.OVERALL – Sare found more docs listed in Google Books but more documents were available as full text in HathiTrust1940s -- 385 total HT found 98 titles; 90 were full text Google found 181 titles; only 4 were full text DUE TO GOOGLE DECISION TO CONSIDER ANYTHING PUBLISHED AFTER 1923 AS IN-COPYRIGHT MATERIAL BECAUSE THEY FALL WITHIN ORPHAN WORKS TIMEFRAME. . THOSE NON-FULL TEXT TITLES FOUND IN GOOGLE BOOKS EITHER HAD “SNIPPET” VIEWS OR “NO PREVIEW” AVAILABE. ---BETTER BIBLIOGRAPHIC DATA IN HATHITRUST RECORDS, ESPECIALLY FOR SERIALS. MULTIPLE RECORDS AND MULTIPLE LINKS IN WORLDCAT RECORDS FOR GOOGLE BOOKS ESPECIALLY CUMBERSOME. PUTTING ON CATALOGING HAT– TITLE CHANGES NOTED IN HT RECORDS WHEREAS NOT IN GOOGLE BOOKSSNIPPET views useful. Also the fact users can see keywords in context beneficial
  6. Everyone can view books and journals, docs in public domain and read online. Single pages can be downloaded. In fact, anyone could download multiple pages from a public domain volume – just one page at a time. The one main difference for users from a HathiTrust partner library is that they can download entire volumes of public domain materials. Other difference is that users from HT partners can save sets of records. Can do this even if not from a partner library although very very difficult.---------NOW LET’S GO INTO HATHITRUSTCLICK ON ICON TO GO TO HT CATALOGAU: United statesAU: CommerceKW: LumberProblems of the softwood lumber industry : hearings before the Committee on Commerce, United States Senate, Eighty-seventh Congress, second session, on impact of lumber imports...by United States. Congress. Senate. Committee on Commerce. Published 1962 FULL TEXT SEARCH – both public domain and copyrightCubanCastroimmigrationCollections:Official gazette of the United States Patent OfficBritish Foreign offceNASA Technical reportsUNDER ABOUTOUR RESEARCH CENTER. AUTHOR SEARCH*DIFFERENT THAN GOOGLE’S N-GRAM, ALSO SEARCHES IN COPYRIGHT.
  7. HathiTrust was launched in 2008 by a 12-university consortium known as the Committee on Institutional Cooperation (CIC), along with the University of California system. It has grown to more than 60 partners, including Columbia, Princeton, Yale, Duke, and Johns Hopkins, Also MissouContent includes Google Books, InternetArchiv, and digital collections from partnershāthī (हाथी) (pronounced HAH-tee) is the Hindi word for elephant, an animal highly regarded for its memory, wisdom, and strength.
  8. LOADING RECORDS INTO LOCAL ILS-- AVAILABLE TO ANY LIBRARY – REGARDLESS IF PARTNER OR NOT PARTNER1. University of Michigan provides an OAI feed of MARC21 and Dublin Core records for public domain items. OAI Toolkit to assist in harvesting records2. Tab delimited files use to retrieve metadata3. Exporting records from WorldCat – public domain materials not identified; would need to check items individually. Non partner libraries who have done this-- Ball State University-- Kent State University (records available via OHIOLink??) – end of 2009/beginning of 2010 Link to powerpoint at end-- University of Colorado at Denver -- Did this in May 2008 but deleted records in 2011 when started using Summon --Article by Jeffrey Beall on this at end. **HATHI DOES ASSIST WITH CREATING CUSTOMIZED DATA SETS. --issues – -- initial labor -- continual maintenance -- would need to regularly download new files -- can your system handle it? Sluggishness -- duplication
  9. HathiTrust was launched in 2008 by a 12-university consortium known as the Committee on Institutional Cooperation (CIC), along with the University of California system. It has grown to more than 60 partners, including Columbia, Princeton, Yale, Duke, and Johns Hopkins, Also MissouContent includes Google Books, InternetArchiv, and digital collections from partnershāthī (हाथी) (pronounced HAH-tee) is the Hindi word for elephant, an animal highly regarded for its memory, wisdom, and strength.
  10. CURRENT CHALLENGES WITH HATHITRUST CATALOG – CAN’T LIMIT TO GOV DOCS; CAN’T SEARCH BY SUDOC NUMBER. AND THE ONGOING ISSUE OF AGENCY NAME CHANGES -- ALL IMPEDIMENTS TO USERS FINDING GOV DOCSANOTHER THORNY ISSUE IS INACCURATE COPYRIGHT STATUS. SOMETIMES GOV DOCS IN PUBLIC DOMAIN NOTED AS “IN COPYRIGHT” AND THUS NOT AVAILABLE IN FULL VIEW. GOOGLE LOCKED DOWN ALL MATERIALS PUBLISHED AFTER 1923 REGARDLESS OF IF GOV DOC OR NOT. ----
  11. HathiTrust was launched in 2008 by a 12-university consortium known as the Committee on Institutional Cooperation (CIC), along with the University of California system. It has grown to more than 60 partners, including Columbia, Princeton, Yale, Duke, and Johns Hopkins, Also MissouContent includes Google Books, InternetArchiv, and digital collections from partnershāthī (हाथी) (pronounced HAH-tee) is the Hindi word for elephant, an animal highly regarded for its memory, wisdom, and strength.