SlideShare uma empresa Scribd logo
1 de 39
1
IIPC, London Web Archiving Week
16 June 2017
Best Practices for Descriptive
Metadata
Recommendations of the OCLC Research Library Partnership
Web Archiving Metadata Working Group
Alexis Antracoli, Karen Stoll Farrell, Jackie Dooley
oc.lc/wam
2
THE PROBLEM
3
3
• Archived websites often are not easily discoverable via
search engines or library and archives catalogs and finding
aid systems, which inhibits use.
• Absence of community best practices for descriptive
metadata was the most widely-shared web archiving
challenge identified in two surveys:
– OCLC Research Library Partnership (2015)
– Weber/Chapman study of users of archived website (2016)
4
4
5
5
6
OCLC RESEARCH LIBRARY PARTNERSHIP
WEB ARCHIVING METADATA
WORKING GROUP
7
7
Objective
• Recommend best practices for web archiving descriptive
metadata that are community-neutral and standards-
neutral
• A set of defined data elements (i.e., a data dictionary)
8
8
Outputs (July 2017)
• Literature review to inform our understanding of
documented user needs and behaviors
• Best practices for descriptive metadata address both
single-site and collection approaches
• Analysis of descriptive metadata functionalities of eleven
harvesting tools [not covered in today’s session]
LITERATURE REVIEWS
Bailey et al. Ben-David & Huurdeman Bernstein Bragg & Hanna Costa
Costa & Gomes Costa & Silva Cruz & Gomes Dougherty & Meyer Galligan
Gatenby Gibbons Goel Goethals Guenther Hartman et al. Hockx-Yu
Jackson Jones & Shankar Lavoie & Gartner Leetaru Mannheimer Masanès
Milligan Murray & Hsieh Neubert Niu O’Dell Peterson Phillips & Koerbin
Pregill Prom & Swain Ras & van Bussel Reynolds Riley & Crookston
Stirling et al. Sweetser Taylor Thomas et al. Thurman & O’Hanlon
Tillinghast Truman Weber&Graham Webster Wuet al. Zhang et al.
Who are the end users of web archives?
Digital humanists
Web scientists
Computer scientists
Data analysts
Journalists
Lawyers
Website owners
Website designers
Government employees
Genealogists
Patent applicants
Instructors
Students
Linguists
Sociologists
Political scientists
Historians
Anthropologists
How are they using web archives?
• Read specific web pages/sites
• Data and text mining
• Technology development
What behaviors do they use?
Costa and Silva (2010) classify needs into three behavioral
groups; much cited by others.
• Navigational
• Informational
• Transactional
Takeaways for end-user needs
• Flexible Formats
• Engagement
• Access and re-use/rights statements
• Archived vs. live
• Subject access
“Provenance” metadata
• “The critical missing piece”
• Provides context
• Why was the content archived?
• Selection criteria
• Scope
Takeaways for metadata practitioners
• Archival and bibliographic approaches
• RDA, MARC, Dublin Core, MODS, finding aids, DACS
• Data elements vary widely
• Same element name, different meanings
• Level of description
– Single site, collection of sites, seed URLs
• Scalability and limited resources
16
DEVELOPING DESCRIPTIVE METADATA
BEST PRACTICES
17
17
Methodology
• Analyze metadata standards & institutional guidelines
– RDA (libraries), DACS (archives), Dublin Core (simplified)
• Evaluate existing metadata records “in the wild”
– WorldCat, ArchiveGrid, Archive-It
• Identify dilemmas specific to web archiving
• Incorporate findings from literature reviews
• Prepare data dictionary and report narrative
18
WEB-SPECIFIC DILEMMAS
19
19
• Is the website creator/owner the … publisher? author? subject?
• Should the title be … transcribed verbatim from the head of the
site? Edited to clarify the nature/scope of the site? Append e.g. "web
archive”?
• Which dates are important/feasible other than capture
dates? Beginning/end of the site's existence? Date of the content?
Copyright?
• How should extent/size be expressed? 1 archived website?
1 online resource? 6.25 Gb? approximately 300 websites?
• Is the host institution that harvests and manages the
archived content the repository? creator? publisher? selector?
20
20
• Is it important to clearly state that the resource is a website?
If so, where? In the title? description? extent statement? all of these?
• Does provenance refer to …the site owner? the repository that
harvests and hosts the site? ways in which the site evolved?
• Does appraisal mean …the reason the site warrants being
archived? a collection of sites named by the repository? the parts of the
site that were harvested?
• Which URLs should be included? Seed? access? landing page?
21
RECOMMENDED BEST PRACTICES
22
22
Setting the context
• Use cases: library, archives, researcher
• Comparisons between …
– Bibliographic and archival approaches to description
– Description of archived and live sites
– Collection, site, and document-level descriptions
23
23
Data dictionary characteristics
• Lean (14 elements); use on its own or with granular library and archives
standards
• Element names and definitions adopted or adapted from standards
• Usage notes explain how to formulate the content of each element
• The same element is used for a concept at all levels of description as per
multilevel principles expressed in archival standards (DACS and EAD).
24
24
Data dictionary inclusion criteria
• Includes common elements used for identification and discovery of all
types of resource (e.g., Creator, Date, Subject, Title)
• Other elements must have clear applicability to archived websites (e.g.
Access Conditions, Description, URL)
• Elements excluded that rarely (if ever) appear in guidelines and/or extant
metadata records and have no web-specific meaning (e.g. audience,
publisher, statement of responsibility)
25
25
WAM data elements
Access/Rights * Extent Title *
Collector Genre/Form URL
Contributor * Language *
Creator * Relation *
Date * Source of Description
Description * Subject *
* = 9 of 14 element names/meanings match Dublin Core
26
26
Access Conditions [to be renamed Rights]
Definition: Circumstances that affect the availability [and/or re-use] of an archived
website or collection.
Use Access Conditions to record whether or not conditions exist that restrict user access to
the archived content. These might include the need to make an appointment for onsite
use or a specified period of time during which the content is embargoed. Such conditions
may be imposed by an archival repository, donor, other agency, or legal statute.
This content is embargoed from public access until 2025.
Due to Twitter's Terms of Service, this data archive is accessible only to the University
of Miami community …
Maps to “Rights” in Dublin Core.
27
27
Access Conditions: Crosswalks
Crosswalks
Dublin Core Rights
EAD <accessrestrict>
<userestrict>
MARC 506
MODS <accessCondition>
schema.org schema:license
schema:isAccessiblrForFree
28
28
Collector
Definition: The organization responsible for curation and stewardship of an
archived website or collection.
Use Collector for the organization that selects the web content for archiving, creates
metadata and performs other activities associated with “ownership” of a resource.
Stated another way, this is the organization that has taken responsibility for the archived
content, although the digital files are not necessarily stored and maintained by this
organization (collections harvested using Archive-It are a prominent example).
No equivalent in Dublin Core.
29
29
Collector: Lifecycle activities
Institutions involved in web archiving engage in a variety of activities during the lifecycle
of archiving web content. We identified four activities performed by the institution that
assumes responsibility for archiving web content:
• Selecting websites for archiving
• Harvesting the content of the designated seed URLs
• Creating and maintaining metadata to describe the content
• Making decisions about other aspects of collections management, including how
the harvested files will be preserved and how will access be provided.
30
30
Collector: Examples
Creator: Seattle (Wash.)
Title: City of Seattle Harvested Websites
Collector: Seattle Municipal Archives
-============
Title: Globalchange.gov
Contributor: U.S. Global Change Research Program
Collector: Federal Depository Library Program
============
Creator: Association for Research into Crimes against Art
Title: ARCAblog : promoting the study and research of art crime and cultural
heritage protection
Collector: New York Art Resources Consortium
31
31
Collector: Crosswalks
Crosswalks
Dublin Core Contributor
EAD <repository>
MARC 524
852 subfield a
852 subfield b
MODS <location>
schema.org schema:OwnershipInfo
32
32
Source of description
Definition: Information about the gathering or creation of the metadata
itself, such as sources of data or the date on which source data was
obtained.
Source of Information is used to identify the source of all or some of the
metadata, particularly for descriptions of single sites. Basic aspects of a
website (creator name, title, etc.) may change significantly, but the
responsible institution is unlikely to have the resources to become aware of
changes, let alone update the metadata. Include the date on which the site
was examined and the location from which the information was taken.
No equivalent in Dublin Core.
33
33
Source of description: Examples
Description based on archived web page captured Sept. 22, 2016; title from
title screen (viewed Oct. 27, 2016)
Title from home page last updated June 21, 2012 (viewed June 22, 2012)
Title from home page (viewed on Oct. 11, 2007)
Title from HTML header (viewed Feb. 16, 2006)
34
34
Source of description: Crosswalks
35
35
WAM data elements (14)
Access/Rights * Extent Title *
Collector Genre/Form URL
Contributor * Language *
Creator * Relation *
Date * Source of Description
Description * Subject *
* = 9 of 14 element names/meanings match Dublin Core
36
PUBLICATION IN LATE JULY
37
37
Three simultaneous reports
● Best practices for descriptive metadata
○ With data dictionary
● User needs
○ With annotated bibliography
● Tools
○ With evaluation grids
38
Q&A
39
SM
Jackie Dooley
Program Officer, OCLC Research
dooleyj@oclc.org
@minniedw
IIPC, Web Archiving Week
16 June 2017
©2016 OCLC. This work is licensed under a Creative Commons Attribution 4.0 International License. Suggested attribution:
“This work uses content from Developing Best Practices for Web Archiving Metadata to Meet User Needs
© OCLC, used under a Creative Commons Attribution 4.0 International License:
http://creativecommons.org/licenses/by/4.0/.”
For more information, please contact:
oc.lc/wam

Mais conteúdo relacionado

Mais procurados

Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 Collection Directions: Some Reflections on Libraries and Stewardship of the ... Collection Directions: Some Reflections on Libraries and Stewardship of the ...
Collection Directions: Some Reflections on Libraries and Stewardship of the ...OCLC
 
The library in the life of the user
The library in the life of the userThe library in the life of the user
The library in the life of the userlisld
 
Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...Rose Holley
 
Cloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureCloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureOCLC Research
 
The OCLC Research Library Partnership
The OCLC Research Library PartnershipThe OCLC Research Library Partnership
The OCLC Research Library PartnershipOCLC
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 
The Evolving Scholarly Record Framing the Landscape
The Evolving Scholarly Record Framing the LandscapeThe Evolving Scholarly Record Framing the Landscape
The Evolving Scholarly Record Framing the LandscapeOCLC
 
The facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentThe facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentlisld
 
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Charleston Conference
 
Working collaboratively: scaling infrastructure, services, learning and innov...
Working collaboratively: scaling infrastructure, services, learning and innov...Working collaboratively: scaling infrastructure, services, learning and innov...
Working collaboratively: scaling infrastructure, services, learning and innov...lisld
 
Linked Data Implementations—Who, What and Why?
Linked Data Implementations—Who, What and Why?Linked Data Implementations—Who, What and Why?
Linked Data Implementations—Who, What and Why?OCLC
 
Library collections and the emerging scholarly record
Library collections and the emerging scholarly recordLibrary collections and the emerging scholarly record
Library collections and the emerging scholarly recordlisld
 
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...Megan Hurst
 
Library discovery: past, present and some futures
Library discovery: past, present and some futuresLibrary discovery: past, present and some futures
Library discovery: past, present and some futureslisld
 
The SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction Data
The SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction DataThe SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction Data
The SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction DataOCLC
 
BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering EvidenceBIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering EvidenceOCLC
 
Towards collaboration at scale: Libraries, the social and the technical
Towards collaboration at scale:  Libraries, the social and the technicalTowards collaboration at scale:  Libraries, the social and the technical
Towards collaboration at scale: Libraries, the social and the technicallisld
 

Mais procurados (20)

Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 Collection Directions: Some Reflections on Libraries and Stewardship of the ... Collection Directions: Some Reflections on Libraries and Stewardship of the ...
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 
The library in the life of the user
The library in the life of the userThe library in the life of the user
The library in the life of the user
 
Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...
 
Cloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureCloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructure
 
The OCLC Research Library Partnership
The OCLC Research Library PartnershipThe OCLC Research Library Partnership
The OCLC Research Library Partnership
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
The Evolving Scholarly Record Framing the Landscape
The Evolving Scholarly Record Framing the LandscapeThe Evolving Scholarly Record Framing the Landscape
The Evolving Scholarly Record Framing the Landscape
 
The facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentThe facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environment
 
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
 
Working collaboratively: scaling infrastructure, services, learning and innov...
Working collaboratively: scaling infrastructure, services, learning and innov...Working collaboratively: scaling infrastructure, services, learning and innov...
Working collaboratively: scaling infrastructure, services, learning and innov...
 
Linked Data Implementations—Who, What and Why?
Linked Data Implementations—Who, What and Why?Linked Data Implementations—Who, What and Why?
Linked Data Implementations—Who, What and Why?
 
Library collections and the emerging scholarly record
Library collections and the emerging scholarly recordLibrary collections and the emerging scholarly record
Library collections and the emerging scholarly record
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
 
NISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
NISO Webinar: The Future of Integrated Library Systems PART 2: User InteractionNISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
NISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
 
Library discovery: past, present and some futures
Library discovery: past, present and some futuresLibrary discovery: past, present and some futures
Library discovery: past, present and some futures
 
The SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction Data
The SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction DataThe SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction Data
The SHARES Partnership, Plus Tracking Trends in ILL Cost and Transaction Data
 
NISO Standards Update @ ALA Midwinter, January 27, 2013 in Seattle, WA
NISO Standards Update @ ALA Midwinter, January 27, 2013 in Seattle, WANISO Standards Update @ ALA Midwinter, January 27, 2013 in Seattle, WA
NISO Standards Update @ ALA Midwinter, January 27, 2013 in Seattle, WA
 
BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering EvidenceBIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
 
Towards collaboration at scale: Libraries, the social and the technical
Towards collaboration at scale:  Libraries, the social and the technicalTowards collaboration at scale:  Libraries, the social and the technical
Towards collaboration at scale: Libraries, the social and the technical
 

Semelhante a Best Practices for Descriptive Metadata

Practical Metadata Where Do I Start For a Digital Project
Practical Metadata Where Do I Start For a Digital ProjectPractical Metadata Where Do I Start For a Digital Project
Practical Metadata Where Do I Start For a Digital ProjectJill Strass
 
Digital Repositories: Essential Information for Academic Librarians
Digital Repositories: Essential Information for Academic LibrariansDigital Repositories: Essential Information for Academic Librarians
Digital Repositories: Essential Information for Academic LibrariansJeffrey Beall
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)Christine Stohn
 
Summary of Trends in Cataloging
Summary of Trends in CatalogingSummary of Trends in Cataloging
Summary of Trends in CatalogingWilliam Worford
 
Linked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureLinked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureEmily Nimsakont
 
Lodlam.slideshare
Lodlam.slideshareLodlam.slideshare
Lodlam.slideshareHafabe
 
Next Generation Repositories
Next Generation RepositoriesNext Generation Repositories
Next Generation Repositoriesukcorr
 
Resource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnResource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnBonaria Biancu
 
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...UKSG: connecting the knowledge community
 
A brief overview of metadata for datasets
A brief overview of metadata for datasetsA brief overview of metadata for datasets
A brief overview of metadata for datasetssesrdm
 
The commitment of arabic sites in the field of libraries and information that...
The commitment of arabic sites in the field of libraries and information that...The commitment of arabic sites in the field of libraries and information that...
The commitment of arabic sites in the field of libraries and information that...Alexander Decker
 
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata MattersAlphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata MattersNew York University
 
Aggregation of cultural heritage datasets through the Web of Data
Aggregation of cultural heritage datasets through the Web of DataAggregation of cultural heritage datasets through the Web of Data
Aggregation of cultural heritage datasets through the Web of DataNuno Freire
 
Improving library services with semantic web technology in the realm of repos...
Improving library services with semantic web technology in the realm of repos...Improving library services with semantic web technology in the realm of repos...
Improving library services with semantic web technology in the realm of repos...redsys
 
An Ecosystem for Digital Costume Collections
An Ecosystem for Digital Costume CollectionsAn Ecosystem for Digital Costume Collections
An Ecosystem for Digital Costume CollectionsArden Kirkland
 

Semelhante a Best Practices for Descriptive Metadata (20)

Practical Metadata Where Do I Start For a Digital Project
Practical Metadata Where Do I Start For a Digital ProjectPractical Metadata Where Do I Start For a Digital Project
Practical Metadata Where Do I Start For a Digital Project
 
UAEU_MDL_Slides_rev1.ppt
UAEU_MDL_Slides_rev1.pptUAEU_MDL_Slides_rev1.ppt
UAEU_MDL_Slides_rev1.ppt
 
Digital Repositories: Essential Information for Academic Librarians
Digital Repositories: Essential Information for Academic LibrariansDigital Repositories: Essential Information for Academic Librarians
Digital Repositories: Essential Information for Academic Librarians
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)
 
Summary of Trends in Cataloging
Summary of Trends in CatalogingSummary of Trends in Cataloging
Summary of Trends in Cataloging
 
Linked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureLinked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the Future
 
Lodlam.slideshare
Lodlam.slideshareLodlam.slideshare
Lodlam.slideshare
 
Next Generation Repositories
Next Generation RepositoriesNext Generation Repositories
Next Generation Repositories
 
Resource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnResource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turn
 
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
 
A brief overview of metadata for datasets
A brief overview of metadata for datasetsA brief overview of metadata for datasets
A brief overview of metadata for datasets
 
Will's World: Walking Through Shakespeare
Will's World: Walking Through ShakespeareWill's World: Walking Through Shakespeare
Will's World: Walking Through Shakespeare
 
The commitment of arabic sites in the field of libraries and information that...
The commitment of arabic sites in the field of libraries and information that...The commitment of arabic sites in the field of libraries and information that...
The commitment of arabic sites in the field of libraries and information that...
 
Edina cigs-21-september-2012
Edina cigs-21-september-2012Edina cigs-21-september-2012
Edina cigs-21-september-2012
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Finding Data Sets
Finding Data SetsFinding Data Sets
Finding Data Sets
 
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata MattersAlphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata Matters
 
Aggregation of cultural heritage datasets through the Web of Data
Aggregation of cultural heritage datasets through the Web of DataAggregation of cultural heritage datasets through the Web of Data
Aggregation of cultural heritage datasets through the Web of Data
 
Improving library services with semantic web technology in the realm of repos...
Improving library services with semantic web technology in the realm of repos...Improving library services with semantic web technology in the realm of repos...
Improving library services with semantic web technology in the realm of repos...
 
An Ecosystem for Digital Costume Collections
An Ecosystem for Digital Costume CollectionsAn Ecosystem for Digital Costume Collections
An Ecosystem for Digital Costume Collections
 

Mais de OCLC

Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...OCLC
 
"You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o..."You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o...OCLC
 
Factors influencing research data management programs.
Factors influencing research data management programs.Factors influencing research data management programs.
Factors influencing research data management programs.OCLC
 
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...OCLC
 
OCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC
 
Investing in library users and potential users: The Many Faces of Digital Vi...
 Investing in library users and potential users: The Many Faces of Digital Vi... Investing in library users and potential users: The Many Faces of Digital Vi...
Investing in library users and potential users: The Many Faces of Digital Vi...OCLC
 
Academic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchAcademic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchOCLC
 
Studying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsStudying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsOCLC
 
Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...OCLC
 
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 People's mode of online engagement: The Many Faces of Digital Visitors and R... People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and R...OCLC
 
Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...OCLC
 
OCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopOCLC
 
Visitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyVisitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyOCLC
 
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...OCLC
 
Visitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopVisitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopOCLC
 
The Library in the Life of the User
The Library in the Life of the UserThe Library in the Life of the User
The Library in the Life of the UserOCLC
 
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...OCLC
 
Changing Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaChanging Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaOCLC
 
Qualitative Research Methods in LIS
Qualitative Research Methods in LISQualitative Research Methods in LIS
Qualitative Research Methods in LISOCLC
 

Mais de OCLC (20)

Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...
 
"You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o..."You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o...
 
Factors influencing research data management programs.
Factors influencing research data management programs.Factors influencing research data management programs.
Factors influencing research data management programs.
 
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
 
OCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant Program
 
Investing in library users and potential users: The Many Faces of Digital Vi...
 Investing in library users and potential users: The Many Faces of Digital Vi... Investing in library users and potential users: The Many Faces of Digital Vi...
Investing in library users and potential users: The Many Faces of Digital Vi...
 
Academic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchAcademic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to research
 
Studying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsStudying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and Residents
 
Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...
 
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 People's mode of online engagement: The Many Faces of Digital Visitors and R... People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 
Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...
 
OCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC RLP @ RLUK
OCLC RLP @ RLUK
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
 
Visitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyVisitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with Technology
 
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
 
Visitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopVisitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise Workshop
 
The Library in the Life of the User
The Library in the Life of the UserThe Library in the Life of the User
The Library in the Life of the User
 
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
 
Changing Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaChanging Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research Agenda
 
Qualitative Research Methods in LIS
Qualitative Research Methods in LISQualitative Research Methods in LIS
Qualitative Research Methods in LIS
 

Último

Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 

Último (20)

Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 

Best Practices for Descriptive Metadata

  • 1. 1 IIPC, London Web Archiving Week 16 June 2017 Best Practices for Descriptive Metadata Recommendations of the OCLC Research Library Partnership Web Archiving Metadata Working Group Alexis Antracoli, Karen Stoll Farrell, Jackie Dooley oc.lc/wam
  • 3. 3 3 • Archived websites often are not easily discoverable via search engines or library and archives catalogs and finding aid systems, which inhibits use. • Absence of community best practices for descriptive metadata was the most widely-shared web archiving challenge identified in two surveys: – OCLC Research Library Partnership (2015) – Weber/Chapman study of users of archived website (2016)
  • 4. 4 4
  • 5. 5 5
  • 6. 6 OCLC RESEARCH LIBRARY PARTNERSHIP WEB ARCHIVING METADATA WORKING GROUP
  • 7. 7 7 Objective • Recommend best practices for web archiving descriptive metadata that are community-neutral and standards- neutral • A set of defined data elements (i.e., a data dictionary)
  • 8. 8 8 Outputs (July 2017) • Literature review to inform our understanding of documented user needs and behaviors • Best practices for descriptive metadata address both single-site and collection approaches • Analysis of descriptive metadata functionalities of eleven harvesting tools [not covered in today’s session]
  • 9. LITERATURE REVIEWS Bailey et al. Ben-David & Huurdeman Bernstein Bragg & Hanna Costa Costa & Gomes Costa & Silva Cruz & Gomes Dougherty & Meyer Galligan Gatenby Gibbons Goel Goethals Guenther Hartman et al. Hockx-Yu Jackson Jones & Shankar Lavoie & Gartner Leetaru Mannheimer Masanès Milligan Murray & Hsieh Neubert Niu O’Dell Peterson Phillips & Koerbin Pregill Prom & Swain Ras & van Bussel Reynolds Riley & Crookston Stirling et al. Sweetser Taylor Thomas et al. Thurman & O’Hanlon Tillinghast Truman Weber&Graham Webster Wuet al. Zhang et al.
  • 10. Who are the end users of web archives? Digital humanists Web scientists Computer scientists Data analysts Journalists Lawyers Website owners Website designers Government employees Genealogists Patent applicants Instructors Students Linguists Sociologists Political scientists Historians Anthropologists
  • 11. How are they using web archives? • Read specific web pages/sites • Data and text mining • Technology development
  • 12. What behaviors do they use? Costa and Silva (2010) classify needs into three behavioral groups; much cited by others. • Navigational • Informational • Transactional
  • 13. Takeaways for end-user needs • Flexible Formats • Engagement • Access and re-use/rights statements • Archived vs. live • Subject access
  • 14. “Provenance” metadata • “The critical missing piece” • Provides context • Why was the content archived? • Selection criteria • Scope
  • 15. Takeaways for metadata practitioners • Archival and bibliographic approaches • RDA, MARC, Dublin Core, MODS, finding aids, DACS • Data elements vary widely • Same element name, different meanings • Level of description – Single site, collection of sites, seed URLs • Scalability and limited resources
  • 17. 17 17 Methodology • Analyze metadata standards & institutional guidelines – RDA (libraries), DACS (archives), Dublin Core (simplified) • Evaluate existing metadata records “in the wild” – WorldCat, ArchiveGrid, Archive-It • Identify dilemmas specific to web archiving • Incorporate findings from literature reviews • Prepare data dictionary and report narrative
  • 19. 19 19 • Is the website creator/owner the … publisher? author? subject? • Should the title be … transcribed verbatim from the head of the site? Edited to clarify the nature/scope of the site? Append e.g. "web archive”? • Which dates are important/feasible other than capture dates? Beginning/end of the site's existence? Date of the content? Copyright? • How should extent/size be expressed? 1 archived website? 1 online resource? 6.25 Gb? approximately 300 websites? • Is the host institution that harvests and manages the archived content the repository? creator? publisher? selector?
  • 20. 20 20 • Is it important to clearly state that the resource is a website? If so, where? In the title? description? extent statement? all of these? • Does provenance refer to …the site owner? the repository that harvests and hosts the site? ways in which the site evolved? • Does appraisal mean …the reason the site warrants being archived? a collection of sites named by the repository? the parts of the site that were harvested? • Which URLs should be included? Seed? access? landing page?
  • 22. 22 22 Setting the context • Use cases: library, archives, researcher • Comparisons between … – Bibliographic and archival approaches to description – Description of archived and live sites – Collection, site, and document-level descriptions
  • 23. 23 23 Data dictionary characteristics • Lean (14 elements); use on its own or with granular library and archives standards • Element names and definitions adopted or adapted from standards • Usage notes explain how to formulate the content of each element • The same element is used for a concept at all levels of description as per multilevel principles expressed in archival standards (DACS and EAD).
  • 24. 24 24 Data dictionary inclusion criteria • Includes common elements used for identification and discovery of all types of resource (e.g., Creator, Date, Subject, Title) • Other elements must have clear applicability to archived websites (e.g. Access Conditions, Description, URL) • Elements excluded that rarely (if ever) appear in guidelines and/or extant metadata records and have no web-specific meaning (e.g. audience, publisher, statement of responsibility)
  • 25. 25 25 WAM data elements Access/Rights * Extent Title * Collector Genre/Form URL Contributor * Language * Creator * Relation * Date * Source of Description Description * Subject * * = 9 of 14 element names/meanings match Dublin Core
  • 26. 26 26 Access Conditions [to be renamed Rights] Definition: Circumstances that affect the availability [and/or re-use] of an archived website or collection. Use Access Conditions to record whether or not conditions exist that restrict user access to the archived content. These might include the need to make an appointment for onsite use or a specified period of time during which the content is embargoed. Such conditions may be imposed by an archival repository, donor, other agency, or legal statute. This content is embargoed from public access until 2025. Due to Twitter's Terms of Service, this data archive is accessible only to the University of Miami community … Maps to “Rights” in Dublin Core.
  • 27. 27 27 Access Conditions: Crosswalks Crosswalks Dublin Core Rights EAD <accessrestrict> <userestrict> MARC 506 MODS <accessCondition> schema.org schema:license schema:isAccessiblrForFree
  • 28. 28 28 Collector Definition: The organization responsible for curation and stewardship of an archived website or collection. Use Collector for the organization that selects the web content for archiving, creates metadata and performs other activities associated with “ownership” of a resource. Stated another way, this is the organization that has taken responsibility for the archived content, although the digital files are not necessarily stored and maintained by this organization (collections harvested using Archive-It are a prominent example). No equivalent in Dublin Core.
  • 29. 29 29 Collector: Lifecycle activities Institutions involved in web archiving engage in a variety of activities during the lifecycle of archiving web content. We identified four activities performed by the institution that assumes responsibility for archiving web content: • Selecting websites for archiving • Harvesting the content of the designated seed URLs • Creating and maintaining metadata to describe the content • Making decisions about other aspects of collections management, including how the harvested files will be preserved and how will access be provided.
  • 30. 30 30 Collector: Examples Creator: Seattle (Wash.) Title: City of Seattle Harvested Websites Collector: Seattle Municipal Archives -============ Title: Globalchange.gov Contributor: U.S. Global Change Research Program Collector: Federal Depository Library Program ============ Creator: Association for Research into Crimes against Art Title: ARCAblog : promoting the study and research of art crime and cultural heritage protection Collector: New York Art Resources Consortium
  • 31. 31 31 Collector: Crosswalks Crosswalks Dublin Core Contributor EAD <repository> MARC 524 852 subfield a 852 subfield b MODS <location> schema.org schema:OwnershipInfo
  • 32. 32 32 Source of description Definition: Information about the gathering or creation of the metadata itself, such as sources of data or the date on which source data was obtained. Source of Information is used to identify the source of all or some of the metadata, particularly for descriptions of single sites. Basic aspects of a website (creator name, title, etc.) may change significantly, but the responsible institution is unlikely to have the resources to become aware of changes, let alone update the metadata. Include the date on which the site was examined and the location from which the information was taken. No equivalent in Dublin Core.
  • 33. 33 33 Source of description: Examples Description based on archived web page captured Sept. 22, 2016; title from title screen (viewed Oct. 27, 2016) Title from home page last updated June 21, 2012 (viewed June 22, 2012) Title from home page (viewed on Oct. 11, 2007) Title from HTML header (viewed Feb. 16, 2006)
  • 35. 35 35 WAM data elements (14) Access/Rights * Extent Title * Collector Genre/Form URL Contributor * Language * Creator * Relation * Date * Source of Description Description * Subject * * = 9 of 14 element names/meanings match Dublin Core
  • 37. 37 37 Three simultaneous reports ● Best practices for descriptive metadata ○ With data dictionary ● User needs ○ With annotated bibliography ● Tools ○ With evaluation grids
  • 39. 39 SM Jackie Dooley Program Officer, OCLC Research dooleyj@oclc.org @minniedw IIPC, Web Archiving Week 16 June 2017 ©2016 OCLC. This work is licensed under a Creative Commons Attribution 4.0 International License. Suggested attribution: “This work uses content from Developing Best Practices for Web Archiving Metadata to Meet User Needs © OCLC, used under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0/.” For more information, please contact: oc.lc/wam