SlideShare uma empresa Scribd logo
1 de 32
Prototypes of pro-active approaches to support the
archiving of web references for scholarly
communications
Richard Wincewicz1, Peter Burnhill1
& Herbert Van de Sompel2
1EDINA, University of Edinburgh, 2Los Alamos National Laboratory
The Project Team
2013 – 2015, funded by the
Andrew W. Mellon
Foundation
• Los Alamos National Laboratory:
Research Library: Herbert Van de Sompel
Harihar Shankar, [Martin Klein, Rob Sanderson]
• University of Edinburgh:
Language Technology Group: Claire Grover,
Beatrice Alex, Colin Matheson, Richard Tobin, [Ke “Adam” Zhou]
EDINA * : Peter Burnhill, Muriel Mewissen (Project Manager),
Tim Stickland, Richard Wincewicz, [Neil Mayo]
Centre for Service Delivery & Digital Expertise
Overview
1. Introduction
2. Evidence
3. Remedy
1. Introduction
Reference Rot
Links to Web at Large resources are subject to
Reference Rot. This is a combination of two factors:
• Link Rot: Link stops working
• e.g. HTTP 404 “Not Found”
• Content Drift: Linked content changes over time
• Possibly to the extent that it is no longer
representative of the content that was initially
referenced
2. Evidence
Articles that Link to Articles & to Web At Large Resources
(PMC)
Martin Klein et al. (2014) Scholarly context not found
http://dx.doi.org/10.1371/journal.pone.0115253
Articles that Link to Articles & to Web At Large Resources
(Elsevier)
Martin Klein et al. (2014) Scholarly context not found
http://dx.doi.org/10.1371/journal.pone.0115253
Articles with URI References (PMC)
Articles 479,194
with URI references 399,005
with URI references to articles 240,857
with URI references to Web at Large 156,160
Martin Klein et al. (2014) Scholarly context not found
http://dx.doi.org/10.1371/journal.pone.0115253
Link Rot (PMC)
Martin Klein et al. (2014) Scholarly context not found
http://dx.doi.org/10.1371/journal.pone.0115253
Link Rot (Elsevier)
Martin Klein et al. (2014) Scholarly context not found
http://dx.doi.org/10.1371/journal.pone.0115253
Links from arXiv, Elsevier, PMC to TLD Targets
Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONE
http://dx.doi.org/10.1371/journal.pone.0115253
Grey is Link Rot – Referenced Content Not Accessible
Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONE
http://dx.doi.org/10.1371/journal.pone.0115253
Grey is Not Archived - Referenced Content Lost
Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONE
http://dx.doi.org/10.1371/journal.pone.0115253
Content Drift – http://dl00.org
2000 2004
2005 2008
(a) Dynamic content
values on webpage change
over time
(b) Static content
but very different (often
unrelated) web pages
3. Remedy
Create Snapshots of Referenced Resources
Various web archives support on-demand creation of
snapshots of URIs (manual, API):
 archive.today
 Internet Archive
 perma.cc
 webcitation.org
When creating snapshots, maintain:
 Original URI
 Snapshot URI
 Date/Time of snapshot
Create Snapshots of Referenced Resources
Snapshots can be created at various stages. The closer to
the moment of referencing, the better the image captured.
Stage Actor Snapshot Quality
Preparation Author/reference tool best
Submission
/Issue
Editor/manuscript
system
good
Publication
Aggregator/
publisher platform
ok
Post-publication
Librarian/IR,
journal archive
better than nothing
Authoring - Zotero Plugin Demonstrator
Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero for pro-active
archiving and temporal references
https://www.youtube.com/v/ZYmi_Ydr65M%26vq
Publication - OJS
Publication - OJS
Publication - OJS
Publication - OJS
Publication - HiberActive Service Demonstrator
Martin Klein et al. (2014) HiberActive: Pro-Active Archiving of web references from scholarly
articles
Open Repositories 2014 http://www.slideshare.net/martinklein0815/hiberactive
Reference Resources Robustly
When referencing resources include:
 Original URI – Allows the user to revisit the URI as it
is at the time of reading, if the URI is still operational
 Snapshot URI – Allows the user to visit the snapshot,
if one was created, and if the web archive in which it
was created is still operational
 Date/Time – with the original URI allow the user to
visit any snapshot created around the Date/Time in
any web archive around the world (using Memento
infrastructure)
(2015) Robust Links - Motivation
http://robustlinks.mementoweb.org/about/
Reference Resources Actionably
When referencing resources, use Link Decorations to convey
Original URI, Snapshot URI, Date/Time
<a href=“http://www.stanford.edu”
data-originalurl=“http://archive.is/FAy6o”
data-versiondate=“2014-08-15” >
<a href=“http://www.stanford.edu”
data-versiondate=“2014-08-15” >
Herbert Van de Sompel et al. (2015) Robust Links - Link Decorations
http://robustlinks.mementoweb.org/spec/
<a href=“http://archive.is/FAy6o”
data-versionurl=“http://www.stanford.edu”
data-versiondate=“2014-08-15” >
Robust Links Using Link Decorations, JavaScript,
Memento API
Demo - http://robustlinks.mementoweb.org/demo/uri_references_js.html
robustlinks.js - https://github.com/mementoweb/robustlinks
Activate Robust Links
There are no Link Decorations, currently. But there is an
article publication date:
 Express the article publication date in an actionable
manner (‘datePublished’ or ‘dateModified’
Schema.org properties) in HTML pages that contain
URI references
 Tailor robustlinks.js to exclude links to articles
 Inject robustlinks.js in HTML pages that contain URI
references
Users Follow Robust Links into Web
Archives
The combination of the referenced URI and the article
publication date:
 Leads users to a snapshot in a web archive, created
as close as possible to the article publication date
 Addresses link rot
 Addresses content drift
Create Archive Copies
When ingesting new content into the platform:
 Parse for URI references
 Create snapshots in web archives of select URIs
 For these URIs, use Link Decorations in HTML to
convey:
• original URI
• snapshot URI
• snapshot Date/Time
Users Follow Robust Links into Web
Archives
The Link Decorations:
 Lead users to the created snapshot, if the web
archive is operational
 Lead users to a snapshot in any web archive, created
as close as possible to the snapshot Date/Time
 Addresses link rot
 Addresses content drift
Prototypes of pro-active approaches to support the
archiving of web references for scholarly
communications
Richard Wincewicz1, Peter Burnhill1
& Herbert Van de Sompel2
1EDINA, University of Edinburgh, 2Los Alamos National Laboratory
http://hiberlink.org #hiberlink

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Signposting Overview
Signposting OverviewSignposting Overview
Signposting Overview
 
Achieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsAchieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed Collections
 
Linked Data: turning the web into a context graph
Linked Data: turning the web into a context graphLinked Data: turning the web into a context graph
Linked Data: turning the web into a context graph
 
Metadata / Linked Data
Metadata / Linked DataMetadata / Linked Data
Metadata / Linked Data
 
Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count
 
Paul Evan Peters Lecture
Paul Evan Peters LecturePaul Evan Peters Lecture
Paul Evan Peters Lecture
 
BIBFRAME as a Library Linked Data Standard
BIBFRAME as a Library Linked Data StandardBIBFRAME as a Library Linked Data Standard
BIBFRAME as a Library Linked Data Standard
 
Linked Data Patterns
Linked Data PatternsLinked Data Patterns
Linked Data Patterns
 
Web Integrated Data
Web Integrated DataWeb Integrated Data
Web Integrated Data
 
Dataincubator
DataincubatorDataincubator
Dataincubator
 
More than just access: scholarship is in need of infrastructure reform
More than just access: scholarship is in need of infrastructure reformMore than just access: scholarship is in need of infrastructure reform
More than just access: scholarship is in need of infrastructure reform
 
Centre for Social Informatics - January 2016
Centre for Social Informatics - January 2016Centre for Social Informatics - January 2016
Centre for Social Informatics - January 2016
 
Welcome to Consuming Linked Data tutorial WWW2010
Welcome to Consuming Linked Data tutorial WWW2010Welcome to Consuming Linked Data tutorial WWW2010
Welcome to Consuming Linked Data tutorial WWW2010
 
Web Archiving Activities of ODU’s Web Science and Digital Library Research G...
Web Archiving Activities of ODU’s Web Science and Digital Library Research G...Web Archiving Activities of ODU’s Web Science and Digital Library Research G...
Web Archiving Activities of ODU’s Web Science and Digital Library Research G...
 
From Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to EntrepreneurFrom Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to Entrepreneur
 
Very Gentle Linked Data Workshop
Very Gentle Linked Data WorkshopVery Gentle Linked Data Workshop
Very Gentle Linked Data Workshop
 
Creating Topical Collections: Web Archives vs. Live Web
Creating Topical Collections:Web Archives vs. Live WebCreating Topical Collections:Web Archives vs. Live Web
Creating Topical Collections: Web Archives vs. Live Web
 
Visualizing Open Access - Open Repositories 2015
Visualizing Open Access - Open Repositories 2015Visualizing Open Access - Open Repositories 2015
Visualizing Open Access - Open Repositories 2015
 
Start Or Home Pages
Start Or Home PagesStart Or Home Pages
Start Or Home Pages
 
Tools for Data Manipulation - UKAD Open Refine Workshop
Tools for Data Manipulation - UKAD Open Refine WorkshopTools for Data Manipulation - UKAD Open Refine Workshop
Tools for Data Manipulation - UKAD Open Refine Workshop
 

Destaque

Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
Matthew Rowe
 
The costs for going gold in the netherlands
The costs for going gold in the netherlandsThe costs for going gold in the netherlands
The costs for going gold in the netherlands
Wouter Gerritsma
 

Destaque (20)

Metis
MetisMetis
Metis
 
Metis
MetisMetis
Metis
 
Preserving the Integrity of the Scholarly Record
Preserving the Integrity of the Scholarly RecordPreserving the Integrity of the Scholarly Record
Preserving the Integrity of the Scholarly Record
 
Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record
 
Access to Digital Back Copy
Access to Digital Back CopyAccess to Digital Back Copy
Access to Digital Back Copy
 
Reminiscing about interoperability
Reminiscing about interoperabilityReminiscing about interoperability
Reminiscing about interoperability
 
Reference Rot and E-Theses: Threat and Remedy
Reference Rot and E-Theses: Threat and RemedyReference Rot and E-Theses: Threat and Remedy
Reference Rot and E-Theses: Threat and Remedy
 
ResourceSync Quick Overview
ResourceSync Quick OverviewResourceSync Quick Overview
ResourceSync Quick Overview
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
 
Hiberactive: Pro-Active Archiving of Web References from Scholarly Articles
Hiberactive: Pro-Active Archiving of  Web References from Scholarly Articles Hiberactive: Pro-Active Archiving of  Web References from Scholarly Articles
Hiberactive: Pro-Active Archiving of Web References from Scholarly Articles
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
 
Creating Pockets of Persistence
Creating Pockets of PersistenceCreating Pockets of Persistence
Creating Pockets of Persistence
 
Pimp your content with structured data
Pimp your content with structured dataPimp your content with structured data
Pimp your content with structured data
 
Reference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and RemedyReference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and Remedy
 
Tales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly RecordTales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly Record
 
A Semantic Data Model for Web Applications
A Semantic Data Model for Web ApplicationsA Semantic Data Model for Web Applications
A Semantic Data Model for Web Applications
 
The costs for going gold in the netherlands
The costs for going gold in the netherlandsThe costs for going gold in the netherlands
The costs for going gold in the netherlands
 
Reference Rot: Threat and Remedy
Reference Rot: Threat and RemedyReference Rot: Threat and Remedy
Reference Rot: Threat and Remedy
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage data
 
The aDORe Federation Architecture
The aDORe Federation ArchitectureThe aDORe Federation Architecture
The aDORe Federation Architecture
 

Semelhante a Prototypes of pro-active approaches to support the archiving of web references for scholarly communications

TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22
jodischneider
 
Web of Data Usage Mining
Web of Data Usage MiningWeb of Data Usage Mining
Web of Data Usage Mining
Markus Luczak-Rösch
 
Of Cataloging & Context
Of Cataloging & ContextOf Cataloging & Context
Of Cataloging & Context
charper
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
Bernhard Haslhofer
 
Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...
Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...
Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...
The Frick Collection
 

Semelhante a Prototypes of pro-active approaches to support the archiving of web references for scholarly communications (20)

HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
 
Ensuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of ScholarshipEnsuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of Scholarship
 
Web Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentWeb Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web content
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22
 
IIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership MeetingIIIF for CNI Spring 2014 Membership Meeting
IIIF for CNI Spring 2014 Membership Meeting
 
Locah Project Show and Tell
Locah Project Show and TellLocah Project Show and Tell
Locah Project Show and Tell
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1
 
Web of Data Usage Mining
Web of Data Usage MiningWeb of Data Usage Mining
Web of Data Usage Mining
 
Of Cataloging & Context
Of Cataloging & ContextOf Cataloging & Context
Of Cataloging & Context
 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataDo the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
 
Linked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageLinked Open Data for Cultural Heritage
Linked Open Data for Cultural Heritage
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
The methods and practices of Linked Open Data
The methods and practices of Linked Open DataThe methods and practices of Linked Open Data
The methods and practices of Linked Open Data
 
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today..."In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
 
Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...
Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...
Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...
 

Mais de EDINA, University of Edinburgh

Mais de EDINA, University of Edinburgh (20)

The Making of the English Landscape:
The Making of the English Landscape: The Making of the English Landscape:
The Making of the English Landscape:
 
Spatial Data, Spatial Humanities
Spatial Data, Spatial HumanitiesSpatial Data, Spatial Humanities
Spatial Data, Spatial Humanities
 
Land Cover Map 2015
Land Cover Map 2015Land Cover Map 2015
Land Cover Map 2015
 
We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
 
GeoForum EDINA report 2017
GeoForum EDINA report 2017GeoForum EDINA report 2017
GeoForum EDINA report 2017
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
 
Moray housemarch2017
Moray housemarch2017Moray housemarch2017
Moray housemarch2017
 
Uniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondaryUniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondary
 
Uniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondaryUniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondary
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola Osborne
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
 
SCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceSCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison service
 
Big data in Digimap
Big data in DigimapBig data in Digimap
Big data in Digimap
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data services
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
 
Digimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarvaDigimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarva
 

Último

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Último (20)

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 

Prototypes of pro-active approaches to support the archiving of web references for scholarly communications

  • 1. Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz1, Peter Burnhill1 & Herbert Van de Sompel2 1EDINA, University of Edinburgh, 2Los Alamos National Laboratory
  • 2. The Project Team 2013 – 2015, funded by the Andrew W. Mellon Foundation • Los Alamos National Laboratory: Research Library: Herbert Van de Sompel Harihar Shankar, [Martin Klein, Rob Sanderson] • University of Edinburgh: Language Technology Group: Claire Grover, Beatrice Alex, Colin Matheson, Richard Tobin, [Ke “Adam” Zhou] EDINA * : Peter Burnhill, Muriel Mewissen (Project Manager), Tim Stickland, Richard Wincewicz, [Neil Mayo] Centre for Service Delivery & Digital Expertise
  • 5. Reference Rot Links to Web at Large resources are subject to Reference Rot. This is a combination of two factors: • Link Rot: Link stops working • e.g. HTTP 404 “Not Found” • Content Drift: Linked content changes over time • Possibly to the extent that it is no longer representative of the content that was initially referenced
  • 7. Articles that Link to Articles & to Web At Large Resources (PMC) Martin Klein et al. (2014) Scholarly context not found http://dx.doi.org/10.1371/journal.pone.0115253
  • 8. Articles that Link to Articles & to Web At Large Resources (Elsevier) Martin Klein et al. (2014) Scholarly context not found http://dx.doi.org/10.1371/journal.pone.0115253
  • 9. Articles with URI References (PMC) Articles 479,194 with URI references 399,005 with URI references to articles 240,857 with URI references to Web at Large 156,160 Martin Klein et al. (2014) Scholarly context not found http://dx.doi.org/10.1371/journal.pone.0115253
  • 10. Link Rot (PMC) Martin Klein et al. (2014) Scholarly context not found http://dx.doi.org/10.1371/journal.pone.0115253
  • 11. Link Rot (Elsevier) Martin Klein et al. (2014) Scholarly context not found http://dx.doi.org/10.1371/journal.pone.0115253
  • 12. Links from arXiv, Elsevier, PMC to TLD Targets Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONE http://dx.doi.org/10.1371/journal.pone.0115253
  • 13. Grey is Link Rot – Referenced Content Not Accessible Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONE http://dx.doi.org/10.1371/journal.pone.0115253
  • 14. Grey is Not Archived - Referenced Content Lost Martin Klein et al. (2014) Scholarly context not found. In: PLOS ONE http://dx.doi.org/10.1371/journal.pone.0115253
  • 15. Content Drift – http://dl00.org 2000 2004 2005 2008 (a) Dynamic content values on webpage change over time (b) Static content but very different (often unrelated) web pages
  • 17. Create Snapshots of Referenced Resources Various web archives support on-demand creation of snapshots of URIs (manual, API):  archive.today  Internet Archive  perma.cc  webcitation.org When creating snapshots, maintain:  Original URI  Snapshot URI  Date/Time of snapshot
  • 18. Create Snapshots of Referenced Resources Snapshots can be created at various stages. The closer to the moment of referencing, the better the image captured. Stage Actor Snapshot Quality Preparation Author/reference tool best Submission /Issue Editor/manuscript system good Publication Aggregator/ publisher platform ok Post-publication Librarian/IR, journal archive better than nothing
  • 19. Authoring - Zotero Plugin Demonstrator Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero for pro-active archiving and temporal references https://www.youtube.com/v/ZYmi_Ydr65M%26vq
  • 24. Publication - HiberActive Service Demonstrator Martin Klein et al. (2014) HiberActive: Pro-Active Archiving of web references from scholarly articles Open Repositories 2014 http://www.slideshare.net/martinklein0815/hiberactive
  • 25. Reference Resources Robustly When referencing resources include:  Original URI – Allows the user to revisit the URI as it is at the time of reading, if the URI is still operational  Snapshot URI – Allows the user to visit the snapshot, if one was created, and if the web archive in which it was created is still operational  Date/Time – with the original URI allow the user to visit any snapshot created around the Date/Time in any web archive around the world (using Memento infrastructure) (2015) Robust Links - Motivation http://robustlinks.mementoweb.org/about/
  • 26. Reference Resources Actionably When referencing resources, use Link Decorations to convey Original URI, Snapshot URI, Date/Time <a href=“http://www.stanford.edu” data-originalurl=“http://archive.is/FAy6o” data-versiondate=“2014-08-15” > <a href=“http://www.stanford.edu” data-versiondate=“2014-08-15” > Herbert Van de Sompel et al. (2015) Robust Links - Link Decorations http://robustlinks.mementoweb.org/spec/ <a href=“http://archive.is/FAy6o” data-versionurl=“http://www.stanford.edu” data-versiondate=“2014-08-15” >
  • 27. Robust Links Using Link Decorations, JavaScript, Memento API Demo - http://robustlinks.mementoweb.org/demo/uri_references_js.html robustlinks.js - https://github.com/mementoweb/robustlinks
  • 28. Activate Robust Links There are no Link Decorations, currently. But there is an article publication date:  Express the article publication date in an actionable manner (‘datePublished’ or ‘dateModified’ Schema.org properties) in HTML pages that contain URI references  Tailor robustlinks.js to exclude links to articles  Inject robustlinks.js in HTML pages that contain URI references
  • 29. Users Follow Robust Links into Web Archives The combination of the referenced URI and the article publication date:  Leads users to a snapshot in a web archive, created as close as possible to the article publication date  Addresses link rot  Addresses content drift
  • 30. Create Archive Copies When ingesting new content into the platform:  Parse for URI references  Create snapshots in web archives of select URIs  For these URIs, use Link Decorations in HTML to convey: • original URI • snapshot URI • snapshot Date/Time
  • 31. Users Follow Robust Links into Web Archives The Link Decorations:  Lead users to the created snapshot, if the web archive is operational  Lead users to a snapshot in any web archive, created as close as possible to the snapshot Date/Time  Addresses link rot  Addresses content drift
  • 32. Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz1, Peter Burnhill1 & Herbert Van de Sompel2 1EDINA, University of Edinburgh, 2Los Alamos National Laboratory http://hiberlink.org #hiberlink