SlideShare uma empresa Scribd logo
1 de 40
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
Discovering Scholarly Orphans
Using ORCID
Martin Klein
@mart1nkle1n
http://orcid.org/0000-0003-0130-2097
Herbert Van de Sompel
@hvdsomp
http://orcid.org/0000-0002-0715-6126
Research Library
Los Alamos National Laboratory
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
2
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
3
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
4
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
5
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
6
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
7
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
8
Novel Archival Paradigm
• Current paradigm:
• Owner of scholarly record submits finalized and atomic record to
custodian, takes care of long-term preservation
• E.g., Publisher uploads journals to Portico, author uploads paper
into institutional repository
• Fails, even for traditional journal articles
• Significant number of journal articles do not make it into archives
• IRs are under-utilized
• Does not account for web-based scholarship, living things with
versions, web resources related to paper
 Argument for a novel paradigm to capture web-based scholarly
resources
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
9
Capture Flow
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
10
Capture Flow
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
11
Algorithmic Discovery of Web Identities
James Powell et al. (2014) EgoSystem: Where are our alumni?
In: code4lib http://journal.code4lib.org/articles/9519
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
12
Capture Flow
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
13
Discovery of Web Identities via a Registry: ORCID
Ian Milligan
http://orcid.org/0000-0002-1470-7723
Mark Matienzo
http://orcid.org/0000-0003-3270-1306
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
14
Mark Matienzo’s ORCID
• Web Identities: 3
(homepage, ScopusID,
ResearcherID)
http://orcid.org/0000-0003-3270-1306
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
15
Mark Matienzo’s Home Page
• URI to GitHub
repository, Twitter
• Could be included in
ORCID profile
http://matienzo.org/
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
16
Ian Milligan’s ORCID
• Web Identities: 0
http://orcid.org/0000-0002-1470-7723
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
17
• Evaluation of ORCID for automatic discovery of Web Identities
• How well does ORCID represent the global community of active
researchers?
• Adoption rate
• Subject coverage
• Geo-location coverage
• How well does ORCID score when it comes to listing Web Identities?
Discovery of Web Identities via a Registry: ORCID
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
18
ORCID data
Discovery of Web Identities via a Registry: ORCID
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
19
• Extract from ORCID records
• First name
• Last name
• Affiliations
• Works (publications, datasets, etc)
• Web identities
ORCID - Adoption Rate
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
20
ORCID - Adoption Rate
2013 2014 2015 2016
05000001000000150000020000002500000
ORCIDs total
ORCIDs with given names
ORCIDs with first names
ORCIDs with works
ORCIDs with affiliations
ORCIDs with web identities
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
21
• Extract DOIs from works
• Match DOIs against CrossRef’s Metadata API
• Obtain subject terms
• Match against descriptive terms from “Classification of Instructional
Programs” (CIP) published by the Institute of Education Sciences
ORCID - Subject Coverage
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
22
ORCID - Subject Coverage
2013
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
23
ORCID - Subject Coverage
Changes from 2013 to 2014
Ranks gained:
• Social Science
• Education
• History
Ranks lost:
• Computer Science
• Legal professions
• Journalism
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
24
ORCID - Subject Coverage
Changes from 2014 to 2015
Ranks gained:
• Social Science
• Education
Ranks lost:
• Natural Resources and
Conservation
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
25
ORCID - Subject Coverage
Changes from 2015 to 2016
Ranks gained:
• Natural Resources and
Conservation
Ranks lost:
• Multi/Interdisciplinary Studies
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
26
Comparison of ORCID subjects with:
1. Distribution of researchers’ disciplines
• Proxy: Ph.D. recipients from U.S. universities
• Obtained from NSF, 2015 data
2. Distribution of publications’ disciplines
• Obtained from UNESCO Science Report
• U.S. data from 2014
Both report disciplines aligned with CIP terms, hence they are
easily comparable.
ORCID - Subject Coverage
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
27
ORCID - Subject Coverage
0
10
20
30
40
50
60
Other
Life Sciences
Physical
Sciences
Mathematics and
Computer Sciences
Education
Psychology and
Social Sciences
Engineering
Humanities and Arts
●
●
●
● ●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
● ●
●
●
●
●
●
●
ORCID Subjects
Ph.D. Researchers
Publications
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
28
• Extract affiliations from ORCID records
• Aggregate country code for associated locations
• Only available in ORCID data since 2015
• Compare against UNESCO data of researcher distribution
ORCID – Geo-Location Coverage
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
29
ORCID - Geo-Location Coverage
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
30
ORCID - Geo-Location Coverage
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
31
ORCID - Geo-Location Coverage
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
32
• Analyze distribution of link “Labels”
• Field lacks controlled vocabulary
ORCID – Web Identities
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
33
ORCID - Web Identities
Top 20 labels 2016
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
34
ORCID - Web Identities
Top 20 labels 2016
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
35
Capture Flow
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
36
Ian Milligan’s ORCID
• Artifacts?
http://orcid.org/0000-0002-1470-7723
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
37
• Analyze distribution of types of “Work” e.g.,
• “journal article” – likely not an orphan
• “data-set” – potential orphan
ORCID - Scholarly Orphans
https://members.orcid.org/api/resources/work-types
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
38
ORCID - Work Types
Dominated by types expected not to be orphans!
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
39
Take-Aways
• ORCID Adoption rate is increasing
• Subject coverage is focused, does not cover all disciplines equally
• Geo-Location coverage is good but not quite representative
• Web Identity coverage is poor; not usable for our purpose in its
current state
• Very few scholarly orphans directly referenced
Discovering Scholarly Orphans Using ORCID
@mart1nkle1n, @hvdsomp
JCDL 2017, 06/22/2017, Toronto, CA
Discovering Scholarly Orphans
Using ORCID
Martin Klein
@mart1nkle1n
http://orcid.org/0000-0003-0130-2097
Herbert Van de Sompel
@hvdsomp
http://orcid.org/0000-0002-0715-6126
Research Library
Los Alamos National Laboratory

Mais conteúdo relacionado

Mais procurados

Linked data radical change
Linked data   radical changeLinked data   radical change
Linked data radical change
Richard Wallis
 

Mais procurados (20)

Paul Evan Peters Lecture
Paul Evan Peters LecturePaul Evan Peters Lecture
Paul Evan Peters Lecture
 
Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count
 
Quantifying Orphaned Annotations in Hypothes.is
Quantifying Orphaned Annotations in Hypothes.isQuantifying Orphaned Annotations in Hypothes.is
Quantifying Orphaned Annotations in Hypothes.is
 
Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013
 
How much does $1.7 billion buy?
How much does $1.7 billion buy?How much does $1.7 billion buy?
How much does $1.7 billion buy?
 
The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about it
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than Done
 
Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)
 
Reminiscing about interoperability
Reminiscing about interoperabilityReminiscing about interoperability
Reminiscing about interoperability
 
Linked data - A radical change?
Linked data - A radical change?Linked data - A radical change?
Linked data - A radical change?
 
Creating Topical Collections: Web Archives vs. Live Web
Creating Topical Collections:Web Archives vs. Live WebCreating Topical Collections:Web Archives vs. Live Web
Creating Topical Collections: Web Archives vs. Live Web
 
Linked Data Snowball, or Why We Need Reconciliation
Linked Data Snowball, or Why We Need ReconciliationLinked Data Snowball, or Why We Need Reconciliation
Linked Data Snowball, or Why We Need Reconciliation
 
The Web of Data is Our Oyster
The Web of Data is Our OysterThe Web of Data is Our Oyster
The Web of Data is Our Oyster
 
Linked Data - Radical Change?
Linked Data -  Radical Change?Linked Data -  Radical Change?
Linked Data - Radical Change?
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve Meyer
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning Issue
 
Linked data radical change
Linked data   radical changeLinked data   radical change
Linked data radical change
 
Metadata - Linked Data
Metadata - Linked DataMetadata - Linked Data
Metadata - Linked Data
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
 
Linked Data - Exposing what we have
Linked Data - Exposing what we haveLinked Data - Exposing what we have
Linked Data - Exposing what we have
 

Destaque

MS Thesis Defense, Aug 2012 - Visualizing Digital Collections at Archive-It
MS Thesis Defense, Aug 2012 - Visualizing Digital Collections at Archive-ItMS Thesis Defense, Aug 2012 - Visualizing Digital Collections at Archive-It
MS Thesis Defense, Aug 2012 - Visualizing Digital Collections at Archive-It
Kalpesh Padia
 
Visualizing Digital Collections at Archive-It - Jcdl 2012
Visualizing Digital Collections at Archive-It - Jcdl 2012Visualizing Digital Collections at Archive-It - Jcdl 2012
Visualizing Digital Collections at Archive-It - Jcdl 2012
Kalpesh Padia
 

Destaque (7)

Archive What I See Now: Personal Web Archiving with WARCs
Archive What I See Now: Personal Web Archiving with WARCsArchive What I See Now: Personal Web Archiving with WARCs
Archive What I See Now: Personal Web Archiving with WARCs
 
Local Memory Project
Local Memory ProjectLocal Memory Project
Local Memory Project
 
Introducing Web Archiving and WSDL Research Group
Introducing Web Archiving and WSDL Research GroupIntroducing Web Archiving and WSDL Research Group
Introducing Web Archiving and WSDL Research Group
 
A Collaborative, Secure, and Private InterPlanetary Wayback Web Archiving Sys...
A Collaborative, Secure, and Private InterPlanetary Wayback Web Archiving Sys...A Collaborative, Secure, and Private InterPlanetary Wayback Web Archiving Sys...
A Collaborative, Secure, and Private InterPlanetary Wayback Web Archiving Sys...
 
MS Thesis Defense, Aug 2012 - Visualizing Digital Collections at Archive-It
MS Thesis Defense, Aug 2012 - Visualizing Digital Collections at Archive-ItMS Thesis Defense, Aug 2012 - Visualizing Digital Collections at Archive-It
MS Thesis Defense, Aug 2012 - Visualizing Digital Collections at Archive-It
 
Visualizing Digital Collections at Archive-It - Jcdl 2012
Visualizing Digital Collections at Archive-It - Jcdl 2012Visualizing Digital Collections at Archive-It - Jcdl 2012
Visualizing Digital Collections at Archive-It - Jcdl 2012
 
Dockerize Your Projects - A Brief Introduction to Containerization
Dockerize Your Projects - A Brief Introduction to ContainerizationDockerize Your Projects - A Brief Introduction to Containerization
Dockerize Your Projects - A Brief Introduction to Containerization
 

Semelhante a Discovering Scholarly Orphans Using ORCID

TNC2012 Federated and scholarly identity - match made in heaven?
TNC2012 Federated and scholarly identity - match made in heaven?TNC2012 Federated and scholarly identity - match made in heaven?
TNC2012 Federated and scholarly identity - match made in heaven?
Gudmundur Thorisson
 

Semelhante a Discovering Scholarly Orphans Using ORCID (20)

NC Data4Good: Understanding Childhood Hunger in Our Communities
NC Data4Good: Understanding Childhood Hunger in Our CommunitiesNC Data4Good: Understanding Childhood Hunger in Our Communities
NC Data4Good: Understanding Childhood Hunger in Our Communities
 
Imperial College Science Communication talk
Imperial College Science Communication talkImperial College Science Communication talk
Imperial College Science Communication talk
 
7. ROR
7. ROR7. ROR
7. ROR
 
Enabling information interoperability with identifiers (L. Haak)
Enabling information interoperability with identifiers  (L. Haak)Enabling information interoperability with identifiers  (L. Haak)
Enabling information interoperability with identifiers (L. Haak)
 
Orcid data cite_20130917
Orcid data cite_20130917Orcid data cite_20130917
Orcid data cite_20130917
 
Meadows apr28-1
Meadows apr28-1Meadows apr28-1
Meadows apr28-1
 
ORCID: Connecting Research & Researchers
ORCID: Connecting Research & ResearchersORCID: Connecting Research & Researchers
ORCID: Connecting Research & Researchers
 
ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016
 
Enabling information interoperability with identifiers (L. Haak)
 Enabling information interoperability with identifiers (L. Haak) Enabling information interoperability with identifiers (L. Haak)
Enabling information interoperability with identifiers (L. Haak)
 
Disseminating your research: Scientific profiles and tools
Disseminating your research: Scientific profiles and toolsDisseminating your research: Scientific profiles and tools
Disseminating your research: Scientific profiles and tools
 
NASA Johnson Space Center Data Science Day 2.0
NASA Johnson Space Center Data Science Day 2.0NASA Johnson Space Center Data Science Day 2.0
NASA Johnson Space Center Data Science Day 2.0
 
Taking Flight:
Taking Flight:Taking Flight:
Taking Flight:
 
Finding and accessing human genome data with Repositive
Finding and accessing human genome data with RepositiveFinding and accessing human genome data with Repositive
Finding and accessing human genome data with Repositive
 
BioVis Meetup @ IEEE VIS 2015
BioVis Meetup @ IEEE VIS 2015BioVis Meetup @ IEEE VIS 2015
BioVis Meetup @ IEEE VIS 2015
 
TNC2012 Federated and scholarly identity - match made in heaven?
TNC2012 Federated and scholarly identity - match made in heaven?TNC2012 Federated and scholarly identity - match made in heaven?
TNC2012 Federated and scholarly identity - match made in heaven?
 
Mejias "Making it work globally"
Mejias "Making it work globally"Mejias "Making it work globally"
Mejias "Making it work globally"
 
Christine borgman keynote
Christine borgman keynoteChristine borgman keynote
Christine borgman keynote
 
ORCID & other Person iDs
ORCID & other Person iDsORCID & other Person iDs
ORCID & other Person iDs
 
Context as a Motor for Discovery, STM Digital Publishing DCE 05,2017
Context as a Motor for Discovery, STM Digital Publishing DCE 05,2017Context as a Motor for Discovery, STM Digital Publishing DCE 05,2017
Context as a Motor for Discovery, STM Digital Publishing DCE 05,2017
 
Research data ecology
Research data ecologyResearch data ecology
Research data ecology
 

Mais de Martin Klein

Mais de Martin Klein (20)

On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly WebOn the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
 
On the Persistence of Persistent Identifiers of the Scholarly Web
 On the Persistence of Persistent Identifiers of the Scholarly Web On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
 
An Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly OrphansAn Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly Orphans
 
Who is Asking - Humans and Machines Experience a Different Scholarly Web
Who is Asking - Humans and Machines  Experience a Different Scholarly WebWho is Asking - Humans and Machines  Experience a Different Scholarly Web
Who is Asking - Humans and Machines Experience a Different Scholarly Web
 
The Memento Tracer Framework: Balancing Quality and Scalability for Web Arch...
The Memento Tracer Framework: Balancing Quality and Scalability  for Web Arch...The Memento Tracer Framework: Balancing Quality and Scalability  for Web Arch...
The Memento Tracer Framework: Balancing Quality and Scalability for Web Arch...
 
Memento Tracer An Innovative Approach Towards Balancing Scale and Fidelity f...
Memento Tracer An Innovative Approach Towards Balancing  Scale and Fidelity f...Memento Tracer An Innovative Approach Towards Balancing  Scale and Fidelity f...
Memento Tracer An Innovative Approach Towards Balancing Scale and Fidelity f...
 
Comparing the Performance of OAI-PMH with ResourceSync
Comparing the Performance of OAI-PMH with ResourceSyncComparing the Performance of OAI-PMH with ResourceSync
Comparing the Performance of OAI-PMH with ResourceSync
 
Evaluating Memento Service Optimizations
Evaluating Memento Service OptimizationsEvaluating Memento Service Optimizations
Evaluating Memento Service Optimizations
 
An Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly OrphansAn Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly Orphans
 
A Vision of the Library’s Role in Archiving Scholarly Artifacts
A Vision of the Library’s Role  in Archiving Scholarly ArtifactsA Vision of the Library’s Role  in Archiving Scholarly Artifacts
A Vision of the Library’s Role in Archiving Scholarly Artifacts
 
First Steps in Research Data Management Under Constraints of a National Secur...
First Steps in Research Data Management Under Constraints of a National Secur...First Steps in Research Data Management Under Constraints of a National Secur...
First Steps in Research Data Management Under Constraints of a National Secur...
 
Smart Routing of Memento Requests
Smart Routing of Memento RequestsSmart Routing of Memento Requests
Smart Routing of Memento Requests
 
Building Event Collections from Crawling Web Archives
Building Event Collections from Crawling Web ArchivesBuilding Event Collections from Crawling Web Archives
Building Event Collections from Crawling Web Archives
 
A Web-Centric Pipeline for Archiving Scholarly Artifacts
A Web-Centric Pipeline for Archiving Scholarly ArtifactsA Web-Centric Pipeline for Archiving Scholarly Artifacts
A Web-Centric Pipeline for Archiving Scholarly Artifacts
 
Focused Crawl of Web Archives to Build Event Collections
Focused Crawl of Web Archives to Build Event CollectionsFocused Crawl of Web Archives to Build Event Collections
Focused Crawl of Web Archives to Build Event Collections
 
Robust Linking to Web Resources
Robust Linking to Web ResourcesRobust Linking to Web Resources
Robust Linking to Web Resources
 
Using the Memento Framework to Assess Content Drift in Scholarly Communication
Using the Memento Framework to Assess Content Drift in Scholarly CommunicationUsing the Memento Framework to Assess Content Drift in Scholarly Communication
Using the Memento Framework to Assess Content Drift in Scholarly Communication
 
Uniform Access to Raw Mementos
Uniform Access to Raw MementosUniform Access to Raw Mementos
Uniform Access to Raw Mementos
 
Robust Links - a proposed solution to reference rot in scholarly communication
Robust Links - a proposed solution to reference rot in scholarly communicationRobust Links - a proposed solution to reference rot in scholarly communication
Robust Links - a proposed solution to reference rot in scholarly communication
 
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
 

Último

Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Sheetaleventcompany
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
sexy call girls service in goa
 
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 

Último (20)

@9999965857 🫦 Sexy Desi Call Girls Laxmi Nagar 💓 High Profile Escorts Delhi 🫶
@9999965857 🫦 Sexy Desi Call Girls Laxmi Nagar 💓 High Profile Escorts Delhi 🫶@9999965857 🫦 Sexy Desi Call Girls Laxmi Nagar 💓 High Profile Escorts Delhi 🫶
@9999965857 🫦 Sexy Desi Call Girls Laxmi Nagar 💓 High Profile Escorts Delhi 🫶
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)
 
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024
 
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
 
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night StandHot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOG
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
 
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
 
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
 

Discovering Scholarly Orphans Using ORCID

  • 1. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA Discovering Scholarly Orphans Using ORCID Martin Klein @mart1nkle1n http://orcid.org/0000-0003-0130-2097 Herbert Van de Sompel @hvdsomp http://orcid.org/0000-0002-0715-6126 Research Library Los Alamos National Laboratory
  • 2. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 2
  • 3. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 3
  • 4. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 4
  • 5. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 5
  • 6. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 6
  • 7. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 7
  • 8. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 8 Novel Archival Paradigm • Current paradigm: • Owner of scholarly record submits finalized and atomic record to custodian, takes care of long-term preservation • E.g., Publisher uploads journals to Portico, author uploads paper into institutional repository • Fails, even for traditional journal articles • Significant number of journal articles do not make it into archives • IRs are under-utilized • Does not account for web-based scholarship, living things with versions, web resources related to paper  Argument for a novel paradigm to capture web-based scholarly resources
  • 9. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 9 Capture Flow
  • 10. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 10 Capture Flow
  • 11. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 11 Algorithmic Discovery of Web Identities James Powell et al. (2014) EgoSystem: Where are our alumni? In: code4lib http://journal.code4lib.org/articles/9519
  • 12. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 12 Capture Flow
  • 13. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 13 Discovery of Web Identities via a Registry: ORCID Ian Milligan http://orcid.org/0000-0002-1470-7723 Mark Matienzo http://orcid.org/0000-0003-3270-1306
  • 14. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 14 Mark Matienzo’s ORCID • Web Identities: 3 (homepage, ScopusID, ResearcherID) http://orcid.org/0000-0003-3270-1306
  • 15. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 15 Mark Matienzo’s Home Page • URI to GitHub repository, Twitter • Could be included in ORCID profile http://matienzo.org/
  • 16. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 16 Ian Milligan’s ORCID • Web Identities: 0 http://orcid.org/0000-0002-1470-7723
  • 17. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 17 • Evaluation of ORCID for automatic discovery of Web Identities • How well does ORCID represent the global community of active researchers? • Adoption rate • Subject coverage • Geo-location coverage • How well does ORCID score when it comes to listing Web Identities? Discovery of Web Identities via a Registry: ORCID
  • 18. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 18 ORCID data Discovery of Web Identities via a Registry: ORCID
  • 19. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 19 • Extract from ORCID records • First name • Last name • Affiliations • Works (publications, datasets, etc) • Web identities ORCID - Adoption Rate
  • 20. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 20 ORCID - Adoption Rate 2013 2014 2015 2016 05000001000000150000020000002500000 ORCIDs total ORCIDs with given names ORCIDs with first names ORCIDs with works ORCIDs with affiliations ORCIDs with web identities
  • 21. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 21 • Extract DOIs from works • Match DOIs against CrossRef’s Metadata API • Obtain subject terms • Match against descriptive terms from “Classification of Instructional Programs” (CIP) published by the Institute of Education Sciences ORCID - Subject Coverage
  • 22. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 22 ORCID - Subject Coverage 2013
  • 23. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 23 ORCID - Subject Coverage Changes from 2013 to 2014 Ranks gained: • Social Science • Education • History Ranks lost: • Computer Science • Legal professions • Journalism
  • 24. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 24 ORCID - Subject Coverage Changes from 2014 to 2015 Ranks gained: • Social Science • Education Ranks lost: • Natural Resources and Conservation
  • 25. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 25 ORCID - Subject Coverage Changes from 2015 to 2016 Ranks gained: • Natural Resources and Conservation Ranks lost: • Multi/Interdisciplinary Studies
  • 26. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 26 Comparison of ORCID subjects with: 1. Distribution of researchers’ disciplines • Proxy: Ph.D. recipients from U.S. universities • Obtained from NSF, 2015 data 2. Distribution of publications’ disciplines • Obtained from UNESCO Science Report • U.S. data from 2014 Both report disciplines aligned with CIP terms, hence they are easily comparable. ORCID - Subject Coverage
  • 27. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 27 ORCID - Subject Coverage 0 10 20 30 40 50 60 Other Life Sciences Physical Sciences Mathematics and Computer Sciences Education Psychology and Social Sciences Engineering Humanities and Arts ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ORCID Subjects Ph.D. Researchers Publications
  • 28. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 28 • Extract affiliations from ORCID records • Aggregate country code for associated locations • Only available in ORCID data since 2015 • Compare against UNESCO data of researcher distribution ORCID – Geo-Location Coverage
  • 29. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 29 ORCID - Geo-Location Coverage
  • 30. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 30 ORCID - Geo-Location Coverage
  • 31. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 31 ORCID - Geo-Location Coverage
  • 32. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 32 • Analyze distribution of link “Labels” • Field lacks controlled vocabulary ORCID – Web Identities
  • 33. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 33 ORCID - Web Identities Top 20 labels 2016
  • 34. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 34 ORCID - Web Identities Top 20 labels 2016
  • 35. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 35 Capture Flow
  • 36. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 36 Ian Milligan’s ORCID • Artifacts? http://orcid.org/0000-0002-1470-7723
  • 37. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 37 • Analyze distribution of types of “Work” e.g., • “journal article” – likely not an orphan • “data-set” – potential orphan ORCID - Scholarly Orphans https://members.orcid.org/api/resources/work-types
  • 38. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 38 ORCID - Work Types Dominated by types expected not to be orphans!
  • 39. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA 39 Take-Aways • ORCID Adoption rate is increasing • Subject coverage is focused, does not cover all disciplines equally • Geo-Location coverage is good but not quite representative • Web Identity coverage is poor; not usable for our purpose in its current state • Very few scholarly orphans directly referenced
  • 40. Discovering Scholarly Orphans Using ORCID @mart1nkle1n, @hvdsomp JCDL 2017, 06/22/2017, Toronto, CA Discovering Scholarly Orphans Using ORCID Martin Klein @mart1nkle1n http://orcid.org/0000-0003-0130-2097 Herbert Van de Sompel @hvdsomp http://orcid.org/0000-0002-0715-6126 Research Library Los Alamos National Laboratory