SlideShare uma empresa Scribd logo
1 de 32
How dinosaurs broke our system

              Challenges in building national researcher
                          identifier services


                          Amanda Hill
                         Names Project


JISC Conference, 2010
Hoping that…

 …Simeon has explained all about the name
  authority problem


 I‟d like to talk about some of the work
  that we‟ve done as part of the Names
  Project recently…


 …and how that fits into today‟s researcher
  identification landscape
Gross generalisation about past
approaches to author identifiers
       Libraries             Publishers


Book-level data        Article-level data

Labour intensive:      Automatically generated:
disambiguation first   disambiguation later

Authors not involved   Authors can edit

Open                   Proprietary
Current international activity
         ISNI                                  ORCID


Library-instigated                 Publisher-instigated


Disambiguation first               Disambiguation later


Authors not involved               Authors can submit/edit


Broad scope                        Current researchers
                       JISC Conference, 2010
Signs of convergence?

 Knowledge Exchange meeting on Digital
  Author Identifiers in March 2012
  encouraged alignment of ISNI and ORCID
  approaches


 ISNI has reserved a block of identifiers for
  use by ORCID

                    JISC Conference, 2010
Sources of information

 Both ORCID and ISNI will use existing pools
  of information to populate their systems

   ISNI: “Leveraging high confidence data from
    different domains”


   “ORCID will link to other name identifier
    systems”
                     JISC Conference, 2010
National author ID systems

 2011: JISC-funded survey and report on
  national author/researcher identifier
  systems around the world

   Report published November 2011
    http://ie-repository.jisc.ac.uk/567/
Maturity of systems (late 2011)
              System            In development since                 Number of identities

Lattes (Brazil)                             1999                              1,600,000

                                                                      31,000 researchers at 160
Frida/Cristin (Norway)                      2003
                                                                             institutions
                                                                      24,400 faculty with profiles
VIVO                                        2003                      150,000 total IDs including
                                                                     undisambiguated co-authors
                                                                           40,000 in the NTA
Digital Author Identifier     2005 (1980s for National Thesaurus
                                                                    15,000 researchers with Digital
(Netherlands)                         of Author Names)
                                                                              Author IDs
Names Project (UK)                          2007                                46,000
New Zealand Electronic Text
                                            2007                                2,000
Centre
Trove People and
Organisations/NLA Party                     2007                   900,000 people and organisations
Infrastructure (Australia)
AuthorClaim                                 2008                                 200

Researcher Name Resolver
                                            2008                               190,000
(Japan)
Populating identifier systems
System                        Records created by   Records imported from Records generated by
                              cataloguers          other systems         data subjects

AuthorClaim

Digital Author Identifier
(Netherlands)

Frida/Cristin (Norway)

Lattes (Brazil)

Names Project (UK)
New Zealand Electronic Text
Centre

Researcher Name Resolver
(Japan)

Trove People and
Organisations/NLA Party
Infrastructure (Australia)

VIVO
Good sources of data for some
          nations

National system      Existing unique identifiers
                     Researcher identifiers from national
Japan
                     researcher databases
                     Number from National Thesaurus of
Netherlands          Author names is converted into
                     Digital Author Identifier
                     Human resources data: social security
Norway
                     numbers

Other national systems assign new
identifiers as new identities are
established.
Features of mature national
          identifier systems

 With more mature systems:
   A national organisation generally has oversight: e.g. in
    Brazil, Norway, Netherlands

   Integration with research funders, reporting agencies
    and institutional repositories


 Individual institutions also have defined roles
  relating to managing information about their own
  staff
SITUATION IN UK

            JISC Conference, 2010
Work to investigate unique IDs
      for UK researchers
 Identified in 2006 as part of the call for
  proposals for the JISC-funded Repositories
  and Preservation Programme

 Mimas and the British Library proposed a two-
  year project to:
   Investigate requirements for a UK name authority
    service
   Build a pilot system to demonstrate potential
The Names Project
       The Chang Project
„From the Annals of the Onomastic
  Society‟


Ian Watson (1990)
Names (not an acronym…)


 Name Authorities Make Everything Simpler


 Names: Ambiguous, Meaningful (or
  Meaningless?), Essential, Symbolic


 …nearly everyone has a name-related
  story
Rhyming couples




     JISC Conference, 2010
Original plan
 Use data from British Library‟s Zetoc service to
  create author IDs
    Journal article information from 1993->
    Last names, initials, paper titles, subject
     classifications


 But…
    International in scope
    Lack of information on affiliations and first names to
     help with making matches
    Huge dataset -> processing issues
Revised plan
 Used 2008 Research Assessment Exercise
  data (as cleaned up by JISC Merit project)
  to pre-populate the Names system
   Identify unique individuals and assign
    identifiers
 Data quality good, included institutional
  information: high accuracy, despite only
  having initials, not full first names
 Except for…
                      JISC Conference, 2010
JISC Conference, 2010
Building on Merit…

 Merit data covers around 20% of active UK
  researchers


 Working to enhance records and create
  new ones with information from other
  sources
   Institutional repositories
   British Library data sets (Zetoc)
   Direct input from researchers
Submission form




     JISC Conference, 2010
http://separatedbyacommonlanguage.blogspot.com/2009/08/initials-and-names.html
Quality matters
 Automatic matching can only achieve so
  much
   Dependent on data source


 British Library team perform manual check of
  results of matching new data sources
   Allows for separation/merging of records


 Plan to allow people to update their own
  information
Ultimate aim
 High-quality set of unique identifiers for UK
  researchers and research institutions

 Available to other systems (national and
  international)
    e.g. Names records exported to ISNI in 2011


 Possible additional services
    Disambiguation of existing data sets
    Identification of external researchers
Access to Names

 API allows for flexible searching of Names
  data


 EPrints plugin released in 2011: allows
  repository users to choose from a list of
  Names identities
   …and to create a Names record if none exists

                     JISC Conference, 2010
JISC Conference, 2010
JISC Conference, 2010
Next steps…

 JISC-convened Researcher ID group – final
  meeting in September > recommendations


 Options Appraisal Report for UK national
  researcher identifier service > December


 Improving data and adding new records

                   JISC Conference, 2010
Summing up

 Names is a hybrid of library/publisher
  approaches
   Automated matching/disambiguation
   Human quality checks
   Data immediately available for re-use in other
    systems
   Researchers can supply information
An evolving area

 Main challenges are cultural and political
  rather than technical


 National author/researcher ID services can be
  important parts of research infrastructure


 Getting agreement and co-ordination at
  national level is vital
Project updates

 Names: http://names.mimas.ac.uk


 Blog: http://namesproject.wordpress.com


 Twitter: @NamesProject



                   JISC Conference, 2010

Mais conteúdo relacionado

Semelhante a How dinosaurs broke our system: challenges in building national researcher identifier services

Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessdatacite
 
Introduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANTIntroduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANTEdward Baker
 
Building the Mother of All Collections: the future of the National Library's ...
Building the Mother of All Collections: the future of the National Library's ...Building the Mother of All Collections: the future of the National Library's ...
Building the Mother of All Collections: the future of the National Library's ...wcathro
 
How Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open ScienceHow Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open Sciencedrnigam
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...OpenAIRE
 
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
Publishing of Scientific Data  - Science Foundation Ireland Summit 2010Publishing of Scientific Data  - Science Foundation Ireland Summit 2010
Publishing of Scientific Data - Science Foundation Ireland Summit 2010jodischneider
 
DataCite at APE 2011
DataCite at APE 2011DataCite at APE 2011
DataCite at APE 2011datacite
 
ViBRANT—Virtual Biodiversity Research and Access Network for Taxonomy
ViBRANT—Virtual Biodiversity Research and Access Network for TaxonomyViBRANT—Virtual Biodiversity Research and Access Network for Taxonomy
ViBRANT—Virtual Biodiversity Research and Access Network for TaxonomyVince Smith
 
OCLC Research @ U of Calgary: New directions for metadata workflows across li...
OCLC Research @ U of Calgary: New directions for metadata workflows across li...OCLC Research @ U of Calgary: New directions for metadata workflows across li...
OCLC Research @ U of Calgary: New directions for metadata workflows across li...OCLC Research
 
ICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR
 
China: Journal Publishing, DOI and CrossCheck (2011 CrossRef Workshops)
China: Journal Publishing, DOI and CrossCheck (2011 CrossRef Workshops)China: Journal Publishing, DOI and CrossCheck (2011 CrossRef Workshops)
China: Journal Publishing, DOI and CrossCheck (2011 CrossRef Workshops)Crossref
 
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...ORCID, Inc
 
DataCite overview 2014
DataCite overview 2014DataCite overview 2014
DataCite overview 2014datacite
 
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...National Institute of Informatics (NII)
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesStefan Dietze
 

Semelhante a How dinosaurs broke our system: challenges in building national researcher identifier services (20)

Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information access
 
Introduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANTIntroduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANT
 
Building the Mother of All Collections: the future of the National Library's ...
Building the Mother of All Collections: the future of the National Library's ...Building the Mother of All Collections: the future of the National Library's ...
Building the Mother of All Collections: the future of the National Library's ...
 
How Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open ScienceHow Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open Science
 
Jan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortiumJan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortium
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
 
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
Publishing of Scientific Data  - Science Foundation Ireland Summit 2010Publishing of Scientific Data  - Science Foundation Ireland Summit 2010
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
 
DataCite at APE 2011
DataCite at APE 2011DataCite at APE 2011
DataCite at APE 2011
 
Metadata for Interoperable Bioscience
Metadata for Interoperable BioscienceMetadata for Interoperable Bioscience
Metadata for Interoperable Bioscience
 
ViBRANT—Virtual Biodiversity Research and Access Network for Taxonomy
ViBRANT—Virtual Biodiversity Research and Access Network for TaxonomyViBRANT—Virtual Biodiversity Research and Access Network for Taxonomy
ViBRANT—Virtual Biodiversity Research and Access Network for Taxonomy
 
Carpenter "The Future of the Scholarly Record"
Carpenter "The Future of the Scholarly Record"Carpenter "The Future of the Scholarly Record"
Carpenter "The Future of the Scholarly Record"
 
OCLC Research @ U of Calgary: New directions for metadata workflows across li...
OCLC Research @ U of Calgary: New directions for metadata workflows across li...OCLC Research @ U of Calgary: New directions for metadata workflows across li...
OCLC Research @ U of Calgary: New directions for metadata workflows across li...
 
ICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR Data Exploration Tools
ICPSR Data Exploration Tools
 
China: Journal Publishing, DOI and CrossCheck (2011 CrossRef Workshops)
China: Journal Publishing, DOI and CrossCheck (2011 CrossRef Workshops)China: Journal Publishing, DOI and CrossCheck (2011 CrossRef Workshops)
China: Journal Publishing, DOI and CrossCheck (2011 CrossRef Workshops)
 
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
 
NISO Webinar: Identify This! Identify That! New Identifiers and New Uses
NISO Webinar: Identify This! Identify That! New Identifiers and New UsesNISO Webinar: Identify This! Identify That! New Identifiers and New Uses
NISO Webinar: Identify This! Identify That! New Identifiers and New Uses
 
Resources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the WebResources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the Web
 
DataCite overview 2014
DataCite overview 2014DataCite overview 2014
DataCite overview 2014
 
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital Libraries
 

Mais de Amanda Hill

Managing small archives
Managing small archivesManaging small archives
Managing small archivesAmanda Hill
 
History as healer
History as healerHistory as healer
History as healerAmanda Hill
 
Beyond the Cenotaph: a 21st century commemoration
Beyond the Cenotaph: a 21st century commemorationBeyond the Cenotaph: a 21st century commemoration
Beyond the Cenotaph: a 21st century commemorationAmanda Hill
 
Getting archives online with Archeion
Getting archives online with ArcheionGetting archives online with Archeion
Getting archives online with ArcheionAmanda Hill
 
Working outside the walls: from gatekeeper to keymaster
Working outside the walls: from gatekeeper to keymasterWorking outside the walls: from gatekeeper to keymaster
Working outside the walls: from gatekeeper to keymasterAmanda Hill
 
Getting your hands on archival gold
Getting your hands on archival goldGetting your hands on archival gold
Getting your hands on archival goldAmanda Hill
 
Breaking barriers without breaking the bank
Breaking barriers without breaking the bankBreaking barriers without breaking the bank
Breaking barriers without breaking the bankAmanda Hill
 
Archeion workshops, March 2012
Archeion workshops, March 2012Archeion workshops, March 2012
Archeion workshops, March 2012Amanda Hill
 
Introduction to arrangement and description (feb 4&5, 2012)
Introduction to arrangement and description (feb 4&5, 2012)Introduction to arrangement and description (feb 4&5, 2012)
Introduction to arrangement and description (feb 4&5, 2012)Amanda Hill
 
Losing control: experiments in outreach
Losing control: experiments in outreachLosing control: experiments in outreach
Losing control: experiments in outreachAmanda Hill
 
Exploring Strange New Worlds: Archives TNG
Exploring Strange New Worlds: Archives TNGExploring Strange New Worlds: Archives TNG
Exploring Strange New Worlds: Archives TNGAmanda Hill
 
Archives 2.0 on a micro scale
Archives 2.0 on a micro scaleArchives 2.0 on a micro scale
Archives 2.0 on a micro scaleAmanda Hill
 
What's in a Name?
What's in a Name?What's in a Name?
What's in a Name?Amanda Hill
 
Names Amanda Hill
Names Amanda HillNames Amanda Hill
Names Amanda HillAmanda Hill
 
Introduction to the Names Project
Introduction to the Names ProjectIntroduction to the Names Project
Introduction to the Names ProjectAmanda Hill
 
A Question Of Interpretation: the role of archivists in an online age
A Question Of  Interpretation: the role of archivists in an online ageA Question Of  Interpretation: the role of archivists in an online age
A Question Of Interpretation: the role of archivists in an online ageAmanda Hill
 
Creating and maintaining a blog: the Archives Hub Blog
Creating and maintaining a blog: the Archives Hub BlogCreating and maintaining a blog: the Archives Hub Blog
Creating and maintaining a blog: the Archives Hub BlogAmanda Hill
 
Interoperability Without Effort
Interoperability Without EffortInteroperability Without Effort
Interoperability Without EffortAmanda Hill
 
Opening up the archives: from basement to browser
Opening up the archives: from basement to browserOpening up the archives: from basement to browser
Opening up the archives: from basement to browserAmanda Hill
 

Mais de Amanda Hill (20)

Managing small archives
Managing small archivesManaging small archives
Managing small archives
 
The past on tap
The past on tapThe past on tap
The past on tap
 
History as healer
History as healerHistory as healer
History as healer
 
Beyond the Cenotaph: a 21st century commemoration
Beyond the Cenotaph: a 21st century commemorationBeyond the Cenotaph: a 21st century commemoration
Beyond the Cenotaph: a 21st century commemoration
 
Getting archives online with Archeion
Getting archives online with ArcheionGetting archives online with Archeion
Getting archives online with Archeion
 
Working outside the walls: from gatekeeper to keymaster
Working outside the walls: from gatekeeper to keymasterWorking outside the walls: from gatekeeper to keymaster
Working outside the walls: from gatekeeper to keymaster
 
Getting your hands on archival gold
Getting your hands on archival goldGetting your hands on archival gold
Getting your hands on archival gold
 
Breaking barriers without breaking the bank
Breaking barriers without breaking the bankBreaking barriers without breaking the bank
Breaking barriers without breaking the bank
 
Archeion workshops, March 2012
Archeion workshops, March 2012Archeion workshops, March 2012
Archeion workshops, March 2012
 
Introduction to arrangement and description (feb 4&5, 2012)
Introduction to arrangement and description (feb 4&5, 2012)Introduction to arrangement and description (feb 4&5, 2012)
Introduction to arrangement and description (feb 4&5, 2012)
 
Losing control: experiments in outreach
Losing control: experiments in outreachLosing control: experiments in outreach
Losing control: experiments in outreach
 
Exploring Strange New Worlds: Archives TNG
Exploring Strange New Worlds: Archives TNGExploring Strange New Worlds: Archives TNG
Exploring Strange New Worlds: Archives TNG
 
Archives 2.0 on a micro scale
Archives 2.0 on a micro scaleArchives 2.0 on a micro scale
Archives 2.0 on a micro scale
 
What's in a Name?
What's in a Name?What's in a Name?
What's in a Name?
 
Names Amanda Hill
Names Amanda HillNames Amanda Hill
Names Amanda Hill
 
Introduction to the Names Project
Introduction to the Names ProjectIntroduction to the Names Project
Introduction to the Names Project
 
A Question Of Interpretation: the role of archivists in an online age
A Question Of  Interpretation: the role of archivists in an online ageA Question Of  Interpretation: the role of archivists in an online age
A Question Of Interpretation: the role of archivists in an online age
 
Creating and maintaining a blog: the Archives Hub Blog
Creating and maintaining a blog: the Archives Hub BlogCreating and maintaining a blog: the Archives Hub Blog
Creating and maintaining a blog: the Archives Hub Blog
 
Interoperability Without Effort
Interoperability Without EffortInteroperability Without Effort
Interoperability Without Effort
 
Opening up the archives: from basement to browser
Opening up the archives: from basement to browserOpening up the archives: from basement to browser
Opening up the archives: from basement to browser
 

Último

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 

Último (20)

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 

How dinosaurs broke our system: challenges in building national researcher identifier services

  • 1. How dinosaurs broke our system Challenges in building national researcher identifier services Amanda Hill Names Project JISC Conference, 2010
  • 2. Hoping that…  …Simeon has explained all about the name authority problem  I‟d like to talk about some of the work that we‟ve done as part of the Names Project recently…  …and how that fits into today‟s researcher identification landscape
  • 3. Gross generalisation about past approaches to author identifiers Libraries Publishers Book-level data Article-level data Labour intensive: Automatically generated: disambiguation first disambiguation later Authors not involved Authors can edit Open Proprietary
  • 4. Current international activity ISNI ORCID Library-instigated Publisher-instigated Disambiguation first Disambiguation later Authors not involved Authors can submit/edit Broad scope Current researchers JISC Conference, 2010
  • 5. Signs of convergence?  Knowledge Exchange meeting on Digital Author Identifiers in March 2012 encouraged alignment of ISNI and ORCID approaches  ISNI has reserved a block of identifiers for use by ORCID JISC Conference, 2010
  • 6. Sources of information  Both ORCID and ISNI will use existing pools of information to populate their systems  ISNI: “Leveraging high confidence data from different domains”  “ORCID will link to other name identifier systems” JISC Conference, 2010
  • 7. National author ID systems  2011: JISC-funded survey and report on national author/researcher identifier systems around the world  Report published November 2011 http://ie-repository.jisc.ac.uk/567/
  • 8. Maturity of systems (late 2011) System In development since Number of identities Lattes (Brazil) 1999 1,600,000 31,000 researchers at 160 Frida/Cristin (Norway) 2003 institutions 24,400 faculty with profiles VIVO 2003 150,000 total IDs including undisambiguated co-authors 40,000 in the NTA Digital Author Identifier 2005 (1980s for National Thesaurus 15,000 researchers with Digital (Netherlands) of Author Names) Author IDs Names Project (UK) 2007 46,000 New Zealand Electronic Text 2007 2,000 Centre Trove People and Organisations/NLA Party 2007 900,000 people and organisations Infrastructure (Australia) AuthorClaim 2008 200 Researcher Name Resolver 2008 190,000 (Japan)
  • 9. Populating identifier systems System Records created by Records imported from Records generated by cataloguers other systems data subjects AuthorClaim Digital Author Identifier (Netherlands) Frida/Cristin (Norway) Lattes (Brazil) Names Project (UK) New Zealand Electronic Text Centre Researcher Name Resolver (Japan) Trove People and Organisations/NLA Party Infrastructure (Australia) VIVO
  • 10. Good sources of data for some nations National system Existing unique identifiers Researcher identifiers from national Japan researcher databases Number from National Thesaurus of Netherlands Author names is converted into Digital Author Identifier Human resources data: social security Norway numbers Other national systems assign new identifiers as new identities are established.
  • 11. Features of mature national identifier systems  With more mature systems:  A national organisation generally has oversight: e.g. in Brazil, Norway, Netherlands  Integration with research funders, reporting agencies and institutional repositories  Individual institutions also have defined roles relating to managing information about their own staff
  • 12. SITUATION IN UK JISC Conference, 2010
  • 13. Work to investigate unique IDs for UK researchers  Identified in 2006 as part of the call for proposals for the JISC-funded Repositories and Preservation Programme  Mimas and the British Library proposed a two- year project to:  Investigate requirements for a UK name authority service  Build a pilot system to demonstrate potential
  • 14. The Names Project The Chang Project „From the Annals of the Onomastic Society‟ Ian Watson (1990)
  • 15. Names (not an acronym…)  Name Authorities Make Everything Simpler  Names: Ambiguous, Meaningful (or Meaningless?), Essential, Symbolic  …nearly everyone has a name-related story
  • 16. Rhyming couples JISC Conference, 2010
  • 17. Original plan  Use data from British Library‟s Zetoc service to create author IDs  Journal article information from 1993->  Last names, initials, paper titles, subject classifications  But…  International in scope  Lack of information on affiliations and first names to help with making matches  Huge dataset -> processing issues
  • 18. Revised plan  Used 2008 Research Assessment Exercise data (as cleaned up by JISC Merit project) to pre-populate the Names system  Identify unique individuals and assign identifiers  Data quality good, included institutional information: high accuracy, despite only having initials, not full first names  Except for… JISC Conference, 2010
  • 19.
  • 21. Building on Merit…  Merit data covers around 20% of active UK researchers  Working to enhance records and create new ones with information from other sources  Institutional repositories  British Library data sets (Zetoc)  Direct input from researchers
  • 22. Submission form JISC Conference, 2010
  • 24. Quality matters  Automatic matching can only achieve so much  Dependent on data source  British Library team perform manual check of results of matching new data sources  Allows for separation/merging of records  Plan to allow people to update their own information
  • 25. Ultimate aim  High-quality set of unique identifiers for UK researchers and research institutions  Available to other systems (national and international)  e.g. Names records exported to ISNI in 2011  Possible additional services  Disambiguation of existing data sets  Identification of external researchers
  • 26. Access to Names  API allows for flexible searching of Names data  EPrints plugin released in 2011: allows repository users to choose from a list of Names identities  …and to create a Names record if none exists JISC Conference, 2010
  • 29. Next steps…  JISC-convened Researcher ID group – final meeting in September > recommendations  Options Appraisal Report for UK national researcher identifier service > December  Improving data and adding new records JISC Conference, 2010
  • 30. Summing up  Names is a hybrid of library/publisher approaches  Automated matching/disambiguation  Human quality checks  Data immediately available for re-use in other systems  Researchers can supply information
  • 31. An evolving area  Main challenges are cultural and political rather than technical  National author/researcher ID services can be important parts of research infrastructure  Getting agreement and co-ordination at national level is vital
  • 32. Project updates  Names: http://names.mimas.ac.uk  Blog: http://namesproject.wordpress.com  Twitter: @NamesProject JISC Conference, 2010

Notas do Editor

  1. Deletable?
  2. …and, I would say, are all very jealous of those countries with ready-made data sources like this…
  3. Namey anecdote here? Dicky Moore & Robin Armstrong Viner?
  4. Known in name authority circles as ‘the Siveter problem’
  5. Every time we add a new data set, the quality of the data within the Names pilot improves – recently added information from the University of the West of England – QA process highlighted a previously unnoticed problem with the original Merit data.