Research Graph: Connecting Identifiers across Research Data Infrastructures
1. Research Graph: Connecting
Identifiers across Research
Data Infrastructures
Dr. Amir Aryani
Project Manager, Australian National Data Service (ANDS), http://ands.org.au
at Australian National University (ANU), http://people.anu.edu.au/amir.aryani
Co-chair of DDRI working group, Research Data Alliance, http://rd-alliance.org
email: amir.aryani@ands.org.au, twitter.com/amir_at_ands
http://orcid.org/0000-0002-4259-9774
This presentation on "Creating a Distributed Graph using RD-Switchboard" by Dr. Amir Aryani is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License
(http://creativecommons.org/licenses/by-sa/4.0/).
2. Agenda
● Challenge of cross-platform discovery
● Research Data Alliance working groups
● Example of Research Graph connections and Neo4j queries
● Impact of identifiers on connections
5. Question: What other researchers or research
activities are connected to this work?”
● What other related datasets has been published by the authors?
● What are the related publications to this dataset?
● What is the research portfolio of the authors related to this work?
● What research projects are related to this dataset?
8. Data Description Registry Interoperability (DDRI) WG
Goal: Enabling Cross-platform Discovery
between research data infrastructures
https://www.rd-alliance.org/groups/data-description-registry-interoperability.html
12. Using Neo4j and RD-Switchboard
● match (n:dataset) where n.doi='10.4225/35/563159f223739' return n
● match (n:dataset)--(r:researcher) where n.doi='10.4225/35/563159f223739'
return r.first_name, r.last_name
● match (r:orcid:researcher) where r.first_name='David' and
r.last_name='Ellsworth' return r.orcid
Find all the publications by 3 degrees of separation
● match (n:dataset)-[*1..3]-(p:publication) where
n.doi='10.4225/35/563159f223739' return n.title limit 50
14. Impact of identifiers: DOI
Case A: match (d:dataset)--(m) where not exists(d.doi) return count(distinct(d))
Case B: match (d:dataset)--(m) where exists(d.doi) return count(distinct(d))
15. Impact of identifiers: ORCID
Case A: match (r:researcher)--(m) where not exists(r.orcid) return count(distinct(r))
Case B: match (r:researcher)--(m) where exists(r.orcid) return count(distinct(r))
16. Summary
● President Identifiers has a crucial role in enabling connections in Research
Graph and enabling functionality of the Switchboard platform.
● Research Graph distributed graph currently more than ten million nodes from
CERN, Dryad, ORCID, figshare, ANDS, NII Japan, and a number of other
international partners. This data can be used for research in collaboration
networks.
● The Research Graph is an open collaborative project. If you are interested to
work with the project partners, please
○ Join DDRI Working Group:
https://www.rd-alliance.org/groups/data-description-registry-interoperability.html
○ and contact amir.aryani@ands.org.au
● More information:
○ http://researchgraph.org/
○ https://github.com/researchgraph
○ https://twitter.com/researchgraph