SlideShare uma empresa Scribd logo
1 de 23
@openaire_euOpenAIRE-Connect Review
23rd of April, 2018 - Brussels
The OpenAIRE Research Graph
Bringing scholarly communication back into the
hands of scientists
PaoloManghi
InstituteofInformationScienceandTechnologies
ConsiglioNazionaledelleRicerche
Materializing the Open Science Graph
Project
communit
y
FunderFunding
Product
Publicatio
n
Researc
h Data
Software
Organizatio
n
Source
Other
res.
products
Mining
Deduplication
End-user feedback
Scientific product
catalogue
Harvesting
GUIDE
LINES
Research Infrastructures Publishing
IT
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
Providing an open metadata
research graph of interlinked
scientific products, with Open
Access information, linked to
funding information and research
communities
The OpenAIRE research graph
Open
Complete
De-duplicated
Transparent
Participatory
Decentralized
Trusted
De-duplicated
More information about the de-duplication framework used by OpenAIRE can be found
searching on Zenodo for :
• “De-duplicating the OpenAIRE Scholarly Communication Big Graph” (poster)
• “GDup: De-Duplication of Scholarly Communication Big Graphs”
Metadata records
corresponding to equivalent
objects are merged
Scientific products
Organizations
Complete: community-trusted sources
Academic Graph
… and more
… and more
… and more
… and more
… and more
… and more
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
• Rely on quality scholarly
communication sources of
different kinds
Participatory
• Include solutions and content
from any interested and known
content provider in scholarly
communication
Institutional repositories
Aggregators
Data archives
Software repositories
Research infrastructure sources
Funder grant databases
Authors & Orgs entity registries
Publishers & journals
• Metadata in the graph includes provenance when harvested
and reliability indicators when obtained from mining
Transparent
• Preservation and ownership beyond OpenAIRE
Exchanged with other graph initiatives
Broker Service: Redistributed via subscription and
notification to contributing data sources
(provide.openaire.eu)
• Openly accessible via APIs
(develop.openaire.eu)
Decentralized
• Authors in the loop to enrich their ORCID record
• Validation of end-user ”claims”
Trusted (November 2019)
Populating the Graph
Harvesting: Revised Classification of Research
Products
Publications
• Article
• Preprint
• Report
• …
Datasets
• Dataset
• Collection
• Clinical Trials
• …
Software
• Research
Software
• …
Other Research
Products
• Service
• Workflow
• Interactive
Resource
• …
Institutional/
publication
repositories
Journals/
publishers
Data
repositories
Other
Products
repositories
Software
repositories
Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica
Open Science publishing
Bridging RIs and Scholarly Communication
Transparency and reproducibility
e-Infrastructures and
Research Infrastructures
Scholarly Communication
infrastructure
Dataset
Method Thematic
Service
Dataset
Experiment Publishing
the experiment
Input
Dataset
Input
Method
Output
Dataset
Experiment
product
Thematic Service
Parameters
Experiment
repo
Research data,
Software,
Workflows,
Publications
Data repo
Method repo
Publications
IT
Harvesting
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
• EPOS Research Infrastructure
Reproducibility
Transparency
Seamless publishing
Open Science publishing workflows
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
Pre-processed sources
Article-dataset links
480Mi links
CrossRef enriched
85Mi publication records
DOIBoost
Academic Graph
Published every 6 months
(new versions to be published next week)
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
Context Propagation
Product
Source
Country
Project
Organization
communit
y
Product
Project Source
Product
Project
Product
supplementedBy
fundedBy
hostedBy
(institutional repository)
located
Funder
funds
(National Funder)
fundedBy
jurisdiction
located
ofInterestofInterest
fundedBy
hostedBy
Product
supplementedBy
157K
8Mi 10K
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
Production: Open Access CAPs
BETA: Open Science CAPs
0
10000000
20000000
30000000
40000000
50000000
60000000
70000000
80000000
90000000
100000000
Old CAP New CAP
literature
0
2000000
4000000
6000000
8000000
10000000
12000000
Old CAP New CAP
research data
0
20000
40000
60000
80000
100000
120000
140000
Old CAP New CAP
software
0
500000
1000000
1500000
2000000
2500000
3000000
3500000
4000000
4500000
Old CAP New CAP
other
110Mi
30Mi
1Mi
10Mi
100K
180K
3Mi
7.5Mi
Harvested content
• Data sources
10K +
• Records
~480Mi
• Publication full-texts
~12Mi (Springer N. coming)
• Links (also text-mined)
~960Mi
PROD BETA PROD BETA
PROD BETAPROD BETA
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
Microsoft Research (being drafted)
Unpaywall (ongoing)
ORCID membership (November 2019)
RDA IG Open Science Graphs for FAIR Data
FREYA, ResearchGraph, OpenCitations,
Open Knowledge Research Graph
IG Session at RDA Helsinki 2019 (15th of October 2019)
Liaisons
Academic Graph
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
• October-November 2019:
OpenAIRE Research Graph open for consultation
Collecting feedback via Trello (operational end of September)
• December 2019:
OpenAIRE Research Graph
in production
BETA Graph Open Consultation
http://beta.explore.openaire.eu
OpenAIREAdvance1stReview|Luxembourg|10Oct2019
Trello for for feedback
Thank you!
Paolo Manghi
paolo.manghi@isti.cnr.it
Architecture,
technologies, and
infrastructure
Metadata
records
files
cleaned
records
Full-text
cache
Transform
Clean
Identify
equivelent
products
and
organisation
s
Aggregation subsystem
De-duplication
subsystem
Information Inference subsystem
Data Sources
Populate
Merge equivalent objects
Data provision
subsystem
Collect
Native graph
“slices”
Publishing
subsystem
Data Monitoring
Action Sets
(similarity
rels)
Front-end
Native
graph
Deduped
graph
Extract full-text
Copy of deduped
graph
Enrich graphs with links
Action Set
(inferred
links)
Enriched
graph
Propagation
Text-mining of
the full-texts and
the graph to
derive new
semantic links
Architecture and technologies: today
Task 9.1. System administration -
infrastructure: before Jan 2018
Public
System
20srv
122CPU
320GB
8TB
Mining
System
21srv
406CPU
2TB
385TB
Data provision
System
23srv
154CPU
430GB
23TB
Testing
System
5srv
30CPU
100GB
3TB
Public
System
44srv
274CPU
905GB
20TB
Mining
System
22srv
414CPU
2.2TB
388TB
Data provision
System
23srv
154CPU
430GB
24TB
Testing
System
14srv
86CPU
302GB
9TB
OpenAIREAdvance1stReview|Luxembourg|10Oct2019

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE caseA Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
 
Intact danish workshop_20171001
Intact danish workshop_20171001Intact danish workshop_20171001
Intact danish workshop_20171001
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph
 
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
 
The European Open Science Cloud
The European Open Science CloudThe European Open Science Cloud
The European Open Science Cloud
 
Towards a Linked Data Publishing Methodology
Towards a Linked Data Publishing MethodologyTowards a Linked Data Publishing Methodology
Towards a Linked Data Publishing Methodology
 
Grant Funding Programme
Grant Funding ProgrammeGrant Funding Programme
Grant Funding Programme
 
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
 
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFLOpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
 
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
 
Knowledge Exchange Consensus: Monitoring of Open Access Publications and Cost...
Knowledge Exchange Consensus: Monitoring of Open Access Publications and Cost...Knowledge Exchange Consensus: Monitoring of Open Access Publications and Cost...
Knowledge Exchange Consensus: Monitoring of Open Access Publications and Cost...
 
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community Call
 
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
 
Demonstration of the 4C cost comparison tool
Demonstration of the 4C cost comparison toolDemonstration of the 4C cost comparison tool
Demonstration of the 4C cost comparison tool
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
 
The Services of the OpenAIREplus Infrastructure for Scholarly Communication –...
The Services of the OpenAIREplus Infrastructure for Scholarly Communication –...The Services of the OpenAIREplus Infrastructure for Scholarly Communication –...
The Services of the OpenAIREplus Infrastructure for Scholarly Communication –...
 
Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service...
Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service...Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service...
Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service...
 
From Box to Hydra via Archivematica
From Box to Hydra via ArchivematicaFrom Box to Hydra via Archivematica
From Box to Hydra via Archivematica
 

Semelhante a 20191119_The OpenAIRE Research Graph

Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
OpenAIRE
 

Semelhante a 20191119_The OpenAIRE Research Graph (20)

Belgium webinar - openAIRE Research Graph
Belgium webinar - openAIRE Research GraphBelgium webinar - openAIRE Research Graph
Belgium webinar - openAIRE Research Graph
 
Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...
 
OpenAIRE Open Science publishing for Research Infrastructures: the EPOS use-c...
OpenAIRE Open Science publishing for Research Infrastructures: the EPOS use-c...OpenAIRE Open Science publishing for Research Infrastructures: the EPOS use-c...
OpenAIRE Open Science publishing for Research Infrastructures: the EPOS use-c...
 
Open sciencerefresher2019
Open sciencerefresher2019Open sciencerefresher2019
Open sciencerefresher2019
 
Introduction to OpenAIRE services and the OpenAIRE Research Graph
Introduction to OpenAIRE services and the OpenAIRE Research GraphIntroduction to OpenAIRE services and the OpenAIRE Research Graph
Introduction to OpenAIRE services and the OpenAIRE Research Graph
 
OpenAIRE-Advance: Advancing Open Scholarship (Presentation at RDA 11th Plenary)
OpenAIRE-Advance: Advancing Open Scholarship (Presentation at RDA 11th Plenary)OpenAIRE-Advance: Advancing Open Scholarship (Presentation at RDA 11th Plenary)
OpenAIRE-Advance: Advancing Open Scholarship (Presentation at RDA 11th Plenary)
 
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
 
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
 
OpenAIRE @ OECD Blue Sky III
OpenAIRE @ OECD Blue Sky IIIOpenAIRE @ OECD Blue Sky III
OpenAIRE @ OECD Blue Sky III
 
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
Infraestructuras, recursos y servicios de OpenAIRE. OpenAIRE Workshop Spain, ...
 
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
 
Enabling better science: Results and vision of the OpenAIRE infrastructure an...
Enabling better science: Results and vision of the OpenAIRE infrastructure an...Enabling better science: Results and vision of the OpenAIRE infrastructure an...
Enabling better science: Results and vision of the OpenAIRE infrastructure an...
 
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
 
Overview of the OA mandate and OpenAIRE infrastructure, Inge Van Nieuwerburgh...
Overview of the OA mandate and OpenAIRE infrastructure, Inge Van Nieuwerburgh...Overview of the OA mandate and OpenAIRE infrastructure, Inge Van Nieuwerburgh...
Overview of the OA mandate and OpenAIRE infrastructure, Inge Van Nieuwerburgh...
 
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...
 
OpenAIRE content in support of Open Science monitoring (Presentation by Paolo...
OpenAIRE content in support of Open Science monitoring (Presentation by Paolo...OpenAIRE content in support of Open Science monitoring (Presentation by Paolo...
OpenAIRE content in support of Open Science monitoring (Presentation by Paolo...
 
Open Science as-a-Service for research communities: preliminary results and u...
Open Science as-a-Service for research communities: preliminary results and u...Open Science as-a-Service for research communities: preliminary results and u...
Open Science as-a-Service for research communities: preliminary results and u...
 
Moving content across the OpenAIRE infrastructure boundaries (6th RDA Plenary)
Moving content across the OpenAIRE infrastructure boundaries (6th RDA Plenary) Moving content across the OpenAIRE infrastructure boundaries (6th RDA Plenary)
Moving content across the OpenAIRE infrastructure boundaries (6th RDA Plenary)
 
Text Mining: the next data frontier. Beyond Open Access
Text Mining: the next data frontier. Beyond Open AccessText Mining: the next data frontier. Beyond Open Access
Text Mining: the next data frontier. Beyond Open Access
 
OpenAIRE services and tools - presentation at #DI4R2016
OpenAIRE services and tools - presentation at #DI4R2016OpenAIRE services and tools - presentation at #DI4R2016
OpenAIRE services and tools - presentation at #DI4R2016
 

Mais de OpenAIRE

Mais de OpenAIRE (20)

10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call
 
9th Content Providers Community Call\
9th Content Providers Community Call\9th Content Providers Community Call\
9th Content Providers Community Call\
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)
 
8th Content Providers Community Call
8th Content Providers Community Call8th Content Providers Community Call
8th Content Providers Community Call
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managers
 
What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community Call
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in Greece
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community Call
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community Call
 
3rd Content Providers Community Call
3rd Content Providers Community Call3rd Content Providers Community Call
3rd Content Providers Community Call
 
2nd Content Providers Community Call
2nd Content Providers Community Call2nd Content Providers Community Call
2nd Content Providers Community Call
 
1st Content Providers Community Call
1st Content Providers Community Call1st Content Providers Community Call
1st Content Providers Community Call
 
IPR and Exploitation
IPR and Exploitation IPR and Exploitation
IPR and Exploitation
 

Último

Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Silpa
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Silpa
 

Último (20)

Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 

20191119_The OpenAIRE Research Graph

  • 1. @openaire_euOpenAIRE-Connect Review 23rd of April, 2018 - Brussels The OpenAIRE Research Graph Bringing scholarly communication back into the hands of scientists PaoloManghi InstituteofInformationScienceandTechnologies ConsiglioNazionaledelleRicerche
  • 2. Materializing the Open Science Graph Project communit y FunderFunding Product Publicatio n Researc h Data Software Organizatio n Source Other res. products Mining Deduplication End-user feedback Scientific product catalogue Harvesting GUIDE LINES Research Infrastructures Publishing IT OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 3. Providing an open metadata research graph of interlinked scientific products, with Open Access information, linked to funding information and research communities The OpenAIRE research graph Open Complete De-duplicated Transparent Participatory Decentralized Trusted
  • 4. De-duplicated More information about the de-duplication framework used by OpenAIRE can be found searching on Zenodo for : • “De-duplicating the OpenAIRE Scholarly Communication Big Graph” (poster) • “GDup: De-Duplication of Scholarly Communication Big Graphs” Metadata records corresponding to equivalent objects are merged Scientific products Organizations
  • 5. Complete: community-trusted sources Academic Graph … and more … and more … and more … and more … and more … and more OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 6. • Rely on quality scholarly communication sources of different kinds Participatory • Include solutions and content from any interested and known content provider in scholarly communication Institutional repositories Aggregators Data archives Software repositories Research infrastructure sources Funder grant databases Authors & Orgs entity registries Publishers & journals
  • 7. • Metadata in the graph includes provenance when harvested and reliability indicators when obtained from mining Transparent
  • 8. • Preservation and ownership beyond OpenAIRE Exchanged with other graph initiatives Broker Service: Redistributed via subscription and notification to contributing data sources (provide.openaire.eu) • Openly accessible via APIs (develop.openaire.eu) Decentralized
  • 9. • Authors in the loop to enrich their ORCID record • Validation of end-user ”claims” Trusted (November 2019)
  • 11. Harvesting: Revised Classification of Research Products Publications • Article • Preprint • Report • … Datasets • Dataset • Collection • Clinical Trials • … Software • Research Software • … Other Research Products • Service • Workflow • Interactive Resource • … Institutional/ publication repositories Journals/ publishers Data repositories Other Products repositories Software repositories Workshop Técnico OpenAIRE / LA Referencia | 29-30 October, 2019 | Costa Rica
  • 12. Open Science publishing Bridging RIs and Scholarly Communication Transparency and reproducibility e-Infrastructures and Research Infrastructures Scholarly Communication infrastructure Dataset Method Thematic Service Dataset Experiment Publishing the experiment Input Dataset Input Method Output Dataset Experiment product Thematic Service Parameters Experiment repo Research data, Software, Workflows, Publications Data repo Method repo Publications IT Harvesting OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 13. • EPOS Research Infrastructure Reproducibility Transparency Seamless publishing Open Science publishing workflows OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 14. Pre-processed sources Article-dataset links 480Mi links CrossRef enriched 85Mi publication records DOIBoost Academic Graph Published every 6 months (new versions to be published next week) OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 15. Context Propagation Product Source Country Project Organization communit y Product Project Source Product Project Product supplementedBy fundedBy hostedBy (institutional repository) located Funder funds (National Funder) fundedBy jurisdiction located ofInterestofInterest fundedBy hostedBy Product supplementedBy 157K 8Mi 10K OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 16. Production: Open Access CAPs BETA: Open Science CAPs 0 10000000 20000000 30000000 40000000 50000000 60000000 70000000 80000000 90000000 100000000 Old CAP New CAP literature 0 2000000 4000000 6000000 8000000 10000000 12000000 Old CAP New CAP research data 0 20000 40000 60000 80000 100000 120000 140000 Old CAP New CAP software 0 500000 1000000 1500000 2000000 2500000 3000000 3500000 4000000 4500000 Old CAP New CAP other 110Mi 30Mi 1Mi 10Mi 100K 180K 3Mi 7.5Mi Harvested content • Data sources 10K + • Records ~480Mi • Publication full-texts ~12Mi (Springer N. coming) • Links (also text-mined) ~960Mi PROD BETA PROD BETA PROD BETAPROD BETA OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 17. Microsoft Research (being drafted) Unpaywall (ongoing) ORCID membership (November 2019) RDA IG Open Science Graphs for FAIR Data FREYA, ResearchGraph, OpenCitations, Open Knowledge Research Graph IG Session at RDA Helsinki 2019 (15th of October 2019) Liaisons Academic Graph OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 18. • October-November 2019: OpenAIRE Research Graph open for consultation Collecting feedback via Trello (operational end of September) • December 2019: OpenAIRE Research Graph in production BETA Graph Open Consultation http://beta.explore.openaire.eu OpenAIREAdvance1stReview|Luxembourg|10Oct2019
  • 19. Trello for for feedback
  • 22. Metadata records files cleaned records Full-text cache Transform Clean Identify equivelent products and organisation s Aggregation subsystem De-duplication subsystem Information Inference subsystem Data Sources Populate Merge equivalent objects Data provision subsystem Collect Native graph “slices” Publishing subsystem Data Monitoring Action Sets (similarity rels) Front-end Native graph Deduped graph Extract full-text Copy of deduped graph Enrich graphs with links Action Set (inferred links) Enriched graph Propagation Text-mining of the full-texts and the graph to derive new semantic links Architecture and technologies: today
  • 23. Task 9.1. System administration - infrastructure: before Jan 2018 Public System 20srv 122CPU 320GB 8TB Mining System 21srv 406CPU 2TB 385TB Data provision System 23srv 154CPU 430GB 23TB Testing System 5srv 30CPU 100GB 3TB Public System 44srv 274CPU 905GB 20TB Mining System 22srv 414CPU 2.2TB 388TB Data provision System 23srv 154CPU 430GB 24TB Testing System 14srv 86CPU 302GB 9TB OpenAIREAdvance1stReview|Luxembourg|10Oct2019

Notas do Editor

  1. How does OpenAIRE materializes the graph? Collection records (dedup) Collection full-texts for OA publications Mining full-texts of publications to find links to data, software, other product, projects, research communities and infrastructures and enhance metadata with affiliation information, subjects/keywords: article-data and data-data links are around 120 Mi, article-article similarity links are around 300Mi
  2. GOAL: High quality open graph for Open (because it must be), Complete (all «trusted»/known sources), Deduplicated (must be disambiguated for statistics), transparent (provenance), participatory (not a closed network), decentralised (ownership and redistribution), trusted (manual curation)
  3. Supported entity types People to come with orcid collaboration Algorithm can be improved but some cases can be handled only manually
  4. Any interested content provider can join the network to provide content. Not a closed network. Interoperability guidelines help in the process.
  5. Mining trust: probability of the mining information to be correct
  6. OpenAIRE DOES NOT own the graph
  7. Supported entity types People to come with orcid collaboration Algorithm can be improved but some cases can be handled only manually
  8. In production today we acquire content according to Open Access-driven CAP: 30 mi pubs with links to other objects (e.g. 1Mi datasets, etc) In BETA we acquire content according to Open Science-driven CAP: this means we collect EVERYTHING (that is in a trusted source) menaing also non-OA content