SlideShare uma empresa Scribd logo
1 de 29
eosc-hub.eu
@EOSC_eu
EOSC-hub receives funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 777536.
EUDAT B2SAFE
2
Motivation and driving consideration about the service
Service architecture and interfaces: overview
- How the user can access the service
 E.g.: REST, GUI, CLIs, etc.
- Service options and attributes
Acceptable Usage Policy (AUP)
Use cases
Documentation/tutorial/information
3/5/2019
Content
3
A research community wants to improve the services offered to its users
granting that:
- their data will still be available after years;
- the data are easily accessible from a researcher with just a browser as well as from a
data manager who needs to transfer massive amounts of data;
- the data are easily discoverable through a well defined set of metadata attributes
and tools;
- the data can be moved to computing resources when needed and back;
- those improvements will not disrupt the user workflows because they will be
inplemented in a transparent way through a seamless integration with the current
community services, which will enforce the authorization policies defined by the
community.
3/5/2019
Motivation
Data planning
Long term data preservation
Data curation
Data access
Data discovery
Well defined API and protocols
Data transfer
Distributed architecture 1
EUDAT infrastructure (CDI)
• Different administrative
domains
• We need to federate
them to offer a common
user management
Easy
access
Distributed architecture 2
Storage systems
User interfaces
Federation
Architecture 1
EUDAT has built an additional layer on top of iRODS to
streamline the processes which supports the replication and
long term data archiving.
iRODS +
EUDAT B2SAFE package +
back-end storage =
B2SAFE service
http://eudat.eu/services/userdoc/configure-b2safe
https://github.com/EUDAT-B2SAFE/B2SAFE-core
Architecture 2
EUDAT B2SAFE package
EUDAT B2SAFE package = rules + scripts
93/5/2019
B2SAFE package rules and scripts
103/5/2019
B2SAFE Data Policy Manager
architecture
DPM
Interfaces towards other services
DPM
metadata
PID catalog
123/5/2019
Interfaces towards other services:
data flow 1
thanks to www.vecteezy.com for the pictures
Community data
Policies: data are stored according to
the rules defined by the community
data are identified
data are registereddata are made discoverable
data can be easily retrieved
data can be easily moved
data are secured
1234
1234
1234
133/5/2019
Interfaces towards other services:
data flow 2
Data are stored according to the
rules defined by the community
Data are identified
Data are made
discoverable
Data are registered
Data can be easily retrieved
Data are secured
Data can be easily moved
A set of EUDAT rules is defined: they
implement the most common data flows.
Community specific rules are added when needed
Longtermdatapreservation
Persistent Identifiers (PIDs) are associated to the data and
registered in the B2HANDLE service
Persistent Identifiers (PIDs) are globally resolvable, they can
be used in B2SHARE and B2STAGE services
Data are replicated according to the defined policy across
different nodes of the EUDAT CDI, making them tolerant to
single node failures and single copy corruption
HTTP API and GridFTP allow to download and upload
data using standard protocols.
Data
discovery
Data
transfer
143/5/2019
Interfaces towards other services:
Data Policy Manager 1
DPM
Definition of policies for data
management
Policies life cycle
management
Policies translation
Policies enforcement
User
authentication
?
Data
manager
Resource
provider
Resource provider
feedback
153/5/2019
Interfaces towards other services:
Data Policy Manager 2
DPM relies on B2ACCESS for the authentication
through the Shibboleth protocol
Policies are implemented as XML documents which
can be created through a web portal
The B2SAFE rules are scheduled according to the
policy trigger and executed by the rule engine
The policies, described in high level language, are
translated into B2SAFE rules
The status of the policy is reported back to the data
manager. It can be waiting in a queue, enforced,
rejected by the resource provider or completed
User authentication
Definition of policies for
data management
Policies translation
Policies enforcement
Resource provider
feedback
Datacuration
Policies life cycle
management
Policies are stored in an XML DB and identified through a
unique id. They can be modified and removed
iRODS icommands: it is a set of CLI commands which can be
deployed through RPM or DEB packages.
(https://irods.org/download)
Davrods: it is webDAV interface on top of iRODS.
(https://github.com/UtrechtUniversity/davrods)
The B2STAGE service offers two interfaces for B2SAFE:
- The GridFTP iRODS-DSI to enable fast data transfer through the GridFTP
protocol;
- HTTP API interface to furnish a RESTful interface towards EUDAT
services.
How to access the service 1
The GridFTP iRODS-DSI
● DSI (Data Storage Interface): GridFTP can be extended to
support different underlying storage system
● Implemented making use of the iRODS C API
● Supports the main iRODS operations (get, put, delete, list,
checksum calculation)
UberFTP
Globus Online
globus-url-copy
WebFTS
FTS3 Rest CLI
data
The GridFTP iRODS-DSI allows users to manage
data on EUDAT nodes (B2SAFE) through any
standard GridFTP client
183/5/2019
HTTP API
User is authenticating
with username/password
Upload
Download
Oauth2: HTTP API get a oauth2 token from
B2ACCESS and provides an api token to the
user
data are streamed from the http
client to b2safe, avoiding to cache
them at the HTTP API server
B2SAFE validates the
oauth2 token and gets
user attributes to map
the user on a local
account
HTTP API talks with
B2SAFE on behalf of
the user, using the
oauth2 token
data are streamed from b2safe,to the http client, avoiding
to cache them at the HTTP API server
193/5/2019
HTTP API authentication
User claudio is authenticating with username/password
A new file is ready to be uploaded
203/5/2019
HTTP API upload / download
Upload
Download
213/5/2019
DPM web portal: policy editor
22
http://hdl.handle.net/11304/e43b2e3f-83c5-4e3f-b8b7-18d38d37a6cd
3/5/2019
Acceptable Usage Policy
233/5/2019
Featured use cases
Use cases
CLARIN
https://www.eudat.eu/communities/common-language-resources-and-
technology-infrastructure
ClimateModel
https://www.eudat.eu/communities/support-to-scientific-research-on-
seasonal-to-decadal-climate-and-air-quality-modelling
EISCAT https://www.eudat.eu/communities/unified-access-to-eiscat-radar-data
EPOS https://www.eudat.eu/communities/european-plate-observing-system
Herbadrop
https://www.eudat.eu/communities/long-term-preservation-of-herbarium-
specimen-images
IST
https://www.eudat.eu/communities/eudat-services-to-guarantee-long-
time-archiving-and-visibility-to-the-repository-of-ist
VPH https://www.eudat.eu/communities/virtual-humans
SDC https://www.seadatanet.org/About-us/SeaDataCloud
24
The SeaDataNet portal (CDI: Common Data Index) collects only part of
the data produced by more than one hundreds of marine research
institutions.
The others are stored locally from the institutions and offered to the
users after a request via email. They are made accessible via a
temporary web service endpoint.
The quality checks are performed by the local institutions, without any
central mechanism, therefore the risk of inconsistencies and
duplications is high.
There is not a Virtual Research Environment, but a set of desktop and
web applications , independent from each other. The user is forced to
upload the data set that she wants to analyze and to download the
result: there is not a shared data space, neither there is a personal one.
3/5/2019
SeaDataCloud: the challenge
253/5/2019
SeaDataCloud: b2safe and b2stage
B2HANDLE
26
B2SAFE and B2STAGE services are hidden behind the community web portal (CDI) which takes care to
manage user and community specific metadata registration (DATA DISCOVERY).
Each of the five EUDAT data centers offers a B2SAFE instance federated with the others.
Each data center provides two storage areas:
- one for the ingestion of the new data uploaded by the data producers, which are the hundreds of marine
science institutions of SeaDataNet (DATA TRANSFER);
- one for the production ready data, which have been validated by the data manager through the community
web portal.
The community web portal triggers quality check workflows on the B2SAFE and B2HOST side (DATA
PLANNING, DATA CURATION).
Once moved into the production area, the data are replicated following a star pattern: each replica has
the same master copy. And a B2HANDLE PID is associated to them (LONG TERM DATA PRESERVATION)
Data can then be shared with applications running on the B2HOST environment (DATA TRANSFER)
3/5/2019
SeaDataCloud: the solution
273/5/2019
SeaDataCloud: b2safe solution
B2HANDLE
Data planning
SDC community web
portal
Data discovery
Data
access
Data
transfer
Data transfer
Long term data preservation
Data curation
28
https://eudat.eu/services/userdoc/b2safe
https://github.com/EUDAT-Training/B2SAFE-B2STAGE-
Training
https://github.com/EUDAT-B2SAFE/B2SAFE-core/wiki
https://github.com/EUDAT-B2SAFE/B2SAFE-DPM/wiki
3/5/2019
Documentations
eosc-hub.eu @EOSC_eu

Mais conteúdo relacionado

Semelhante a EUDAT B2SAFE & EOSC-hub

EUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT
 
Information server prototype for GFIS
Information server prototype for GFISInformation server prototype for GFIS
Information server prototype for GFISJarmo Saarikko
 
Data Preservation Service Area
Data Preservation Service AreaData Preservation Service Area
Data Preservation Service AreaEUDAT
 
D3.4.1 Data fusion tools
D3.4.1 Data fusion toolsD3.4.1 Data fusion tools
D3.4.1 Data fusion toolsFOODIE_Project
 
B2STAGE- how to shift large amounts of data| www.eudat.eu |
B2STAGE- how to shift large amounts of data| www.eudat.eu | B2STAGE- how to shift large amounts of data| www.eudat.eu |
B2STAGE- how to shift large amounts of data| www.eudat.eu | EUDAT
 
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu | How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu | EUDAT
 
EGI-EUDAT interoperability| www.eudat.eu |
EGI-EUDAT interoperability| www.eudat.eu | EGI-EUDAT interoperability| www.eudat.eu |
EGI-EUDAT interoperability| www.eudat.eu | EUDAT
 
DataPorts Blockchain Overview
DataPorts Blockchain OverviewDataPorts Blockchain Overview
DataPorts Blockchain OverviewDataPortsProject
 
EOSC-hub & EUDAT B2FIND service
EOSC-hub & EUDAT B2FIND serviceEOSC-hub & EUDAT B2FIND service
EOSC-hub & EUDAT B2FIND serviceEOSC-hub project
 
EUDAT Collaborative Data Infrastructure: Data Access and Re-use Service Area
EUDAT Collaborative Data Infrastructure: Data Access and Re-use Service AreaEUDAT Collaborative Data Infrastructure: Data Access and Re-use Service Area
EUDAT Collaborative Data Infrastructure: Data Access and Re-use Service AreaEUDAT
 
A cloud environment for backup and data storage
A cloud environment for backup and data storageA cloud environment for backup and data storage
A cloud environment for backup and data storageIGEEKS TECHNOLOGIES
 
A cloud enviroment for backup and data storage
A cloud enviroment for backup and data storageA cloud enviroment for backup and data storage
A cloud enviroment for backup and data storageIGEEKS TECHNOLOGIES
 
An efficient and secure data storage in cloud computing using modified RSA pu...
An efficient and secure data storage in cloud computing using modified RSA pu...An efficient and secure data storage in cloud computing using modified RSA pu...
An efficient and secure data storage in cloud computing using modified RSA pu...IJECEIAES
 
D3.1.2 heterogeneous data repositories and related services
D3.1.2 heterogeneous data repositories and related servicesD3.1.2 heterogeneous data repositories and related services
D3.1.2 heterogeneous data repositories and related servicesFOODIE_Project
 
Automatic Management of Wireless Sensor Networks through Cloud Computing
Automatic Management of Wireless Sensor Networks through Cloud ComputingAutomatic Management of Wireless Sensor Networks through Cloud Computing
Automatic Management of Wireless Sensor Networks through Cloud Computingyousef emami
 
SeaDataCloud - upgrading the Common Data Index (CDI) Data Discovery and Acces...
SeaDataCloud - upgrading the Common Data Index (CDI) Data Discovery and Acces...SeaDataCloud - upgrading the Common Data Index (CDI) Data Discovery and Acces...
SeaDataCloud - upgrading the Common Data Index (CDI) Data Discovery and Acces...EUDAT
 
Linked Data for Ambient Intelligence
Linked Data for Ambient IntelligenceLinked Data for Ambient Intelligence
Linked Data for Ambient IntelligenceFulvio Corno
 
App for peer-to-peer file transfer
App for peer-to-peer file transferApp for peer-to-peer file transfer
App for peer-to-peer file transferIRJET Journal
 

Semelhante a EUDAT B2SAFE & EOSC-hub (20)

EUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdfEUDAT Booklet Mar22 (2).pdf
EUDAT Booklet Mar22 (2).pdf
 
EUDAT B2DROP & EOSC-hub
EUDAT B2DROP & EOSC-hubEUDAT B2DROP & EOSC-hub
EUDAT B2DROP & EOSC-hub
 
Information server prototype for GFIS
Information server prototype for GFISInformation server prototype for GFIS
Information server prototype for GFIS
 
Data Preservation Service Area
Data Preservation Service AreaData Preservation Service Area
Data Preservation Service Area
 
D3.4.1 Data fusion tools
D3.4.1 Data fusion toolsD3.4.1 Data fusion tools
D3.4.1 Data fusion tools
 
B2STAGE- how to shift large amounts of data| www.eudat.eu |
B2STAGE- how to shift large amounts of data| www.eudat.eu | B2STAGE- how to shift large amounts of data| www.eudat.eu |
B2STAGE- how to shift large amounts of data| www.eudat.eu |
 
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu | How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu |
 
EGI-EUDAT interoperability| www.eudat.eu |
EGI-EUDAT interoperability| www.eudat.eu | EGI-EUDAT interoperability| www.eudat.eu |
EGI-EUDAT interoperability| www.eudat.eu |
 
Dq36708711
Dq36708711Dq36708711
Dq36708711
 
DataPorts Blockchain Overview
DataPorts Blockchain OverviewDataPorts Blockchain Overview
DataPorts Blockchain Overview
 
EOSC-hub & EUDAT B2FIND service
EOSC-hub & EUDAT B2FIND serviceEOSC-hub & EUDAT B2FIND service
EOSC-hub & EUDAT B2FIND service
 
EUDAT Collaborative Data Infrastructure: Data Access and Re-use Service Area
EUDAT Collaborative Data Infrastructure: Data Access and Re-use Service AreaEUDAT Collaborative Data Infrastructure: Data Access and Re-use Service Area
EUDAT Collaborative Data Infrastructure: Data Access and Re-use Service Area
 
A cloud environment for backup and data storage
A cloud environment for backup and data storageA cloud environment for backup and data storage
A cloud environment for backup and data storage
 
A cloud enviroment for backup and data storage
A cloud enviroment for backup and data storageA cloud enviroment for backup and data storage
A cloud enviroment for backup and data storage
 
An efficient and secure data storage in cloud computing using modified RSA pu...
An efficient and secure data storage in cloud computing using modified RSA pu...An efficient and secure data storage in cloud computing using modified RSA pu...
An efficient and secure data storage in cloud computing using modified RSA pu...
 
D3.1.2 heterogeneous data repositories and related services
D3.1.2 heterogeneous data repositories and related servicesD3.1.2 heterogeneous data repositories and related services
D3.1.2 heterogeneous data repositories and related services
 
Automatic Management of Wireless Sensor Networks through Cloud Computing
Automatic Management of Wireless Sensor Networks through Cloud ComputingAutomatic Management of Wireless Sensor Networks through Cloud Computing
Automatic Management of Wireless Sensor Networks through Cloud Computing
 
SeaDataCloud - upgrading the Common Data Index (CDI) Data Discovery and Acces...
SeaDataCloud - upgrading the Common Data Index (CDI) Data Discovery and Acces...SeaDataCloud - upgrading the Common Data Index (CDI) Data Discovery and Acces...
SeaDataCloud - upgrading the Common Data Index (CDI) Data Discovery and Acces...
 
Linked Data for Ambient Intelligence
Linked Data for Ambient IntelligenceLinked Data for Ambient Intelligence
Linked Data for Ambient Intelligence
 
App for peer-to-peer file transfer
App for peer-to-peer file transferApp for peer-to-peer file transfer
App for peer-to-peer file transfer
 

Mais de EOSC-hub project

EOSC-hub Early Adopter Programme
EOSC-hub Early Adopter ProgrammeEOSC-hub Early Adopter Programme
EOSC-hub Early Adopter ProgrammeEOSC-hub project
 
2019 05-21 egi and eosc - final
2019 05-21 egi and eosc - final2019 05-21 egi and eosc - final
2019 05-21 egi and eosc - finalEOSC-hub project
 
Introduction to service management and FitSM
Introduction to service management and FitSMIntroduction to service management and FitSM
Introduction to service management and FitSMEOSC-hub project
 
Service management board (SMB), Service providers’ forum (SPF)
Service management board (SMB), Service providers’ forum (SPF)Service management board (SMB), Service providers’ forum (SPF)
Service management board (SMB), Service providers’ forum (SPF)EOSC-hub project
 
Joining the EOSC-hub as a Service Provider
Joining the EOSC-hub as a Service ProviderJoining the EOSC-hub as a Service Provider
Joining the EOSC-hub as a Service ProviderEOSC-hub project
 
PID services - understandability and findability of data
PID services - understandability and findability of dataPID services - understandability and findability of data
PID services - understandability and findability of dataEOSC-hub project
 
Software for data management and exploitation
Software for data management and exploitationSoftware for data management and exploitation
Software for data management and exploitationEOSC-hub project
 
Repositories for long-term preservation - certification
Repositories for long-term preservation - certificationRepositories for long-term preservation - certification
Repositories for long-term preservation - certificationEOSC-hub project
 
EOSC working group on FAIR
EOSC working group on FAIREOSC working group on FAIR
EOSC working group on FAIREOSC-hub project
 
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...EOSC-hub project
 
Services to support FAIR data - Introduction
Services to support FAIR data - IntroductionServices to support FAIR data - Introduction
Services to support FAIR data - IntroductionEOSC-hub project
 
Pathways for EOSC-hub and MaX collaboration
Pathways for EOSC-hub and MaX collaborationPathways for EOSC-hub and MaX collaboration
Pathways for EOSC-hub and MaX collaborationEOSC-hub project
 
Overview on the HPC CoEs panorama
Overview on the HPC CoEs panoramaOverview on the HPC CoEs panorama
Overview on the HPC CoEs panoramaEOSC-hub project
 
Overview of the Onboarding and validation process and the Rules of Participat...
Overview of the Onboarding and validation process and the Rules of Participat...Overview of the Onboarding and validation process and the Rules of Participat...
Overview of the Onboarding and validation process and the Rules of Participat...EOSC-hub project
 
ELIXIR Competence Centre in EOSC-hub
ELIXIR Competence Centre in EOSC-hubELIXIR Competence Centre in EOSC-hub
ELIXIR Competence Centre in EOSC-hubEOSC-hub project
 

Mais de EOSC-hub project (20)

EOSC-hub Early Adopter Programme
EOSC-hub Early Adopter ProgrammeEOSC-hub Early Adopter Programme
EOSC-hub Early Adopter Programme
 
2019 05-21 egi and eosc - final
2019 05-21 egi and eosc - final2019 05-21 egi and eosc - final
2019 05-21 egi and eosc - final
 
Introduction to service management and FitSM
Introduction to service management and FitSMIntroduction to service management and FitSM
Introduction to service management and FitSM
 
Service management board (SMB), Service providers’ forum (SPF)
Service management board (SMB), Service providers’ forum (SPF)Service management board (SMB), Service providers’ forum (SPF)
Service management board (SMB), Service providers’ forum (SPF)
 
Joining the EOSC-hub as a Service Provider
Joining the EOSC-hub as a Service ProviderJoining the EOSC-hub as a Service Provider
Joining the EOSC-hub as a Service Provider
 
PID services - understandability and findability of data
PID services - understandability and findability of dataPID services - understandability and findability of data
PID services - understandability and findability of data
 
Software for data management and exploitation
Software for data management and exploitationSoftware for data management and exploitation
Software for data management and exploitation
 
Repositories for long-term preservation - certification
Repositories for long-term preservation - certificationRepositories for long-term preservation - certification
Repositories for long-term preservation - certification
 
EOSC working group on FAIR
EOSC working group on FAIREOSC working group on FAIR
EOSC working group on FAIR
 
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...
Updates on the FAIR Data Maturity Model RDA Working Group & the DG RTD FAIR i...
 
Services to support FAIR data - Introduction
Services to support FAIR data - IntroductionServices to support FAIR data - Introduction
Services to support FAIR data - Introduction
 
EOSC-synergy
EOSC-synergyEOSC-synergy
EOSC-synergy
 
ExPaNDS
ExPaNDSExPaNDS
ExPaNDS
 
EOSC-Pillar
EOSC-PillarEOSC-Pillar
EOSC-Pillar
 
NI4OS-Europe
NI4OS-EuropeNI4OS-Europe
NI4OS-Europe
 
Excellerat CoE
Excellerat CoEExcellerat CoE
Excellerat CoE
 
Pathways for EOSC-hub and MaX collaboration
Pathways for EOSC-hub and MaX collaborationPathways for EOSC-hub and MaX collaboration
Pathways for EOSC-hub and MaX collaboration
 
Overview on the HPC CoEs panorama
Overview on the HPC CoEs panoramaOverview on the HPC CoEs panorama
Overview on the HPC CoEs panorama
 
Overview of the Onboarding and validation process and the Rules of Participat...
Overview of the Onboarding and validation process and the Rules of Participat...Overview of the Onboarding and validation process and the Rules of Participat...
Overview of the Onboarding and validation process and the Rules of Participat...
 
ELIXIR Competence Centre in EOSC-hub
ELIXIR Competence Centre in EOSC-hubELIXIR Competence Centre in EOSC-hub
ELIXIR Competence Centre in EOSC-hub
 

Último

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Último (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

EUDAT B2SAFE & EOSC-hub

  • 1. eosc-hub.eu @EOSC_eu EOSC-hub receives funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 777536. EUDAT B2SAFE
  • 2. 2 Motivation and driving consideration about the service Service architecture and interfaces: overview - How the user can access the service  E.g.: REST, GUI, CLIs, etc. - Service options and attributes Acceptable Usage Policy (AUP) Use cases Documentation/tutorial/information 3/5/2019 Content
  • 3. 3 A research community wants to improve the services offered to its users granting that: - their data will still be available after years; - the data are easily accessible from a researcher with just a browser as well as from a data manager who needs to transfer massive amounts of data; - the data are easily discoverable through a well defined set of metadata attributes and tools; - the data can be moved to computing resources when needed and back; - those improvements will not disrupt the user workflows because they will be inplemented in a transparent way through a seamless integration with the current community services, which will enforce the authorization policies defined by the community. 3/5/2019 Motivation Data planning Long term data preservation Data curation Data access Data discovery Well defined API and protocols Data transfer
  • 4. Distributed architecture 1 EUDAT infrastructure (CDI) • Different administrative domains • We need to federate them to offer a common user management Easy access
  • 5. Distributed architecture 2 Storage systems User interfaces Federation
  • 7. EUDAT has built an additional layer on top of iRODS to streamline the processes which supports the replication and long term data archiving. iRODS + EUDAT B2SAFE package + back-end storage = B2SAFE service http://eudat.eu/services/userdoc/configure-b2safe https://github.com/EUDAT-B2SAFE/B2SAFE-core Architecture 2
  • 8. EUDAT B2SAFE package EUDAT B2SAFE package = rules + scripts
  • 10. 103/5/2019 B2SAFE Data Policy Manager architecture DPM
  • 11. Interfaces towards other services DPM metadata PID catalog
  • 12. 123/5/2019 Interfaces towards other services: data flow 1 thanks to www.vecteezy.com for the pictures Community data Policies: data are stored according to the rules defined by the community data are identified data are registereddata are made discoverable data can be easily retrieved data can be easily moved data are secured 1234 1234 1234
  • 13. 133/5/2019 Interfaces towards other services: data flow 2 Data are stored according to the rules defined by the community Data are identified Data are made discoverable Data are registered Data can be easily retrieved Data are secured Data can be easily moved A set of EUDAT rules is defined: they implement the most common data flows. Community specific rules are added when needed Longtermdatapreservation Persistent Identifiers (PIDs) are associated to the data and registered in the B2HANDLE service Persistent Identifiers (PIDs) are globally resolvable, they can be used in B2SHARE and B2STAGE services Data are replicated according to the defined policy across different nodes of the EUDAT CDI, making them tolerant to single node failures and single copy corruption HTTP API and GridFTP allow to download and upload data using standard protocols. Data discovery Data transfer
  • 14. 143/5/2019 Interfaces towards other services: Data Policy Manager 1 DPM Definition of policies for data management Policies life cycle management Policies translation Policies enforcement User authentication ? Data manager Resource provider Resource provider feedback
  • 15. 153/5/2019 Interfaces towards other services: Data Policy Manager 2 DPM relies on B2ACCESS for the authentication through the Shibboleth protocol Policies are implemented as XML documents which can be created through a web portal The B2SAFE rules are scheduled according to the policy trigger and executed by the rule engine The policies, described in high level language, are translated into B2SAFE rules The status of the policy is reported back to the data manager. It can be waiting in a queue, enforced, rejected by the resource provider or completed User authentication Definition of policies for data management Policies translation Policies enforcement Resource provider feedback Datacuration Policies life cycle management Policies are stored in an XML DB and identified through a unique id. They can be modified and removed
  • 16. iRODS icommands: it is a set of CLI commands which can be deployed through RPM or DEB packages. (https://irods.org/download) Davrods: it is webDAV interface on top of iRODS. (https://github.com/UtrechtUniversity/davrods) The B2STAGE service offers two interfaces for B2SAFE: - The GridFTP iRODS-DSI to enable fast data transfer through the GridFTP protocol; - HTTP API interface to furnish a RESTful interface towards EUDAT services. How to access the service 1
  • 17. The GridFTP iRODS-DSI ● DSI (Data Storage Interface): GridFTP can be extended to support different underlying storage system ● Implemented making use of the iRODS C API ● Supports the main iRODS operations (get, put, delete, list, checksum calculation) UberFTP Globus Online globus-url-copy WebFTS FTS3 Rest CLI data The GridFTP iRODS-DSI allows users to manage data on EUDAT nodes (B2SAFE) through any standard GridFTP client
  • 18. 183/5/2019 HTTP API User is authenticating with username/password Upload Download Oauth2: HTTP API get a oauth2 token from B2ACCESS and provides an api token to the user data are streamed from the http client to b2safe, avoiding to cache them at the HTTP API server B2SAFE validates the oauth2 token and gets user attributes to map the user on a local account HTTP API talks with B2SAFE on behalf of the user, using the oauth2 token data are streamed from b2safe,to the http client, avoiding to cache them at the HTTP API server
  • 19. 193/5/2019 HTTP API authentication User claudio is authenticating with username/password A new file is ready to be uploaded
  • 20. 203/5/2019 HTTP API upload / download Upload Download
  • 21. 213/5/2019 DPM web portal: policy editor
  • 23. 233/5/2019 Featured use cases Use cases CLARIN https://www.eudat.eu/communities/common-language-resources-and- technology-infrastructure ClimateModel https://www.eudat.eu/communities/support-to-scientific-research-on- seasonal-to-decadal-climate-and-air-quality-modelling EISCAT https://www.eudat.eu/communities/unified-access-to-eiscat-radar-data EPOS https://www.eudat.eu/communities/european-plate-observing-system Herbadrop https://www.eudat.eu/communities/long-term-preservation-of-herbarium- specimen-images IST https://www.eudat.eu/communities/eudat-services-to-guarantee-long- time-archiving-and-visibility-to-the-repository-of-ist VPH https://www.eudat.eu/communities/virtual-humans SDC https://www.seadatanet.org/About-us/SeaDataCloud
  • 24. 24 The SeaDataNet portal (CDI: Common Data Index) collects only part of the data produced by more than one hundreds of marine research institutions. The others are stored locally from the institutions and offered to the users after a request via email. They are made accessible via a temporary web service endpoint. The quality checks are performed by the local institutions, without any central mechanism, therefore the risk of inconsistencies and duplications is high. There is not a Virtual Research Environment, but a set of desktop and web applications , independent from each other. The user is forced to upload the data set that she wants to analyze and to download the result: there is not a shared data space, neither there is a personal one. 3/5/2019 SeaDataCloud: the challenge
  • 26. 26 B2SAFE and B2STAGE services are hidden behind the community web portal (CDI) which takes care to manage user and community specific metadata registration (DATA DISCOVERY). Each of the five EUDAT data centers offers a B2SAFE instance federated with the others. Each data center provides two storage areas: - one for the ingestion of the new data uploaded by the data producers, which are the hundreds of marine science institutions of SeaDataNet (DATA TRANSFER); - one for the production ready data, which have been validated by the data manager through the community web portal. The community web portal triggers quality check workflows on the B2SAFE and B2HOST side (DATA PLANNING, DATA CURATION). Once moved into the production area, the data are replicated following a star pattern: each replica has the same master copy. And a B2HANDLE PID is associated to them (LONG TERM DATA PRESERVATION) Data can then be shared with applications running on the B2HOST environment (DATA TRANSFER) 3/5/2019 SeaDataCloud: the solution
  • 27. 273/5/2019 SeaDataCloud: b2safe solution B2HANDLE Data planning SDC community web portal Data discovery Data access Data transfer Data transfer Long term data preservation Data curation