SlideShare uma empresa Scribd logo
1 de 23
www.structuralbiology.eu
A distributed infrastructure for integrated structural biology
Antonio Rosato
rosato@cerm.unifi.it
This presentation is under the CC-BY-NC-SA license
What is Structural Biology and the SB Interest Group
Chairs:
Lucia Banci (U. Florence)
Chris Morris (STFC)
Antonio Rosato (U. Florence)
RDA Plenary 3 provided most of the indications that
lead to the implementation of the West-Life VRE
Fields of research underpinned and supported by BMS RIs
16 Instruct Centres
157 platforms offered
for access
>2650 registered users
6 National Affiliated
Centres
11 user networks
Operational since 2012
For all Instruct platforms:
www.structuralbiology.eu
Instruct is the major player for integrative structural biology
across Europe, making possible to tackle demanding projects that
require joint technical efforts.
It links also to other biomedical infrastructures via the CORBEL
project.
Instruct offers
economic gains
through new
marketable
technologies, new
drug and vaccine
development,
and improved
health.
Impact
Scarselli,Cantini,Banci,Rappuolietal.,
ScienceTransl.Med.2011
By knowing the structural properties of the antigens
and of the epitopes in all the variants, a chimera
antigen was produced which elicits complete
protective immunity. Patent WO 2011051893 A1
Structure-based design of a Vaccine
against Meningococcus B
What do we have and what do we need regarding DATA?
 Open Data culture pervasive in the Life Sciences (except
pharma). However this only applies to final (i.e. atomic
coordinates) and supporting (~intermediate) data. Structures
have their own DOIs, so are citable
 Structural data are discoverable and accessible via the
wwPDB. Supporting data not necessarily so (but
INSTRUCT mandates that). Raw data are not discoverable,
usually unregistered
 There have been significant efforts to ensure that
supporting data are reusable, i.e. structural data can be
re-computed
 The above applies to traditional, single-technique
studies
What do we have and what do we need regarding DATA?
In the INSTRUCT data policy:
"storage of data is the responsibility of the User to whom it
belongs. [...] Instruct Centres are not required to take responsibility for storing
data beyond the immediate acquisition visit or the time taken for post
experimental analysis if the latter is also provided by the Centre. However, Instruct
Centres aspire to offer an archive to store data, especially in cases where the data
volume makes this more practical that transferring the data, although there is not
a concrete time limit for this implementation"
 The data belong to the Users until they are published.
Then, the Users must make the data available
 Specialized trusted repositories exist for most
individual types of structural data.
What about integration of different structural data?
INSTRUCT is “Integrative Structural Biology”
However, if the data
from each technique
are deposited in
separate repositories,
re-use will be not
possible
The wwPDB task force
recommended:
In addition to archiving the models themselves, all relevant experimental data
and metadata as well as experimental and computational protocols should be
archived. Inclusivity is key.
A. Sali et al. Structure 2015
Key issues for INSTRUCT users
 SB researchers visit multiple experimental facilities
and use multiple techniques to obtain info on their
systems
 Correspondingly, they collect data in a variety of
different formats and described different
metadata, even for the same feature (e.g. sample
preparation
 The aforementioned visits take place over several
weeks or even months, also depending on the
information obtained
 Facilities have different policies for users’ data
storage, sometimes providing only very short term
storage
What do we need regarding DATA?
With integrated structural biology becoming routine we will
need to
 Re-evaluate and merge ontologies and metadata in a
cross-technique manner.
 Metadata need to be generated early in integrated SB to
allow data to be meaningfully combined later on.
However the infrastructure only sees the users when their
samples are ready.
 Document how raw/intermediate data are combined to
get to a final structural description.
Users worry more about data
processing than data management
Agree
Disagree
If a web portal offers
integrated access to
data archives and to
processing software,
I will use it
Last year I
discarded some
samples or files
because their
provenance was
not recorded
well enough.
Last year I
repeated
some work
because I
could not
find the
sample or file
produced.
I would use
combined
techniques if
the software
was easier to
get and use.
If a web portal
offers
integrated
access to data
archives and to
processing
software, I will
use it
Potential
conflict
with the
open
access
paradigm
A VRE for structural biology
west-life.eu
For a description of the life cycle of SB data: https://goo.gl/G8wqrE
Toward a new VRE portal
Also: OneData from the INDIGO-DataCloud project
Virtualising portals using EC3
Data challenges and opportunities for a
federated data infrastructure
• Provide collaborative environments to share / process /
analyze data (“à la Dropbox”)
• Provide simple mechanisms to manage the data: sharing /
protection / deletion
• Various access levels depending on security requirements
– Personal X.509 certificated – not user friendly
– robot certificates
– open access
• Control data quality
• Provide mechanisms for citation / recognition
Schematic diagram of a structure determination data pipeline.
From Westbrook et al. NAR, 2003
Open data access
• What is the data lifecycle in Structural Biology?
• Often linked to a specific grant
Current data model
Multiple experimental methods
and facilities
• Mean 3.9 techniques / respondent
– Biologists, not technique experts
324 respondents Other techniques used in past year:
X-ray
diffraction
NMR Spectroscopy
Electron
Microscopy
SAXS Modelling CD SPR Calorimetry Total
Preferred technique:
X-ray diffraction 52 36 68 89 74 64 80 463
NMR spectroscopy 35 9 21 44 50 24 39 222
Electron microscopy 12 3 6 13 6 6 10 56
SAXS 15 8 5 14 21 15 17 95
Modelling 20 14 5 10 24 21 22 116
Total 82 77 55 105 160 175 130 168 952

Mais conteúdo relacionado

Mais procurados

dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET
 
Rda in a_nutshell_february_2017_updated
Rda in a_nutshell_february_2017_updatedRda in a_nutshell_february_2017_updated
Rda in a_nutshell_february_2017_updatedResearch Data Alliance
 
RDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelRDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelOpenAIRE
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeTom Plasterer
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Tom Plasterer
 
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...EUDAT
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for BiopharmaTom Plasterer
 
Altman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data ManagementAltman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data ManagementASIS&T
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchersSarah Jones
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataTom Plasterer
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingMaaike Duine
 
Smith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesSmith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesASIS&T
 
BioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageBioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageTom Plasterer
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodeiASIS&T
 
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Tom Plasterer
 
LIBER Webinar: 23 Things About Research Data Management
LIBER Webinar: 23 Things About Research Data ManagementLIBER Webinar: 23 Things About Research Data Management
LIBER Webinar: 23 Things About Research Data ManagementLIBER Europe
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsTom Plasterer
 

Mais procurados (20)

dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
 
Rda in a_nutshell_february_2017_updated
Rda in a_nutshell_february_2017_updatedRda in a_nutshell_february_2017_updated
Rda in a_nutshell_february_2017_updated
 
RDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelRDA FAIR Data Maturity Model
RDA FAIR Data Maturity Model
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
 
Borgman - Privacy, Policy and Data Governance in the University
Borgman - Privacy, Policy and Data Governance in the UniversityBorgman - Privacy, Policy and Data Governance in the University
Borgman - Privacy, Policy and Data Governance in the University
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
FAIR Data in Trustworthy Data Repositories Webinar - 12-13 December 2016| www...
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for Biopharma
 
Altman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data ManagementAltman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data Management
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier Linking
 
Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11
 
Smith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesSmith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case Studies
 
BioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageBioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative Advantage
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodei
 
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
 
LIBER Webinar: 23 Things About Research Data Management
LIBER Webinar: 23 Things About Research Data ManagementLIBER Webinar: 23 Things About Research Data Management
LIBER Webinar: 23 Things About Research Data Management
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge Graphs
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
 

Destaque

ACTRIS - Aerosol, Clouds, and Trace gases Research Infrastructure
ACTRIS - Aerosol, Clouds, and Trace gases Research Infrastructure ACTRIS - Aerosol, Clouds, and Trace gases Research Infrastructure
ACTRIS - Aerosol, Clouds, and Trace gases Research Infrastructure Research Data Alliance
 
Cultural Heritage: when data are much worst than one can believe
Cultural Heritage: when data are much worst than one can believe Cultural Heritage: when data are much worst than one can believe
Cultural Heritage: when data are much worst than one can believe Research Data Alliance
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...Mark Wilkinson
 
EGI Services for Structural Biology - S Brewer EGI
EGI Services for Structural Biology - S Brewer EGIEGI Services for Structural Biology - S Brewer EGI
EGI Services for Structural Biology - S Brewer EGISteve Brewer
 
Adapting federated cyberinfrastructure for shared data collection facilities ...
Adapting federated cyberinfrastructure for shared data collection facilities ...Adapting federated cyberinfrastructure for shared data collection facilities ...
Adapting federated cyberinfrastructure for shared data collection facilities ...Boston Consulting Group
 
Markus Zweckstetter
Markus ZweckstetterMarkus Zweckstetter
Markus ZweckstetterAlzforum
 
Cliff Brangwynne
Cliff BrangwynneCliff Brangwynne
Cliff BrangwynneAlzforum
 
Reproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologyReproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologykhinsen
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Research Data Alliance
 
New Directions in Structural Biology at Diamond
New Directions in Structural Biology at DiamondNew Directions in Structural Biology at Diamond
New Directions in Structural Biology at Diamondwarwick_amr
 
Protein interaction networks from yeast to human
Protein interaction networks from yeast to humanProtein interaction networks from yeast to human
Protein interaction networks from yeast to humanLars Juhl Jensen
 
Structure Based Drug Design
Structure Based Drug DesignStructure Based Drug Design
Structure Based Drug Designdikheidi
 
UFOs: The Reality of the Impossible
UFOs: The Reality of the ImpossibleUFOs: The Reality of the Impossible
UFOs: The Reality of the ImpossibleMichael Hughes
 
general organization and characterstics of virus
general organization and characterstics of virusgeneral organization and characterstics of virus
general organization and characterstics of virusMohd Asif Kanth
 
Conservation and availability of plant genetic diversity
Conservation and availability of plant genetic diversity Conservation and availability of plant genetic diversity
Conservation and availability of plant genetic diversity Bioversity International
 
L01 ecture 01-
L01 ecture 01-L01 ecture 01-
L01 ecture 01-MUBOSScz
 
Anaesthetic management of valvular heart disease for non cardiac surgery
Anaesthetic management of valvular heart disease for non cardiac surgeryAnaesthetic management of valvular heart disease for non cardiac surgery
Anaesthetic management of valvular heart disease for non cardiac surgeryNaveen Cheran
 
Data sharing in neuroimaging: incentives, tools, and challenges
Data sharing in neuroimaging: incentives, tools, and challengesData sharing in neuroimaging: incentives, tools, and challenges
Data sharing in neuroimaging: incentives, tools, and challengesKrzysztof Gorgolewski
 
Genomics Crash Course for Data Engineers
Genomics Crash Course for Data EngineersGenomics Crash Course for Data Engineers
Genomics Crash Course for Data EngineersAllen Day, PhD
 
Structure based drug design
Structure based drug designStructure based drug design
Structure based drug designADAM S
 

Destaque (20)

ACTRIS - Aerosol, Clouds, and Trace gases Research Infrastructure
ACTRIS - Aerosol, Clouds, and Trace gases Research Infrastructure ACTRIS - Aerosol, Clouds, and Trace gases Research Infrastructure
ACTRIS - Aerosol, Clouds, and Trace gases Research Infrastructure
 
Cultural Heritage: when data are much worst than one can believe
Cultural Heritage: when data are much worst than one can believe Cultural Heritage: when data are much worst than one can believe
Cultural Heritage: when data are much worst than one can believe
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
 
EGI Services for Structural Biology - S Brewer EGI
EGI Services for Structural Biology - S Brewer EGIEGI Services for Structural Biology - S Brewer EGI
EGI Services for Structural Biology - S Brewer EGI
 
Adapting federated cyberinfrastructure for shared data collection facilities ...
Adapting federated cyberinfrastructure for shared data collection facilities ...Adapting federated cyberinfrastructure for shared data collection facilities ...
Adapting federated cyberinfrastructure for shared data collection facilities ...
 
Markus Zweckstetter
Markus ZweckstetterMarkus Zweckstetter
Markus Zweckstetter
 
Cliff Brangwynne
Cliff BrangwynneCliff Brangwynne
Cliff Brangwynne
 
Reproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologyReproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biology
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
 
New Directions in Structural Biology at Diamond
New Directions in Structural Biology at DiamondNew Directions in Structural Biology at Diamond
New Directions in Structural Biology at Diamond
 
Protein interaction networks from yeast to human
Protein interaction networks from yeast to humanProtein interaction networks from yeast to human
Protein interaction networks from yeast to human
 
Structure Based Drug Design
Structure Based Drug DesignStructure Based Drug Design
Structure Based Drug Design
 
UFOs: The Reality of the Impossible
UFOs: The Reality of the ImpossibleUFOs: The Reality of the Impossible
UFOs: The Reality of the Impossible
 
general organization and characterstics of virus
general organization and characterstics of virusgeneral organization and characterstics of virus
general organization and characterstics of virus
 
Conservation and availability of plant genetic diversity
Conservation and availability of plant genetic diversity Conservation and availability of plant genetic diversity
Conservation and availability of plant genetic diversity
 
L01 ecture 01-
L01 ecture 01-L01 ecture 01-
L01 ecture 01-
 
Anaesthetic management of valvular heart disease for non cardiac surgery
Anaesthetic management of valvular heart disease for non cardiac surgeryAnaesthetic management of valvular heart disease for non cardiac surgery
Anaesthetic management of valvular heart disease for non cardiac surgery
 
Data sharing in neuroimaging: incentives, tools, and challenges
Data sharing in neuroimaging: incentives, tools, and challengesData sharing in neuroimaging: incentives, tools, and challenges
Data sharing in neuroimaging: incentives, tools, and challenges
 
Genomics Crash Course for Data Engineers
Genomics Crash Course for Data EngineersGenomics Crash Course for Data Engineers
Genomics Crash Course for Data Engineers
 
Structure based drug design
Structure based drug designStructure based drug design
Structure based drug design
 

Semelhante a INSTRUCT - Integrated Structural Biology Infrastructure

The BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchThe BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchBlue BRIDGE
 
VREs and Research Tools - supporting collaborative research
VREs and Research Tools - supporting collaborative researchVREs and Research Tools - supporting collaborative research
VREs and Research Tools - supporting collaborative researchChristopher Brown
 
Virtual research environments for implementing long tail open science
Virtual research environments for implementing long tail open scienceVirtual research environments for implementing long tail open science
Virtual research environments for implementing long tail open scienceBlue BRIDGE
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
A Distributed Architecture for Sharing Ecological Data Sets with Access and U...
A Distributed Architecture for Sharing Ecological Data Sets with Access and U...A Distributed Architecture for Sharing Ecological Data Sets with Access and U...
A Distributed Architecture for Sharing Ecological Data Sets with Access and U...Javier González
 
Building a Blockchain-based Reputation Infrastructure for Open Research. Ca...
  Building a Blockchain-based Reputation Infrastructure for Open Research. Ca...  Building a Blockchain-based Reputation Infrastructure for Open Research. Ca...
Building a Blockchain-based Reputation Infrastructure for Open Research. Ca...Carmen Holotescu
 
Building a Blockchain-based Reputation Infrastructure for Open Research. Case...
Building a Blockchain-based Reputation Infrastructure for Open Research. Case...Building a Blockchain-based Reputation Infrastructure for Open Research. Case...
Building a Blockchain-based Reputation Infrastructure for Open Research. Case...Carmen Holotescu
 
Metid Match 2014 - SEEK for Science
Metid Match 2014 - SEEK for ScienceMetid Match 2014 - SEEK for Science
Metid Match 2014 - SEEK for Scienceale93756
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2Alex Hardisty
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresguest0dc425
 
Author's workflow and the role of open access
Author's workflow and the role of open accessAuthor's workflow and the role of open access
Author's workflow and the role of open accessPaola Gargiulo
 
CREW VRE Release 5 - 2009 May
CREW VRE Release 5 - 2009 MayCREW VRE Release 5 - 2009 May
CREW VRE Release 5 - 2009 MayMartin Turner
 
Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introductionMIBBI Checklists
 
The Repository Roadmap - are we heading in the right direction?
The Repository Roadmap - are we heading in the right direction?The Repository Roadmap - are we heading in the right direction?
The Repository Roadmap - are we heading in the right direction?Eduserv Foundation
 
Andy Powell Presentation
Andy Powell PresentationAndy Powell Presentation
Andy Powell PresentationDonggi heo
 
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLES
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLESCOLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLES
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLESijcsit
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Blue BRIDGE
 

Semelhante a INSTRUCT - Integrated Structural Biology Infrastructure (20)

The BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchThe BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative research
 
VREs and Research Tools - supporting collaborative research
VREs and Research Tools - supporting collaborative researchVREs and Research Tools - supporting collaborative research
VREs and Research Tools - supporting collaborative research
 
Virtual research environments for implementing long tail open science
Virtual research environments for implementing long tail open scienceVirtual research environments for implementing long tail open science
Virtual research environments for implementing long tail open science
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
A Distributed Architecture for Sharing Ecological Data Sets with Access and U...
A Distributed Architecture for Sharing Ecological Data Sets with Access and U...A Distributed Architecture for Sharing Ecological Data Sets with Access and U...
A Distributed Architecture for Sharing Ecological Data Sets with Access and U...
 
Building a Blockchain-based Reputation Infrastructure for Open Research. Ca...
  Building a Blockchain-based Reputation Infrastructure for Open Research. Ca...  Building a Blockchain-based Reputation Infrastructure for Open Research. Ca...
Building a Blockchain-based Reputation Infrastructure for Open Research. Ca...
 
Building a Blockchain-based Reputation Infrastructure for Open Research. Case...
Building a Blockchain-based Reputation Infrastructure for Open Research. Case...Building a Blockchain-based Reputation Infrastructure for Open Research. Case...
Building a Blockchain-based Reputation Infrastructure for Open Research. Case...
 
Metid Match 2014 - SEEK for Science
Metid Match 2014 - SEEK for ScienceMetid Match 2014 - SEEK for Science
Metid Match 2014 - SEEK for Science
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
 
Author's workflow and the role of open access
Author's workflow and the role of open accessAuthor's workflow and the role of open access
Author's workflow and the role of open access
 
CREW VRE Release 5 - 2009 May
CREW VRE Release 5 - 2009 MayCREW VRE Release 5 - 2009 May
CREW VRE Release 5 - 2009 May
 
Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introduction
 
The Repository Roadmap - are we heading in the right direction?
The Repository Roadmap - are we heading in the right direction?The Repository Roadmap - are we heading in the right direction?
The Repository Roadmap - are we heading in the right direction?
 
Andy Powell Presentation
Andy Powell PresentationAndy Powell Presentation
Andy Powell Presentation
 
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLES
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLESCOLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLES
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLES
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
UCIAD overview
UCIAD overviewUCIAD overview
UCIAD overview
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 

Mais de Research Data Alliance

The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsResearch Data Alliance
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsResearch Data Alliance
 
RDA Value for Infrastructure Providers
RDA Value for Infrastructure ProvidersRDA Value for Infrastructure Providers
RDA Value for Infrastructure ProvidersResearch Data Alliance
 
The Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing ResearchThe Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing ResearchResearch Data Alliance
 

Mais de Research Data Alliance (20)

RDA in a Nutshell - September 2020
RDA in a Nutshell - September 2020RDA in a Nutshell - September 2020
RDA in a Nutshell - September 2020
 
RDA in a Nutshell - August 2020
RDA in a Nutshell - August 2020RDA in a Nutshell - August 2020
RDA in a Nutshell - August 2020
 
RDA in a Nutshell - July 2020
RDA in a Nutshell - July 2020RDA in a Nutshell - July 2020
RDA in a Nutshell - July 2020
 
RDA in a Nutshell - June 2020
RDA in a Nutshell - June 2020RDA in a Nutshell - June 2020
RDA in a Nutshell - June 2020
 
RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020
 
RDA in a Nutshell - April 2020
RDA in a Nutshell - April 2020RDA in a Nutshell - April 2020
RDA in a Nutshell - April 2020
 
RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020
 
RDA in a Nutshell - February 2020
RDA in a Nutshell - February 2020RDA in a Nutshell - February 2020
RDA in a Nutshell - February 2020
 
RDA in a Nutshell - January 2020
RDA in a Nutshell - January 2020RDA in a Nutshell - January 2020
RDA in a Nutshell - January 2020
 
Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019
 
Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019
 
RDA in a Nutshell - October 2019
RDA in a Nutshell - October 2019RDA in a Nutshell - October 2019
RDA in a Nutshell - October 2019
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to Individuals
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to Individuals
 
RDA Value for Infrastructure Providers
RDA Value for Infrastructure ProvidersRDA Value for Infrastructure Providers
RDA Value for Infrastructure Providers
 
Rda in a nutshell september 2019
Rda in a nutshell september 2019Rda in a nutshell september 2019
Rda in a nutshell september 2019
 
The Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing ResearchThe Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing Research
 
RDA Value for Libraries
RDA Value for LibrariesRDA Value for Libraries
RDA Value for Libraries
 
The Value of the RDA for Funders
The Value of the RDA for FundersThe Value of the RDA for Funders
The Value of the RDA for Funders
 
Rda in a nutshell august 2019
Rda in a nutshell august 2019Rda in a nutshell august 2019
Rda in a nutshell august 2019
 

Último

NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 

Último (20)

NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 

INSTRUCT - Integrated Structural Biology Infrastructure

  • 1. www.structuralbiology.eu A distributed infrastructure for integrated structural biology Antonio Rosato rosato@cerm.unifi.it This presentation is under the CC-BY-NC-SA license
  • 2. What is Structural Biology and the SB Interest Group Chairs: Lucia Banci (U. Florence) Chris Morris (STFC) Antonio Rosato (U. Florence) RDA Plenary 3 provided most of the indications that lead to the implementation of the West-Life VRE
  • 3. Fields of research underpinned and supported by BMS RIs
  • 4. 16 Instruct Centres 157 platforms offered for access >2650 registered users 6 National Affiliated Centres 11 user networks Operational since 2012 For all Instruct platforms: www.structuralbiology.eu
  • 5. Instruct is the major player for integrative structural biology across Europe, making possible to tackle demanding projects that require joint technical efforts. It links also to other biomedical infrastructures via the CORBEL project. Instruct offers economic gains through new marketable technologies, new drug and vaccine development, and improved health. Impact Scarselli,Cantini,Banci,Rappuolietal., ScienceTransl.Med.2011 By knowing the structural properties of the antigens and of the epitopes in all the variants, a chimera antigen was produced which elicits complete protective immunity. Patent WO 2011051893 A1 Structure-based design of a Vaccine against Meningococcus B
  • 6. What do we have and what do we need regarding DATA?  Open Data culture pervasive in the Life Sciences (except pharma). However this only applies to final (i.e. atomic coordinates) and supporting (~intermediate) data. Structures have their own DOIs, so are citable  Structural data are discoverable and accessible via the wwPDB. Supporting data not necessarily so (but INSTRUCT mandates that). Raw data are not discoverable, usually unregistered  There have been significant efforts to ensure that supporting data are reusable, i.e. structural data can be re-computed  The above applies to traditional, single-technique studies
  • 7.
  • 8. What do we have and what do we need regarding DATA? In the INSTRUCT data policy: "storage of data is the responsibility of the User to whom it belongs. [...] Instruct Centres are not required to take responsibility for storing data beyond the immediate acquisition visit or the time taken for post experimental analysis if the latter is also provided by the Centre. However, Instruct Centres aspire to offer an archive to store data, especially in cases where the data volume makes this more practical that transferring the data, although there is not a concrete time limit for this implementation"  The data belong to the Users until they are published. Then, the Users must make the data available  Specialized trusted repositories exist for most individual types of structural data.
  • 9. What about integration of different structural data? INSTRUCT is “Integrative Structural Biology” However, if the data from each technique are deposited in separate repositories, re-use will be not possible The wwPDB task force recommended: In addition to archiving the models themselves, all relevant experimental data and metadata as well as experimental and computational protocols should be archived. Inclusivity is key. A. Sali et al. Structure 2015
  • 10. Key issues for INSTRUCT users  SB researchers visit multiple experimental facilities and use multiple techniques to obtain info on their systems  Correspondingly, they collect data in a variety of different formats and described different metadata, even for the same feature (e.g. sample preparation  The aforementioned visits take place over several weeks or even months, also depending on the information obtained  Facilities have different policies for users’ data storage, sometimes providing only very short term storage
  • 11. What do we need regarding DATA? With integrated structural biology becoming routine we will need to  Re-evaluate and merge ontologies and metadata in a cross-technique manner.  Metadata need to be generated early in integrated SB to allow data to be meaningfully combined later on. However the infrastructure only sees the users when their samples are ready.  Document how raw/intermediate data are combined to get to a final structural description.
  • 12. Users worry more about data processing than data management Agree Disagree If a web portal offers integrated access to data archives and to processing software, I will use it Last year I discarded some samples or files because their provenance was not recorded well enough. Last year I repeated some work because I could not find the sample or file produced. I would use combined techniques if the software was easier to get and use. If a web portal offers integrated access to data archives and to processing software, I will use it Potential conflict with the open access paradigm
  • 13. A VRE for structural biology west-life.eu For a description of the life cycle of SB data: https://goo.gl/G8wqrE
  • 14. Toward a new VRE portal Also: OneData from the INDIGO-DataCloud project
  • 16.
  • 17.
  • 18. Data challenges and opportunities for a federated data infrastructure • Provide collaborative environments to share / process / analyze data (“à la Dropbox”) • Provide simple mechanisms to manage the data: sharing / protection / deletion • Various access levels depending on security requirements – Personal X.509 certificated – not user friendly – robot certificates – open access • Control data quality • Provide mechanisms for citation / recognition
  • 19.
  • 20. Schematic diagram of a structure determination data pipeline. From Westbrook et al. NAR, 2003
  • 21. Open data access • What is the data lifecycle in Structural Biology? • Often linked to a specific grant
  • 23. Multiple experimental methods and facilities • Mean 3.9 techniques / respondent – Biologists, not technique experts 324 respondents Other techniques used in past year: X-ray diffraction NMR Spectroscopy Electron Microscopy SAXS Modelling CD SPR Calorimetry Total Preferred technique: X-ray diffraction 52 36 68 89 74 64 80 463 NMR spectroscopy 35 9 21 44 50 24 39 222 Electron microscopy 12 3 6 13 6 6 10 56 SAXS 15 8 5 14 21 15 17 95 Modelling 20 14 5 10 24 21 22 116 Total 82 77 55 105 160 175 130 168 952