SlideShare uma empresa Scribd logo
1 de 58
Baixar para ler offline
Digital preservation
caring for our data to foster
 knowledge discovery and
       dissemination

     Claudia Bauzer Medeiros
      Institute of Computing
             UNICAMP
Pre-Saervare
 (Before) – (Save)
= save before disappears
Maintain
    Manu-tenere

= being able to get/find it
Dec 2008




Feb 2010
Data deluge
• At end of 2011 – info created and replicated > 1.8 zettabytes

• 90% data created in the last 2 years

• 5 hour flight – 240 Tbytes

• Facebook – 200 million users, >70 languages

• Each person in England is filmed 300 times/day

• Teenagers in the US send average 110 phone text messages a day

=> We need to build arks during the deluge - PRESERVATION
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
WHY PRESERVE
• Costly to produce

• Contribute to progress of science

• Intrinsic value
  culture/science/sustainability
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
WHY PRESERVE
• Costly to produce
   – Infrastructure, power, software, models, visualization,
     people
   – Hardware, Software, Peopleware
• Contribute to progress of science
   – Reproducibility and reusability
   – Publication and sharing
   – Quality
• Intrinsic value culture/science/sustainability
   – Digital humanities
   – Domesday project
   – Fonoteca Neotropical Jacques Vieillard
The Domesday Project 1086-1986
• Digital decay
• Equipment obsolescence
• Software obsolescence
Domesday reloaded
Fonoteca
Neotropical
Jacques
Vieillard
Outline
• Why preserve?

• What to preserve?
• How to preserve?

And associated challenges
What to preserve?
• Data

• BUT what is “data”?



• Only data?
What to preserve?
• Data
• BUT what is “data”?
  – Files and records
  – Models, documentation, annotations, sketches,
    experiments, recordings
• Only data?
What to preserve?
• Data
• BUT what is “data”?
  – Files and records
  – Models, documentation, annotations, sketches,
    experiments, recordings
• Only data?
  – How produced it – workflows, devices,
    methodologies, materials and methods,
    reasonings, logs --- provenance
What to preserve?
• Data
• Environment in which was produced

• Data needed to preserve occupies more space
  than the data itself
• Preservation means storing more than object
  itself
What about our research data?
               (slide adapted from Jim Gray)
Experiments
 Instruments

  Files                           Questions

  Papers                          Answers

   Simulations
          Models


             DATA



Data-driven science                    “Collaboratory”


                                                         23/10000
Data sources?
    Table of Product Characteristics
   id        Property name Value
 MilkProd     productsrep     MilkA
 MilkProd       quantity      10000
 MilkProd     validity date 10/06/2006
CheeseProd productsr          Minas
CheeseProd    epquantity      2000
CheeseProd validity date 12/02/2006
CheeseProd      shape        Circular




                                                         24/10000
eEnvironmental Science
• Direct and indirect observations




                                     25/10000
Data sources




               26/10000
27/10000
We are
 DATASCOPE
 engineers


Software is the
      device/tool
Outline
• Why preserve?
• What to preserve?

• How to preserve?

And associated challenges
How to preserve?

How to construct the ark during the
             deluge?

Presaervare, Manutenere and Share
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures, metadata,standards
• To afford maintenance costs
  – Cloud? CAP theorem?
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay

• To ensure quality
  – Curation procedures,metadata, standards
• To afford maintenance costs
  – Cloud? CAP theorem? =======     WHERE
How to preserve?
• To ensure retrievability and sharing
  – Index structures
  – Ontologies, metadata, keywords, standards
  – Workflows
• To ensure longevity
  – Media decay, software decay, hardware decay
  – PEOPLE DECAY
• To ensure quality
  – Curation procedures,metadata, standards
• To afford maintenance costs
  – Cloud? CAP theorem? =======     WHERE
Sharing and open access

NSF Data Management Policy

 Paper and data publication
Sharing of Data Leads to Progress on Alzheimer’s
                                        By GINA KOLATA
                                   Published: August 12, 2010
                                      = NEW YORK TIMES

In 2003, a group of scientists and executives from the National Institutes of Health, the Food and
Drug Administration, the drug and medical-imaging industries, universities and nonprofit groups
joined in a project that experts say had no precedent: a collaborative effort to find the biological
          markers that show the progression of Alzheimer’s disease in the human brain.



   share all the data, making every single
  finding public immediately, available to
 anyone with a computer anywhere in the
                    world
        => AVAILABILITY and REUSE
• Data must be properly curated throughout its
  life-cycle and released with the appropriate
  high-quality metadata.
• Medical Research Council UK




                                           40/10000
• Research data should be made available for
  use by other researchers. Researchers must
  retain research data, including electronic data,
  in a durable, indexed and retrievable form.
• Australian Govnmt National Health and
  Medical Research Council



                                              41/10000
Microsoft Academic Search
40M publications
19M authors
75 publishers (Wiley, Springer, ACM, IEEE …)




                                               42/10000
Google Scholar Citations




                      43/10000
• Citing data is as important as citing papers
• For researchers, publishers, data centers
• Over 1M DOI, several major national research
  libraries
  – Germany, France, Korea, Netherlands, Australia,
    USA...
• Present manager – German National Library of
  Science and Technology

                                                 44/10000
Publish on the Cloud
Add metadata
Pre-print sharing




                       45/10000
FNJV
       proj.lis.ic.unicamp.br/fnjv
• Sharing by publishing on the Web
• Retrievability by extending metadata




                                         46/10000
CURATION AND USE OF STANDARDS
Workflows and model preservation
Workflows and model preservation
         Comb-e-Chem
                   Video
                                                    Simulation

                                                                 Properties

                           Analysis
  Diffractometer




                                           Structures
                                           Database




X-Ray                                                                   Properties
e-Lab                                                                   e-Lab

                                      Grid Middleware

                                                                          52/10000
The cloud and CAP
Outline
•   Why preserve?
•   What to preserve?
•   How to preserve?
•   Where to preserve?

And a few associated challenges
PRE-SAVE and MANU-TENERE
Outline
• Why preserve?
  – Costly to produce (hardware, software, peopleware)
  – Contribute to progress of science
  – Value – culture, science, sustainability
• What to preserve?
  – Data [WHAT IS DATA?]
  – Context of production and use
• How to preserve?
  – Accessibility and sharing – standards, metadata,
    ontologies
  – Integrity and quality – context to use (hw, sw),
    standards
References
•




             56/10000
References
NSF – CISE Data management policy
The Domesday Project
http://www.atsf.co.uk/dottext/domesday.html
The CLARIN Project (languages)
Eigenfactor.org
Altmetrics movement
Thank you!!!!

Mais conteúdo relacionado

Mais procurados

Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iJose Enrique Ruiz
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleCarole Goble
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?myGrid team
 
Status of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaStatus of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaHans Herrmann
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Chris Rusbridge
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceGarethKnight
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudOpenStack
 

Mais procurados (8)

Just Digitise It - Daniel Wilksch - 2015
Just Digitise It - Daniel Wilksch - 2015Just Digitise It - Daniel Wilksch - 2015
Just Digitise It - Daniel Wilksch - 2015
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote Goble
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?
 
Status of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in CanadaStatus of Alien Invasive Species Information in Canada
Status of Alien Invasive Species Information in Canada
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Federation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research CloudFederation and Interoperability in the Nectar Research Cloud
Federation and Interoperability in the Nectar Research Cloud
 

Destaque

Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...ariadnenetwork
 
Legal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesLegal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesZapproved
 
Research bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataResearch bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataLancaster University Library
 
D.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationD.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationPRELIDA Project
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservationsmtcd
 
Digital preservation
Digital preservationDigital preservation
Digital preservationSarika Sawant
 
Basic of Human Resource Management
Basic of Human Resource ManagementBasic of Human Resource Management
Basic of Human Resource ManagementAshit Jain
 
Introduction to human resource management
Introduction to human resource managementIntroduction to human resource management
Introduction to human resource managementTanuj Poddar
 
Human resource management ppt
Human resource management ppt Human resource management ppt
Human resource management ppt Babasab Patil
 
Human Resource Management
Human Resource ManagementHuman Resource Management
Human Resource Managementgumbhir singh
 

Destaque (13)

Data preservation
Data preservationData preservation
Data preservation
 
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
Small Solutions for Small Institutions – Steps Towards Archiving and Preserva...
 
Legal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best PracticesLegal Hold and Data Preservation Best Practices
Legal Hold and Data Preservation Best Practices
 
Research bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research DataResearch bites: Digital Preservation for Research Data
Research bites: Digital Preservation for Research Data
 
D.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital PreservationD.3.1: State of the Art - Linked Data and Digital Preservation
D.3.1: State of the Art - Linked Data and Digital Preservation
 
Data preservation 101
Data preservation 101Data preservation 101
Data preservation 101
 
Is Violent Crime Increasing or Decreasing?
Is Violent Crime Increasing or Decreasing?Is Violent Crime Increasing or Decreasing?
Is Violent Crime Increasing or Decreasing?
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital preservation
Digital preservationDigital preservation
Digital preservation
 
Basic of Human Resource Management
Basic of Human Resource ManagementBasic of Human Resource Management
Basic of Human Resource Management
 
Introduction to human resource management
Introduction to human resource managementIntroduction to human resource management
Introduction to human resource management
 
Human resource management ppt
Human resource management ppt Human resource management ppt
Human resource management ppt
 
Human Resource Management
Human Resource ManagementHuman Resource Management
Human Resource Management
 

Semelhante a Claudia Bauzer Medeiros Digital preservation – caring for our data to foster knowledge discovery and dissemination

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemASIS&T
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12ASIS&T
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...Ardan Patwardhan
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and SharingJisc
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourcePhilippa Griffin
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2Alex Hardisty
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Jeroen Rombouts
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3guru122
 

Semelhante a Claudia Bauzer Medeiros Digital preservation – caring for our data to foster knowledge discovery and dissemination (20)

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
RDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management EcosystemRDAP13 John Kunze: The Data Management Ecosystem
RDAP13 John Kunze: The Data Management Ecosystem
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and Sharing
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 
Big Data
Big Data Big Data
Big Data
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 

Mais de Beniamino Murgante

Analyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesAnalyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesBeniamino Murgante
 
Smart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesSmart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesBeniamino Murgante
 
The evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesThe evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesBeniamino Murgante
 
Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Beniamino Murgante
 
Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Beniamino Murgante
 
Involving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityInvolving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityBeniamino Murgante
 
Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Beniamino Murgante
 
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...Beniamino Murgante
 
Presentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triestePresentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triesteBeniamino Murgante
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...Beniamino Murgante
 
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Beniamino Murgante
 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Beniamino Murgante
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...Beniamino Murgante
 
Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Beniamino Murgante
 
Planning and Smartness: the true challenge
Planning and Smartness: the true challengePlanning and Smartness: the true challenge
Planning and Smartness: the true challengeBeniamino Murgante
 
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...Beniamino Murgante
 
Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Beniamino Murgante
 
Tecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessTecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessBeniamino Murgante
 

Mais de Beniamino Murgante (20)

Analyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable citiesAnalyzing and assessing ecological transition in building sustainable cities
Analyzing and assessing ecological transition in building sustainable cities
 
Smart Cities: New Science for the Cities
Smart Cities: New Science for the CitiesSmart Cities: New Science for the Cities
Smart Cities: New Science for the Cities
 
The evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processesThe evolution of spatial analysis and modeling in decision processes
The evolution of spatial analysis and modeling in decision processes
 
Smart City or Urban Science?
Smart City or Urban Science?Smart City or Urban Science?
Smart City or Urban Science?
 
Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...Involving citizens in smart energy approaches: the experience of an energy pa...
Involving citizens in smart energy approaches: the experience of an energy pa...
 
Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...Programmazione per la governance territoriale in tema di tutela della biodive...
Programmazione per la governance territoriale in tema di tutela della biodive...
 
Involving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing WalkabilityInvolving Citizens in a Participation Process for Increasing Walkability
Involving Citizens in a Participation Process for Increasing Walkability
 
Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg Presentation of ICCSA 2019 at the University of Saint petersburg
Presentation of ICCSA 2019 at the University of Saint petersburg
 
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
RISCHIO TERRITORIALE NEL GOVERNO DEL TERRITORIO: Ricerca e formazione nelle s...
 
Presentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of triestePresentation of ICCSA 2017 at the University of trieste
Presentation of ICCSA 2017 at the University of trieste
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
 
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
Focussing Energy Consumers’ Behaviour Change towards Energy Efficiency and Lo...
 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
Socio-Economic Planning profiles: Sciences VS Daily activities in public sector 
 
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
GEOGRAPHIC INFORMATION – NEED TO KNOW (GI-N2K) Towards a more demand-driven g...
 
Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...Garden in motion. An experience of citizens involvement in public space regen...
Garden in motion. An experience of citizens involvement in public space regen...
 
Planning and Smartness: the true challenge
Planning and Smartness: the true challengePlanning and Smartness: the true challenge
Planning and Smartness: the true challenge
 
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
GeoSDI: una piattaforma social di dati geografici basata sui principi di INSP...
 
Murgante smart energy
Murgante smart energyMurgante smart energy
Murgante smart energy
 
Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness Informazione Geografica, Città, Smartness
Informazione Geografica, Città, Smartness
 
Tecnologie, Territorio, Smartness
Tecnologie, Territorio, SmartnessTecnologie, Territorio, Smartness
Tecnologie, Territorio, Smartness
 

Último

Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17Celine George
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 

Último (20)

Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 

Claudia Bauzer Medeiros Digital preservation – caring for our data to foster knowledge discovery and dissemination

  • 1. Digital preservation caring for our data to foster knowledge discovery and dissemination Claudia Bauzer Medeiros Institute of Computing UNICAMP
  • 2. Pre-Saervare (Before) – (Save) = save before disappears
  • 3. Maintain Manu-tenere = being able to get/find it
  • 4.
  • 6. Data deluge • At end of 2011 – info created and replicated > 1.8 zettabytes • 90% data created in the last 2 years • 5 hour flight – 240 Tbytes • Facebook – 200 million users, >70 languages • Each person in England is filmed 300 times/day • Teenagers in the US send average 110 phone text messages a day => We need to build arks during the deluge - PRESERVATION
  • 7. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges
  • 8. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges
  • 9. WHY PRESERVE • Costly to produce • Contribute to progress of science • Intrinsic value culture/science/sustainability
  • 10. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 11. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 12. WHY PRESERVE • Costly to produce – Infrastructure, power, software, models, visualization, people – Hardware, Software, Peopleware • Contribute to progress of science – Reproducibility and reusability – Publication and sharing – Quality • Intrinsic value culture/science/sustainability – Digital humanities – Domesday project – Fonoteca Neotropical Jacques Vieillard
  • 13. The Domesday Project 1086-1986 • Digital decay • Equipment obsolescence • Software obsolescence
  • 16.
  • 17.
  • 18. Outline • Why preserve? • What to preserve? • How to preserve? And associated challenges
  • 19. What to preserve? • Data • BUT what is “data”? • Only data?
  • 20. What to preserve? • Data • BUT what is “data”? – Files and records – Models, documentation, annotations, sketches, experiments, recordings • Only data?
  • 21. What to preserve? • Data • BUT what is “data”? – Files and records – Models, documentation, annotations, sketches, experiments, recordings • Only data? – How produced it – workflows, devices, methodologies, materials and methods, reasonings, logs --- provenance
  • 22. What to preserve? • Data • Environment in which was produced • Data needed to preserve occupies more space than the data itself • Preservation means storing more than object itself
  • 23. What about our research data? (slide adapted from Jim Gray) Experiments Instruments Files Questions Papers Answers Simulations Models DATA Data-driven science “Collaboratory” 23/10000
  • 24. Data sources? Table of Product Characteristics id Property name Value MilkProd productsrep MilkA MilkProd quantity 10000 MilkProd validity date 10/06/2006 CheeseProd productsr Minas CheeseProd epquantity 2000 CheeseProd validity date 12/02/2006 CheeseProd shape Circular 24/10000
  • 25. eEnvironmental Science • Direct and indirect observations 25/10000
  • 26. Data sources 26/10000
  • 28. We are DATASCOPE engineers Software is the device/tool
  • 29. Outline • Why preserve? • What to preserve? • How to preserve? And associated challenges
  • 30. How to preserve? How to construct the ark during the deluge? Presaervare, Manutenere and Share
  • 31. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 32. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 33. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures • To afford maintenance costs – Cloud? CAP theorem?
  • 34. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures, metadata,standards • To afford maintenance costs – Cloud? CAP theorem?
  • 35. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay • To ensure quality – Curation procedures,metadata, standards • To afford maintenance costs – Cloud? CAP theorem? ======= WHERE
  • 36. How to preserve? • To ensure retrievability and sharing – Index structures – Ontologies, metadata, keywords, standards – Workflows • To ensure longevity – Media decay, software decay, hardware decay – PEOPLE DECAY • To ensure quality – Curation procedures,metadata, standards • To afford maintenance costs – Cloud? CAP theorem? ======= WHERE
  • 37. Sharing and open access NSF Data Management Policy Paper and data publication
  • 38.
  • 39. Sharing of Data Leads to Progress on Alzheimer’s By GINA KOLATA Published: August 12, 2010 = NEW YORK TIMES In 2003, a group of scientists and executives from the National Institutes of Health, the Food and Drug Administration, the drug and medical-imaging industries, universities and nonprofit groups joined in a project that experts say had no precedent: a collaborative effort to find the biological markers that show the progression of Alzheimer’s disease in the human brain. share all the data, making every single finding public immediately, available to anyone with a computer anywhere in the world => AVAILABILITY and REUSE
  • 40. • Data must be properly curated throughout its life-cycle and released with the appropriate high-quality metadata. • Medical Research Council UK 40/10000
  • 41. • Research data should be made available for use by other researchers. Researchers must retain research data, including electronic data, in a durable, indexed and retrievable form. • Australian Govnmt National Health and Medical Research Council 41/10000
  • 42. Microsoft Academic Search 40M publications 19M authors 75 publishers (Wiley, Springer, ACM, IEEE …) 42/10000
  • 44. • Citing data is as important as citing papers • For researchers, publishers, data centers • Over 1M DOI, several major national research libraries – Germany, France, Korea, Netherlands, Australia, USA... • Present manager – German National Library of Science and Technology 44/10000
  • 45. Publish on the Cloud Add metadata Pre-print sharing 45/10000
  • 46. FNJV proj.lis.ic.unicamp.br/fnjv • Sharing by publishing on the Web • Retrievability by extending metadata 46/10000
  • 47.
  • 48.
  • 49. CURATION AND USE OF STANDARDS
  • 50. Workflows and model preservation
  • 51.
  • 52. Workflows and model preservation Comb-e-Chem Video Simulation Properties Analysis Diffractometer Structures Database X-Ray Properties e-Lab e-Lab Grid Middleware 52/10000
  • 54. Outline • Why preserve? • What to preserve? • How to preserve? • Where to preserve? And a few associated challenges PRE-SAVE and MANU-TENERE
  • 55. Outline • Why preserve? – Costly to produce (hardware, software, peopleware) – Contribute to progress of science – Value – culture, science, sustainability • What to preserve? – Data [WHAT IS DATA?] – Context of production and use • How to preserve? – Accessibility and sharing – standards, metadata, ontologies – Integrity and quality – context to use (hw, sw), standards
  • 56. References • 56/10000
  • 57. References NSF – CISE Data management policy The Domesday Project http://www.atsf.co.uk/dottext/domesday.html The CLARIN Project (languages) Eigenfactor.org Altmetrics movement