SlideShare uma empresa Scribd logo
1 de 14
Data-PASS:
How collaborative preservation works
Micah Altman, Harvard University

IASSIST 2010, Ithaca New York
                                       1
What’s next?

 ✤   What is Data-PASS?

 ✤   Challenges of preserving scientific evidence

 ✤   Converging trends

 ✤   Benefits of institutional collaboration

 ✤   Evolving structure of collaboration

 ✤   Services and infrastructure

                                                   2
How collaborative preservation works.
Collaborators and Co-Conspirators

 ✤   Margaret Adams, Caroline Arms, Ed Bachman, Nitin Borwankar,
     Adam Buchbinder, Ken Bollen, Bryan Beecher, Steve Burling,
     Jonathan Crabtree, Darrell Donakowski, Myron Gutmann, Gary
     King, Patrick King, Jared Lyle, Marc Maynard, Amy Pienta, Lois
     Timms-Ferrarra, Copeland Young.

 ✤   Research Support
     Thanks to the Library of Congress (PA#NDP03-1), the National
     Science Foundation (DMS-0835500, SES 0112072), IMLS
     (LG-05-09-0041-09), the Harvard University Library, the Institute for
     Quantitative Social Science, the Harvard-MIT Data Center, and the
     Murray Research Archive.
                                                                             3
How collaborative preservation works.
What is Data-PASS?

 ✤   Data-PASS is a broad-based partnership of data archives dedicated to acquiring and preserving data at-
     risk of being lost to the social science research community.

 ✤   Data-PASS partners have rescued thousands of data sets and created the largest catalog of social science
     data in existence.

 ✤   Data-PASS partners collaborate to

     ✤   identify and promote good archival practices,

     ✤   seek out at-risk research data,

     ✤   build preservation infrastructure,

     ✤   and mutually safeguard collections.

 ✤   Our current initiatives include:

     ✤   improving data citation practices,

     ✤   automatic policy-based archival replication
                                                                                                                4
How collaborative preservation works.
Challenges of Preserving
    Scientific Evidence
✤   Scientists expectations are changing

    ✤    Movements toward open access and open data

    ✤    Specialized workflow systems

    ✤    Diversity of approaches to managing replication and community data

✤   Scientific change creates technical challenges:

    ✤    Forms, formats, and research workflows change

    ✤    Data is not self-documenting

    ✤    Intellectual property & privacy law are evolving

    ✤    Resources to deal with these changes are limited

✤   Much of the empirical base of science becomes lost                                          Source: Wikimedia Commons



    ✤    Journal articles & books are only summaries

    ✤    Full replication is expensive or impossible

    ✤    This slows scientific progress:
         cooked results, publication bias, citation authority distortion, challenges of meta-
         analysis
                                                                                                                            5
How collaborative preservation works.
Converging trends in preservation
 ✤   Standardized criteria for evaluating trustworthiness of archives

     ✤   TRAC; NARA TDR; Drambora

 ✤   Collaborative stewardship by memory institutions

     ✤   Meta-Archive, CLOCKSS, COPUL, PeDALS, ADN, Chronopolis

 ✤   Technology for replication and verification

     ✤   Solutions developed within the library/archival community:
         LOCKSS, IRODS, ACE, Duraspace

     ✤   Commercial HPC and Cloud solutions:
         Hadoop, Crashplan, Mozy, AWS, etc.

     ✤   P2P sharing:
         freenet, gnunet, Taho-LAFS
                                                                        6
How collaborative preservation works.
Benefits of Collaboration
  "Nothing new that is really interesting comes without collaboration" -- James Watson



 ✤
     General Benefits
     ✤   Exposure to funding opportunities; collection development leads
     ✤   Division of labor in tracking law, technology, information science
     ✤   Combined experience in preservation practice
 ✤   Data-PASS Focus*
     ✤
         Expanded discoverability of collections
         ✤
             Reach new audiences
         ✤
             Holdings across the joint collection are more complete
         ✤
             Virtual collections can be built from slices of the joint collection
     ✤
         Development and advocacy of archival good practices
         ✤
             (Current initiative: outreach to professional associations in support of data citation)
     ✤
         Insurance against institutional and technological failure                                         7
How collaborative preservation works.             * And the museum of obsolete data storage technologies
How Collaborative Stewardship acts as
 Insurance Against Preservation Failure
 ✤   Collaborative replication & stewardship can substantially
     mitigate preservation risk from:

     ✤   External threats to institution failure:

         ✤   funding loss; attacks;
             legal regime change;
             mission drift

     ✤   Institutional failure:

         ✤   Unintentional curatorial modification;
             Loss of institutional knowledge;
             Change in mission

 ✤   And also reduce preservation risk from:

     ✤   Media failure (from storage & media characteristics);
         Software & hardware infrastructure failures
                                                                 8
How collaborative preservation works.
Shared Infrastructure
 ✤   Shared infrastructure can

     ✤   reduce costs

     ✤   reduce risk

     ✤   coordinate operations

     ✤   validate shared standards

 ✤   Data-PASS Shared Infrastructure

     ✤   Shared Catalog

     ✤   Policy-Driven Distributed Replication
         (in development)

     ✤   The Dataverse Network
         (overlapping infrastructure)
                                                 9
How collaborative preservation works.
Shared Catalog
 ✤   Unified Discovery

     ✤   Simple & fielded search

     ✤   Virtual collection across entire catalog

     ✤   Browse by subject, data, source

 ✤   Metadata delivery

     ✤   Descriptive study, file, and variable
         information

     ✤   Provenance & rights metadata

     ✤   Human, OAI, Z39.50 interfaces

 ✤   Layered Services

     ✤   Data reformatting for delivery

     ✤   On-line analysis
                                                    10
How collaborative preservation works.
The Dataverse Network ®
                       For Organizations                                              For Scholars




✤     Dataverses are Data-PASS ready -- all dataverses can provide:   ✤   The Dataverse Network System is Open-Source and
      ✤    DDI (2.x) metadata export (intuitive form-based entry)     ✤   Creating a Dataverse requires no software.
      ✤    Catalog access through OAI-PMH (and Z39.50)                ✤   IQSS & MRA host an open DVN and offer no-cost
      ✤    LOCKSS compatibility                                           permanent storage:

      ✤    Version control (new); Terms of use metadata; Flexible                  http://dvn.iq.harvard.edu
           contributor-curator-editor workflows
                                                                                                                            11
How   collaborative preservation works. (better) self-archiving
           -- ideal for “living collections” &
Policy-Driven Distributed Replication

 ✤   Policy Based

     ✤   Preservation requirements shape policy

     ✤   Policy drives replication rules

     ✤   Auditing demonstrates conformance with
         preservation requirements

 ✤   Copies are distributed

     ✤   Across space

     ✤   Among institutions

     ✤   Across time (version history retained)

 ✤   Commitments scaled to participant resources

     ✤   Collection size

     ✤   Technology
                                                   12
How collaborative preservation works.
Structure of Collaboration
Areas of collaboration...                                 Steps to participation
 ✤   Partnership agreements                                 Partners agree to...

     agreement on good practice;                             ✤   Publishing metadata
     permission to preserve;
     partners offer to accept data transfer if archive fails ✤   Use of replication system

 ✤   Coordinated operations                                  ✤   Good archival practice
                                                                 (TRAC compliance not required)
     shared leads;
     regular communication;
                                                             ✤   Transfer protocols
     collegial review available
                                                            Partners use the following technlogies
 ✤   Shared good practice                                    ✤   Light-weight protocols:
                                                                 OAI-PMH + DDI 2-lite +
     metadata; preservation; confidentiality
                                                                 HTTP harvestable data
 ✤   Circle of gifts norm                                    ✤   Software:
                                                                 Could use a hosted dataverse or;
     in-kind effort & resource;                                  install open source OAI-PMH server, etc.
     contributions are voluntary & proportional
                                                             ✤   No fear - we can help!                     13
More Questions?



 ✤   Know of research data at risk of loss?

 ✤   Need help preserving your research data?

 ✤   Want more visibility and protection for your collections ?

                               http://data-pass.org
                           data-pass@icpsr.umich.edu
                                                                  14
How collaborative preservation works.

Mais conteúdo relacionado

Semelhante a Data-PASS: How Collaborative Presentation Works

The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Research data spring: a consortial approach to RDM within SaS
Research data spring: a consortial approach to RDM within SaSResearch data spring: a consortial approach to RDM within SaS
Research data spring: a consortial approach to RDM within SaSJisc RDM
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...ASIS&T
 
The Dark Side of Digital Preservation: Distributed Digital Preservation
The Dark Side of Digital Preservation: Distributed Digital PreservationThe Dark Side of Digital Preservation: Distributed Digital Preservation
The Dark Side of Digital Preservation: Distributed Digital PreservationEducopia
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementMarieke Guy
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementopl10
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData ManagementUlrike Wittig
 
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...Artefactual Systems - Archivematica
 
Auditing Distributed Preservation Networks
Auditing Distributed Preservation Networks Auditing Distributed Preservation Networks
Auditing Distributed Preservation Networks Micah Altman
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareRobin Rice
 
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)dri_ireland
 
Converged IT and Data Commons
Converged IT and Data CommonsConverged IT and Data Commons
Converged IT and Data CommonsSimon Twigger
 
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)OpenAIRE
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing dataWorld Agroforestry (ICRAF)
 

Semelhante a Data-PASS: How Collaborative Presentation Works (20)

The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Research data spring: a consortial approach to RDM within SaS
Research data spring: a consortial approach to RDM within SaSResearch data spring: a consortial approach to RDM within SaS
Research data spring: a consortial approach to RDM within SaS
 
Long Term Preservation Dale Peters
Long Term Preservation Dale PetersLong Term Preservation Dale Peters
Long Term Preservation Dale Peters
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...
 
The Dark Side of Digital Preservation: Distributed Digital Preservation
The Dark Side of Digital Preservation: Distributed Digital PreservationThe Dark Side of Digital Preservation: Distributed Digital Preservation
The Dark Side of Digital Preservation: Distributed Digital Preservation
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 
Supporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data ManagementSupporting Libraries in Leading the Way in Research Data Management
Supporting Libraries in Leading the Way in Research Data Management
 
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft AzureAccelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
 
Auditing Distributed Preservation Networks
Auditing Distributed Preservation Networks Auditing Distributed Preservation Networks
Auditing Distributed Preservation Networks
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
 
Converged IT and Data Commons
Converged IT and Data CommonsConverged IT and Data Commons
Converged IT and Data Commons
 
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing data
 

Mais de Micah Altman

Selecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesSelecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesMicah Altman
 
Well-Being - A Sunset Conversation
Well-Being - A Sunset ConversationWell-Being - A Sunset Conversation
Well-Being - A Sunset ConversationMicah Altman
 
Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...Micah Altman
 
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Micah Altman
 
Well-being A Sunset Conversation
Well-being A Sunset ConversationWell-being A Sunset Conversation
Well-being A Sunset ConversationMicah Altman
 
Can We Fix Peer Review
Can We Fix Peer ReviewCan We Fix Peer Review
Can We Fix Peer ReviewMicah Altman
 
Academy Owned Peer Review
Academy Owned Peer ReviewAcademy Owned Peer Review
Academy Owned Peer ReviewMicah Altman
 
Redistricting in the US -- An Overview
Redistricting in the US -- An OverviewRedistricting in the US -- An Overview
Redistricting in the US -- An OverviewMicah Altman
 
A Future for Electoral Districting
A Future for Electoral DistrictingA Future for Electoral Districting
A Future for Electoral DistrictingMicah Altman
 
A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk  A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk Micah Altman
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...Micah Altman
 
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Micah Altman
 
Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Micah Altman
 
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsCreative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsMicah Altman
 
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...Micah Altman
 
Ndsa 2016 opening plenary
Ndsa 2016 opening plenaryNdsa 2016 opening plenary
Ndsa 2016 opening plenaryMicah Altman
 
Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Micah Altman
 
Software Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanSoftware Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanMicah Altman
 
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...Micah Altman
 
Gary Price, MIT Program on Information Science
Gary Price, MIT Program on Information ScienceGary Price, MIT Program on Information Science
Gary Price, MIT Program on Information ScienceMicah Altman
 

Mais de Micah Altman (20)

Selecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategiesSelecting efficient and reliable preservation strategies
Selecting efficient and reliable preservation strategies
 
Well-Being - A Sunset Conversation
Well-Being - A Sunset ConversationWell-Being - A Sunset Conversation
Well-Being - A Sunset Conversation
 
Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...Matching Uses and Protections for Government Data Releases: Presentation at t...
Matching Uses and Protections for Government Data Releases: Presentation at t...
 
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
 
Well-being A Sunset Conversation
Well-being A Sunset ConversationWell-being A Sunset Conversation
Well-being A Sunset Conversation
 
Can We Fix Peer Review
Can We Fix Peer ReviewCan We Fix Peer Review
Can We Fix Peer Review
 
Academy Owned Peer Review
Academy Owned Peer ReviewAcademy Owned Peer Review
Academy Owned Peer Review
 
Redistricting in the US -- An Overview
Redistricting in the US -- An OverviewRedistricting in the US -- An Overview
Redistricting in the US -- An Overview
 
A Future for Electoral Districting
A Future for Electoral DistrictingA Future for Electoral Districting
A Future for Electoral Districting
 
A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk  A History of the Internet :Scott Bradner’s Program on Information Science Talk
A History of the Internet :Scott Bradner’s Program on Information Science Talk
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
 
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
Labor And Reward In Science: Commentary on Cassidy Sugimoto’s Program on Info...
 
Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:Utilizing VR and AR in the Library Space:
Utilizing VR and AR in the Library Space:
 
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-NotsCreative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
Creative Data Literacy: Bridging the Gap Between Data-Haves and Have-Nots
 
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
SOLARSPELL: THE SOLAR POWERED EDUCATIONAL LEARNING LIBRARY - EXPERIENTIAL LEA...
 
Ndsa 2016 opening plenary
Ndsa 2016 opening plenaryNdsa 2016 opening plenary
Ndsa 2016 opening plenary
 
Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...Making Decisions in a World Awash in Data: We’re going to need a different bo...
Making Decisions in a World Awash in Data: We’re going to need a different bo...
 
Software Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental ScanSoftware Repositories for Research-- An Environmental Scan
Software Repositories for Research-- An Environmental Scan
 
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
The Open Access Network: Rebecca Kennison’s Talk for the MIT Prorgam on Infor...
 
Gary Price, MIT Program on Information Science
Gary Price, MIT Program on Information ScienceGary Price, MIT Program on Information Science
Gary Price, MIT Program on Information Science
 

Último

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 

Último (20)

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 

Data-PASS: How Collaborative Presentation Works

  • 1. Data-PASS: How collaborative preservation works Micah Altman, Harvard University IASSIST 2010, Ithaca New York 1
  • 2. What’s next? ✤ What is Data-PASS? ✤ Challenges of preserving scientific evidence ✤ Converging trends ✤ Benefits of institutional collaboration ✤ Evolving structure of collaboration ✤ Services and infrastructure 2 How collaborative preservation works.
  • 3. Collaborators and Co-Conspirators ✤ Margaret Adams, Caroline Arms, Ed Bachman, Nitin Borwankar, Adam Buchbinder, Ken Bollen, Bryan Beecher, Steve Burling, Jonathan Crabtree, Darrell Donakowski, Myron Gutmann, Gary King, Patrick King, Jared Lyle, Marc Maynard, Amy Pienta, Lois Timms-Ferrarra, Copeland Young. ✤ Research Support Thanks to the Library of Congress (PA#NDP03-1), the National Science Foundation (DMS-0835500, SES 0112072), IMLS (LG-05-09-0041-09), the Harvard University Library, the Institute for Quantitative Social Science, the Harvard-MIT Data Center, and the Murray Research Archive. 3 How collaborative preservation works.
  • 4. What is Data-PASS? ✤ Data-PASS is a broad-based partnership of data archives dedicated to acquiring and preserving data at- risk of being lost to the social science research community. ✤ Data-PASS partners have rescued thousands of data sets and created the largest catalog of social science data in existence. ✤ Data-PASS partners collaborate to ✤ identify and promote good archival practices, ✤ seek out at-risk research data, ✤ build preservation infrastructure, ✤ and mutually safeguard collections. ✤ Our current initiatives include: ✤ improving data citation practices, ✤ automatic policy-based archival replication 4 How collaborative preservation works.
  • 5. Challenges of Preserving Scientific Evidence ✤ Scientists expectations are changing ✤ Movements toward open access and open data ✤ Specialized workflow systems ✤ Diversity of approaches to managing replication and community data ✤ Scientific change creates technical challenges: ✤ Forms, formats, and research workflows change ✤ Data is not self-documenting ✤ Intellectual property & privacy law are evolving ✤ Resources to deal with these changes are limited ✤ Much of the empirical base of science becomes lost Source: Wikimedia Commons ✤ Journal articles & books are only summaries ✤ Full replication is expensive or impossible ✤ This slows scientific progress: cooked results, publication bias, citation authority distortion, challenges of meta- analysis 5 How collaborative preservation works.
  • 6. Converging trends in preservation ✤ Standardized criteria for evaluating trustworthiness of archives ✤ TRAC; NARA TDR; Drambora ✤ Collaborative stewardship by memory institutions ✤ Meta-Archive, CLOCKSS, COPUL, PeDALS, ADN, Chronopolis ✤ Technology for replication and verification ✤ Solutions developed within the library/archival community: LOCKSS, IRODS, ACE, Duraspace ✤ Commercial HPC and Cloud solutions: Hadoop, Crashplan, Mozy, AWS, etc. ✤ P2P sharing: freenet, gnunet, Taho-LAFS 6 How collaborative preservation works.
  • 7. Benefits of Collaboration "Nothing new that is really interesting comes without collaboration" -- James Watson ✤ General Benefits ✤ Exposure to funding opportunities; collection development leads ✤ Division of labor in tracking law, technology, information science ✤ Combined experience in preservation practice ✤ Data-PASS Focus* ✤ Expanded discoverability of collections ✤ Reach new audiences ✤ Holdings across the joint collection are more complete ✤ Virtual collections can be built from slices of the joint collection ✤ Development and advocacy of archival good practices ✤ (Current initiative: outreach to professional associations in support of data citation) ✤ Insurance against institutional and technological failure 7 How collaborative preservation works. * And the museum of obsolete data storage technologies
  • 8. How Collaborative Stewardship acts as Insurance Against Preservation Failure ✤ Collaborative replication & stewardship can substantially mitigate preservation risk from: ✤ External threats to institution failure: ✤ funding loss; attacks; legal regime change; mission drift ✤ Institutional failure: ✤ Unintentional curatorial modification; Loss of institutional knowledge; Change in mission ✤ And also reduce preservation risk from: ✤ Media failure (from storage & media characteristics); Software & hardware infrastructure failures 8 How collaborative preservation works.
  • 9. Shared Infrastructure ✤ Shared infrastructure can ✤ reduce costs ✤ reduce risk ✤ coordinate operations ✤ validate shared standards ✤ Data-PASS Shared Infrastructure ✤ Shared Catalog ✤ Policy-Driven Distributed Replication (in development) ✤ The Dataverse Network (overlapping infrastructure) 9 How collaborative preservation works.
  • 10. Shared Catalog ✤ Unified Discovery ✤ Simple & fielded search ✤ Virtual collection across entire catalog ✤ Browse by subject, data, source ✤ Metadata delivery ✤ Descriptive study, file, and variable information ✤ Provenance & rights metadata ✤ Human, OAI, Z39.50 interfaces ✤ Layered Services ✤ Data reformatting for delivery ✤ On-line analysis 10 How collaborative preservation works.
  • 11. The Dataverse Network ® For Organizations For Scholars ✤ Dataverses are Data-PASS ready -- all dataverses can provide: ✤ The Dataverse Network System is Open-Source and ✤ DDI (2.x) metadata export (intuitive form-based entry) ✤ Creating a Dataverse requires no software. ✤ Catalog access through OAI-PMH (and Z39.50) ✤ IQSS & MRA host an open DVN and offer no-cost ✤ LOCKSS compatibility permanent storage: ✤ Version control (new); Terms of use metadata; Flexible http://dvn.iq.harvard.edu contributor-curator-editor workflows 11 How collaborative preservation works. (better) self-archiving -- ideal for “living collections” &
  • 12. Policy-Driven Distributed Replication ✤ Policy Based ✤ Preservation requirements shape policy ✤ Policy drives replication rules ✤ Auditing demonstrates conformance with preservation requirements ✤ Copies are distributed ✤ Across space ✤ Among institutions ✤ Across time (version history retained) ✤ Commitments scaled to participant resources ✤ Collection size ✤ Technology 12 How collaborative preservation works.
  • 13. Structure of Collaboration Areas of collaboration... Steps to participation ✤ Partnership agreements Partners agree to... agreement on good practice; ✤ Publishing metadata permission to preserve; partners offer to accept data transfer if archive fails ✤ Use of replication system ✤ Coordinated operations ✤ Good archival practice (TRAC compliance not required) shared leads; regular communication; ✤ Transfer protocols collegial review available Partners use the following technlogies ✤ Shared good practice ✤ Light-weight protocols: OAI-PMH + DDI 2-lite + metadata; preservation; confidentiality HTTP harvestable data ✤ Circle of gifts norm ✤ Software: Could use a hosted dataverse or; in-kind effort & resource; install open source OAI-PMH server, etc. contributions are voluntary & proportional ✤ No fear - we can help! 13
  • 14. More Questions? ✤ Know of research data at risk of loss? ✤ Need help preserving your research data? ✤ Want more visibility and protection for your collections ? http://data-pass.org data-pass@icpsr.umich.edu 14 How collaborative preservation works.

Notas do Editor

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n