SlideShare uma empresa Scribd logo
1 de 26
SEAD: Sustainable Environment
through Actionable Data
Margaret Hedstrom
Professor of Information
Faculty Associate Institute for Social Research (ICPSR)
PI, SEAD
June 23, 2014
Overview
• What is SEAD?
• Vision and Rationale
• Target Audience and User Communities
• Current Status
• SEAD, Universities, and Libraries
• Some Lessons Learned (so far)
• Plans and Future Engagement
2
What is SEAD?
• A Cooperative Agreement funded by NSF to
develop sustainable cyberinfrastructure for
preservation and access to scientific data ($8
million/5 years)
• A partnership between the universities of
Michigan, Indiana and Illinois
• An emerging set of services for data management,
sharing, curation, discovery and preservation for
researchers in the “long tail”
• A case study of data needs in sustainability
science
3
SEAD Vision and Rationale
• Small teams, researchers with short-term
projects, and individual scientists (the long tail)
are under served by today’s data preservation
and access infrastructure
• These communities will take advantage of
evolving data preservation and access
infrastructure if:
– it supports science objectives and enables new kinds
of science
– it is easy to use
– collaborators and peers are also using it
• Sustainability science is a good test case
Researcher(s)
Create and
Analyze Data
Researchers
Publish
Results
?
Researchers
Deposit Data
Libraries
Acquire
Publications
Repositories
Curate Data
Researchers
Search for
Publications
Researchers
Integrate, Create
New Data
Researchers
Search for
Data
Data Preservation and Access Today
Researcher(s)
Create and
Analyze Data
Researchers
Publish
Results
?
Researchers
Deposit Data
Libraries
Acquire
Publications
Repositories
Curate Data
Researchers
Search for
Publications
Researchers
Integrate, Create
New Data, and
Analyze Data
Researchers
Search for
Data
Data Preservation and Access Today
Research
Question
SEARCH for
People
Publications
Data
Collaboration
Environment
Discovery and Access
Environments
Combine,
Integrate,
Analyze
Preservation
Environments
SEAD Vision
Share
Improve
Curate Data
Upload/Do
wnload
DataSEAD ACR
SEAD Virtual Archive
SEAD Social Network
Target Audience / User Communities
Sustainability Scientists
• Focused on problems that require data, methods,
tools, and expertise from multiple disciplines
• Requires many different types of data about physical,
natural, and social phenomena in order to understand
interactions between natural and human systems
• Uses a combination of observational (field) data,
experimental data, simulations, and models
• Conducts research in small to medium-sized labs or
centers under the direction of a single PI or a Center
Director.
8
Target Audience / User Communities
the “Long Tail” of Scientific Research
• Data discovery is via targeted foraging and word-of-mouth
• Almost all data are stored locally
• Minimal local IT support
• Metadata standards and ontologies, where they do exist, are based
on disciplinary norms or local practices
• Data formats and metadata standards are often controlled by
multiple independent third-parties (e.g. instrument and application
providers
• Data are vulnerable to interruptions in organizational arrangements
(graduate students finish PhD’s and move on – lab or center funding
sunsets)
• No single data set is likely to have great value standing alone, but
when aggregated, combined and integrated data become valuable
resources of discovery and innovation.
9
Overview
Project Start 10/01/11
User Requirements Report 5/12
NCED Repository Ingest 8/12
Prototype Review 4/22/13
SEAD 1.0 Released 10/13
DataOne Member Node 11/13
End User Workshop 4/11/14
10th User Group 5/11/14
36-Month Review 10/14/14
Renewal (?) for Years 6-10
10
Summary of Current Status
• Working Platform
– SEAD Active Content Repository (ACR)
• Collaboration / File Sharing Space for Research Projects
• Staging Area for Data Prior to Publishing or Archiving
– SEAD Virtual Archive
• Capability to push data from ACR and/or local research
environments to preservation and discovery services
(Institutional Repositories/DataONE)
– SEAD Research Network
• Researcher initiated profiles with harvesting of citations,
linkage of data-people-publications, reporting
11
SEAD Prototype
SEAD, Universities, and Libraries
• From the researcher’s perspective
– SEAD is an project work space that enables data
sharing, commenting, secure storage, extraction
of metadata, and active/social curation
• From the university research infrastructure
perspective
– SEAD is a staging area for data curation prior to
publication, submission, and preservation
13
Data Set Publishing Workflow
•Data content used
within ACR
•Researcher Profile
Established in VIVO
NCED Data Set
Ingested to ACR
•Data Set ready to
publish
NCED Data Set
Ingested to VA •DataCite minted
DOI attached to
finalized Data Set
NCED Data Set
Deposited with IR
•DOI Resolution to
designated IR
NCED Data Set
Published to
VIVO
Data Citation
Example - Person
Data Citation
Example - Dataset
DOI
Authors
Subject areas
Abstract
Geographic focus
Rights information
SEAD: Explore Sustainability Research
PEOPLE ORGANIZATIONS
RESEARCH
(DATA + PUBLICATIONS)
NCED Publications in VIVO
SEAD Virtual Archive
• Purpose: Long-term preservation and discovery
– Thin virtualization layer on top of multiple university
Institutional Repositories (IRs)
– Enhances IRs by being sustainability science-aware
• Team: IU Libraries, UIUC Libraries, and Data To
Insight Center at IU
• Starting point: Data Conservancy code (Johns
Hopkins U.)
– Extended for sustainability science long tail use cases
Making Data Sustainable: Use Case
Active Curation
Repository
(ACR)
SEAD Virtual
Archive
IUScholarwork
s
UIUC Ideals
Packaged
object
Preserve data
Keep private for 5 years
Index data, metadata
and relationships
• Collected data about Lower
Mississippi flood
• Stored in Active Repository
• Organized as a collection
• Marked “Ready for
publication”
• Collections visible to team only
for 5 years
• Deposited to repository based
on dataset creator affiliation
• Find by author, location,
keywords or repository
Preview
Data
Upload
Data to
VA
Run
Virus
Checking
File
Charact-
erization
Mint
DOI
Deposit
to IR (&
cloud)
Update
DOI
target
Index
Metadata
Index
Scientific
Metadata
Large
Dataset
Decision
Version
Data
IR
Match-
maker
Index
Scientific
Metadata
Accept
Repository
Agreement
Ingest Workflow into SEAD VA
Link to live demo http://seadva.d2i.indiana.edu:8181/sead-access/#login
Successful automatic
ingest into UIUC IDEALS
repository
Communication with IRs
Datasets deposited into IU SDA, IU
Scholworks and UIUC IDEALS
Some Lessons Learned
• Some researchers and projects in the “long tail”
have sophisticated demands for active data
services
• Supporting analysis of data in SEAD adds
complexity and cost
• Users want some degree of customization of
bare-bones file storage and active project space
• A big gap remains between data producers and
the campus/library/archive infrastructures for
long-term access and preservation.
24
SEAD Priorities and Future Plans
• Make SEAD more stable and more usable
• Attract a larger, broader, and more diverse
user community
– Network effects in the long tail
– Self service
• Expand repository options
• Resolve Governance and Sustainability
25
More info
• www.sead-data.net
• Give or send email to myersjd@umich.edu for
access to the SEAD Demo site

Mais conteúdo relacionado

Mais procurados

ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsSEAD
 
NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14SEAD
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceRobert H. McDonald
 
RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel ASIS&T
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectASIS&T
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRobert H. McDonald
 
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...ASIS&T
 
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceRDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceASIS&T
 
Improving Data Management Capacity in the Mekong Basin Using SEAD
Improving Data Management Capacity in the Mekong Basin Using SEADImproving Data Management Capacity in the Mekong Basin Using SEAD
Improving Data Management Capacity in the Mekong Basin Using SEADSEAD
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017ARDC
 
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...ASIS&T
 
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FuturePoster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FutureASIS&T
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...ARDC
 

Mais procurados (20)

ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and Tools
 
NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability Science
 
RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data Interoperability
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
 
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceRDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
 
Improving Data Management Capacity in the Mekong Basin Using SEAD
Improving Data Management Capacity in the Mekong Basin Using SEADImproving Data Management Capacity in the Mekong Basin Using SEAD
Improving Data Management Capacity in the Mekong Basin Using SEAD
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
 
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
 
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FuturePoster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
 
Zucca "Technology & Systems"
Zucca "Technology & Systems"Zucca "Technology & Systems"
Zucca "Technology & Systems"
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
 
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management PlanNISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management Plan
 

Semelhante a Presentation to the UM Library Emergent Research Series

Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...EDINA, University of Edinburgh
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Robin Rice
 
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Anita de Waard
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries? Robin Rice
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersIncisive_Events
 
Developing Research Data Management Policy and Services
Developing Research Data Management Policy and ServicesDeveloping Research Data Management Policy and Services
Developing Research Data Management Policy and ServicesRobin Rice
 
Introduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster UniversityIntroduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster UniversityLancaster University Library
 
Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...Robin Rice
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...EDINA, University of Edinburgh
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Natsuko Nicholls
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...EDINA, University of Edinburgh
 
Research Data Service at the University of Edinburgh
Research Data Service at the University of EdinburghResearch Data Service at the University of Edinburgh
Research Data Service at the University of EdinburghRobin Rice
 
Staffing Research Data Services at University of Edinburgh
Staffing Research Data Services at University of EdinburghStaffing Research Data Services at University of Edinburgh
Staffing Research Data Services at University of EdinburghRobin Rice
 
Educause 2015 RDM Maturity
Educause 2015 RDM Maturity Educause 2015 RDM Maturity
Educause 2015 RDM Maturity ResearchSpace
 
Improving RDM through closer integration of electronic lab notebooks and data...
Improving RDM through closer integration of electronic lab notebooks and data...Improving RDM through closer integration of electronic lab notebooks and data...
Improving RDM through closer integration of electronic lab notebooks and data...rmacneil88
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsMartin Donnelly
 

Semelhante a Presentation to the UM Library Emergent Research Series (20)

Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
 
Engaging the Researcher in RDM
Engaging the Researcher in RDMEngaging the Researcher in RDM
Engaging the Researcher in RDM
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producers
 
Developing Research Data Management Policy and Services
Developing Research Data Management Policy and ServicesDeveloping Research Data Management Policy and Services
Developing Research Data Management Policy and Services
 
Introduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster UniversityIntroduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster University
 
Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...
 
Research Data Service at the University of Edinburgh
Research Data Service at the University of EdinburghResearch Data Service at the University of Edinburgh
Research Data Service at the University of Edinburgh
 
Staffing Research Data Services at University of Edinburgh
Staffing Research Data Services at University of EdinburghStaffing Research Data Services at University of Edinburgh
Staffing Research Data Services at University of Edinburgh
 
Educause 2015 RDM Maturity
Educause 2015 RDM Maturity Educause 2015 RDM Maturity
Educause 2015 RDM Maturity
 
Improving RDM through closer integration of electronic lab notebooks and data...
Improving RDM through closer integration of electronic lab notebooks and data...Improving RDM through closer integration of electronic lab notebooks and data...
Improving RDM through closer integration of electronic lab notebooks and data...
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
 

Mais de SEAD

Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...SEAD
 
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...SEAD
 
Ignite@AGU14
Ignite@AGU14Ignite@AGU14
Ignite@AGU14SEAD
 
Preservation, Publishing, and People: A SEAD View
Preservation, Publishing, and People: A SEAD ViewPreservation, Publishing, and People: A SEAD View
Preservation, Publishing, and People: A SEAD ViewSEAD
 
An Overview of Plans for SEAD
An Overview of Plans for SEADAn Overview of Plans for SEAD
An Overview of Plans for SEADSEAD
 
SEAD Prototype: Data Curation and Preservation for Sustainability Science
SEAD Prototype: Data Curation and Preservation for Sustainability ScienceSEAD Prototype: Data Curation and Preservation for Sustainability Science
SEAD Prototype: Data Curation and Preservation for Sustainability ScienceSEAD
 
SEAD: Opening Data in the "Long Tail" for Active and Social Curation
SEAD: Opening Data in the "Long Tail" for Active and Social CurationSEAD: Opening Data in the "Long Tail" for Active and Social Curation
SEAD: Opening Data in the "Long Tail" for Active and Social CurationSEAD
 
SEAD: A system to support social and active data curation
SEAD: A system to support social and active data curationSEAD: A system to support social and active data curation
SEAD: A system to support social and active data curationSEAD
 
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...SEAD
 

Mais de SEAD (9)

Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
Poster: Using SEAD to Support Collaboration among Land Managers, Scientists, ...
 
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
 
Ignite@AGU14
Ignite@AGU14Ignite@AGU14
Ignite@AGU14
 
Preservation, Publishing, and People: A SEAD View
Preservation, Publishing, and People: A SEAD ViewPreservation, Publishing, and People: A SEAD View
Preservation, Publishing, and People: A SEAD View
 
An Overview of Plans for SEAD
An Overview of Plans for SEADAn Overview of Plans for SEAD
An Overview of Plans for SEAD
 
SEAD Prototype: Data Curation and Preservation for Sustainability Science
SEAD Prototype: Data Curation and Preservation for Sustainability ScienceSEAD Prototype: Data Curation and Preservation for Sustainability Science
SEAD Prototype: Data Curation and Preservation for Sustainability Science
 
SEAD: Opening Data in the "Long Tail" for Active and Social Curation
SEAD: Opening Data in the "Long Tail" for Active and Social CurationSEAD: Opening Data in the "Long Tail" for Active and Social Curation
SEAD: Opening Data in the "Long Tail" for Active and Social Curation
 
SEAD: A system to support social and active data curation
SEAD: A system to support social and active data curationSEAD: A system to support social and active data curation
SEAD: A system to support social and active data curation
 
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
 

Último

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 

Último (20)

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 

Presentation to the UM Library Emergent Research Series

  • 1. SEAD: Sustainable Environment through Actionable Data Margaret Hedstrom Professor of Information Faculty Associate Institute for Social Research (ICPSR) PI, SEAD June 23, 2014
  • 2. Overview • What is SEAD? • Vision and Rationale • Target Audience and User Communities • Current Status • SEAD, Universities, and Libraries • Some Lessons Learned (so far) • Plans and Future Engagement 2
  • 3. What is SEAD? • A Cooperative Agreement funded by NSF to develop sustainable cyberinfrastructure for preservation and access to scientific data ($8 million/5 years) • A partnership between the universities of Michigan, Indiana and Illinois • An emerging set of services for data management, sharing, curation, discovery and preservation for researchers in the “long tail” • A case study of data needs in sustainability science 3
  • 4. SEAD Vision and Rationale • Small teams, researchers with short-term projects, and individual scientists (the long tail) are under served by today’s data preservation and access infrastructure • These communities will take advantage of evolving data preservation and access infrastructure if: – it supports science objectives and enables new kinds of science – it is easy to use – collaborators and peers are also using it • Sustainability science is a good test case
  • 5. Researcher(s) Create and Analyze Data Researchers Publish Results ? Researchers Deposit Data Libraries Acquire Publications Repositories Curate Data Researchers Search for Publications Researchers Integrate, Create New Data Researchers Search for Data Data Preservation and Access Today
  • 6. Researcher(s) Create and Analyze Data Researchers Publish Results ? Researchers Deposit Data Libraries Acquire Publications Repositories Curate Data Researchers Search for Publications Researchers Integrate, Create New Data, and Analyze Data Researchers Search for Data Data Preservation and Access Today
  • 7. Research Question SEARCH for People Publications Data Collaboration Environment Discovery and Access Environments Combine, Integrate, Analyze Preservation Environments SEAD Vision Share Improve Curate Data Upload/Do wnload DataSEAD ACR SEAD Virtual Archive SEAD Social Network
  • 8. Target Audience / User Communities Sustainability Scientists • Focused on problems that require data, methods, tools, and expertise from multiple disciplines • Requires many different types of data about physical, natural, and social phenomena in order to understand interactions between natural and human systems • Uses a combination of observational (field) data, experimental data, simulations, and models • Conducts research in small to medium-sized labs or centers under the direction of a single PI or a Center Director. 8
  • 9. Target Audience / User Communities the “Long Tail” of Scientific Research • Data discovery is via targeted foraging and word-of-mouth • Almost all data are stored locally • Minimal local IT support • Metadata standards and ontologies, where they do exist, are based on disciplinary norms or local practices • Data formats and metadata standards are often controlled by multiple independent third-parties (e.g. instrument and application providers • Data are vulnerable to interruptions in organizational arrangements (graduate students finish PhD’s and move on – lab or center funding sunsets) • No single data set is likely to have great value standing alone, but when aggregated, combined and integrated data become valuable resources of discovery and innovation. 9
  • 10. Overview Project Start 10/01/11 User Requirements Report 5/12 NCED Repository Ingest 8/12 Prototype Review 4/22/13 SEAD 1.0 Released 10/13 DataOne Member Node 11/13 End User Workshop 4/11/14 10th User Group 5/11/14 36-Month Review 10/14/14 Renewal (?) for Years 6-10 10
  • 11. Summary of Current Status • Working Platform – SEAD Active Content Repository (ACR) • Collaboration / File Sharing Space for Research Projects • Staging Area for Data Prior to Publishing or Archiving – SEAD Virtual Archive • Capability to push data from ACR and/or local research environments to preservation and discovery services (Institutional Repositories/DataONE) – SEAD Research Network • Researcher initiated profiles with harvesting of citations, linkage of data-people-publications, reporting 11
  • 13. SEAD, Universities, and Libraries • From the researcher’s perspective – SEAD is an project work space that enables data sharing, commenting, secure storage, extraction of metadata, and active/social curation • From the university research infrastructure perspective – SEAD is a staging area for data curation prior to publication, submission, and preservation 13
  • 14. Data Set Publishing Workflow •Data content used within ACR •Researcher Profile Established in VIVO NCED Data Set Ingested to ACR •Data Set ready to publish NCED Data Set Ingested to VA •DataCite minted DOI attached to finalized Data Set NCED Data Set Deposited with IR •DOI Resolution to designated IR NCED Data Set Published to VIVO
  • 16. Data Citation Example - Dataset DOI Authors Subject areas Abstract Geographic focus Rights information
  • 17. SEAD: Explore Sustainability Research PEOPLE ORGANIZATIONS RESEARCH (DATA + PUBLICATIONS)
  • 19. SEAD Virtual Archive • Purpose: Long-term preservation and discovery – Thin virtualization layer on top of multiple university Institutional Repositories (IRs) – Enhances IRs by being sustainability science-aware • Team: IU Libraries, UIUC Libraries, and Data To Insight Center at IU • Starting point: Data Conservancy code (Johns Hopkins U.) – Extended for sustainability science long tail use cases
  • 20. Making Data Sustainable: Use Case Active Curation Repository (ACR) SEAD Virtual Archive IUScholarwork s UIUC Ideals Packaged object Preserve data Keep private for 5 years Index data, metadata and relationships • Collected data about Lower Mississippi flood • Stored in Active Repository • Organized as a collection • Marked “Ready for publication” • Collections visible to team only for 5 years • Deposited to repository based on dataset creator affiliation • Find by author, location, keywords or repository
  • 21. Preview Data Upload Data to VA Run Virus Checking File Charact- erization Mint DOI Deposit to IR (& cloud) Update DOI target Index Metadata Index Scientific Metadata Large Dataset Decision Version Data IR Match- maker Index Scientific Metadata Accept Repository Agreement Ingest Workflow into SEAD VA Link to live demo http://seadva.d2i.indiana.edu:8181/sead-access/#login
  • 22. Successful automatic ingest into UIUC IDEALS repository
  • 23. Communication with IRs Datasets deposited into IU SDA, IU Scholworks and UIUC IDEALS
  • 24. Some Lessons Learned • Some researchers and projects in the “long tail” have sophisticated demands for active data services • Supporting analysis of data in SEAD adds complexity and cost • Users want some degree of customization of bare-bones file storage and active project space • A big gap remains between data producers and the campus/library/archive infrastructures for long-term access and preservation. 24
  • 25. SEAD Priorities and Future Plans • Make SEAD more stable and more usable • Attract a larger, broader, and more diverse user community – Network effects in the long tail – Self service • Expand repository options • Resolve Governance and Sustainability 25
  • 26. More info • www.sead-data.net • Give or send email to myersjd@umich.edu for access to the SEAD Demo site

Notas do Editor

  1. MH: Revise Slide and Clarify message. One might say the everyone is under-served by today’s DPAI but Interdisciplinary researchers have particular barriers / requirements Multiple Sources extracts from reference databases observations experimental results and model outputs images derived data products Multiple file types, data types, data structures, data models Multiple resolutions (spatial, temporal, granularity) Multiple metadata standards and ontologies Local standards and data practices developed on the fly Data are vulnerable to interruptions in organizational arrangements graduate students finish PhD’s and move on project funding lapses lab or center funding sunsets One might also say that the long tail under utilizes existing DPAI (which is true) but for good reasons.
  2. Build from Praveen’s life cycles. Mention some of the steps that occur in curation. Mention time lag
  3. Build from Praveen’s life cycles. Mention some of the steps that occur in curation. Mention time lag
  4. Support inter-disciplinary research and data driven research by: Enabling access to: Publications Data People (Expertise / Potential Collaborators in novel innovative ways that continuously anticipate and adapt to changes in technologies and in user needs and expectations; Specifically, Accelerate data discovery Support new types of analyses with heterogeneous data Reduce overall costs of curation [rather than shift costs between researchers and repositories] Accelerate the movement of data from researchers into preservation, discovery and access environments Increase the quantity, improve the quality, and enhance the utility of scientific data for reuse.
  5. Start 2:01 Stop 4:00 ACR Start 4:00 Stop: 4:53 Vivo Start 9:57 Stop: 11:21 11:55 – end Vivo
  6. Might move this section on Ingest workflow?
  7. Reporting (Extra win for SEAD) and responsive to the community
  8.  - less emphasis on features and functionality -  remove "context" slides (done) matchmaker workflow slide – simplify make multiple dimensions of decision-making process of matchmaker more clear - record a demo of how ingest and matchmaking works deposit to ideals; make decision-making process points clear through example of Praveen, and demonstrate visually the embargo in ideals - move DataNet slide to other decks (done)
  9. VA - ACR interactions - user or science side of the story A researcher at U of Illinois led the data collection effort related to Lower Mississippi flood. The data have been collected and uploaded to ACR. In ACR the data have been organized into collections, processed for easy previewing and described (tagged and annotated). One subcollection has been marked as “Ready to publish”, i.e., ready for long-term preservation. Praveen wants to preserve the subcollection, but keep it private for 5 years. SEAD Virtual Archive queries ACR and finds this subcollection. It packages the subcollection using its BagIT protocol and invokes its matchmaker algorithm to decide where to ingest the subcollection. The Matchmaker queries VIVO and finds that Praveen is from the University of Illinois. VA automatically creates a collection in IDEALS and marks it “embargoed” for 5 years. After the collection is ingested, it appears in Virtual Archive and in IDEALS. In Virtual Archive this collection can be found by searching by author, location, keywords and repository. In the future, it will also provide search by data types (e.g., images, geo, video, etc.), instruments (e.g., Lidar, Aviris) and methods (e.g., data models, experiments, etc.)
  10. - VA - IR communication - bring out details about solutions for large files (SDA), explain why numbers of files are so different for SDA, Scholarworks and Ideals (slide 10)
  11. After Lunch