Advancing the fourth paradigm of research: Assimilating repositories into active research phases
1. Title Here
Advancing the fourth paradigm of research:
Assimilating repositories into active
research phases
Tyler Walters
Dean, University Libraries, Virginia Tech
SPARC Conference, Kansas City, March 12, 2012 Title Here, Optional or
Unit Identifier
2. The Rise of
Virtual Environments
Repositories are being woven into “virtual ecosystems,”
they are holistic and support communities of practice
• Early stages / deposit: raw/early phase data, notes, etc.
• Annotating, sharing within research groups, commenting, etc.
• Research proposal writing, project planning, etc.
• Tools:
• Discovery, analysis, visualization, and text/data/image mining
are being used in concert with repositories
• Virtual communities and their communication tools
• e.g., social media and community networking capabilities
2
3. Which projects are highlighted?
TARDIS
Purdue University Research Repository | PURR
3
4. e-Research
Version 0.9
Protein Crystallography Research Data and Metadata Workflow 3/6/2011
Computer
Researcher's Computer '
Cluster
Research Admin
RM4
Repository
Synchrotron
MX1 / MX2
Metaman MyTardis MyTardis
Beamline
Monash Raw Data
Metadata Protein Crystallography The Australian Repositories
Extraction for Diffraction ImageS
Research Data Management Platform
(for raw, processed, refined, and published data)
Australian
ORCA Research Data
Commons
From Capture Metadata
Harvester
to Publication
MyTardis
MyTardis LEGEND
Institutional Data & Metadata
Research
Institutional MyTardis Metadata
Data
Research
Future?: Registry
Data
Proposed
Registry Institutional
Virtual lab Research
Data
system Registry
5. Early Stage/Deposit
• Move curation upstream in the data/information life cycle
• Automatically capture metadata, defined by the data producers
• Provide facilities for annotation and mark-up of data
5
6. Early Stage:
The Active Curation Model
Active Curation Social Media
Workflows
Data Review
Rating
Commenting
Metadata
7. Tools and Toolkits
• A Critical Intersection in the ‘Virtual Ecosystem’ is:
Tools + Repositories
• Developing toolkits for discovery, analysis, visualization,
and text/data/image mining… all are being used with
repositories
• Leveraging existing tools (open source and proprietary)
• Incorporating custom, discipline-specific tools
7
8. Tools & Toolkits
Functionality:
• By data type
• Search
• Visualization
• Subsetting
• Analysis
• Services
8
10. Communities and
Communication
• Co-authorship
• Co-funding
• Micro-citation
• Shared project
repositories
• Shared tags
• Threaded discussions
• Quoting, forwarding, …
• (reviewing, commenting)
(slide from SEAD)
11. Working Group Support
• File share
• Wikis
• To do lists
• Blogs
• Calendars
• Forums
• Project notes
• Commenting
• Tagging
• Proposal
writing
11
12. How do IRs and “papers” fit in?
IRs are being leveraged in these new developments
• Services over an Network of Data
Producers
active content layer
that is backed
by/harvested into a
Web User Interface
federated archive
infrastructure based Active Content Repository
Services Provided
on institutional Content Curation Archival
data
Other
Mining Decisions services
resources generation
Virtual Archives
Institutional Repositories
(slide from SEAD) Data RPI UIUC UM
IU ICPSR
Conservancy
User Network
13. Linked Data and Repositories
• Tag and annotate data
• Overlay it with reference data
• Organize it in domain terminology
• Link it to people, papers, projects,
conversations…
(slide from SEAD)
14. Thank you…
Tyler Walters
tyler.walters@vt.edu
tywalters1 = Skype / Twitter
Acknowledgements for slides and conversations:
• Robert McDonald (Indiana), SEAD
• William Michener (New Mexico), DataONE
• Antohny Bietz and Steve Androulakis (Monash), TARDIS
• Michael Witt (Purdue), PURR
• Suzie Allard (Tennessee), ORNL DAAC
• Sayeed Chourdhury (Johns Hopkins), Data Conservancy
14