SlideShare uma empresa Scribd logo
1 de 11
DataONE

Research Data Access & Preservation
21 March 2012

Suzie Allard, Ph.D.
University of Tennessee
DataONE vision and approach
Enable new science and knowledge creation through
universal access to data about life on earth and the
environment that sustains it.
1. Build on existing
   cyberinfrastructure   2. Create new
                            cyberinfrastructure   3. Support communities
                                                     of practice




                                                                           2
                                                                               2
DataONE Cyberinfrastructure
Three major components for a      Member Nodes
flexible, scalable, sustainable   • diverse institutions
                                  Coordinating Nodes
network                           • serve local community
                                  • retain complete metadata
                                  Investigator Toolkit
                                  • provide resources for
                                    catalog
                                    managing their data
                                  • indexing for search
                                  • retain copies of data
                                  • network-wide services
                                  • ensure content
                                    availability (preservation)
                                  • replication services




                                                                  3
Training in all elements of the data life cycle

                                Plan

                   Analyze               Collect
    Kepler




             Integrate                         Assure




                   Discover              Describe

                              Preserve
                                                        4
DataONE Education and Training

Summer Internships
Training at Conferences and Workshops
  • Supercomputing 2011
  • DataONE Implementation Workshop: Publishing data as a
    Member Node
  • Ecological Society of America (ESA)
  • American Geophysical Union (AGU)
Educational Modules
Graduate-level course
  • Summer Institute for Environmental Informatics

                                                            5
On-line Education Modules




                            6
Environmental Information Management (EIM) Institute
Graduate students biology, geology, ecology, or other
environmental sciences, environmental engineering, geography
or science librarianship
Conceptual and practical hands-on
training to effectively
design, manage, analyze, visualize, and
preserve data and information:
• Managing data files
• Creating databases and web portals
• Data analysis and visualization
• Techniques for
   managing, analyzing, and visualizing
   geospatial data

                                                               7
DataONE Team and Sponsors
       • Amber Budden, Roger Dahl, Rebecca                     • Ewa Deelman
         Koskela, Bill Michener, Robert Nahf, Mark
       • Servilla
         Dave Vieglais                                         • Peter Honeyman

       • Suzie Allard, Carol Tenopir, Maribeth                 • Jeff Horsburgh
         Manoff, Kimberley Douglass, Robert
       • Waltz, Bruce Wilson Giri
         John Cobb, Bob Cook,                                  • Robert Sandusky
        Palanismy, Line Pouchard
       • Patricia Cruse, John Kunze                            • Bertram Ludaescher

       • Sky Bristol, Mike Frame, Richard Huffine, Viv         • Peter Buneman
         Hutchison, Jeff Morisette, Jake Weltzin, Lisa Zolly
       • Chris Jones, Stephanie Hampton, Matt                  • Cliff Duke
         Jones
       • Paul Allen, Rick Bonney, Steve Kelling                • Carole Goble

       • Ryan Scherle, Todd Vision                             • Donald Hobern

       • Randy Butler                                          • David DeRoure


                                      LEON LEVY
                                      FOUNDATION                                      8
DataONE Team
               Year 1
                            Year 2




                        Year 3


                                     9
Questions




            10
A Science Use Case

               Diverse bird observations and           Model results
               environmental data from
               300,00 locations in the US      Occurrence of Indigo Bunting (2008)
               integrated and analyzed using
               High Performance Computing
               Resources


Land Cover


                                                 Jan   Ap     Jun   Sep    Dec
                                                       r
Meteorology
                                                 • Examine patterns of
                                                   migration
MODIS –        Spatio-Temporal Exploratory       • Infer how climate
Remote         Model identifies factors            change may affect
sensing data   affecting patterns of               bird migration
               migration


                                                                                     11

Mais conteúdo relacionado

Mais procurados

Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
SEAD
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodei
ASIS&T
 

Mais procurados (20)

Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
 
Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...Data management: international challenges, national infrastructure, and insti...
Data management: international challenges, national infrastructure, and insti...
 
EDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable UnitsEDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable Units
 
ANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data ReuseANDS Applications Program: Building Tools to Facilitate Data Reuse
ANDS Applications Program: Building Tools to Facilitate Data Reuse
 
Provenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four TransformationsProvenance in Support of the ANDS Four Transformations
Provenance in Support of the ANDS Four Transformations
 
Natasha intro to rdm c3 dis may 2018.pptx
Natasha intro to rdm c3 dis may 2018.pptxNatasha intro to rdm c3 dis may 2018.pptx
Natasha intro to rdm c3 dis may 2018.pptx
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptx
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
Birgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International PerspectiveBirgit Schmidt: RDA for Libraries from an International Perspective
Birgit Schmidt: RDA for Libraries from an International Perspective
 
Sue cook c3 dis dm-ps 1.pptx
Sue cook c3 dis dm-ps 1.pptxSue cook c3 dis dm-ps 1.pptx
Sue cook c3 dis dm-ps 1.pptx
 
Comeaux RDAP11 Data Archives in Federal Agencies
Comeaux RDAP11 Data Archives in Federal AgenciesComeaux RDAP11 Data Archives in Federal Agencies
Comeaux RDAP11 Data Archives in Federal Agencies
 
End of COBWEB Co-Design Projects Celebration
End of COBWEB Co-Design Projects Celebration		End of COBWEB Co-Design Projects Celebration
End of COBWEB Co-Design Projects Celebration
 
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
Using SEAD to Support Collaboration among Land Managers, Scientists, and the ...
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodei
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
 
Smith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesSmith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case Studies
 
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
 
Altman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data ManagementAltman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data Management
 
Ignite@AGU14
Ignite@AGU14Ignite@AGU14
Ignite@AGU14
 

Destaque

DataONE User's Group Lifecycle Management: Planning
DataONE User's Group Lifecycle Management:  PlanningDataONE User's Group Lifecycle Management:  Planning
DataONE User's Group Lifecycle Management: Planning
Andrew Sallans
 

Destaque (7)

Hands-On Data Management Planning for Life Sciences
Hands-On Data Management Planning for Life SciencesHands-On Data Management Planning for Life Sciences
Hands-On Data Management Planning for Life Sciences
 
DataONE User's Group Lifecycle Management: Planning
DataONE User's Group Lifecycle Management:  PlanningDataONE User's Group Lifecycle Management:  Planning
DataONE User's Group Lifecycle Management: Planning
 
NSF Data Management Plan Case Study: UVa’s Response.
NSF Data Management Plan Case Study:  UVa’s Response.NSF Data Management Plan Case Study:  UVa’s Response.
NSF Data Management Plan Case Study: UVa’s Response.
 
NSF Data Management Plan - Implications for Librarians
NSF Data Management Plan - Implications for LibrariansNSF Data Management Plan - Implications for Librarians
NSF Data Management Plan - Implications for Librarians
 
Marketing With LinkedIn
Marketing With LinkedInMarketing With LinkedIn
Marketing With LinkedIn
 
Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...
 
Badges to Acknowledge Open Practices
Badges to Acknowledge Open PracticesBadges to Acknowledge Open Practices
Badges to Acknowledge Open Practices
 

Semelhante a DataOne - Suzie Allard - RDAP12

Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
TERN Australia
 
Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...
Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...
Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...
TERN Australia
 
Sharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataSharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem Data
TERN Australia
 
Research data lifecycle diagram
Research data lifecycle diagramResearch data lifecycle diagram
Research data lifecycle diagram
Steven Cracknell
 

Semelhante a DataOne - Suzie Allard - RDAP12 (20)

Michener Plenary PPSR2012
Michener Plenary PPSR2012Michener Plenary PPSR2012
Michener Plenary PPSR2012
 
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
 
ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides
 
RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satis...
RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satis...RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satis...
RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satis...
 
DataONE_cobb_hubbub2012_20120924_v05
DataONE_cobb_hubbub2012_20120924_v05DataONE_cobb_hubbub2012_20120924_v05
DataONE_cobb_hubbub2012_20120924_v05
 
Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...
Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...
Stuart Phinn and Andy Lowe_TERN's national ecosystem data infrastructure is d...
 
Knowledge Exchange, Nov 2011, Bonn
Knowledge Exchange, Nov 2011, BonnKnowledge Exchange, Nov 2011, Bonn
Knowledge Exchange, Nov 2011, Bonn
 
Ausplots Training - Session 1
Ausplots Training - Session 1Ausplots Training - Session 1
Ausplots Training - Session 1
 
Research Data Sharing LERU
Research Data Sharing LERU Research Data Sharing LERU
Research Data Sharing LERU
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
 
Introduction to Research Data Management for postgraduate students
Introduction to Research Data Management for postgraduate studentsIntroduction to Research Data Management for postgraduate students
Introduction to Research Data Management for postgraduate students
 
Walker odi -uksg_2013-jenny_walker
Walker odi  -uksg_2013-jenny_walkerWalker odi  -uksg_2013-jenny_walker
Walker odi -uksg_2013-jenny_walker
 
Engaging the Researcher in RDM
Engaging the Researcher in RDMEngaging the Researcher in RDM
Engaging the Researcher in RDM
 
Sharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataSharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem Data
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
 
Data Facilties Workshop - Panel on Global Data Sharing Exemplars
Data Facilties Workshop - Panel on Global Data Sharing ExemplarsData Facilties Workshop - Panel on Global Data Sharing Exemplars
Data Facilties Workshop - Panel on Global Data Sharing Exemplars
 
Research data lifecycle diagram
Research data lifecycle diagramResearch data lifecycle diagram
Research data lifecycle diagram
 
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
 

Mais de ASIS&T

Mais de ASIS&T (20)

RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
 
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesRDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
 
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
 
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
 
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
 
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
 
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
 
RDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in PracticeRDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in Practice
 
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
 
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
 
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
 
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
 
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
 
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge BrokerRDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
 
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
 
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
 
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research DataRDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
 
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide CollaborationRDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
 
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

DataOne - Suzie Allard - RDAP12

  • 1. DataONE Research Data Access & Preservation 21 March 2012 Suzie Allard, Ph.D. University of Tennessee
  • 2. DataONE vision and approach Enable new science and knowledge creation through universal access to data about life on earth and the environment that sustains it. 1. Build on existing cyberinfrastructure 2. Create new cyberinfrastructure 3. Support communities of practice 2 2
  • 3. DataONE Cyberinfrastructure Three major components for a Member Nodes flexible, scalable, sustainable • diverse institutions Coordinating Nodes network • serve local community • retain complete metadata Investigator Toolkit • provide resources for catalog managing their data • indexing for search • retain copies of data • network-wide services • ensure content availability (preservation) • replication services 3
  • 4. Training in all elements of the data life cycle Plan Analyze Collect Kepler Integrate Assure Discover Describe Preserve 4
  • 5. DataONE Education and Training Summer Internships Training at Conferences and Workshops • Supercomputing 2011 • DataONE Implementation Workshop: Publishing data as a Member Node • Ecological Society of America (ESA) • American Geophysical Union (AGU) Educational Modules Graduate-level course • Summer Institute for Environmental Informatics 5
  • 7. Environmental Information Management (EIM) Institute Graduate students biology, geology, ecology, or other environmental sciences, environmental engineering, geography or science librarianship Conceptual and practical hands-on training to effectively design, manage, analyze, visualize, and preserve data and information: • Managing data files • Creating databases and web portals • Data analysis and visualization • Techniques for managing, analyzing, and visualizing geospatial data 7
  • 8. DataONE Team and Sponsors • Amber Budden, Roger Dahl, Rebecca • Ewa Deelman Koskela, Bill Michener, Robert Nahf, Mark • Servilla Dave Vieglais • Peter Honeyman • Suzie Allard, Carol Tenopir, Maribeth • Jeff Horsburgh Manoff, Kimberley Douglass, Robert • Waltz, Bruce Wilson Giri John Cobb, Bob Cook, • Robert Sandusky Palanismy, Line Pouchard • Patricia Cruse, John Kunze • Bertram Ludaescher • Sky Bristol, Mike Frame, Richard Huffine, Viv • Peter Buneman Hutchison, Jeff Morisette, Jake Weltzin, Lisa Zolly • Chris Jones, Stephanie Hampton, Matt • Cliff Duke Jones • Paul Allen, Rick Bonney, Steve Kelling • Carole Goble • Ryan Scherle, Todd Vision • Donald Hobern • Randy Butler • David DeRoure LEON LEVY FOUNDATION 8
  • 9. DataONE Team Year 1 Year 2 Year 3 9
  • 10. Questions 10
  • 11. A Science Use Case Diverse bird observations and Model results environmental data from 300,00 locations in the US Occurrence of Indigo Bunting (2008) integrated and analyzed using High Performance Computing Resources Land Cover Jan Ap Jun Sep Dec r Meteorology • Examine patterns of migration MODIS – Spatio-Temporal Exploratory • Infer how climate Remote Model identifies factors change may affect sensing data affecting patterns of bird migration migration 11

Notas do Editor

  1. The DataONE mission/vision is to “enable new science and knowledge creation through universal access to data about life on earth and the environment that sustains it.” DataONE is based on three precepts. 1. We are leveraging existing infrastructure such as the hundreds of existing data centers and repositories, and the myriad of software tools. 2. We are focusing our efforts on developing new infrastructure that better enables interoperability across data centers and between scientific tools and data resources. [The new cyberinfrastructure being created by DataONE is illustrated on a future slide.] 3. We recognize that the largest challenges are sociocultural in nature, and thus we focus significant attention on engaging and supporting the broader community of stakeholders (e.g. scientists, students, librarians).
  2. DataONE is a federated data network built to improve access to Earth science data, and to support science by: engaging the relevant science, data, and policy communities; facilitating easy, secure, and persistent storage of data; and disseminating integrated and user-friendly tools for data discovery, analysis, visualization, and decision-making. There are three principal components:Member Nodes that include a diverse array of data centers and repositories that are associated with national and international agencies and research networks, universities, libraries, etc.Coordinating Nodes that support data replication across Member Nodes (i.e., data centers) as well as network wide services like 24/7 access to metadata at the CNs, indexing and rapid search and discovery, etc.An Investigator Toolkit that includes tools that are widely used by scientists, The tools are coupled with the DataONE resources so that it is, for example, possible to seamlessly and transparently access data at Member Nodes through the tool of your choice.
  3. Other development activities during years 3-5 will focus on expanding the suite of tools that are available through the Investigator Toolkit. New tool additions will be identified and prioritized by the DataONE Users Group.
  4. Other development activities during years 2-5 will focus on expanding the suite of tools that are available through the Investigator Toolkit. New tool additions will be identified and prioritized by the DataONE Users Group.
  5. This final slide illustrates the initial DataONE partners that have now been involved for over 3 years, since the proposal was conceived. The DataONE Users Group now includes significantly more partners and we expect to grow exponentially over the next five years.
  6. The DataONE team is growing!
  7. The Scientific Exploration, Visualization and Analysis Working Group is an example of a scientific use case. By running through a comprehensive case study, this working group was able to provide specific guidance on the challenges faced when conducting data intensive science. Challenges that were communicated to, and met by, the DataONE core CI team and developers.Science requires: Multiple cooperating extreme scale CI components (EVA/eBird pilot lesson learned)EVA pilot collaborated with TeraGrid (now XSEDE) to use HPC and “schlep” data as part of the workflow50K cpu-core hours (SU’s) last year(supporting SOTB 2011)3M hours allocated this year (Cornell CLO team has optimized code for 3-10X speedup, loosened data transfer bottleneck, so we will under run)Plan for 500 species (3 yr data) runs. Currently: 70/wk for 2011 campaignHPC use 10X 2 years in a row. Data increases as well.Conclusion: success breeds scale