SlideShare uma empresa Scribd logo
1 de 20
Facilitating	
  data	
  stewardship	
  
                    practices	
  for	
  scientists	
  
                                   	
  


Carly	
  Strasser	
  |	
  carly.strasser@ucop.edu	
  |	
  www.carlystrasser.net	
  
        Open	
  Access	
  symposium	
  |	
  University	
  of	
  North	
  Texas	
  |	
  May	
  2012	
  
UGLY	
  TRUTH	
  
                                                    Many	
  
                                                    Earth	
  |	
  Environmental	
  |	
  Ecological	
  
                                                    scientists…	
  	
  
                                                    	
  
5shortessays.blogspot.com	
  



                                                                 	
  
                          are	
  not	
  taught	
  data	
  management	
  
                          don’t	
  know	
  what	
  metadata	
  are	
  
                          can’t	
  name	
  data	
  centers	
  or	
  repositories	
  
                          don’t	
  share	
  data	
  publicly	
  or	
  store	
  it	
  in	
  an	
  archive	
  
                          aren’t	
  convinced	
  they	
  should	
  share	
  data	
  

                                                                           	
  
Where	
  data	
  end	
  up	
  
                                                       From	
  Flickr	
  by	
  diylibrarian	
  




                                                                                                  www




                         blog.order2disorder.com	
  




                                                                                                  From	
  Flickr	
  by	
  csessums	
  
  Data	
  
Metadata	
  




                                                                                                      From	
  Flickr	
  by	
  csessums	
  
                                                                         Recreated	
  from	
  Klump	
  et	
  al.	
  2006	
  
Where	
  data	
  end	
  up	
  
                                                                    From	
  Flickr	
  by	
  diylibrarian	
  




                                                                                                               www




  Data	
  
                                                                                          www
Metadata	
  
                             From	
  Flickr	
  by	
  torkildr	
  




                                                                                      Recreated	
  from	
  Klump	
  et	
  al.	
  2006	
  
Intercept	
  the	
  
 researchers	
  where	
  
they	
  already	
  work:	
  
Frequency	
  of	
  
                                                           Excel	
  use	
                    Rare	
  or	
  
                                                                                             occasional	
  
                                                                                             use	
  
                                                                                                        Moderate	
  
                                                                                                        use	
  
            Percent	
  of	
  respondents	
  who	
  use	
  
            Excel	
  for	
  these	
  tasks	
  
100	
                                                                                Every	
  day	
  
  90	
                                                                               or	
  almost	
  
  80	
                                                                               every	
  day	
  
  70	
  
  60	
  
  50	
  
  40	
  
  30	
  
  20	
  
  10	
  
    0	
  
             Organizing	
     Visualizing	
     Sta:s:cs	
     Sharing	
  data	
  
                data	
           data	
  
Facilitate	
  
                        Archiving	
  
        Data	
                              Data	
  Reuse	
  
management	
             Sharing	
  
&	
  organization	
                       Reproducibility	
  
                        Publishing	
  
•    Open	
  source	
  add-­‐in	
  &	
  web	
  application	
  
•    Facilitate	
  data	
  management,	
  sharing,	
  archiving	
  for	
  scientists	
  
•    Focus	
  on	
  atmospheric,	
  ecological,	
  hydrological,	
  and	
  
     oceanographic	
  data	
  
•    Collect	
  requirements	
  for	
  add-­‐in	
  from	
  scientists,	
  data	
  
     centers,	
  libraries	
  
Add-­‐in	
  &	
  Web	
  Application?	
  
Add-­‐in	
  	
  
•  Little	
  pieces	
  of	
  software	
  	
  
•  Download	
  to	
  extend	
  the	
  capabilities	
  of	
  Excel	
  
•  Appear	
  as	
  “ribbon”	
  in	
  Excel	
  
•  Only	
  work	
  with	
  Windows	
  Excel	
  2007+	
  
•  Available	
  offline	
  but	
  updates	
  difficult	
  




                                                                  www.ablebits.com	
  
Add-­‐in	
  &	
  Web	
  Application?	
  
Add-­‐in	
  	
  
•  Little	
  pieces	
  of	
  software	
  	
  
•  Download	
  to	
  extend	
  the	
  capabilities	
  of	
  Excel	
  
•  Appear	
  as	
  “ribbon”	
  in	
  Excel	
  
•  Only	
  work	
  with	
  Windows	
  Excel	
  2007+	
  
•  Available	
  offline	
  but	
  updates	
  difficult	
  
Web-­‐based	
  application	
  	
  
•  Websites	
  that	
  do	
  something	
  with	
  info/files	
  provided	
  by	
  user	
  
•  Examples:	
  Facebook,	
  YouTube	
  
•  No	
  program	
  download	
  required	
  but	
  updates	
  easy	
  
•  New	
  user	
  interface	
  to	
  learn	
  
What	
  will	
  DCXL	
  do?	
  




 What	
  do	
  scientists	
  
         need?	
  
~ 150	
  scientists	
  
•  No	
  data	
  preservation	
  
   –  Unaware	
  of	
  archives	
  
   –  Resistant	
  to	
  sharing	
  
•  Poor	
  data	
  documentation	
  
•  90%	
  use	
  other	
  programs	
  along	
  with	
  Excel	
  
Requirements	
  
1.   Must	
  work	
  for	
  Excel	
  users	
  without	
  the	
  add-­‐in	
  
2.   No	
  additional	
  software	
  necessary	
  
3.   Can	
  be	
  used	
  offline	
  
4.   Perform	
  CSV	
  compatibility	
  checks,	
  reporting,	
  and	
  automated	
  fixes	
  
5.   Add	
  Metadata	
  to	
  data	
  file	
  
      a.  Can	
  use	
  existing	
  metadata	
  as	
  a	
  template	
  
      b.  Add-­‐in	
  can	
  automatically	
  generate	
  some	
  of	
  the	
  metadata	
  
            where	
  the	
  info	
  is	
  available	
  from	
  the	
  file	
  
6.  Generate	
  a	
  citation	
  for	
  the	
  data	
  file	
  
7.  Deposit	
  data	
  and	
  metadata	
  in	
  a	
  repository	
  
	
  
Requirements	
  


Features	
  
1.  Compatibility	
  Check	
  
2.  Generate	
  metadata	
  
3.  Generate	
  citation	
  
4.  Post	
  data	
  to	
  repository	
  
DCXL	
  Add-­‐in	
  Ribbon	
  
Open	
  Access?	
  
Vision	
  for	
  Future	
  
•  Community	
  adoption	
  
•  Extension	
  to	
  other	
  programs	
  
   –  Google	
  Docs,	
  OpenOffice	
  
•  Incorporation	
  of	
  other	
  metadata	
  schemas	
  
•  Repository	
  adoption	
  
•  Partnerships:	
  FigShare,	
  F1000,	
  USGS,	
  etc.	
  
Website:	
  dcxl.cdlib.org	
  
dcxl.cdlib.org	
  
@dcxlCDL	
  
www.facebook.com/DCXLatCDL	
  


                                     www.carlystrasser.net	
  
                                 carlystrasser@gmail.com	
  
                                            @carlystrasser	
  

Mais conteúdo relacionado

Mais procurados

Landscape of Data Curation - Microsoft eScience 2012
Landscape of Data Curation - Microsoft eScience 2012Landscape of Data Curation - Microsoft eScience 2012
Landscape of Data Curation - Microsoft eScience 2012
Carly Strasser
 
UCLA: Data Management for Scientists
UCLA: Data Management for ScientistsUCLA: Data Management for Scientists
UCLA: Data Management for Scientists
Carly Strasser
 
Juliana Freire PPT
Juliana Freire PPTJuliana Freire PPT
Juliana Freire PPT
Laura Manley
 

Mais procurados (20)

Data Management: The Current Landscape
Data Management: The Current LandscapeData Management: The Current Landscape
Data Management: The Current Landscape
 
Data Herding for Scientists - UC Davis OA Week
Data Herding for Scientists - UC Davis OA WeekData Herding for Scientists - UC Davis OA Week
Data Herding for Scientists - UC Davis OA Week
 
Data Herding for Scientists - IGERT Symposium at UF
Data Herding for Scientists - IGERT Symposium at UFData Herding for Scientists - IGERT Symposium at UF
Data Herding for Scientists - IGERT Symposium at UF
 
Landscape of Data Curation - Microsoft eScience 2012
Landscape of Data Curation - Microsoft eScience 2012Landscape of Data Curation - Microsoft eScience 2012
Landscape of Data Curation - Microsoft eScience 2012
 
Data Management: Scientist Perspective - DLF 2012
Data Management: Scientist Perspective - DLF 2012Data Management: Scientist Perspective - DLF 2012
Data Management: Scientist Perspective - DLF 2012
 
Open Data & Open Access - DLF 2012
Open Data & Open Access - DLF 2012Open Data & Open Access - DLF 2012
Open Data & Open Access - DLF 2012
 
UCLA: Data Management for Scientists
UCLA: Data Management for ScientistsUCLA: Data Management for Scientists
UCLA: Data Management for Scientists
 
Digital Curation for Excel (DCXL)
Digital Curation for Excel (DCXL)Digital Curation for Excel (DCXL)
Digital Curation for Excel (DCXL)
 
DMPTool Overview for UC Merced Research Week
DMPTool Overview for UC Merced Research WeekDMPTool Overview for UC Merced Research Week
DMPTool Overview for UC Merced Research Week
 
UC Santa Cruz: Data Management for Scientists
UC Santa Cruz: Data Management for ScientistsUC Santa Cruz: Data Management for Scientists
UC Santa Cruz: Data Management for Scientists
 
Research Data and Scholarly Communication
Research Data and Scholarly CommunicationResearch Data and Scholarly Communication
Research Data and Scholarly Communication
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing Serendipity
 
Data Management Solutions from Libraries at NSF Large Facilities Workshop
Data Management Solutions from Libraries at NSF Large Facilities WorkshopData Management Solutions from Libraries at NSF Large Facilities Workshop
Data Management Solutions from Libraries at NSF Large Facilities Workshop
 
Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)
 
Data Management Planning for ESA 2013
Data Management Planning for ESA 2013Data Management Planning for ESA 2013
Data Management Planning for ESA 2013
 
DataUp at ACRL 2013
DataUp at ACRL 2013DataUp at ACRL 2013
DataUp at ACRL 2013
 
RDAP 15: You’re in good company: Unifying campus research data services
RDAP 15: You’re in good company: Unifying campus research data servicesRDAP 15: You’re in good company: Unifying campus research data services
RDAP 15: You’re in good company: Unifying campus research data services
 
Library Orientation School of Medicine 2009
Library Orientation School of Medicine 2009Library Orientation School of Medicine 2009
Library Orientation School of Medicine 2009
 
The Internet, Science, and Transformations of Knowledge
The Internet, Science, and Transformations of KnowledgeThe Internet, Science, and Transformations of Knowledge
The Internet, Science, and Transformations of Knowledge
 
Juliana Freire PPT
Juliana Freire PPTJuliana Freire PPT
Juliana Freire PPT
 

Destaque (6)

Os nossos poetas
Os nossos poetasOs nossos poetas
Os nossos poetas
 
1 historia do surgimento da psicanalise
1   historia do surgimento da psicanalise1   historia do surgimento da psicanalise
1 historia do surgimento da psicanalise
 
Histeria
HisteriaHisteria
Histeria
 
Seminario freud
Seminario freudSeminario freud
Seminario freud
 
Freud
FreudFreud
Freud
 
Data Management Plans: Presentation for Data Governance Workshop
Data Management Plans: Presentation for Data Governance WorkshopData Management Plans: Presentation for Data Governance Workshop
Data Management Plans: Presentation for Data Governance Workshop
 

Semelhante a DataUp: Data Curation for Excel

DataUp: An overview for the DataONE Users Group
DataUp: An overview for the DataONE Users GroupDataUp: An overview for the DataONE Users Group
DataUp: An overview for the DataONE Users Group
Carly Strasser
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferies
BDLSS
 
Lecture2 slides-march-29
Lecture2 slides-march-29Lecture2 slides-march-29
Lecture2 slides-march-29
Cyri Jones
 
Provenance Management to Enable Data Sharing
Provenance Management to Enable Data SharingProvenance Management to Enable Data Sharing
Provenance Management to Enable Data Sharing
University of Arizona
 
Module 1 - Chapter1.pptx
Module 1 - Chapter1.pptxModule 1 - Chapter1.pptx
Module 1 - Chapter1.pptx
SoniaDevi15
 

Semelhante a DataUp: Data Curation for Excel (20)

DataUp: An overview for the DataONE Users Group
DataUp: An overview for the DataONE Users GroupDataUp: An overview for the DataONE Users Group
DataUp: An overview for the DataONE Users Group
 
DataUp Overview: AGU 2012
DataUp Overview: AGU 2012DataUp Overview: AGU 2012
DataUp Overview: AGU 2012
 
2015 09 emc lsug
2015 09 emc lsug2015 09 emc lsug
2015 09 emc lsug
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
Data Management Plans: Tips, Tricks and Tools
Data Management Plans: Tips, Tricks and ToolsData Management Plans: Tips, Tricks and Tools
Data Management Plans: Tips, Tricks and Tools
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferies
 
Opening up: bibliographic data sharing & interoperability
Opening up: bibliographic data sharing & interoperabilityOpening up: bibliographic data sharing & interoperability
Opening up: bibliographic data sharing & interoperability
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career Conference
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
 
Duraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository ServicesDuraspace Hot Topics Series 6: Metadata and Repository Services
Duraspace Hot Topics Series 6: Metadata and Repository Services
 
Lecture2 slides-march-29
Lecture2 slides-march-29Lecture2 slides-march-29
Lecture2 slides-march-29
 
Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014
 
Informatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data DecadeInformatics Transform : Re-engineering Libraries for the Data Decade
Informatics Transform : Re-engineering Libraries for the Data Decade
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific Research
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP Students
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Provenance Management to Enable Data Sharing
Provenance Management to Enable Data SharingProvenance Management to Enable Data Sharing
Provenance Management to Enable Data Sharing
 
Module 1 - Chapter1.pptx
Module 1 - Chapter1.pptxModule 1 - Chapter1.pptx
Module 1 - Chapter1.pptx
 
Dataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTagsDataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTags
 

Mais de Carly Strasser

Mais de Carly Strasser (20)

Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of Change
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
 
Lightning Talk on open data for #oaw14sky
Lightning Talk on open data for #oaw14skyLightning Talk on open data for #oaw14sky
Lightning Talk on open data for #oaw14sky
 
CDL Tools for DataCite 2014
CDL Tools for DataCite 2014CDL Tools for DataCite 2014
CDL Tools for DataCite 2014
 
ESA Ignite talk on quality control for data
ESA Ignite talk on quality control for dataESA Ignite talk on quality control for data
ESA Ignite talk on quality control for data
 
ESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharingESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharing
 
Data publication and Citation for CLIR postdoc seminar
Data publication and Citation for CLIR postdoc seminarData publication and Citation for CLIR postdoc seminar
Data publication and Citation for CLIR postdoc seminar
 
Data Management for Mountain Observatories Workshop
Data Management for Mountain Observatories WorkshopData Management for Mountain Observatories Workshop
Data Management for Mountain Observatories Workshop
 
Libraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch LibrariesLibraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch Libraries
 
Open Science for Australian Institute of Marine Science Workshop
Open Science for Australian Institute of Marine Science WorkshopOpen Science for Australian Institute of Marine Science Workshop
Open Science for Australian Institute of Marine Science Workshop
 
Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014
 
Data management overview and UC3 tools for IASSIST 2014
Data management overview and UC3 tools for IASSIST 2014Data management overview and UC3 tools for IASSIST 2014
Data management overview and UC3 tools for IASSIST 2014
 
Dash for IASSIST 2014
Dash for IASSIST 2014Dash for IASSIST 2014
Dash for IASSIST 2014
 
DMPTool for UMass eScience Symposium
DMPTool for UMass eScience SymposiumDMPTool for UMass eScience Symposium
DMPTool for UMass eScience Symposium
 
DMPTool 2.0 for #IDCC14
DMPTool 2.0 for #IDCC14DMPTool 2.0 for #IDCC14
DMPTool 2.0 for #IDCC14
 
Data Publication at CDL for IDCC14
Data Publication at CDL for IDCC14Data Publication at CDL for IDCC14
Data Publication at CDL for IDCC14
 
Data Publication for UC Davis Publish or Perish
Data Publication for UC Davis Publish or PerishData Publication for UC Davis Publish or Perish
Data Publication for UC Davis Publish or Perish
 
DMPTool for IMLS #WebWise14
DMPTool for IMLS #WebWise14DMPTool for IMLS #WebWise14
DMPTool for IMLS #WebWise14
 
Bren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheetsBren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheets
 
Cal Poly - An Overview of Open Science
Cal Poly - An Overview of Open ScienceCal Poly - An Overview of Open Science
Cal Poly - An Overview of Open Science
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

DataUp: Data Curation for Excel

  • 1. Facilitating  data  stewardship   practices  for  scientists     Carly  Strasser  |  carly.strasser@ucop.edu  |  www.carlystrasser.net   Open  Access  symposium  |  University  of  North  Texas  |  May  2012  
  • 2. UGLY  TRUTH   Many   Earth  |  Environmental  |  Ecological   scientists…       5shortessays.blogspot.com     are  not  taught  data  management   don’t  know  what  metadata  are   can’t  name  data  centers  or  repositories   don’t  share  data  publicly  or  store  it  in  an  archive   aren’t  convinced  they  should  share  data    
  • 3. Where  data  end  up   From  Flickr  by  diylibrarian   www blog.order2disorder.com   From  Flickr  by  csessums   Data   Metadata   From  Flickr  by  csessums   Recreated  from  Klump  et  al.  2006  
  • 4. Where  data  end  up   From  Flickr  by  diylibrarian   www Data   www Metadata   From  Flickr  by  torkildr   Recreated  from  Klump  et  al.  2006  
  • 5. Intercept  the   researchers  where   they  already  work:  
  • 6. Frequency  of   Excel  use   Rare  or   occasional   use   Moderate   use   Percent  of  respondents  who  use   Excel  for  these  tasks   100   Every  day   90   or  almost   80   every  day   70   60   50   40   30   20   10   0   Organizing   Visualizing   Sta:s:cs   Sharing  data   data   data  
  • 7.
  • 8. Facilitate   Archiving   Data   Data  Reuse   management   Sharing   &  organization   Reproducibility   Publishing  
  • 9. •  Open  source  add-­‐in  &  web  application   •  Facilitate  data  management,  sharing,  archiving  for  scientists   •  Focus  on  atmospheric,  ecological,  hydrological,  and   oceanographic  data   •  Collect  requirements  for  add-­‐in  from  scientists,  data   centers,  libraries  
  • 10. Add-­‐in  &  Web  Application?   Add-­‐in     •  Little  pieces  of  software     •  Download  to  extend  the  capabilities  of  Excel   •  Appear  as  “ribbon”  in  Excel   •  Only  work  with  Windows  Excel  2007+   •  Available  offline  but  updates  difficult   www.ablebits.com  
  • 11. Add-­‐in  &  Web  Application?   Add-­‐in     •  Little  pieces  of  software     •  Download  to  extend  the  capabilities  of  Excel   •  Appear  as  “ribbon”  in  Excel   •  Only  work  with  Windows  Excel  2007+   •  Available  offline  but  updates  difficult   Web-­‐based  application     •  Websites  that  do  something  with  info/files  provided  by  user   •  Examples:  Facebook,  YouTube   •  No  program  download  required  but  updates  easy   •  New  user  interface  to  learn  
  • 12. What  will  DCXL  do?   What  do  scientists   need?  
  • 13. ~ 150  scientists   •  No  data  preservation   –  Unaware  of  archives   –  Resistant  to  sharing   •  Poor  data  documentation   •  90%  use  other  programs  along  with  Excel  
  • 14. Requirements   1.  Must  work  for  Excel  users  without  the  add-­‐in   2.  No  additional  software  necessary   3.  Can  be  used  offline   4.  Perform  CSV  compatibility  checks,  reporting,  and  automated  fixes   5.  Add  Metadata  to  data  file   a.  Can  use  existing  metadata  as  a  template   b.  Add-­‐in  can  automatically  generate  some  of  the  metadata   where  the  info  is  available  from  the  file   6.  Generate  a  citation  for  the  data  file   7.  Deposit  data  and  metadata  in  a  repository    
  • 15. Requirements   Features   1.  Compatibility  Check   2.  Generate  metadata   3.  Generate  citation   4.  Post  data  to  repository  
  • 18. Vision  for  Future   •  Community  adoption   •  Extension  to  other  programs   –  Google  Docs,  OpenOffice   •  Incorporation  of  other  metadata  schemas   •  Repository  adoption   •  Partnerships:  FigShare,  F1000,  USGS,  etc.  
  • 20. dcxl.cdlib.org   @dcxlCDL   www.facebook.com/DCXLatCDL   www.carlystrasser.net   carlystrasser@gmail.com   @carlystrasser