SlideShare uma empresa Scribd logo
1 de 12
Baixar para ler offline
Executable Papers:
   publishing science
      that works

 Anita de Waard, Elsevier Labs
HCLS Scientific Discourse Group
         June 20, 2011
Elsevier Challenges
Goals:
-  Invite and survey ideas in innovative science publishing
-  Create a community of people working on similar issues, from different
   backgrounds/viewpoints
Rules:
-  Open submission; very interdisciplinary panel of judges; open publication of
   submissions
-  IPR stays with author; if commercial development, Elsevier has right of first
   refusal
Challenges so far:
-  2008/9: Elsevier Grand Challenge for knowledge enhancement in the life
   sciences: http://www.elseviergrandchallenge.com
-  2010/11: ISMB Killer App award: rewarding bioinformatics apps that work for
   biologists http://killerapp.iscb.org/
-  2011: Elsevier Executable Paper challenge: http://www.executablepapers.com/
Executable Paper Challenge
Driven by issues in publishing computational science:

-   How can we develop a model for executable files that is compatible
    with the user’s operating system and architecture and adaptable to
    future systems?

-   How do we manage very large file sizes?

-   How do we validate data and code, and decrease the reviewer’s
    workload?

-   How to support registering and tracking of actions taken on the
    ‘executable paper?’
Coorganised with International Conf on Computational Sci http://www.iccs-
meeting.org:

- For high-performance and (geo/eco/bio/chem)‘-informatics’ fields
- Actually challenge participants were a different community!
The Finalists:
           http://www.executablepapers.com/finalists.html
1.SHARE - a web portal for creating and sharing executable research
  papers http://sites.google.com/site/executablepaper/
2.A data and code model for reproducible research and executable papers http://
  dirac.cnrs-orleans.fr/~hinsen/executable_paper_challenge.tar.gz
3.A-R-E: The Author-Review-Execute environment
  http://iwb.fluidops.com:7878/resource/AREpaper
4.Planetary System: Web 3.0 and Active Documents
  https://trac.mathweb.org/planetary/wiki/EPCDemo
5.Paper Mache: Creating Dynamic Reproducible Science
  http://oware.cse.tamu.edu:8080/
6.A Provenance Based Infrastructure for Creating Executable Papers
  http://www.vistrails.org/index.php/ExecutablePapers
7.Universal Identifier for Computation Results
  http://vcr.stanford.edu
8.R2 Platform for Reproducible Research
  http://rsquared.stat.uni-muenchen.de/
9.The Collage Authoring Environment
  http://collage.cyfronet.pl
SHARE - a web portal for creating
               and sharing executable research papers
            http://sites.google.com/site/executablepaper/
-   built to house the submissions to the Transformation Tool Contest (TTC)
-   an environment where all software and data related to the paper is
    optimally installed and ready for (temporary and secure) evaluation
-   a specific virtual machine image can be instantiated within the paper
-   SHARE supports multiple operating systems both at the level of the remote
    virtual machines as well as at the level of the connecting clients running on
    the user’s machine
-   more than 100 heterogenous images have been contributed by different
    research communities so far
A-R-E: The Author-Review-Execute environment
http://iwb.fluidops.com:7878/resource/AREpaper
-   A data-driven, loosely coupled, and distributed approach to support
    the life cycle of an (executable) paper: authoring, reviewing, publication
    and study:
    -  finding out which paragraph is providing the information bit
       pertinent to the reference
    -  navigate from data points in a plot to the data items in raw
       experimental data that led to these points (e.g. point to an excel
       sheet column with experimental data)
    -  navigate into the program code that led to a specific data set
-   Based on a semantic wiki:
A Provenance Based Infrastructure for
                Creating Executable Papers
    http://www.vistrails.org/index.php/ExecutablePapers
-    VisTrails provides a mechanism to store provenance for workflows

-    Code and plug-ins for LaTeX, Wiki, Microsoft Word, and PowerPoint

-    CrowdLabs (http://www.crowdlabs.org) to allow papers to point to results
     that can be executed on a remote server and interactively explored from a
     Web browser
Universal Identifier for Computation Results
                  http://vcr.stanford.edu
-   Verifiable Computational Result (VCR): A
    computational result (eg. table, figure, chart, dataset),
    together with the metadata describing in detail the
    computations that created it every computation
    automatically generates a detailed chronicle of its
    inputs and outputs as part of the process execution.
    The chronicle is automatically stored in a standard
    format on a VCR repository for later access

-   Verifiable Result Repository (Repository): A web-
    services provider that archives VCRs and later serves
    up views of specific computational results

-    Verifiable Result Identifier (VRI): A URL (web address)
    that universally and permanently identifies a repository
    and causes it to serve up views of a specific VCRa
    DOI-like string that permanently and uniquely
    identifies the chronicle associated to that result and
    the repository that can serve views of that chronicle.
The Collage Authoring Environment
             http://collage.cyfronet.pl
-   environment which enables authors to seamlessly embed chunks of
    executable code (called assets) into scientific publications:
    - input forms: used by the user to feed input data into the running
      experiment
    - visualizations: render an experiment result which can be directly
      visualized in the research paper
    - code snippets: embed an editable view of the code which enacts a
      specific computation and may be used to generate additional assets

-   allow repeated execution of these assets on underlying computing and data
    storage resources:
Next step: The Executable Journal?

-   Ideally, we’d like all these tools to work together
-   In fact, we’d like that to be how we communicate informatics/
    computer science!
-   Submit a paper with a piece of working code
-   The code works on the platform
-   The code stays on the platform, and is available for other papers
    to run on, too!
-   Advantages:
    -   Clearer communication of software
    -   Less reinvention of the wheel
    -   More collaboration
In other words:
“I like the idea of [...] a research object corresponding to a
PhD thesis sitting on the (digital) library shelf and then being
re-executed as new data comes along.  So the thesis sits
there and new results (or papers, or research objects) pop
out. I like this example because it involves tying down the
method and letting the data flow, instead of the widely held
view that the data sits there and methods are applied to it.
[...]
These papers then become a way of distributing data and
methods in a highly usable and user-centric way [...]. So
scientists don't need to download and install tools and learn
user interfaces.They just interact with the published
executable papers...”
                 Dave De Roure, email to Wf4ever group
What does this have to do with HCLS?

-   Might be a good area to explore this in?

-   E.g. interchange of annotations that we are
    exploring w/Tim Clark’s group...

-   Next step:

    -   Funding?

    -   Format?

    -   Platform?

-   Thoughts??

Mais conteúdo relacionado

Semelhante a Executable papers

[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...
[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...
[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...3TU.Datacentrum
 
Tell Me Quality Documentation
Tell Me Quality DocumentationTell Me Quality Documentation
Tell Me Quality DocumentationMarco Berlot
 
Towards an Infrastructure for Enabling Systematic Development and Research of...
Towards an Infrastructure for Enabling Systematic Development and Research of...Towards an Infrastructure for Enabling Systematic Development and Research of...
Towards an Infrastructure for Enabling Systematic Development and Research of...Rafael Ferreira da Silva
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOpen Science Fair
 
Cytoscape: Now and Future
Cytoscape: Now and FutureCytoscape: Now and Future
Cytoscape: Now and FutureKeiichiro Ono
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceDavid De Roure
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
Plannen Code Jam OpenSocial gadgets
Plannen Code Jam OpenSocial gadgets Plannen Code Jam OpenSocial gadgets
Plannen Code Jam OpenSocial gadgets kirstenveelo
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchangelagoze
 
eROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC ArchitectureeROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC Architecturee-ROSA
 
DSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformDSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformAndrea Bollini
 
Plannen Code Jam OpenSocial gadgets
Plannen Code Jam OpenSocial gadgetsPlannen Code Jam OpenSocial gadgets
Plannen Code Jam OpenSocial gadgetskirstenveelo
 
Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups Carlo Vaccari
 
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Artefactual Systems - AtoM
 
final proposal-Implement and create new documentation toolchain
final proposal-Implement and create new documentation toolchainfinal proposal-Implement and create new documentation toolchain
final proposal-Implement and create new documentation toolchainParamkusham Shruthi
 
Application Platforms and Developer Communities - New software tools and app...
Application Platforms and Developer Communities -  New software tools and app...Application Platforms and Developer Communities -  New software tools and app...
Application Platforms and Developer Communities - New software tools and app...Michael Habib
 
Gsoc proposal 2021 polaris
Gsoc proposal 2021 polarisGsoc proposal 2021 polaris
Gsoc proposal 2021 polarisAyushBansal122
 

Semelhante a Executable papers (20)

[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...
[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...
[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...
 
Tell Me Quality Documentation
Tell Me Quality DocumentationTell Me Quality Documentation
Tell Me Quality Documentation
 
Towards an Infrastructure for Enabling Systematic Development and Research of...
Towards an Infrastructure for Enabling Systematic Development and Research of...Towards an Infrastructure for Enabling Systematic Development and Research of...
Towards an Infrastructure for Enabling Systematic Development and Research of...
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications database
 
Cytoscape: Now and Future
Cytoscape: Now and FutureCytoscape: Now and Future
Cytoscape: Now and Future
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
Reproducible Science and Deep Software Variability
Reproducible Science and Deep Software VariabilityReproducible Science and Deep Software Variability
Reproducible Science and Deep Software Variability
 
Plannen Code Jam OpenSocial gadgets
Plannen Code Jam OpenSocial gadgets Plannen Code Jam OpenSocial gadgets
Plannen Code Jam OpenSocial gadgets
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
eROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC ArchitectureeROSA Stakeholder WS1: EOSC Architecture
eROSA Stakeholder WS1: EOSC Architecture
 
DSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformDSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platform
 
Plannen Code Jam OpenSocial gadgets
Plannen Code Jam OpenSocial gadgetsPlannen Code Jam OpenSocial gadgets
Plannen Code Jam OpenSocial gadgets
 
Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups
 
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
 
UCIAD overview
UCIAD overviewUCIAD overview
UCIAD overview
 
final proposal-Implement and create new documentation toolchain
final proposal-Implement and create new documentation toolchainfinal proposal-Implement and create new documentation toolchain
final proposal-Implement and create new documentation toolchain
 
Application Platforms and Developer Communities - New software tools and app...
Application Platforms and Developer Communities -  New software tools and app...Application Platforms and Developer Communities -  New software tools and app...
Application Platforms and Developer Communities - New software tools and app...
 
Gsoc proposal 2021 polaris
Gsoc proposal 2021 polarisGsoc proposal 2021 polaris
Gsoc proposal 2021 polaris
 

Mais de Anita de Waard

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?Anita de Waard
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataAnita de Waard
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsAnita de Waard
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesAnita de Waard
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Anita de Waard
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?Anita de Waard
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data ManagementAnita de Waard
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseAnita de Waard
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of PublishingAnita de Waard
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryAnita de Waard
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data SharingAnita de Waard
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingAnita de Waard
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumAnita de Waard
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataAnita de Waard
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016Anita de Waard
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...Anita de Waard
 

Mais de Anita de Waard (20)

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
History of the future
History of the futureHistory of the future
History of the future
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 

Último

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 

Último (20)

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Executable papers

  • 1. Executable Papers: publishing science that works Anita de Waard, Elsevier Labs HCLS Scientific Discourse Group June 20, 2011
  • 2. Elsevier Challenges Goals: - Invite and survey ideas in innovative science publishing - Create a community of people working on similar issues, from different backgrounds/viewpoints Rules: - Open submission; very interdisciplinary panel of judges; open publication of submissions - IPR stays with author; if commercial development, Elsevier has right of first refusal Challenges so far: - 2008/9: Elsevier Grand Challenge for knowledge enhancement in the life sciences: http://www.elseviergrandchallenge.com - 2010/11: ISMB Killer App award: rewarding bioinformatics apps that work for biologists http://killerapp.iscb.org/ - 2011: Elsevier Executable Paper challenge: http://www.executablepapers.com/
  • 3. Executable Paper Challenge Driven by issues in publishing computational science: - How can we develop a model for executable files that is compatible with the user’s operating system and architecture and adaptable to future systems? - How do we manage very large file sizes? - How do we validate data and code, and decrease the reviewer’s workload? - How to support registering and tracking of actions taken on the ‘executable paper?’ Coorganised with International Conf on Computational Sci http://www.iccs- meeting.org: - For high-performance and (geo/eco/bio/chem)‘-informatics’ fields - Actually challenge participants were a different community!
  • 4. The Finalists: http://www.executablepapers.com/finalists.html 1.SHARE - a web portal for creating and sharing executable research papers http://sites.google.com/site/executablepaper/ 2.A data and code model for reproducible research and executable papers http:// dirac.cnrs-orleans.fr/~hinsen/executable_paper_challenge.tar.gz 3.A-R-E: The Author-Review-Execute environment http://iwb.fluidops.com:7878/resource/AREpaper 4.Planetary System: Web 3.0 and Active Documents https://trac.mathweb.org/planetary/wiki/EPCDemo 5.Paper Mache: Creating Dynamic Reproducible Science http://oware.cse.tamu.edu:8080/ 6.A Provenance Based Infrastructure for Creating Executable Papers http://www.vistrails.org/index.php/ExecutablePapers 7.Universal Identifier for Computation Results http://vcr.stanford.edu 8.R2 Platform for Reproducible Research http://rsquared.stat.uni-muenchen.de/ 9.The Collage Authoring Environment http://collage.cyfronet.pl
  • 5. SHARE - a web portal for creating and sharing executable research papers http://sites.google.com/site/executablepaper/ - built to house the submissions to the Transformation Tool Contest (TTC) - an environment where all software and data related to the paper is optimally installed and ready for (temporary and secure) evaluation - a specific virtual machine image can be instantiated within the paper - SHARE supports multiple operating systems both at the level of the remote virtual machines as well as at the level of the connecting clients running on the user’s machine - more than 100 heterogenous images have been contributed by different research communities so far
  • 6. A-R-E: The Author-Review-Execute environment http://iwb.fluidops.com:7878/resource/AREpaper - A data-driven, loosely coupled, and distributed approach to support the life cycle of an (executable) paper: authoring, reviewing, publication and study: - finding out which paragraph is providing the information bit pertinent to the reference - navigate from data points in a plot to the data items in raw experimental data that led to these points (e.g. point to an excel sheet column with experimental data) - navigate into the program code that led to a specific data set - Based on a semantic wiki:
  • 7. A Provenance Based Infrastructure for Creating Executable Papers http://www.vistrails.org/index.php/ExecutablePapers - VisTrails provides a mechanism to store provenance for workflows - Code and plug-ins for LaTeX, Wiki, Microsoft Word, and PowerPoint - CrowdLabs (http://www.crowdlabs.org) to allow papers to point to results that can be executed on a remote server and interactively explored from a Web browser
  • 8. Universal Identifier for Computation Results http://vcr.stanford.edu - Verifiable Computational Result (VCR): A computational result (eg. table, figure, chart, dataset), together with the metadata describing in detail the computations that created it every computation automatically generates a detailed chronicle of its inputs and outputs as part of the process execution. The chronicle is automatically stored in a standard format on a VCR repository for later access - Verifiable Result Repository (Repository): A web- services provider that archives VCRs and later serves up views of specific computational results - Verifiable Result Identifier (VRI): A URL (web address) that universally and permanently identifies a repository and causes it to serve up views of a specific VCRa DOI-like string that permanently and uniquely identifies the chronicle associated to that result and the repository that can serve views of that chronicle.
  • 9. The Collage Authoring Environment http://collage.cyfronet.pl - environment which enables authors to seamlessly embed chunks of executable code (called assets) into scientific publications: - input forms: used by the user to feed input data into the running experiment - visualizations: render an experiment result which can be directly visualized in the research paper - code snippets: embed an editable view of the code which enacts a specific computation and may be used to generate additional assets - allow repeated execution of these assets on underlying computing and data storage resources:
  • 10. Next step: The Executable Journal? - Ideally, we’d like all these tools to work together - In fact, we’d like that to be how we communicate informatics/ computer science! - Submit a paper with a piece of working code - The code works on the platform - The code stays on the platform, and is available for other papers to run on, too! - Advantages: - Clearer communication of software - Less reinvention of the wheel - More collaboration
  • 11. In other words: “I like the idea of [...] a research object corresponding to a PhD thesis sitting on the (digital) library shelf and then being re-executed as new data comes along.  So the thesis sits there and new results (or papers, or research objects) pop out. I like this example because it involves tying down the method and letting the data flow, instead of the widely held view that the data sits there and methods are applied to it. [...] These papers then become a way of distributing data and methods in a highly usable and user-centric way [...]. So scientists don't need to download and install tools and learn user interfaces.They just interact with the published executable papers...” Dave De Roure, email to Wf4ever group
  • 12. What does this have to do with HCLS? - Might be a good area to explore this in? - E.g. interchange of annotations that we are exploring w/Tim Clark’s group... - Next step: - Funding? - Format? - Platform? - Thoughts??