SlideShare a Scribd company logo
1 of 20
Download to read offline
Reproducibility, dissemination,
and management of modeling results

17 February 2014, Braunschweig

Dagmar Waltemath

http://sems.uni-rostock.de
Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management

http://sems.uni-rostock.de

2
Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management

http://sems.uni-rostock.de

3
“We’ve been hearing a common theme from
the academic community – researchers are
having difficulty managing and accessing their
data. It seems to be an ongoing problem for
research scientists, at any stage of their
careers.”
(Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data
Management)

http://sems.uni-rostock.de

4
Outline

reproducibility

dissemination

http://sems.uni-rostock.de

management

5
Outline

reproducibility

dissemination

management

“People can’t share knowledge if they don’t
speak a common language”
Tom Davenport, Lawrence Prusak (2000) Working Knowledge

http://sems.uni-rostock.de

6
Reproducible modeling results :: Standards

Model
Entities, network
of reactions, math

Fig: Goldbeter (1991),
http://www.ncbi.nlm.nih.
gov/pubmed/1833774

Annotations
Compartment: Cell GO:0005623
Publication: Goldbeter
PMID:1833774
M = inactive CDCD2 Kinase:
UniProt:CDK1a_XENIA
Fig.: BioModels Database
Behavior: Oscillation
TEDDY_0000006
Algorithm: Gillespie
KiSAO:000029

Protocols

Fig.: BioModels Database

http://sems.uni-rostock.de

7
Reproducible modeling results :: Towards publication

1

3

2

+

4

5

Following: Waltemath et al (2013) Reproducibility of model-based results in systems biology. Springer
http://sems.uni-rostock.de

8
Outline

reproducibility

dissemination

management

[Quantitative] models will be only as useful as their access and reuse
is easy for all scientists.
Nicolas Le Novère (2006) Model storage, exchange and integration. BMC Neuroscience
http://sems.uni-rostock.de

9
Dissemination :: Model curation and annotation

Fig.: Li et al (2010) BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic
models. BMC Systems Biology
http://sems.uni-rostock.de

10
Dissemination :: Public model repositories

1.
2.
3.
4.

Higher visibility of research
Long-term availability
Link to other resources
Quality-checks
Fig.: Piwowar and Vision (2013) Data reuse and the open
data citation advantage. PeerJ

http://sems.uni-rostock.de

11
Dissemination :: Quality checks with functional curation

Fig.: Example for functional curation on heart model, http://travis.cs.ox.ac.uk/FunctionalCuration/db.html

Fig.: Cooper et al (under review) Through models to knowledge with virtual experiments
http://sems.uni-rostock.de

Martin Scharm
12
Outline

reproducibility

dissemination

management

“And that’s why we need model Management.“
Following: http://www.indiana.edu/~hperp200/images/WhyWeNeedComputer_thumb.png

http://sems.uni-rostock.de

13
Management :: Integration of model-related data
“Which models are annotated with ‘Adenosine tri-phosphate’?”

Document

”Which models contain reactions with
ATP as reactant and ADP as product?“
C2

CP

Pubmed:
1831270
Kegg Pathway
sce04111

is

pM

Cell

asProduct
asReactant

EC-Code:
3.1.3.16

Uniprot:P04551

Uniprot:P04551

Interpro:
IPR006670

is

hasPart

isContainedIn

isVersion

isVersionOf

• Relations between entities
• Links to concepts in bio-ontologies

Reaction3

isVersionOf

• Graph store (Neo4J database)

isDescribedBy
Tyson1991
Cell Cycle 6
var

GO:0005623

Fig.: Henkel et al (2012) Considerations of graph-based
concepts to manage of computational biology models and
associated simulations INFORMATIK2012, Braunschweig

Ron Henkel
http://sems.uni-rostock.de

14
Management :: Integration of model-related data
Document

Document

SEDML
Pubmed:
1831270

isDescribedBy

Tyson_1991

Modelreference

C2

is_connected

is_connected

environment

Simulation

Task

Datagenerator

Output

CP
Variable

is_connected

Variable
C2

CP

time

time

time

CP

KISAO:
Ontology

C2

KISAO:097

is_mapped_to

KISAO:000

KISAO:201

isA
Document

isDescribedBy

KISAO:433

Tyson1991
Cell Cycle 6
var

Reaction3

C2

CP

pM

KISAO:352

KISAO:20

KISAO:019

Kegg Pathway
sce04111

is

KISAO:273

KISAO:447

SBO:
Ontology

Cell

asProduct
isContainedIn

is

hasPart

isA

ha

f
nO

sP

art

isVersionOf

SBO:0000

is

sio
er

EC-Code:
3.1.3.16

isV

isVersionOf

asReactant

SBO:064
Uniprot:P04551

Interpro:
IPR006670

GO:0005623

SBO:544

SBO:004

SBO:231

SBO:003

SBO:236

SBO:545

SBO:000064

Fig.: Henkel et al (in preparation)
http://sems.uni-rostock.de

15
Management :: Combination of methods
Keywords describing a
model of interest.

Rank

isVersion
Of

CP

Unipr
ot:P0
4551

is

pM

3.

Maex‘98

SEDM
L
Tyso
n_19
91

Inte
rpro
:
IPR
006
670

Pubm
isDescribedBy

Cel
l
envi
ron
men
t

ed:
Pubm
183127
0 ed:
183127
0

Model
refere
nce

CP

Simul
ation

Task

Outpu
t

Datag
enera
tor

Varia
ble

GO:0
0056
23

C2

CP

time

time

time

CP

C2

Varia
ble

ID:
BIOMD000000005
Authors:
Tyson JJ.
Date:
13 Sep 2005 12:31:08
Publication: pubmed:1831270
Species:
cdc2k, cyclin …
Reaction: cyclin_cdc2k_dissociation, …

Tyson‘91
Tyson‘91 ODE plot

simulate

Tyson‘91

Doc
ume
nt

Pub
med:
Kegg
1831
Path
270
way
sce04
111

is

hasPart

isVersion
Unipr
ot:P0
4551

Novak‘97

Docu
ment

isDescrib
edBy

C2

ECCode:
3.1.3.
16

Format

retrieve

select simulation
description

compare with paper

search

C2

isVersion
Of

19
91
Cel
l
Cy
cle
6
var

1.

2.

Do
cu
me
Tys
nt
on

Re
act
ion
3

Name

Tyson’91 ODE plot

Model: BIOMD000000005
add simulation
Algorithm:
ODE solver
description to
Type:
time course
simulation software Output:
plot

Fig.: Following Waltemath et al (2013) Reproducibility of model-based results in systems biology. Springer.
Henkel et al (2010) Ranked retrieval of Computational Biology models. BMC bioinformatics
http://sems.uni-rostock.de

Ron Henkel
16
Management :: Provenance
“Give me the best matching model published on the Cell Cycle
and considering cdk1.”

Lucene: species:cdk1, compartment:cell, …

Fig.: Waltemath et al (2013) Improving the reuse of computational models through version control.Bioinformatics
http://sems.uni-rostock.de

17
Management :: Model version control

Fig.: courtesy Martin Scharm, BudHat, http://sems.uni-rostock.de/budhat
http://sems.uni-rostock.de

Martin Scharm
18
Summary :: SEMS projects & Contributions

foster
dissemination

improve
management
Document

isDescribedBy
Tyson1991
Cell Cycle 6
var

Reaction3

C2

CP

Pubmed:
1831270
Kegg Pathway
sce04111

is

pM

Cell

asProduct

EC-Code:
3.1.3.16

http://sems.uni-rostock.de

Uniprot:P04551

Uniprot:P04551

Interpro:
IPR006670

is

hasPart

isContainedIn

isVersion

isVersionOf

asReactant

isVersionOf

ensure
reproducibility

GO:0005623

19
Thank you for your attention.
Collaborators
Nicolas Le Novère

Christian Rosenke

David Nickerson

Wolfgang Müller

Jonathan Cooper

Falk Schreiber

Jon Olav Vik

SED-ML Editorial Board

Tommy Yu

SBML Editorial Board

HARMONY 2015
Wittenberg
HERMESForschungsförderung
HERMES-Forschungsförderung
der
der Universität RostockUniversität Rostock
http://sems.uni-rostock.de

@SemsProject

20

More Related Content

Similar to Modeling Reproducibility, Dissemination and Management

Data Provenance and Scientific Workflow Management
Data Provenance and Scientific Workflow ManagementData Provenance and Scientific Workflow Management
Data Provenance and Scientific Workflow ManagementNeuroMat
 
Cao report 2007-2012
Cao report 2007-2012Cao report 2007-2012
Cao report 2007-2012Elif Ceylan
 
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018Emma Schymanski
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingBram Zandbelt
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zenecaKerstin Forsberg
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksCarole Goble
 
Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Kerstin Forsberg
 
Frankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectFrankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectGoethe Univeristy
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Kees van Bochove
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Werner Leyh
 
Transparency in the Data Supply Chain
Transparency in the Data Supply ChainTransparency in the Data Supply Chain
Transparency in the Data Supply ChainPaul Groth
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...Dr. Haxel Consult
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Sciencedgarijo
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapemhaendel
 

Similar to Modeling Reproducibility, Dissemination and Management (20)

Data Provenance and Scientific Workflow Management
Data Provenance and Scientific Workflow ManagementData Provenance and Scientific Workflow Management
Data Provenance and Scientific Workflow Management
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
Cao report 2007-2012
Cao report 2007-2012Cao report 2007-2012
Cao report 2007-2012
 
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific Computing
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
 
Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium
 
Frankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectFrankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee Projeect
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
 
Transparency in the Data Supply Chain
Transparency in the Data Supply ChainTransparency in the Data Supply Chain
Transparency in the Data Supply Chain
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 

More from University Medicine Greifswald

A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...University Medicine Greifswald
 
COMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightCOMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightUniversity Medicine Greifswald
 
Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...University Medicine Greifswald
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityUniversity Medicine Greifswald
 
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...University Medicine Greifswald
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUniversity Medicine Greifswald
 
Identifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsIdentifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsUniversity Medicine Greifswald
 
Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...University Medicine Greifswald
 
Masymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesMasymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesUniversity Medicine Greifswald
 
Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...University Medicine Greifswald
 

More from University Medicine Greifswald (19)

A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...
 
When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?
 
COMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightCOMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management right
 
Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusability
 
2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn
 
Mehr Medizininformatik am Meer
Mehr Medizininformatik am MeerMehr Medizininformatik am Meer
Mehr Medizininformatik am Meer
 
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology models
 
Identifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsIdentifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational models
 
Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...
 
Coming Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMSComing Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMS
 
Masymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesMasymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositories
 
Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...
 
SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)
 
Simulation experiment descriptions and management
Simulation experiment descriptions and managementSimulation experiment descriptions and management
Simulation experiment descriptions and management
 
Sems project overview
Sems project overviewSems project overview
Sems project overview
 
Bio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-MLBio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-ML
 
Meta-Information for Bio-Models
Meta-Information for Bio-ModelsMeta-Information for Bio-Models
Meta-Information for Bio-Models
 

Recently uploaded

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Modeling Reproducibility, Dissemination and Management

  • 1. Reproducibility, dissemination, and management of modeling results 17 February 2014, Braunschweig Dagmar Waltemath http://sems.uni-rostock.de
  • 2. Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management http://sems.uni-rostock.de 2
  • 3. Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management http://sems.uni-rostock.de 3
  • 4. “We’ve been hearing a common theme from the academic community – researchers are having difficulty managing and accessing their data. It seems to be an ongoing problem for research scientists, at any stage of their careers.” (Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management) http://sems.uni-rostock.de 4
  • 6. Outline reproducibility dissemination management “People can’t share knowledge if they don’t speak a common language” Tom Davenport, Lawrence Prusak (2000) Working Knowledge http://sems.uni-rostock.de 6
  • 7. Reproducible modeling results :: Standards Model Entities, network of reactions, math Fig: Goldbeter (1991), http://www.ncbi.nlm.nih. gov/pubmed/1833774 Annotations Compartment: Cell GO:0005623 Publication: Goldbeter PMID:1833774 M = inactive CDCD2 Kinase: UniProt:CDK1a_XENIA Fig.: BioModels Database Behavior: Oscillation TEDDY_0000006 Algorithm: Gillespie KiSAO:000029 Protocols Fig.: BioModels Database http://sems.uni-rostock.de 7
  • 8. Reproducible modeling results :: Towards publication 1 3 2 + 4 5 Following: Waltemath et al (2013) Reproducibility of model-based results in systems biology. Springer http://sems.uni-rostock.de 8
  • 9. Outline reproducibility dissemination management [Quantitative] models will be only as useful as their access and reuse is easy for all scientists. Nicolas Le Novère (2006) Model storage, exchange and integration. BMC Neuroscience http://sems.uni-rostock.de 9
  • 10. Dissemination :: Model curation and annotation Fig.: Li et al (2010) BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models. BMC Systems Biology http://sems.uni-rostock.de 10
  • 11. Dissemination :: Public model repositories 1. 2. 3. 4. Higher visibility of research Long-term availability Link to other resources Quality-checks Fig.: Piwowar and Vision (2013) Data reuse and the open data citation advantage. PeerJ http://sems.uni-rostock.de 11
  • 12. Dissemination :: Quality checks with functional curation Fig.: Example for functional curation on heart model, http://travis.cs.ox.ac.uk/FunctionalCuration/db.html Fig.: Cooper et al (under review) Through models to knowledge with virtual experiments http://sems.uni-rostock.de Martin Scharm 12
  • 13. Outline reproducibility dissemination management “And that’s why we need model Management.“ Following: http://www.indiana.edu/~hperp200/images/WhyWeNeedComputer_thumb.png http://sems.uni-rostock.de 13
  • 14. Management :: Integration of model-related data “Which models are annotated with ‘Adenosine tri-phosphate’?” Document ”Which models contain reactions with ATP as reactant and ADP as product?“ C2 CP Pubmed: 1831270 Kegg Pathway sce04111 is pM Cell asProduct asReactant EC-Code: 3.1.3.16 Uniprot:P04551 Uniprot:P04551 Interpro: IPR006670 is hasPart isContainedIn isVersion isVersionOf • Relations between entities • Links to concepts in bio-ontologies Reaction3 isVersionOf • Graph store (Neo4J database) isDescribedBy Tyson1991 Cell Cycle 6 var GO:0005623 Fig.: Henkel et al (2012) Considerations of graph-based concepts to manage of computational biology models and associated simulations INFORMATIK2012, Braunschweig Ron Henkel http://sems.uni-rostock.de 14
  • 15. Management :: Integration of model-related data Document Document SEDML Pubmed: 1831270 isDescribedBy Tyson_1991 Modelreference C2 is_connected is_connected environment Simulation Task Datagenerator Output CP Variable is_connected Variable C2 CP time time time CP KISAO: Ontology C2 KISAO:097 is_mapped_to KISAO:000 KISAO:201 isA Document isDescribedBy KISAO:433 Tyson1991 Cell Cycle 6 var Reaction3 C2 CP pM KISAO:352 KISAO:20 KISAO:019 Kegg Pathway sce04111 is KISAO:273 KISAO:447 SBO: Ontology Cell asProduct isContainedIn is hasPart isA ha f nO sP art isVersionOf SBO:0000 is sio er EC-Code: 3.1.3.16 isV isVersionOf asReactant SBO:064 Uniprot:P04551 Interpro: IPR006670 GO:0005623 SBO:544 SBO:004 SBO:231 SBO:003 SBO:236 SBO:545 SBO:000064 Fig.: Henkel et al (in preparation) http://sems.uni-rostock.de 15
  • 16. Management :: Combination of methods Keywords describing a model of interest. Rank isVersion Of CP Unipr ot:P0 4551 is pM 3. Maex‘98 SEDM L Tyso n_19 91 Inte rpro : IPR 006 670 Pubm isDescribedBy Cel l envi ron men t ed: Pubm 183127 0 ed: 183127 0 Model refere nce CP Simul ation Task Outpu t Datag enera tor Varia ble GO:0 0056 23 C2 CP time time time CP C2 Varia ble ID: BIOMD000000005 Authors: Tyson JJ. Date: 13 Sep 2005 12:31:08 Publication: pubmed:1831270 Species: cdc2k, cyclin … Reaction: cyclin_cdc2k_dissociation, … Tyson‘91 Tyson‘91 ODE plot simulate Tyson‘91 Doc ume nt Pub med: Kegg 1831 Path 270 way sce04 111 is hasPart isVersion Unipr ot:P0 4551 Novak‘97 Docu ment isDescrib edBy C2 ECCode: 3.1.3. 16 Format retrieve select simulation description compare with paper search C2 isVersion Of 19 91 Cel l Cy cle 6 var 1. 2. Do cu me Tys nt on Re act ion 3 Name Tyson’91 ODE plot Model: BIOMD000000005 add simulation Algorithm: ODE solver description to Type: time course simulation software Output: plot Fig.: Following Waltemath et al (2013) Reproducibility of model-based results in systems biology. Springer. Henkel et al (2010) Ranked retrieval of Computational Biology models. BMC bioinformatics http://sems.uni-rostock.de Ron Henkel 16
  • 17. Management :: Provenance “Give me the best matching model published on the Cell Cycle and considering cdk1.” Lucene: species:cdk1, compartment:cell, … Fig.: Waltemath et al (2013) Improving the reuse of computational models through version control.Bioinformatics http://sems.uni-rostock.de 17
  • 18. Management :: Model version control Fig.: courtesy Martin Scharm, BudHat, http://sems.uni-rostock.de/budhat http://sems.uni-rostock.de Martin Scharm 18
  • 19. Summary :: SEMS projects & Contributions foster dissemination improve management Document isDescribedBy Tyson1991 Cell Cycle 6 var Reaction3 C2 CP Pubmed: 1831270 Kegg Pathway sce04111 is pM Cell asProduct EC-Code: 3.1.3.16 http://sems.uni-rostock.de Uniprot:P04551 Uniprot:P04551 Interpro: IPR006670 is hasPart isContainedIn isVersion isVersionOf asReactant isVersionOf ensure reproducibility GO:0005623 19
  • 20. Thank you for your attention. Collaborators Nicolas Le Novère Christian Rosenke David Nickerson Wolfgang Müller Jonathan Cooper Falk Schreiber Jon Olav Vik SED-ML Editorial Board Tommy Yu SBML Editorial Board HARMONY 2015 Wittenberg HERMESForschungsförderung HERMES-Forschungsförderung der der Universität RostockUniversität Rostock http://sems.uni-rostock.de @SemsProject 20