The presentation discussed the future of scientific communication, noting that open access to digital data and methods will be key to enabling reproducibility and maximizing data integration to advance discovery, but that both technological innovations and changes to incentives and rewards systems will be needed to fully realize more open and interactive models of sharing research.
2024 02 15 AZ GOP LD4 Gen Meeting Minutes_FINAL_20240228.docx
Future of Research Communication 2011
1. The Future of Research
Communication
Judith Blake
The Jackson Laboratory
2. My Perspective
Future open access to digital data will speed discovery
Data generation is getting easier, data analysis is
getting harder, we are drowning in data
Key to scientific discourse is the ability to reproduce
and verify results - currently difficult for computational
results that do not include code and data upon
publication
Current „academic‟ publication and rewards systems
are inadequate for measuring scientific contributions
Digital data repositories, open access
publications, electronic journals, and semantic web
enhancements will all contribute to the success of
future of science communications
Rocky ISBC 12/8/11
3. Outline
1. Improving knowledge communication
- Vision: What are the communication functionalities
needed?
- Technology: What are the tools for doing this?
2. Impacting our world
- Social Aspects: How do we quantify impact of
use/reward system?
- Coolness: How do we make it attractive to do/ use?
3. Overcoming obstacles
-Financial Considerations: How do we make is
sustainable?
-Getting the ball rolling: How do we start? Rocky ISBC 12/8/11
6. Managing
Biological
Information is
Nothing New
Roxy Laybourne and
others, photo by Chip Clark
Rocky ISBC 12/8/11
Bird Collections at the Smithsonian Natural History Museum
7. TCTCTCCCCCGCCCCCCAGGCTCCCCCGGTCGCTCTCCTCCGGCGGTCGCCCGCGCTCGGTGGATGTGGC
From the birth of the field of genetics until a decade ago,
TGGCAGCTGCCGCCCCCTCCCTCGCTCGCCGCCTGCTCTTCCTCGGCCCTCCGCCTCCTCCCCTCCTCCT it was generally assumed
TCTCGTCTTCAGCCGCTCCTCTCGCCGCCGCCTCCACAGCCTGGGCCTCGCCGCGATGCCGGAGAAGAGG its function. In the vast
that the parental origin of a gene could have no effect on
CCCTTCGAGCGGCTGCCTGCCGATGTCTCCCCCATCAACTACAGCCTTTGCCTCAAGCCCGACTTGCTGG this paradigm has appeared
majority of studies carried out during the last 90 years,
to hold true. However, with increasingly sophisticated genetic and embryological
ACTTCACCTTCGAGGGCAAGCTGGAGGCCGCCGCCCAGGTGAGGCAGGCGACTAATCAGATTGTGATGAA
investigations in the mouse, important exceptions to this rule have been uncovered
TTGTGCTGATATTGATATTATTACAGCTTCATATGCACCAGAAGGAGATGAAGAAATACATGCTACAGGA
over the last decade. First, the results of nuclear transplantation experiments carried
TTTAACTATCAGAATGAAGATGAAAAAGTCACCTTGTCTTTCCCTAGTACTCTGCAAACAGGTACGGGAA
out with single-cell fertilized embryos have demonstrated an absolute requirement for
CCTTAAAGATAGATTTTGTTGGAGAGCTGAATGACAAAATGAAAGGTTTCTATAGAAGTAAATATACTAC
both a maternally-derived and a paternally-derived pronculeus to allow full-term
CCCTTCTGGAGAGGTGCGCTATGCTGCTGTAACACAGTTTGAGGCTACTGATGCCCGAAGGGCTTTTCCT
development (McGrath and Solter, 1983). Second, in animals that receive both
TGCTGGGATGAGCCTGCTATCAAAGCAACTTTTGATATCTCATTGGTTGTTCCTAAAGACAGAGTAGCTT
homologs of certain chromosomes or subchromosomal regions from one parent and
TATCAAACATGAATGTAATTGACCGGAAACCATACCCTGATGATGAAAATTTAGTGGAAGTGAAGTTTGC
not the other (through the mating of translocation heterozygotes as described in
CCGCACACCTGTTATGTCTACATATCTGGTGGCATTTGTTGTGGGTGAATATGACTTTGTAGAAACAAGGobserved including enhanced
Section 5.2.3), dramatic effects on development can be
TCAAAAGATGGTGTGTGTGTCCGTGTTTACACTCCTGTTGGCAAAGCAGAGCAAGGAAAATTTGCGTTAG Kirk, 1985). Third, either of
or retarded growth and outright lethality (Cattanach and
two deletions that cover a small region of mouse chromosome 17 can be transmitted
AGGTTGCTGCTAAAACCTTGCCTTTTTATAAGGACTACTTCAATGTTCCTTATCCTCTACCTAAAATTGA
TCTCATTGCTATTGCAGACTTTGCAGCTGGTGCCATGGAGAACTGGGGCCTTGTTACTTATAGGGAGACTdeletions cause prenatal
normally from a father to his offspring, but these same
lethality when they are maternally transmitted (Johnson, 1974; Winking and
GCATTGCTTATTGATCCAAAAAATTCCTGTTCTTCATCCCGCCAGTGGGTTGCTCTGGTTGTGGGACATG
Silver, 1984). Fourth, similar parent-of-origin effects have been observed on the
AACTCGCCCATCAATGGTTTGGAAATCTTGTTACTATGGAATGGTGGACTCATCTTTGGTTAAATGAAGG
TTTTGCATCCTGGATTGAATATCTGTGTGTAGACCACTGCTTCCCAGAGTATGATATTTGGACTCAGTTT knock-out allele at the Igf2
phenotypes expressed by animals that carry a targeted
locus (DeChiara et al., 1991). Finally, molecular techniques have been used to directly
GTTTCTGCTGATTACACCCGTGCCCAGGAGCTTGACGCCTTAGATAACAGCCATCCTATTGAAGTCAGTG
The trouble with facts is that there are so many of
demonstrate the expression of transcripts from one parental allele and not the other
TGGGCCATCCATCTGAGGTTGATGAGATATTTGATGCTATATCATATAGCAAAGGTGCATCTGTCATCCG
at the Igf2r locus (Barlow et al., 1991) and the H19 locus (Bartolomei et al., 1991). The
them.
AATGCTGCATGACTACATTGGGGATAAGGACTTTAAGAAAGGAATGAACATGTATTTAACCAAGTTCCAA
accumulated data indicate that a subset of mouse genes (on the order of 0.2%) will
CAAAAGAATGCTGCCACAGAGGATCTCTGGGAAAGTTTAGAAAATGCTAGTGGTAAACCTATAGCAGCTG The Gentle Reader (1903)
Samuel Crothers:
function differently in normal embryos depending on whether they have been
GTTTCTGCTGATTACACCCGTGCCCAGGAGCTTGACGCCTTAGATAACAGCCATCCTATTGAAGTCAGTG
inherited through the male or the female gamete, such that one allele will be expressed
TGGGCCATCCATCTGAGGTTGATGAGATATTTGATGCTATATCATATAGCAAAGGTGCATCTGTCATCCG
and the other will be silent. Genomic imprinting is the term that has been coined to
AATGCTGCATGACTACATTGGGGATAAGGACTTTAAGAAAGGAATGAACATGTATTTAACCAAGTTCCAA by a gene varies depending
describe this situation in which the phenotype expressed
on its parental origin (Sapienza, 1989). Further experiments have demonstrated
CAAAAGAATGCTGCCACAGAGGATCTCTGGGAAAGTTTAGAAAATGCTAGTGGTAAACCTATAGCAGCTG
that, in general, the "imprint" is erased and regenerated during gametogenesis so
that the function of an imprintable gene is fully determined by the sex of its
Rocky ISBC 12/8/11
progenitor alone, and not by earlier ancestors.
9. The knowledge is in
the details
Curators use controlled
terms from structured
vocabularies
(ontologies) to annotate
complex biological
systems described in
the literature
Rocky ISBC 12/8/11
10. Crash Blossoms
and other semantic ambiguities
translating what we say into
what we mean: data, words
and knowledge
“Violinist Linked to JAL Crash
Blossoms”
“Squad Helps Dog Bite
“MacArthur Flies Back toVictim”
Front”
“Red Tape Holds Up New
Bridge.” Rocky ISBC 12/8/11
11. Today there are many biomedical ontologies…
Open Biomedical
Ontologies
http://www.obofoundry.org/
The „s link to the
term request
trackers for the
listed ontologies.
Rocky ISBC 12/8/11
12. Something very important and very weird is happening to the book right
now: It‟s shedding its papery corpus and transmigrating into a bodiless
digital form, right before our eyes. We‟re witnessing the bibliographical
equivalent of the rapture. If anything we may be lowballing the weirdness
of it all. Rocky ISBC 12/8/11
Lev Grossman, NYTimes Book Review Sept
14. …..into the Future
The current back-propagation of biomedical literature
of semantic integration using ontologies is not scalable
to necessary level of granularity and context needed
A key element of data integration is the mark-up of
data at the time of generation
Reasoned Argument communication includes
providing methods and data to enable
reproducibility, and requires
open access to the semantically enriched
discussion, machine-readable metadata, accessible
datasets, peer review discussions, and possibility of
testing for reproducibility Rocky ISBC 12/8/11
15. 1 - Improving Knowledge
Communication
What are functionalities needed
detail methods, both wet and dry
provide data with appropriate metadata
support interactive results, i.e., tables and figures
track metrics of utility, usage, and impact
Technology
electronic lab books that are easy and functional
data collection and repositories that provide standards
and persistence
new models for data interconnections
real-time metrics available
Rocky ISBC 12/8/11
16. What‟s Changing - 2
Peer Review
Journal Impact (the Myth of Impact Factors)
Supplementary Material (i.e., the DATA) missing
and or incomplete
Metrics of Impact of Research
Persistence of Data
Rocky ISBC 12/8/11
17. Peer Review-1 – bad review
hard to engage expert reviews for multi-component
research
The Scientist
Rocky ISBC 12/8/11
18. Large computation analysis
require multiple coordinated
reviews
Rocky ISBC 12/8/11
Slide from Carol Bult
19. Peer Review – bad review 2
Computation analysis verity depends on data input
Ascertainment bias refers to a systematic
distortion in measuring the true frequency of
Faculty of
way in which 12/8/1
a phenomenon due to the Rocky ISBC the
1000 1
data are collected.
20. We need to enable
reproducibility
Rocky ISBC 12/8/11
22. Research data is simply not
available
50 highest impact research journals
1st 10 original research articles of 2009
88% of journals had „some‟ statement on sharing
of data
50% of articles did not meet journal standards
9% of articles in full compliance
algorithms and meta-date required for
reproducibility not required by any journal
Rocky ISBC 12/8/11
Alsheikh-Ali et al., PLoS One 2011;6(9):e24357. Epub 2011
Sep 7.
24. Changing Incentives to Publish - 2
Nature Medicine 16, 744 (2010) doi:10.1038/nm0710-744
Rocky ISBC 12/8/11
25. 2 - Impacting Our World
Open Access to data within supportive
environment accelerates knowledge
discovery
Social Aspects:
How do we quantify impact of use/reward
system?
Coolness:
How do we make it attractive to do/ use?
Rocky ISBC 12/8/11
27. Interactive Communication
Imbedded links to data for figures
Immediate access to referenced material
Interactive community
Blogs and commentary
Analytics of impact
Rocky ISBC 12/8/11
33. Key Points
Communication of research methods, data and results is
changing.
Access to research is limited outside northern hemisphere -
institutional access missing – thus limiting the globalization of
science.
Utility of research depends on inter-connections and access to
data and results; cloud-sourcing of science imminent.
The reward mechanism (tenure /cash) via publication record is
changing, but slowly for most of us.
The business model of scientific publishing is under great stress.
Government investment mechanisms for support and sharing of
science endeavors are under intense discussion.
New research communication mechanisms are coming; some
are already here. Rocky ISBC 12/8/11
34. Summary
We must continue to support the Reasoned Argument
- in context of massive amounts of digital data, this includes
comprehensive data access to maximize data integration
and enable reproducibility
The necessary upheaval in scientific communication
requires both technological and sociological
innovations
YOU can be part of this sea change in research
communication
Rocky ISBC 12/8/11
36. Acknowledgements
MGI PIs
Carol Bult, Janan Eppig, Jim Kadin, Joel Richardson, Martin Ringwald
GO Consortium PIs and Council
Michael Ashburner, Mike Cherry, Suzanna Lewis, Paul Thomas, Paul
Sternberg;
Rolf Apweiler, Rex Chisholm, Eva Huala
GO @ MGI
Alex Diehl, Mary Dolan, David Hill,
Li Ni, Harold Drabkin, Li Ni,
Dmitry Sitnikov
Funding: NIH-NHGRI P-41 grants to MGI and GOC; GM080646 to
PRO Rocky ISBC 12/8/11
Notas do Editor
Anita de WaardDisruptive Technologies Director, Elsevier LabsTim Clark Harvard neuroscientist, alzforum bioinformaticsIvan Herman Centrum Wiskunde & Informatica (CWI) performs fundamental scientific research in mathematics and computer science and is located in Amsterdam, the ...Eduard Hovy – information science instituteISI now Thomas Reuters
Gather Data, integrate data, reason over data, communicate results, reproducibility, scientific debate1860 British Association debate on the Theory of Evolution between Thomas Huxley and Samuel Witherspoon
in fact I would argue that
The Semantic Web architecture grounds itself on available standards for referingto entities, viz. Uniform resource identifiers (URIs) (Berners-Lee et al., 1998)2, andencoding of character symbols, i.e. Unicode (The Unicode Consortium, 2003), andreuses existing Web technologies like the Extensible Markup Language (XML) (Brayet al., 2000) for syntactic purposes. The core layers of the architecture, viz. XML,RDF, Ontology and Logic will be the subject of the subsequent sections on Syntax(Section 3.2 on page 46), Data (Section 3.3 on page 49), Semantics (Section 3.4 onpage 60) and Rules (Section 3.5 on page 70). The top layers providing Proof andTrust are starting to be addressed by research today.3 Consequently, we cannotgive an explicit account of these layers but only describe their intention. Accordingto (Berners-Lee, 1998), the ability to check the validity of statements made in the(Semantic) Web is important. Therefore the creators of statements should be ableto provide a proof of correctness of the statement which is verifiable by a machine.http://digbib.ubka.uni-karlsruhe.de/eva/2004/wiwi/2&search=manifest&format=1&page=55Web ontology reasoning with logic databases Volz, Raphael
China offers Cash Incentives to authors in elite journalsItaly, others offers support to institutions based on journals of publications…