SlideShare a Scribd company logo
1 of 57
The Reaming of Life Philip E. Bourne University of California San Diego pbourne@ucsd.edu Jim Gray eScience Award Lecture Oct. 12, 2010
Disclaimer I am a domain (life) scientist not a computer or information scientist I am fortunate enough to have a major biological resource (the Protein Data Bank) and a major biological journal (PLoS Computational Biology) as my playground I am part of the long tail I am naïve, but I am the majority Oct. 12, 2010 Jim Gray eScience Award Lecture
The Reaming of Life -What on Earth is He Talking About? A reamer is a tool for turning a roughly punched hole into an accurate and smooth one The digital data deluge has punched that rough hole For the life {other?} sciences to optimally advance we need an accurate and smooth conduit through which data can be distilled, analyzed, visualized, distributed and above all else comprehended Oct. 12, 2010 Jim Gray eScience Award Lecture
… and we need to accelerate the process by which this is donehere is why …. This is just another way of saying what Jim said and is embodied in the Fourth Paradigm Oct. 12, 2010 Jim Gray eScience Award Lecture
The Scientific Process is Too Slow to Respond to a Crisis – Either Global or Personal Oct. 12, 2010 Jim Gray eScience Award Lecture By the time the paper is published  we could all be dead http://knol.google.com/k/plos-currents-influenza# Motivation
In a time of crisis the need for fast access  to accurate data and any knowledge of that data are paramount Structure Summary page activity for H1N1 Influenza related structures Jan. 2008 Jan. 2009 Jan. 2010 Jul. 2009 Jul. 2008 Jul. 2010 3B7E: Neuraminidase of A/Brevig Mission/1/1918  H1N1 strain in complex with zanamivir 1RUZ: 1918 H1 Hemagglutinin * http://www.cdc.gov/h1n1flu/estimates/April_March_13.htm Motivation
If that is not enough…For some people the scientific process may be too slow to save their life Oct. 12, 2010 Jim Gray eScience Award Lecture Motivation
Josh Sommer – A Remarkable Young ManCo-founder & Executive Director the Chordoma Foundation Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
Chordoma A rare form of brain cancer No known drugs Treatment – surgical resection followed by intense radiation therapy Oct. 12, 2010 Jim Gray eScience Award Lecture http://upload.wikimedia.org/wikipedia/commons/2/2b/Chordoma.JPG Motivation
Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
Oct. 12, 2010 Jim Gray eScience Award Lecture If I have seen further it is only by standing on the shoulders of giants Isaac Isaac Newton From Josh’s point of view the climb  up just takes too long > 15 years and > $850M to be  more precise Adapted: http://sagecongress.org/Presentations/Sommer.pdf Motivation
Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
Oct. 12, 2010 Jim Gray eScience Award Lecture http://fora.tv/2010/04/23/Sage_Commons_Josh_Sommer_Chordoma_Foundation Motivation
Now we are all hopefully motivated let us break this down to what actually needs to be done in my opinion Here are a few big things … Oct. 12, 2010 Jim Gray eScience Award Lecture
A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
We Need Data and Knowledge About That Data to Interoperate The Knowledge and Data Cycle 0. Full text of PLoS papers stored  in a database 4. The composite view has links to pertinent blocks  of literature text and back to the PDB User clicks on content Metadata and webservices to data provide an interactiveview that can be annotated Selecting features provides a data/knowledge mashup Analysis leads to new content I can share 4. 1. 3. A composite view of journal and database content results 1. A link brings up figures  from the paper 3. 2. 2. Clicking the paper figure retrieves data from the PDB which is analyzed PLoS Comp. Biol. 2005 1(3) e34
We Need Data and Knowledge About That Data to Interoperate – What is Stopping US? Governance – publishers vs. database providers Reward Metadata standards for provenance, privacy etc. Exemplars  …. Oct. 12, 2010 Jim Gray eScience Award Lecture Caveat: Each discipline is different – I speak very much from a biomedical sciences perspective
Certainly the Argument for Interoperability in the Biomedical Sciences is Strong 1078 databases reported in NAR 2008 MetaBase http://biodatabase.org reports 2,651 entries edited 12,587 times PubMed contains 18,792,257 entries ~100,000 papers indexed per month In Feb 2009: 67,406,898 interactive searches were done 92,216,786 entries were viewed Data as of April 14, 2009 We need data and knowledge about that data to interoperate PLoS Comp. Biol. 2005 1(3) e34
A Small Example - The World Wide Protein Data Bank The single worldwide repository for data on the structure of biological macromolecules Vital for drug discovery and the life sciences 39 years old Free to all http://www.wwpdb.org We need data and knowledge about that data to interoperate PLoS Comp. Biol. 2005 1(3) e34
The World Wide Protein Data Bank – The Best Case Scenario Paper not published unless data are deposited – strong data to literature correspondence Highly structured data conforming to an extensive ontology DOI’s assigned to every structure http://www.wwpdb.org	 We need data and knowledge about that data to interoperate PLoS Comp. Biol. 2005 1(3) e34
Example Interoperability: The Database View www.rcsb.org/pdb/explore/literature.do?structureId=1TIM We need data and knowledge about that data to interoperate BMC Bioinformatics 2010 11:220
Example Interoperability: The Literature Viewhttp://biolit.ucsd.edu Nucleic Acids Research 2008 36(S2) W385-389 We need data and knowledge about that data to interoperate
ICTP Trieste, December 10, 2007 We need data and knowledge about that data to interoperate
Semantic Tagging & Widgets are a Powerful Tool to Integrate Data and Knowledge of that Data, But as Yet Not Used Much Oct. 12, 2010 Jim Gray eScience Award Lecture Will Widgets and Semantic Tagging Change Computational Biology?  PLoS Comp. Biol. 6(2) e1000673 We need data and knowledge about that data to interoperate
Semantic Tagging of Database Content in The Literature or Elsewhere http://www.rcsb.org/pdb/static.do?p=widgets/widgetShowcase.jsp PLoS Comp. Biol. 6(2) e1000673 Semantic Tagging
We need data and knowledge about that data to interoperate
The Publishers are Starting to Do It Oct. 12, 2010 Jim Gray eScience Award Lecture From Anita de Waard, Elsevier
This is Literature Post-processingBetter to Get the Authors Involved Authors are the absolute experts on the content More effective distribution of labor Add metadata before the article enters the publishing process We need data and knowledge about that data to interoperate
Word 2007 Add-in for authors Allows authors to add metadata as they write, before they submit the manuscript Authors are assisted by automated term recognition OBO ontologies Database IDs Metadata are embedded directly into the manuscript document via XML tags, OOXML format Open Machine-readable Open source, Microsoft Public License http://www.codeplex.com/ucsdbiolit We need data and knowledge about that data to interoperate
Challenges Authors  Carrot IF one or more publishers fast tracked a paper that had semantic markup it might catch on Publishers Carrot Competitive advantage We need data and knowledge about that data to interoperate
The Promise – A Hypothetical Example Cardiac Disease Literature Immunology Literature Shared Function We need data and knowledge about that data to interoperate
A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
One Small Example – The Molecular Biology Toolkit (MBT) jMol, VMD … are de facto standard important tools for rendering biological molecules .. but They are not versatile ie do not for example: Respond to the data they are reading Offer views that match the users interests Allow the user to annotate the data Allow those annotations to be shared (published?) Oct. 12, 2010 Jim Gray eScience Award Lecture Think More About the Tools
MBT Featureshttp://mbt.sdsc.edu Offer a framework not an end user application Responds to the data type Support read write access Encourages others to write end user applications Discourages feature creep Oct. 12, 2010 Jim Gray eScience Award Lecture Immunologists Immunome Research, 2007 3(1):3  Medicinal Chemists BMC Bioinformatics 2005, 6:21. Think More About the Tools
A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
Reward Systems Need to ChangeWhat is Needed? Author disambiguation Auditing (identification and metrics) of all scholarship - means new tools Seniors need to promote alternative forms of scholarship Juniors need to respond Oct. 12, 2010 Jim Gray eScience Award Lecture Ten Simple Rules for Getting Promoted as a Computational Biologist in Academia  PLoS Comp Biol to appear Reward Systems Need to Change
Example Tools Oct. 12, 2010 Jim Gray eScience Award Lecture http://www.researcherid.com/ http://pubnet.gersteinlab.org/ http://www.biomedexperts.com
What Are these Alternative Forms of Scholarship? Reviews Curation Research [Grants] Journal Article Poster Session Conference Paper Blogs Community Service/Data Reward Systems Need to Change
Reward Systems Need to Change
A Unique Identifier is Going to Happen  It is DOIs for people Some scientists will resist The winner is ORCHID? Reward Systems Need to Change
Ideally the ID will be Tagged to Every Piece of Scholarly Communication I an Not a Scientist I am a Number PLoS Comp. Biol. 2008 4(12) e1000247 Reward Systems Need to Change
A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
The Truth About My Laboratory I have ?? mail folders! The intellectual memory of my laboratory is in those folders This is an unhealthy hub and spoke mentality We Need Scientist Management Tools
The Truth About My Laboratory I generate way more negative that positive data, but where is it?  Content management is a mess Slides, posters….. Data, lab notebooks …. Collaborations, Journal clubs … Software is open but where is it? Farewell is for the data too http://artbyvida.com/portfolio.php Computational Biology Resources Lack Persistence and Usability. PLoS Comp. Biol. 2008 4(7): e1000136 We Need Scientist Management Tools
Many Great Tools Out There Oct. 12, 2010 Jim Gray eScience Award Lecture Taverna We Need Scientist Management Tools
Where I See the Problems The long tail is confused Lack of interoperability between the options The reward (publishing) is still removed from the available tools Oct. 12, 2010 Jim Gray eScience Award Lecture We Need Scientist Management Tools
A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
Yes YouTube Can Increase the Rate of Discovery Unleash the full power of the Internet
The Lab ExperimentPaper+Rich Media My students enjoyed the experience The shyest student was actually the most bold in front of the camera “We will become a generation of “science castors” They liked the exposure for the most part – rather than the PI it puts them out in front Unleash the full power of the Internet
Organic Growth 3 Years Later www.scivee.tv Some of their work viewed 20,000+ times Global audience of researchers, educators and academic/research institutions 60,000 unique visitors & 2M pageviews/month 16,000 registered users & 600 communities 5,000 uploads of video content (about journal articles, conferences, research news and classes) Growing 4-5% monthly Sustainability - evolving a business model supporting journals and conferences Unleash the full power of the Internet
What Emerged: SciveeCasts Products ApplicationProductPrimary Customers Journals	PubCastJournals, publishers, societies Meetings	PosterCast	Societies, conference orgs. SlideCast Comm.	PaperCast	Societies, journals Podcast SlideCast Education	PosterCast	Societies, universities SlideCast BooksBookCast	Publishers, book sellers Unleash the full power of the Internet
Summarizing the Reaming of Life By “Life” I mean experiences in the Life Sciences By “Reaming” I mean the the making of something smooth, fast and accurate The Monty Python parody is on conversation cards for getting a dialog going ..  The rest is just a few examples of the small ways we are trying to address big problems in the hope they will inspire us all to think more deeply about the problem Oct. 12, 2010 Jim Gray eScience Award Lecture
Acknowledgements BioLit Team Lynn Fink Parker Williams Marco Martinez RahulChandran Greg Quinn MBT John Moreland John Beaver Microsoft Scholarly Communications Pablo Fernicola Lee Dirks SavasParastitidas Alex Wade Tony Hey wwPDB team Andreas Prilc DimitrisDimitropoulos SciVee Team Apryl Bailey Leo Chalupa Lynn Fink Marc Friedman (CEO) Ken Liu Alex Ramos Willy Suwanto Ben Yukich http://www.scivee.tv http://biolit.ucsd.edu http//www.pdb.org http://www.codeplex.com/ucsdbiolit
pbourne@ucsd.edu Questions?

More Related Content

What's hot

Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data ManagementAmanda Whitmire
 
The Brain Imaging Data Structure and its use for fNIRS
The Brain Imaging Data Structure and its use for fNIRSThe Brain Imaging Data Structure and its use for fNIRS
The Brain Imaging Data Structure and its use for fNIRSRobert Oostenveld
 
Donders neuroimage toolkit - open science and good practices
Donders neuroimage toolkit -  open science and good practicesDonders neuroimage toolkit -  open science and good practices
Donders neuroimage toolkit - open science and good practicesRobert Oostenveld
 
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'TERN Australia
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositoriesChris Rusbridge
 
20160719 23 Research Data Things
20160719 23 Research Data Things20160719 23 Research Data Things
20160719 23 Research Data ThingsKatina Toufexis
 
20160523 23 Research Data Things
20160523 23 Research Data Things20160523 23 Research Data Things
20160523 23 Research Data ThingsKatina Toufexis
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsAaron Collie
 
CuttingEEG - Open Science, Open Data and BIDS for EEG
CuttingEEG - Open Science, Open Data and BIDS for EEGCuttingEEG - Open Science, Open Data and BIDS for EEG
CuttingEEG - Open Science, Open Data and BIDS for EEGRobert Oostenveld
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data managementCunera Buys
 
Donders Repository - removing barriers for management and sharing of research...
Donders Repository - removing barriers for management and sharing of research...Donders Repository - removing barriers for management and sharing of research...
Donders Repository - removing barriers for management and sharing of research...Robert Oostenveld
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...Amanda Whitmire
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data ManagementOpenAIRE
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)James Hendler
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide WebJames Hendler
 
BIOMAG2018 - Denis Engemann - MNE-HCP
BIOMAG2018 - Denis Engemann - MNE-HCPBIOMAG2018 - Denis Engemann - MNE-HCP
BIOMAG2018 - Denis Engemann - MNE-HCPRobert Oostenveld
 
Public data archiving: Who does? Who doesn't? What can we do about it?
Public data archiving: Who does?  Who doesn't?  What can we do about it?Public data archiving: Who does?  Who doesn't?  What can we do about it?
Public data archiving: Who does? Who doesn't? What can we do about it?Heather Piwowar
 
Organizing EEG data using the Brain Imaging Data Structure
Organizing EEG data using the Brain Imaging Data Structure Organizing EEG data using the Brain Imaging Data Structure
Organizing EEG data using the Brain Imaging Data Structure Robert Oostenveld
 

What's hot (20)

Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
The Brain Imaging Data Structure and its use for fNIRS
The Brain Imaging Data Structure and its use for fNIRSThe Brain Imaging Data Structure and its use for fNIRS
The Brain Imaging Data Structure and its use for fNIRS
 
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
 
Donders neuroimage toolkit - open science and good practices
Donders neuroimage toolkit -  open science and good practicesDonders neuroimage toolkit -  open science and good practices
Donders neuroimage toolkit - open science and good practices
 
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositories
 
20160719 23 Research Data Things
20160719 23 Research Data Things20160719 23 Research Data Things
20160719 23 Research Data Things
 
20160523 23 Research Data Things
20160523 23 Research Data Things20160523 23 Research Data Things
20160523 23 Research Data Things
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering Students
 
CuttingEEG - Open Science, Open Data and BIDS for EEG
CuttingEEG - Open Science, Open Data and BIDS for EEGCuttingEEG - Open Science, Open Data and BIDS for EEG
CuttingEEG - Open Science, Open Data and BIDS for EEG
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data management
 
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
 
Donders Repository - removing barriers for management and sharing of research...
Donders Repository - removing barriers for management and sharing of research...Donders Repository - removing barriers for management and sharing of research...
Donders Repository - removing barriers for management and sharing of research...
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data Management
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide Web
 
BIOMAG2018 - Denis Engemann - MNE-HCP
BIOMAG2018 - Denis Engemann - MNE-HCPBIOMAG2018 - Denis Engemann - MNE-HCP
BIOMAG2018 - Denis Engemann - MNE-HCP
 
Public data archiving: Who does? Who doesn't? What can we do about it?
Public data archiving: Who does?  Who doesn't?  What can we do about it?Public data archiving: Who does?  Who doesn't?  What can we do about it?
Public data archiving: Who does? Who doesn't? What can we do about it?
 
Organizing EEG data using the Brain Imaging Data Structure
Organizing EEG data using the Brain Imaging Data Structure Organizing EEG data using the Brain Imaging Data Structure
Organizing EEG data using the Brain Imaging Data Structure
 

Viewers also liked

Where Have We Been & Where Are We Going?
Where Have We Been & Where Are We Going?Where Have We Been & Where Are We Going?
Where Have We Been & Where Are We Going?Philip Bourne
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Philip Bourne
 
Mongodb Meetup Group 052012
Mongodb Meetup Group 052012Mongodb Meetup Group 052012
Mongodb Meetup Group 052012scrazzl
 
Highlights from NIH Data Science
Highlights from NIH Data ScienceHighlights from NIH Data Science
Highlights from NIH Data SciencePhilip Bourne
 
scrazzl, our journey in slides
scrazzl, our journey in slidesscrazzl, our journey in slides
scrazzl, our journey in slidesscrazzl
 
From Where Have We Come & Where Are We Going
From Where Have We Come & Where Are We GoingFrom Where Have We Come & Where Are We Going
From Where Have We Come & Where Are We GoingPhilip Bourne
 
Open Science: Some Possible Actions by University Leaders on Behalf of Resear...
Open Science:Some Possible Actions by University Leaders on Behalf of Resear...Open Science:Some Possible Actions by University Leaders on Behalf of Resear...
Open Science: Some Possible Actions by University Leaders on Behalf of Resear...Philip Bourne
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataPhilip Bourne
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectivePhilip Bourne
 
reaming n tapping
reaming n tappingreaming n tapping
reaming n tappingRina Eryani
 
Operation research complete
Operation research completeOperation research complete
Operation research completeRohit Mishra
 
There is No Intelligent Life Down Here
There is No Intelligent Life Down HereThere is No Intelligent Life Down Here
There is No Intelligent Life Down HerePhilip Bourne
 
Ten Simple Rules for Open Access Publishers
Ten Simple Rules for Open Access PublishersTen Simple Rules for Open Access Publishers
Ten Simple Rules for Open Access PublishersPhilip Bourne
 
Towards Accelerating Innovation at UCSD
Towards Accelerating Innovation at UCSDTowards Accelerating Innovation at UCSD
Towards Accelerating Innovation at UCSDPhilip Bourne
 
Bioinformatics in the Bourne Lab.
Bioinformatics in the Bourne Lab.Bioinformatics in the Bourne Lab.
Bioinformatics in the Bourne Lab.Philip Bourne
 
Professional Development Lecture 1 2011
Professional Development Lecture 1 2011Professional Development Lecture 1 2011
Professional Development Lecture 1 2011Philip Bourne
 

Viewers also liked (20)

Where Have We Been & Where Are We Going?
Where Have We Been & Where Are We Going?Where Have We Been & Where Are We Going?
Where Have We Been & Where Are We Going?
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 
Mongodb Meetup Group 052012
Mongodb Meetup Group 052012Mongodb Meetup Group 052012
Mongodb Meetup Group 052012
 
Hpm100615
Hpm100615Hpm100615
Hpm100615
 
Highlights from NIH Data Science
Highlights from NIH Data ScienceHighlights from NIH Data Science
Highlights from NIH Data Science
 
scrazzl, our journey in slides
scrazzl, our journey in slidesscrazzl, our journey in slides
scrazzl, our journey in slides
 
From Where Have We Come & Where Are We Going
From Where Have We Come & Where Are We GoingFrom Where Have We Come & Where Are We Going
From Where Have We Come & Where Are We Going
 
Open Science: Some Possible Actions by University Leaders on Behalf of Resear...
Open Science:Some Possible Actions by University Leaders on Behalf of Resear...Open Science:Some Possible Actions by University Leaders on Behalf of Resear...
Open Science: Some Possible Actions by University Leaders on Behalf of Resear...
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big Data
 
Vogal's mathod
Vogal's mathodVogal's mathod
Vogal's mathod
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH Perspective
 
Reaming ppt
 Reaming ppt Reaming ppt
Reaming ppt
 
reaming n tapping
reaming n tappingreaming n tapping
reaming n tapping
 
Operation research complete
Operation research completeOperation research complete
Operation research complete
 
There is No Intelligent Life Down Here
There is No Intelligent Life Down HereThere is No Intelligent Life Down Here
There is No Intelligent Life Down Here
 
Ten Simple Rules for Open Access Publishers
Ten Simple Rules for Open Access PublishersTen Simple Rules for Open Access Publishers
Ten Simple Rules for Open Access Publishers
 
Towards Accelerating Innovation at UCSD
Towards Accelerating Innovation at UCSDTowards Accelerating Innovation at UCSD
Towards Accelerating Innovation at UCSD
 
Bioinformatics in the Bourne Lab.
Bioinformatics in the Bourne Lab.Bioinformatics in the Bourne Lab.
Bioinformatics in the Bourne Lab.
 
Professional Development Lecture 1 2011
Professional Development Lecture 1 2011Professional Development Lecture 1 2011
Professional Development Lecture 1 2011
 
RDAP 033111
RDAP 033111RDAP 033111
RDAP 033111
 

Similar to Jim Gray Award Lecture

UCSD Library Presentation 10182010
UCSD Library Presentation 10182010UCSD Library Presentation 10182010
UCSD Library Presentation 10182010Philip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data SciencePhilip Bourne
 
Scholarly Communication for Bioinformatics Students
Scholarly Communication for Bioinformatics StudentsScholarly Communication for Bioinformatics Students
Scholarly Communication for Bioinformatics StudentsPhilip Bourne
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersPhilip Bourne
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeLizLyon
 
The OpenCon Intro to Open Data
The OpenCon Intro to Open DataThe OpenCon Intro to Open Data
The OpenCon Intro to Open DataRoss Mounce
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global EcosystemPhilip Bourne
 
Open Data Driven Scholarly Communication in 2020
Open Data Driven Scholarly Communication in 2020Open Data Driven Scholarly Communication in 2020
Open Data Driven Scholarly Communication in 2020Philip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeLizLyon
 
Ucsd library10182010
Ucsd library10182010Ucsd library10182010
Ucsd library10182010Philip Bourne
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
Biomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterpriseBiomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterprisePhilip Bourne
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...African Open Science Platform
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 

Similar to Jim Gray Award Lecture (20)

UCSD Library Presentation 10182010
UCSD Library Presentation 10182010UCSD Library Presentation 10182010
UCSD Library Presentation 10182010
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
Scholarly Communication for Bioinformatics Students
Scholarly Communication for Bioinformatics StudentsScholarly Communication for Bioinformatics Students
Scholarly Communication for Bioinformatics Students
 
Murpha11
Murpha11Murpha11
Murpha11
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific Publishers
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
The OpenCon Intro to Open Data
The OpenCon Intro to Open DataThe OpenCon Intro to Open Data
The OpenCon Intro to Open Data
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global Ecosystem
 
Open Data Driven Scholarly Communication in 2020
Open Data Driven Scholarly Communication in 2020Open Data Driven Scholarly Communication in 2020
Open Data Driven Scholarly Communication in 2020
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
 
From byte to mind
From byte to mindFrom byte to mind
From byte to mind
 
Ucsd library10182010
Ucsd library10182010Ucsd library10182010
Ucsd library10182010
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
Biomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterpriseBiomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital Enterprise
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 

More from Philip Bourne

AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationPhilip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingPhilip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityPhilip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?Philip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug DiscoveryPhilip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchPhilip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptxPhilip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision EducationPhilip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance SustainabilityPhilip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in ResearchPhilip Bourne
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?Philip Bourne
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science LandscapePhilip Bourne
 
The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data SciencePhilip Bourne
 
The Most Important Ten Simple Rules
The Most Important Ten Simple RulesThe Most Important Ten Simple Rules
The Most Important Ten Simple RulesPhilip Bourne
 

More from Philip Bourne (20)

AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 
SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data Science
 
The Most Important Ten Simple Rules
The Most Important Ten Simple RulesThe Most Important Ten Simple Rules
The Most Important Ten Simple Rules
 

Recently uploaded

Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 

Recently uploaded (20)

Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 

Jim Gray Award Lecture

  • 1. The Reaming of Life Philip E. Bourne University of California San Diego pbourne@ucsd.edu Jim Gray eScience Award Lecture Oct. 12, 2010
  • 2. Disclaimer I am a domain (life) scientist not a computer or information scientist I am fortunate enough to have a major biological resource (the Protein Data Bank) and a major biological journal (PLoS Computational Biology) as my playground I am part of the long tail I am naïve, but I am the majority Oct. 12, 2010 Jim Gray eScience Award Lecture
  • 3. The Reaming of Life -What on Earth is He Talking About? A reamer is a tool for turning a roughly punched hole into an accurate and smooth one The digital data deluge has punched that rough hole For the life {other?} sciences to optimally advance we need an accurate and smooth conduit through which data can be distilled, analyzed, visualized, distributed and above all else comprehended Oct. 12, 2010 Jim Gray eScience Award Lecture
  • 4. … and we need to accelerate the process by which this is donehere is why …. This is just another way of saying what Jim said and is embodied in the Fourth Paradigm Oct. 12, 2010 Jim Gray eScience Award Lecture
  • 5. The Scientific Process is Too Slow to Respond to a Crisis – Either Global or Personal Oct. 12, 2010 Jim Gray eScience Award Lecture By the time the paper is published we could all be dead http://knol.google.com/k/plos-currents-influenza# Motivation
  • 6. In a time of crisis the need for fast access to accurate data and any knowledge of that data are paramount Structure Summary page activity for H1N1 Influenza related structures Jan. 2008 Jan. 2009 Jan. 2010 Jul. 2009 Jul. 2008 Jul. 2010 3B7E: Neuraminidase of A/Brevig Mission/1/1918 H1N1 strain in complex with zanamivir 1RUZ: 1918 H1 Hemagglutinin * http://www.cdc.gov/h1n1flu/estimates/April_March_13.htm Motivation
  • 7. If that is not enough…For some people the scientific process may be too slow to save their life Oct. 12, 2010 Jim Gray eScience Award Lecture Motivation
  • 8. Josh Sommer – A Remarkable Young ManCo-founder & Executive Director the Chordoma Foundation Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 9. Chordoma A rare form of brain cancer No known drugs Treatment – surgical resection followed by intense radiation therapy Oct. 12, 2010 Jim Gray eScience Award Lecture http://upload.wikimedia.org/wikipedia/commons/2/2b/Chordoma.JPG Motivation
  • 10. Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 11. Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 12. Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 13. Oct. 12, 2010 Jim Gray eScience Award Lecture If I have seen further it is only by standing on the shoulders of giants Isaac Isaac Newton From Josh’s point of view the climb up just takes too long > 15 years and > $850M to be more precise Adapted: http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 14. Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 15. Oct. 12, 2010 Jim Gray eScience Award Lecture http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 16. Oct. 12, 2010 Jim Gray eScience Award Lecture http://fora.tv/2010/04/23/Sage_Commons_Josh_Sommer_Chordoma_Foundation Motivation
  • 17. Now we are all hopefully motivated let us break this down to what actually needs to be done in my opinion Here are a few big things … Oct. 12, 2010 Jim Gray eScience Award Lecture
  • 18. A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
  • 19. We Need Data and Knowledge About That Data to Interoperate The Knowledge and Data Cycle 0. Full text of PLoS papers stored in a database 4. The composite view has links to pertinent blocks of literature text and back to the PDB User clicks on content Metadata and webservices to data provide an interactiveview that can be annotated Selecting features provides a data/knowledge mashup Analysis leads to new content I can share 4. 1. 3. A composite view of journal and database content results 1. A link brings up figures from the paper 3. 2. 2. Clicking the paper figure retrieves data from the PDB which is analyzed PLoS Comp. Biol. 2005 1(3) e34
  • 20. We Need Data and Knowledge About That Data to Interoperate – What is Stopping US? Governance – publishers vs. database providers Reward Metadata standards for provenance, privacy etc. Exemplars …. Oct. 12, 2010 Jim Gray eScience Award Lecture Caveat: Each discipline is different – I speak very much from a biomedical sciences perspective
  • 21. Certainly the Argument for Interoperability in the Biomedical Sciences is Strong 1078 databases reported in NAR 2008 MetaBase http://biodatabase.org reports 2,651 entries edited 12,587 times PubMed contains 18,792,257 entries ~100,000 papers indexed per month In Feb 2009: 67,406,898 interactive searches were done 92,216,786 entries were viewed Data as of April 14, 2009 We need data and knowledge about that data to interoperate PLoS Comp. Biol. 2005 1(3) e34
  • 22. A Small Example - The World Wide Protein Data Bank The single worldwide repository for data on the structure of biological macromolecules Vital for drug discovery and the life sciences 39 years old Free to all http://www.wwpdb.org We need data and knowledge about that data to interoperate PLoS Comp. Biol. 2005 1(3) e34
  • 23. The World Wide Protein Data Bank – The Best Case Scenario Paper not published unless data are deposited – strong data to literature correspondence Highly structured data conforming to an extensive ontology DOI’s assigned to every structure http://www.wwpdb.org We need data and knowledge about that data to interoperate PLoS Comp. Biol. 2005 1(3) e34
  • 24. Example Interoperability: The Database View www.rcsb.org/pdb/explore/literature.do?structureId=1TIM We need data and knowledge about that data to interoperate BMC Bioinformatics 2010 11:220
  • 25. Example Interoperability: The Literature Viewhttp://biolit.ucsd.edu Nucleic Acids Research 2008 36(S2) W385-389 We need data and knowledge about that data to interoperate
  • 26. ICTP Trieste, December 10, 2007 We need data and knowledge about that data to interoperate
  • 27. Semantic Tagging & Widgets are a Powerful Tool to Integrate Data and Knowledge of that Data, But as Yet Not Used Much Oct. 12, 2010 Jim Gray eScience Award Lecture Will Widgets and Semantic Tagging Change Computational Biology? PLoS Comp. Biol. 6(2) e1000673 We need data and knowledge about that data to interoperate
  • 28. Semantic Tagging of Database Content in The Literature or Elsewhere http://www.rcsb.org/pdb/static.do?p=widgets/widgetShowcase.jsp PLoS Comp. Biol. 6(2) e1000673 Semantic Tagging
  • 29. We need data and knowledge about that data to interoperate
  • 30. The Publishers are Starting to Do It Oct. 12, 2010 Jim Gray eScience Award Lecture From Anita de Waard, Elsevier
  • 31. This is Literature Post-processingBetter to Get the Authors Involved Authors are the absolute experts on the content More effective distribution of labor Add metadata before the article enters the publishing process We need data and knowledge about that data to interoperate
  • 32. Word 2007 Add-in for authors Allows authors to add metadata as they write, before they submit the manuscript Authors are assisted by automated term recognition OBO ontologies Database IDs Metadata are embedded directly into the manuscript document via XML tags, OOXML format Open Machine-readable Open source, Microsoft Public License http://www.codeplex.com/ucsdbiolit We need data and knowledge about that data to interoperate
  • 33. Challenges Authors Carrot IF one or more publishers fast tracked a paper that had semantic markup it might catch on Publishers Carrot Competitive advantage We need data and knowledge about that data to interoperate
  • 34. The Promise – A Hypothetical Example Cardiac Disease Literature Immunology Literature Shared Function We need data and knowledge about that data to interoperate
  • 35. A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
  • 36. One Small Example – The Molecular Biology Toolkit (MBT) jMol, VMD … are de facto standard important tools for rendering biological molecules .. but They are not versatile ie do not for example: Respond to the data they are reading Offer views that match the users interests Allow the user to annotate the data Allow those annotations to be shared (published?) Oct. 12, 2010 Jim Gray eScience Award Lecture Think More About the Tools
  • 37. MBT Featureshttp://mbt.sdsc.edu Offer a framework not an end user application Responds to the data type Support read write access Encourages others to write end user applications Discourages feature creep Oct. 12, 2010 Jim Gray eScience Award Lecture Immunologists Immunome Research, 2007 3(1):3 Medicinal Chemists BMC Bioinformatics 2005, 6:21. Think More About the Tools
  • 38. A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
  • 39. Reward Systems Need to ChangeWhat is Needed? Author disambiguation Auditing (identification and metrics) of all scholarship - means new tools Seniors need to promote alternative forms of scholarship Juniors need to respond Oct. 12, 2010 Jim Gray eScience Award Lecture Ten Simple Rules for Getting Promoted as a Computational Biologist in Academia PLoS Comp Biol to appear Reward Systems Need to Change
  • 40. Example Tools Oct. 12, 2010 Jim Gray eScience Award Lecture http://www.researcherid.com/ http://pubnet.gersteinlab.org/ http://www.biomedexperts.com
  • 41. What Are these Alternative Forms of Scholarship? Reviews Curation Research [Grants] Journal Article Poster Session Conference Paper Blogs Community Service/Data Reward Systems Need to Change
  • 42. Reward Systems Need to Change
  • 43. A Unique Identifier is Going to Happen It is DOIs for people Some scientists will resist The winner is ORCHID? Reward Systems Need to Change
  • 44. Ideally the ID will be Tagged to Every Piece of Scholarly Communication I an Not a Scientist I am a Number PLoS Comp. Biol. 2008 4(12) e1000247 Reward Systems Need to Change
  • 45. A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
  • 46. The Truth About My Laboratory I have ?? mail folders! The intellectual memory of my laboratory is in those folders This is an unhealthy hub and spoke mentality We Need Scientist Management Tools
  • 47. The Truth About My Laboratory I generate way more negative that positive data, but where is it? Content management is a mess Slides, posters….. Data, lab notebooks …. Collaborations, Journal clubs … Software is open but where is it? Farewell is for the data too http://artbyvida.com/portfolio.php Computational Biology Resources Lack Persistence and Usability. PLoS Comp. Biol. 2008 4(7): e1000136 We Need Scientist Management Tools
  • 48. Many Great Tools Out There Oct. 12, 2010 Jim Gray eScience Award Lecture Taverna We Need Scientist Management Tools
  • 49. Where I See the Problems The long tail is confused Lack of interoperability between the options The reward (publishing) is still removed from the available tools Oct. 12, 2010 Jim Gray eScience Award Lecture We Need Scientist Management Tools
  • 50. A Few Things to Accelerate the Rate of Scientific Discovery Better communication, data and knowledge access, and new modes of discovery, which means: We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives We need to be more open with both We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery Reward systems need to change We need scientist management tools We need to be less fixated on the big data problems We need to unleash the full power of the Internet Oct. 12, 2010 Jim Gray eScience Award Lecture Hard Easy
  • 51. Yes YouTube Can Increase the Rate of Discovery Unleash the full power of the Internet
  • 52. The Lab ExperimentPaper+Rich Media My students enjoyed the experience The shyest student was actually the most bold in front of the camera “We will become a generation of “science castors” They liked the exposure for the most part – rather than the PI it puts them out in front Unleash the full power of the Internet
  • 53. Organic Growth 3 Years Later www.scivee.tv Some of their work viewed 20,000+ times Global audience of researchers, educators and academic/research institutions 60,000 unique visitors & 2M pageviews/month 16,000 registered users & 600 communities 5,000 uploads of video content (about journal articles, conferences, research news and classes) Growing 4-5% monthly Sustainability - evolving a business model supporting journals and conferences Unleash the full power of the Internet
  • 54. What Emerged: SciveeCasts Products ApplicationProductPrimary Customers Journals PubCastJournals, publishers, societies Meetings PosterCast Societies, conference orgs. SlideCast Comm. PaperCast Societies, journals Podcast SlideCast Education PosterCast Societies, universities SlideCast BooksBookCast Publishers, book sellers Unleash the full power of the Internet
  • 55. Summarizing the Reaming of Life By “Life” I mean experiences in the Life Sciences By “Reaming” I mean the the making of something smooth, fast and accurate The Monty Python parody is on conversation cards for getting a dialog going .. The rest is just a few examples of the small ways we are trying to address big problems in the hope they will inspire us all to think more deeply about the problem Oct. 12, 2010 Jim Gray eScience Award Lecture
  • 56. Acknowledgements BioLit Team Lynn Fink Parker Williams Marco Martinez RahulChandran Greg Quinn MBT John Moreland John Beaver Microsoft Scholarly Communications Pablo Fernicola Lee Dirks SavasParastitidas Alex Wade Tony Hey wwPDB team Andreas Prilc DimitrisDimitropoulos SciVee Team Apryl Bailey Leo Chalupa Lynn Fink Marc Friedman (CEO) Ken Liu Alex Ramos Willy Suwanto Ben Yukich http://www.scivee.tv http://biolit.ucsd.edu http//www.pdb.org http://www.codeplex.com/ucsdbiolit