The document discusses the perspectives of the author as a data producer, overseer of data curation efforts, database provider, and data user. The author argues that open data is important for enabling new discoveries. However, data repositories currently make accessing and using data difficult. The author's vision for 2020 is for biological questions to be answered by operating on data in a simpler, more productive and reproducible manner. This will require improvements like a data registry and an "App+" store model.
1. Open Data Driving Scholarly Communications in 2020 Philip E. Bourne UCSD [email_address] 7th Int. Data Curation Conference Bristol UK Dec. 7, 2011
2.
3. This Lecture will Try and Present All Aspects of this Perspective 7th Int. Data Curation Conference Bristol UK Dec. 7, 2011
4. But First: Why Open Data Are Important – The Story of Meredith 7th Int. Data Curation Conference Bristol UK Dec. 7, 2011
5. Meredith got data the old fashioned way – she did not discover it in a broad and deep search she read the papers and bugged the authors Imagine what she could do if data were instantly discoverable, the value quantified in some way and more simply used 7th Int. Data Curation Conference Bristol UK Dec. 7, 2011
6.
7. Some Thoughts in Supporting Curation http://collections.plos.org/ploscompbiol/biocurators.php They really should to do more to promote themselves
8.
9.
10.
11. Some Happy Thoughts as a Database Provider Number of released entries Year We manage to handle Increased volume and complexity at a lesser cost Usage increases and the community broadens Database Provision Increasingly these define future funding, could it be the H-factor mistake for data?
12.
13.
14.
15.
16. Semantic Tagging & Widgets are a Powerful Tool to Integrate Data and Knowledge of that Data, But as Yet Not Used Much Will Widgets and Semantic Tagging Change Computational Biology? PLoS Comp. Biol. 6(2) e1000673 7th Int. Data Curation Conference Bristol UK Dec. 7, 2011 Database Provision
17.
18. Example of Interoperability: The Database View www.rcsb.org/pdb/explore/literature.do?structureId=1TIM BMC Bioinformatics 2010 11:220 Database Provision