Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Issues and Challenges in Public Licensing of Biodiversity Data and Publications
1. 生物多樣性資料與出版品公眾授權的議題與挑戰 邵廣昭 (K-T Shao) 林朝欽 (C-C Lin) 中央研究院生物多樣性研究中心 行政院農委會林業試驗所 20090327-- AS 資訊所 (25mins) Issues and Challenges in Public Licensing of Biodiversity Data and Publications Research Center for Biodiversity, Academia Sinica Taiwan Forest Research Institute, Council of Agriculture
2.
3.
4.
5. The Global Biodiversity Information Facility (GBIF) is an international organisation that is working to make the world's biodiversity data accessible via internet. Sharing primary scientific biodiversity data to benefit society, science and a sustainable future. GBIF's members include countries and international organisations(82 members), 282 Data Providers, 170 M records. 全球生物多樣性資訊機構 G LOBAL B IODIVERSITY I NFORMATION F ACILITY GBIF , http://www.gbif.org/
6. All GBIF data by one-degree cell (darker colours representing more records) 1.7 億筆博物館標本及其分布之資料 170 M georeferenced records mapped to a 1 X 1 degree grid
7. GBIF’s National Nodes GBIF 之國家節點或入口網站 ( GBIF’s National Nodes or Portals ) 82 members -- 47 countries, 35 organizations plus CBD Secretariat. Each country has its’ own NODE.
8. Integrate Taiwan Biodiversity National Information Network with global databases (TaiBIF/TELDAP GBIF) GBIF (Global Biodiversity Information Facility) Species 2000 ITIS, BIOS BioNET-International …… Species 2000 AO, ASEANET …… PACINET, PBIF Global organisms database e.g. FishBase TaiBIF /TELDAP (national node of GBIF) Global Regional (Asia, Asian Oceania or Pacific) Taiwan TaiBNET (species checklist & experts namelist) NDAP (specimen database) TBRD (distribution database) Local organisms database (e.g. Fish database of Taiwan) Local Institutions, NGO, heritages, projects and publications etc.
9. RECOMMENDATION ON OPEN ACCESS TO BIODIVERSITY DATA (Adopted by GBIF Governing Board on Jan. 1 2006) • Promote that species and specimen level data and associated metadata that are generated in funded projects are made publicly available through mechanisms cooperating with GBIF, within a specified period after completion of the supported research. GBIF – 公開生物多樣性資料之建言 由公務預算調查所得之物種及標本層級之資料在完成研究之一定期限內應予公開
10. 最近 EU ( 歐盟 ) 正籲請科學家簽署響應 「公務預算所完成之科學研究調查資料應予公開使用」 Petition for guaranteed public access to publicly-funded research results -- Our mission of disseminating knowledge is only half complete if the information is not made widely and readily available to society. Berlin Declaration, October 2003 27,629 signatories since January 17th, 2007. http://www.ec-petition.eu/
11.
12. 生物多樣性資料 vs. DNA 資料或環境因子資料 -- 2. 後者可經由儀噐 at lab or 自環境中連續監測、自動擷取即記錄得到,甚至可由軟 體分析。 Biodiversity data vs. molecular data or environmental data -- 2. The genetic or DNA data or the ordinary environmental data which can be obtained by using instrument, automatic monitoring equipment or analytic software. 故 生物多樣性資料 更需尊重資料提供者之著作權。 Biodiversity data calls for more serious respect for intellectual property rights.
13.
14. 政府 出版品方面 ,過去委託單位在與作者簽約時,均只載明著作權係屬於原作者,而委託單位只有使用權,且未明載可授權給第三者使用,或是創用 (cc) 授權。 3. Photo has more commercial values than text With respect to government publications , the commissioning party in the past when signing with an author merely specified that the original author holds the copyright and the commissioning party holds the right of use. The right for the commissioning party to authorize third party users was not spelt out, nor was the use of Creative Commons (CC) authorized. As a result, most of government publications, even if there are electronic files of them, can not be uploaded to the web for public use; a very regrettable practice. Therefore, government agencies in the future should request an author to sign the CC Attribution License so that publications can be made publicly available online.
15. The 1st Official “Catalog of Life of Taiwan”, native species checklist will be published with the CD-ROM in Dec. 2008 – Totally 50000+ native species 出版台灣物種名錄及物種多樣性研究 (97/12)
21. Digital contents produced are requested to upload to the Union Catalog. 資料需繳聯合目錄 The Union Catalog planned to use the Creative Commons license model. 聯合目錄用 cc To use CC in the Union Catalog, content producers’ agreements in using CC is required. TELDAP then started IPR inventory, found that not all properties are CC suitable. 但發現並非所有資料均可 cc 授權,只好重新盤點 The Union Catalog now uses a page describing its copyright notices. 只好用自己方式授權 The Culture Portal naturally wanted to derive contents from the Union Catalog. But content producers only license to the Union Catalog. 資料只授權聯合目錄,卻不能給同計劃的國際入口網用 The Culture Portal then requests licenses on a per-project basis. 只好重簽授權書 The Culture Portal is now composing its copyright notices. 雖己簽妥,但所給資料很有限,品質亦不佳 資料不能在同一計劃的不同網站上使同
22.
23.
24.
25.
26.
27.
28.
29. 3. In addition to SCI reports and indicators of impact factors, digitized material such as number of records (of specimen collected, ecological distribution and DNA sequence) archived and uploaded to the Internet, i.e., “Repository Impact Factor” should also be considered in the merit system and research performance evaluation. 3. 研究成果考核除依據 SCI 期刋報告之篇數及影響因子外,數位化資訊之筆數,如典藏標本數、生態分布資料、 DNA 序列及上網公開等,此即 ”典藏影響因子” ,亦應列入考核研究表現評比之指標。
30. 4. It is necessary to discuss and decide on how to collect and open access to data under the existing IPR regulations in order to remove obstacles to data collection and to build up mutual trust. The organization in charge of data integration should have only the custody but not the ownership of the data until data provider retired . The data and genetic materials must not be provided to any third party without the consent of the original owners. 4. 在現有之智財權規範下應速研訂資料收集及開放之機制,以消弭整合資料之阻力及建立互信。 在資料提供者退休前 , 負責資料整合之機構應只有 監護權 而非 所有權 。在未經原提供者之同意或授權下不能逕自提供他人使用。
34. Persistent ? digital and physical data stores, moderately accessible 80% ? digital Ecological & Ecosystem Data accessible with difficulty <5% digital Species- & Specimen Data Persistent digital universally accessible data stores 95% digital Molecular Sequence & Gene/Genome Data Subdomain Digital Status Common Use Licensing Data Status Status of Biodiversity Scientific Data Persistent physical data stores
35. (Michener et al. 1997) Problem of Common Use-Data Entropy Information Content Time Time of publication Specific details General details Accident Retirement or career change Death Data Entropy
36. Problem of Common Use-Lack of Tools Raw data Information/ Knowledge Management, Archiving, & Curation Discovery, Retrieval, & Integration Analysis, Synthesis, & Forecasting
37.
38. 臺灣物種名錄 Catalog of Life in Taiwan , TaiBNET http://TaiBNET.sinica.edu.tw