SlideShare uma empresa Scribd logo
1 de 53
Biodiversity Data Provider Software Hands-on exercises with TAPIR Stockholm Biodiversity Informatics Symposium 2008 (SBIS2008) Dag Terje Filip Endresen, Nordic Genetic Resources Center (Sweden) / Bioversity International (Italy)
Fallacies of Distributed Computing The network is reliable. Latency is zero. Bandwidth is infinite. The network is secure. Topology doesn't change. There is one administrator. Transport cost is zero. The network is homogeneous. This list of fallacies came about at Sun Microsystems around 1994. 2
TAPIR 3 Cartoon by Sasha Kopf (Creative Commons)
TAPIR TAPIR - TDWG Access Protocol for Information Retrieval. During the 2004 TDWG meeting in Christchurch, NZ, work started on a unified protocol and named TAPIR. TAPIR is based on the protocol from the two data provider software, BioCASE and DiGIR.  4
Provider software, wrappers DiGIR (2002, not active) http://digir.sourceforge.net BioCASE (2003, PyWrapper) http://www.biocase.org PyWrapper3 (2006, not active) http://trac.pywrapper.org/ TapirLink (2007) http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink GBIF Provider Toolkit (2009) http://code.google.com/p/gbif-providertoolkit 5
BioCASE 2.5.ORC 6 The BioCASE provider software is a product of the EU funded BioCASE project (2001-2004).  Developed at BGBM in Berlin.  Last updated in April 2008, with support for Python version 2.5 and less required external Implement the BioCASE provider to share data as ABCD 2.06. http://www.biocase.org
1. Make sure you have Python 2.5 installed  (command line: python –v) 2.Download the latest provider software from http://www.biocase.org 3. Uncompress the BioCASE provider software to a folder on your system [provider_software_2.5.0RC.tar.gz]     (tar –xzvf provider_...tar.gz) 4. Run setup.py, (python setup.py) 5. Configure your web server to mount biocase/www as http://localhost/biocase/ Hint: You will find an example for httpd.conf as the last terminal output from running setup.py 7 BioCASE 2.5.ORC
BioCASE 2.5.ORC 6. Visit the library test page:  http://localhost/biocase/utilities/testlibs.cgi 6a. Download latest 4 Suite  from http://4suite.org/ Uncompress and install  [4Suite-XML-1.0.2.tar.bz2] 6b. Install additional python  libraries, including  the desired  database driver. For each  python package:  (python setup.py install) 6c. Graphviz is useful to  visualize the database table structure. 8
BioCASE 2.5.ORC 7. Configuration ,[object Object]
 Database connection
 Database table structure
 Mapping of data model to standard schema9
BioCASE 2.5.ORC 8. Query Form The manual query form is illustrative for understanding exactly how the wrapper software works!   http://localhost/biocase/utilities/queryforms/qf_manual.cgi?dsa=sesto 10
11
PyWrapper3 Home:http://trac.pywrapper.org/ Primary developers: Markus Döring, Javier de la Torre 14/07/2008 - Development stalled We are sorry to inform you that development of the TAPIR branch of PyWrapper has been stalled. The latest 3.1 alpha version is not stable and not recommended for production!  (Message from the home page) ,[object Object]
PyWrapper 3.1.0 alpha (development version, works with Python 2.5)PyWrapper is tested and verified to work fine with Windows, Mac OS X and Linux. 12
Required configuration Web server: Any CGI compliant web server: Apache, IIS etc. (The built in CherryPy web server can also be used). Database: Major databases are supported, including MySQL, Oracle, SQLServer, Sybase, Access, PostgreSQL. Theoretically any database with a Python library should work.  Python PyWrapper is developed with the Python programming language. (The latest version from the SVN code repository works with Python version 2.5) 13 Apache, MySQL and Python are open source software, free to use - even for commercial products.
Installation http://trac.pywrapper.org/pywrapper/wiki/InstallationGuide 1. Download the latest PyWrapper3 installer. 	Use SVN export or checkout for Python 2.5 support 2. Uncompress to a folder of your choice. 	Example: “/usr/local/pywrapper3/” 	Example: “C:ywrapper Local installation: If you have a Subversion client installed, you may use the automatic installer. (Local Python and libraries are installed to your pywrapper folder) promt$  svn export svn://svn.pywrapper.org:80/pywrapper/trunk pywrapper promt$  cd pywrapper/tools promt$  /bin/shinstall.sh This will require that you have a bash shell, and probably that you have a Unix line system like e.g. FreeBSD, Linux or Mac OS X… 3. Execute: pywrapper/setup.py 	Example:  promt$  python setup.py (Mac OS X, Linux) 	On Windows locate setup.py and double-click 14
Start standalone server Execute start_server.py(default port is 8080) promt$ cdwebapp/  promt$ ./start_server.py 8088(example to start on port 8088) On a Windows system you may do this in a MS-DOS window (or double-click the file - if you accept the default port). Some messages will pass across your screen. Please be patient, this could take a minute. Wait for the message “start server …” and find find PyWrapper at: http://localhost:8088/pywrapper 15
Configuration After successful installation, you will need to configure your data provider. Follow the instructions from the PyWrapper documentation web page to configure. Data sources. If you provide more datasets or several databases they will be configured as individual data sources (dsa). Database connection. For PyWrapper to access your database. Database structure. Define the relevant database tables, the primary keys and foreign keys. Data model. Map your database model to the standard represented by the XML Schemas you choose. http://trac.pywrapper.org/pywrapper/wiki/Documentation 16
Screen examples PyWrapper comes with a graphical web based configuration tool For more information and more screen dumps from the configuration of PyWrapper, see: http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i 17
TapirLink 0.6.1 18
TapirLink 0.6.1 Home: http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink Primary developers: Renato De Giovanni, Dave Vieglais Download: http://sourceforge.net/project/showfiles.php?group_id=38190 Uncompress PHP source code Eg: /usr/local/tapir/tapirlink Mount admin and www directory for your web server. Example: Apache “httpd.conf” Alias /tapirlink   "/usr/local/tapir/tapirlink/www” Alias /tapirlink-admin   "/usr/local/tapir/tapirlink/admin"  <Location /tapirlink>  	Order allow,deny  	Allow from all </Location>  <Location /tapirlink-admin>  	Order allow,deny  	Allow from all </Location>  Read permissions on all directories Write on cache, config, log, statistics 19
TapirLink 0.6.1 Start by adding a new resource http://localhost/tapirlink-admin/ Step 1: Describe your new resource 20
TapirLink 0.6.1 Step 2: Data source, database connection Step 3: Table structure 21
TapirLink 0.6.1 Step 4: Filter Step 5: Select mapping standards to use 22
TapirLink 0.6.1 Step 5b: Mapping data schema (ABCD 2.06) etc… 23
TapirLink 0.6.1 Step 5c: Mapping data concepts (Darwin Core) etc… Step 5d: Remember that DwC have an extension for geospatial descriptors etc… 24
TapirLink 0.6.1 Step 6: Settings New resource successfully configured 25
TapirLink 0.6.1 Test resource with client form: http://localhost/tapirlink/tapir_client.php The XML Client form is very illustrative for understanding exactly how the wrapper software works!  26
Service interface 27
Example of a service request 	All exchanged data is formatted with XML tags. 28
Example of a service response 29 ...
Example TAPIR service request 30
Example TAPIR service response singer:/sourcename singer:/taxonomy/genus singer:/taxonomy/species singer:/taxonomy/subspecies singer:/holding/ID singer:/holding/name singer:/origin/collecting/countrysource singer:/origin/collecting/countrysourceID singer:/status/biologicalstatus singer:/status/biologicalstatusID ... 31
Example TAPIR service SEARCH request 32
Example TAPIR service Search response 33
Example of OAI-PMH service request OAI-PMH requests are expressed as HTTP requests. OAI-PMH requests must be submitted using either the HTTP GET or POST methods. http://an.oa.org/OAI-script?verb=GetRecord &identifier=oai:arXiv.org:hep-th/9901001 &metadataPrefix=oai_dc 34
Example of OAI-PMH service RESPONSE OAI-PMH responses are formatted as HTTP responses. With The Content-Type as text/xml. 35
OAI-PMH PROTOCOL, metadata formats 36 Request types (verb): Identify ListMetadataFormats ListSets GetRecord ListIdentifiers ListRecords For purposes of interoperability, the metadataPrefix `oai_dc’ is reserved for Dublin Core. Communities adopt own metadataPrefixesfor own metadata fomats. Relevant formats/schemas for Biodiversity Informatics are Darwin Core and ABCD.
Why sharing data? 37
[http://data.gbif.org/] 38 [http://data.gbif.org/datasets/resource/1487]
GBIF PGR Network 2 [http://data.gbif.org/datasets/network/2] 39
40 Distributed network The image is from the BioCASE web site
41 Decentralized network GBIF (Global Biodiversity Information Facility) ALIS (Accession Level  Information System) USER Svalbard Global Seed Vault (Safe Backup) (USDA ARS National  Germplasm  Repositories...) Web  Services USDA GRIN (USA) SINGER (CGIAR) (CGIAR International Future Harvest  gene banks...) EURISCO (Europe) MCPD IHAR (Poland) WUR CGN (Netherlands) NordGen (Northern Europe) IPK Gatersleben (Germany) (Other European  gene banks...)
42 Crop Wild Relatives LKA ARM BOL National Datasets are shared with  the central  CWR data index. The national  datasets as well as  access to other International  datasets are provided from the CWR data portal. MDG EURISCO UZB http://www.cropwildrelatives.org SINGER
Data portal example 43
44 http://wwwdev.ngb.se/portal/index.php?scope=demo
45
46
47
48
49 Outlook The compatibility of data standards between PGR and biodiversity collections made it possible to integrate the worldwide germplasm collections into the biodiversity community. Using GBIF technology (and contributing to its development), the PGR community can easily establish specific PGR networks without duplicating GBIF's work. Use of GBIF technology and integration of PGR collection data into GBIF allows PGR users to simultaneously search PGR collections and other biodiversity collections, and to get access to the data (and possibly the material) of relevant biodiversity collections. The establishment of new data portals on a specific crop, a regional thematic network or similar subset of the total global biodiversity datasets; can be done with rather few efforts! This requires only that all the relevant datasets are provided by GBIF compatible web services (like the BioCASE PyWrapper).

Mais conteúdo relacionado

Mais procurados

Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopLearning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopSomeshwar Kale
 
Future of HCatalog - Hadoop Summit 2012
Future of HCatalog - Hadoop Summit 2012Future of HCatalog - Hadoop Summit 2012
Future of HCatalog - Hadoop Summit 2012Hortonworks
 

Mais procurados (20)

NEON HDF5
NEON HDF5NEON HDF5
NEON HDF5
 
ICESat-2 Metadata and Status
ICESat-2 Metadata and StatusICESat-2 Metadata and Status
ICESat-2 Metadata and Status
 
Utilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud ComputingUtilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud Computing
 
Efficiently serving HDF5 via OPeNDAP
Efficiently serving HDF5 via OPeNDAPEfficiently serving HDF5 via OPeNDAP
Efficiently serving HDF5 via OPeNDAP
 
NetCDF and HDF5
NetCDF and HDF5NetCDF and HDF5
NetCDF and HDF5
 
May 2013 HUG: HCatalog/Hive Data Out
May 2013 HUG: HCatalog/Hive Data OutMay 2013 HUG: HCatalog/Hive Data Out
May 2013 HUG: HCatalog/Hive Data Out
 
HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)
 
HDF Update 2016
HDF Update 2016HDF Update 2016
HDF Update 2016
 
Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopLearning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
 
Product Designer Hub - Taking HPD to the Web
Product Designer Hub - Taking HPD to the WebProduct Designer Hub - Taking HPD to the Web
Product Designer Hub - Taking HPD to the Web
 
Hive hcatalog
Hive hcatalogHive hcatalog
Hive hcatalog
 
NASA HDF/HDF-EOS Data for Dummies (and Developers)
NASA HDF/HDF-EOS Data for Dummies (and Developers)NASA HDF/HDF-EOS Data for Dummies (and Developers)
NASA HDF/HDF-EOS Data for Dummies (and Developers)
 
Matlab, Big Data, and HDF Server
Matlab, Big Data, and HDF ServerMatlab, Big Data, and HDF Server
Matlab, Big Data, and HDF Server
 
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFViewHDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
 
Moving form HDF4 to HDF5/netCDF-4
Moving form HDF4 to HDF5/netCDF-4Moving form HDF4 to HDF5/netCDF-4
Moving form HDF4 to HDF5/netCDF-4
 
Implementing HDF5 in MATLAB
Implementing HDF5 in MATLABImplementing HDF5 in MATLAB
Implementing HDF5 in MATLAB
 
Data Analytics using MATLAB and HDF5
Data Analytics using MATLAB and HDF5Data Analytics using MATLAB and HDF5
Data Analytics using MATLAB and HDF5
 
HDF Tools Tutorial
HDF Tools TutorialHDF Tools Tutorial
HDF Tools Tutorial
 
Future of HCatalog - Hadoop Summit 2012
Future of HCatalog - Hadoop Summit 2012Future of HCatalog - Hadoop Summit 2012
Future of HCatalog - Hadoop Summit 2012
 
Hdf5 intro
Hdf5 introHdf5 intro
Hdf5 intro
 

Semelhante a Data exchange alternatives, SBIS conference in Stockholm (2008)

Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Dag Endresen
 
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)Dag Endresen
 
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...Thomas K. Y. Lam
 
User-space Network Processing
User-space Network ProcessingUser-space Network Processing
User-space Network ProcessingRyousei Takano
 
Datastage parallell jobs vs datastage server jobs
Datastage parallell jobs vs datastage server jobsDatastage parallell jobs vs datastage server jobs
Datastage parallell jobs vs datastage server jobsshanker_uma
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 
Dynamic Hadoop Clusters
Dynamic Hadoop ClustersDynamic Hadoop Clusters
Dynamic Hadoop ClustersSteve Loughran
 
PCIC Data Portal 2.0
PCIC Data Portal 2.0PCIC Data Portal 2.0
PCIC Data Portal 2.0James Hiebert
 
Deploying Apache Spark and testing big data applications on servers powered b...
Deploying Apache Spark and testing big data applications on servers powered b...Deploying Apache Spark and testing big data applications on servers powered b...
Deploying Apache Spark and testing big data applications on servers powered b...Principled Technologies
 
Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin  Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin Kuberton
 
Hadoop and big data training
Hadoop and big data trainingHadoop and big data training
Hadoop and big data trainingagiamas
 
Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016StackIQ
 
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014NoSQLmatters
 
Provisioning Servers Made Easy
Provisioning Servers Made EasyProvisioning Servers Made Easy
Provisioning Servers Made EasyAll Things Open
 
Programmability in spss 14
Programmability in spss 14Programmability in spss 14
Programmability in spss 14Armand Ruis
 
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopSurvey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopIRJET Journal
 
Extending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesExtending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesNicola Ferraro
 
Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developerPaul Czarkowski
 

Semelhante a Data exchange alternatives, SBIS conference in Stockholm (2008) (20)

Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)
 
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
 
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
Quick start with pallaral meta on window10 virtual desktop-virtualbox linux u...
 
User-space Network Processing
User-space Network ProcessingUser-space Network Processing
User-space Network Processing
 
Datastage parallell jobs vs datastage server jobs
Datastage parallell jobs vs datastage server jobsDatastage parallell jobs vs datastage server jobs
Datastage parallell jobs vs datastage server jobs
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Dynamic Hadoop Clusters
Dynamic Hadoop ClustersDynamic Hadoop Clusters
Dynamic Hadoop Clusters
 
PCIC Data Portal 2.0
PCIC Data Portal 2.0PCIC Data Portal 2.0
PCIC Data Portal 2.0
 
Deploying Apache Spark and testing big data applications on servers powered b...
Deploying Apache Spark and testing big data applications on servers powered b...Deploying Apache Spark and testing big data applications on servers powered b...
Deploying Apache Spark and testing big data applications on servers powered b...
 
Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin  Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin
 
Hadoop and big data training
Hadoop and big data trainingHadoop and big data training
Hadoop and big data training
 
Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016
 
Hadoop
HadoopHadoop
Hadoop
 
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
 
Provisioning Servers Made Easy
Provisioning Servers Made EasyProvisioning Servers Made Easy
Provisioning Servers Made Easy
 
Programmability in spss 14
Programmability in spss 14Programmability in spss 14
Programmability in spss 14
 
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopSurvey Paper on Big Data and Hadoop
Survey Paper on Big Data and Hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
Extending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesExtending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with Kubernetes
 
Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developer
 

Mais de Dag Endresen

Joint GBIF Biodiversa+ symposium in Helsinki on 2024-04-16
Joint GBIF Biodiversa+ symposium in  Helsinki on 2024-04-16Joint GBIF Biodiversa+ symposium in  Helsinki on 2024-04-16
Joint GBIF Biodiversa+ symposium in Helsinki on 2024-04-16Dag Endresen
 
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Dag Endresen
 
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Dag Endresen
 
Ontologies for biodiversity informatics, UiO DSC June 2023
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023Dag Endresen
 
Evacuation of the Kherson herbarium
Evacuation of the Kherson herbariumEvacuation of the Kherson herbarium
Evacuation of the Kherson herbariumDag Endresen
 
2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC RomeDag Endresen
 
BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24Dag Endresen
 
Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Dag Endresen
 
BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022Dag Endresen
 
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20Dag Endresen
 
GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03Dag Endresen
 
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...Dag Endresen
 
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Dag Endresen
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18Dag Endresen
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021Dag Endresen
 
2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portalsDag Endresen
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)Dag Endresen
 
GBIF and Open Science
GBIF and Open ScienceGBIF and Open Science
GBIF and Open ScienceDag Endresen
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementDag Endresen
 
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...Dag Endresen
 

Mais de Dag Endresen (20)

Joint GBIF Biodiversa+ symposium in Helsinki on 2024-04-16
Joint GBIF Biodiversa+ symposium in  Helsinki on 2024-04-16Joint GBIF Biodiversa+ symposium in  Helsinki on 2024-04-16
Joint GBIF Biodiversa+ symposium in Helsinki on 2024-04-16
 
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
 
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
 
Ontologies for biodiversity informatics, UiO DSC June 2023
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023
 
Evacuation of the Kherson herbarium
Evacuation of the Kherson herbariumEvacuation of the Kherson herbarium
Evacuation of the Kherson herbarium
 
2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome2023-05-08 GLIS SAC Rome
2023-05-08 GLIS SAC Rome
 
BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24BioDT for the UiO Science section meeting 2023-03-24
BioDT for the UiO Science section meeting 2023-03-24
 
Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26Data and Stats Forum at MINA NMBU - 2023-04-26
Data and Stats Forum at MINA NMBU - 2023-04-26
 
BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022BioDATA final conference in Oslo, November 2022
BioDATA final conference in Oslo, November 2022
 
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
 
GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03GBIF at Living Norway Open Science Lab 2022-03-03
GBIF at Living Norway Open Science Lab 2022-03-03
 
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
 
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021
 
2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
GBIF and Open Science
GBIF and Open ScienceGBIF and Open Science
GBIF and Open Science
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data management
 
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
 

Último

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Último (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Data exchange alternatives, SBIS conference in Stockholm (2008)

  • 1. Biodiversity Data Provider Software Hands-on exercises with TAPIR Stockholm Biodiversity Informatics Symposium 2008 (SBIS2008) Dag Terje Filip Endresen, Nordic Genetic Resources Center (Sweden) / Bioversity International (Italy)
  • 2. Fallacies of Distributed Computing The network is reliable. Latency is zero. Bandwidth is infinite. The network is secure. Topology doesn't change. There is one administrator. Transport cost is zero. The network is homogeneous. This list of fallacies came about at Sun Microsystems around 1994. 2
  • 3. TAPIR 3 Cartoon by Sasha Kopf (Creative Commons)
  • 4. TAPIR TAPIR - TDWG Access Protocol for Information Retrieval. During the 2004 TDWG meeting in Christchurch, NZ, work started on a unified protocol and named TAPIR. TAPIR is based on the protocol from the two data provider software, BioCASE and DiGIR. 4
  • 5. Provider software, wrappers DiGIR (2002, not active) http://digir.sourceforge.net BioCASE (2003, PyWrapper) http://www.biocase.org PyWrapper3 (2006, not active) http://trac.pywrapper.org/ TapirLink (2007) http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink GBIF Provider Toolkit (2009) http://code.google.com/p/gbif-providertoolkit 5
  • 6. BioCASE 2.5.ORC 6 The BioCASE provider software is a product of the EU funded BioCASE project (2001-2004). Developed at BGBM in Berlin. Last updated in April 2008, with support for Python version 2.5 and less required external Implement the BioCASE provider to share data as ABCD 2.06. http://www.biocase.org
  • 7. 1. Make sure you have Python 2.5 installed (command line: python –v) 2.Download the latest provider software from http://www.biocase.org 3. Uncompress the BioCASE provider software to a folder on your system [provider_software_2.5.0RC.tar.gz] (tar –xzvf provider_...tar.gz) 4. Run setup.py, (python setup.py) 5. Configure your web server to mount biocase/www as http://localhost/biocase/ Hint: You will find an example for httpd.conf as the last terminal output from running setup.py 7 BioCASE 2.5.ORC
  • 8. BioCASE 2.5.ORC 6. Visit the library test page: http://localhost/biocase/utilities/testlibs.cgi 6a. Download latest 4 Suite from http://4suite.org/ Uncompress and install [4Suite-XML-1.0.2.tar.bz2] 6b. Install additional python libraries, including the desired database driver. For each python package: (python setup.py install) 6c. Graphviz is useful to visualize the database table structure. 8
  • 9.
  • 11. Database table structure
  • 12. Mapping of data model to standard schema9
  • 13. BioCASE 2.5.ORC 8. Query Form The manual query form is illustrative for understanding exactly how the wrapper software works! http://localhost/biocase/utilities/queryforms/qf_manual.cgi?dsa=sesto 10
  • 14. 11
  • 15.
  • 16. PyWrapper 3.1.0 alpha (development version, works with Python 2.5)PyWrapper is tested and verified to work fine with Windows, Mac OS X and Linux. 12
  • 17. Required configuration Web server: Any CGI compliant web server: Apache, IIS etc. (The built in CherryPy web server can also be used). Database: Major databases are supported, including MySQL, Oracle, SQLServer, Sybase, Access, PostgreSQL. Theoretically any database with a Python library should work. Python PyWrapper is developed with the Python programming language. (The latest version from the SVN code repository works with Python version 2.5) 13 Apache, MySQL and Python are open source software, free to use - even for commercial products.
  • 18. Installation http://trac.pywrapper.org/pywrapper/wiki/InstallationGuide 1. Download the latest PyWrapper3 installer. Use SVN export or checkout for Python 2.5 support 2. Uncompress to a folder of your choice. Example: “/usr/local/pywrapper3/” Example: “C:ywrapper Local installation: If you have a Subversion client installed, you may use the automatic installer. (Local Python and libraries are installed to your pywrapper folder) promt$ svn export svn://svn.pywrapper.org:80/pywrapper/trunk pywrapper promt$ cd pywrapper/tools promt$ /bin/shinstall.sh This will require that you have a bash shell, and probably that you have a Unix line system like e.g. FreeBSD, Linux or Mac OS X… 3. Execute: pywrapper/setup.py Example: promt$ python setup.py (Mac OS X, Linux) On Windows locate setup.py and double-click 14
  • 19. Start standalone server Execute start_server.py(default port is 8080) promt$ cdwebapp/ promt$ ./start_server.py 8088(example to start on port 8088) On a Windows system you may do this in a MS-DOS window (or double-click the file - if you accept the default port). Some messages will pass across your screen. Please be patient, this could take a minute. Wait for the message “start server …” and find find PyWrapper at: http://localhost:8088/pywrapper 15
  • 20. Configuration After successful installation, you will need to configure your data provider. Follow the instructions from the PyWrapper documentation web page to configure. Data sources. If you provide more datasets or several databases they will be configured as individual data sources (dsa). Database connection. For PyWrapper to access your database. Database structure. Define the relevant database tables, the primary keys and foreign keys. Data model. Map your database model to the standard represented by the XML Schemas you choose. http://trac.pywrapper.org/pywrapper/wiki/Documentation 16
  • 21. Screen examples PyWrapper comes with a graphical web based configuration tool For more information and more screen dumps from the configuration of PyWrapper, see: http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i 17
  • 23. TapirLink 0.6.1 Home: http://wiki.tdwg.org/twiki/bin/view/TAPIR/TapirLink Primary developers: Renato De Giovanni, Dave Vieglais Download: http://sourceforge.net/project/showfiles.php?group_id=38190 Uncompress PHP source code Eg: /usr/local/tapir/tapirlink Mount admin and www directory for your web server. Example: Apache “httpd.conf” Alias /tapirlink "/usr/local/tapir/tapirlink/www” Alias /tapirlink-admin "/usr/local/tapir/tapirlink/admin" <Location /tapirlink> Order allow,deny Allow from all </Location> <Location /tapirlink-admin> Order allow,deny Allow from all </Location> Read permissions on all directories Write on cache, config, log, statistics 19
  • 24. TapirLink 0.6.1 Start by adding a new resource http://localhost/tapirlink-admin/ Step 1: Describe your new resource 20
  • 25. TapirLink 0.6.1 Step 2: Data source, database connection Step 3: Table structure 21
  • 26. TapirLink 0.6.1 Step 4: Filter Step 5: Select mapping standards to use 22
  • 27. TapirLink 0.6.1 Step 5b: Mapping data schema (ABCD 2.06) etc… 23
  • 28. TapirLink 0.6.1 Step 5c: Mapping data concepts (Darwin Core) etc… Step 5d: Remember that DwC have an extension for geospatial descriptors etc… 24
  • 29. TapirLink 0.6.1 Step 6: Settings New resource successfully configured 25
  • 30. TapirLink 0.6.1 Test resource with client form: http://localhost/tapirlink/tapir_client.php The XML Client form is very illustrative for understanding exactly how the wrapper software works! 26
  • 32. Example of a service request All exchanged data is formatted with XML tags. 28
  • 33. Example of a service response 29 ...
  • 34. Example TAPIR service request 30
  • 35. Example TAPIR service response singer:/sourcename singer:/taxonomy/genus singer:/taxonomy/species singer:/taxonomy/subspecies singer:/holding/ID singer:/holding/name singer:/origin/collecting/countrysource singer:/origin/collecting/countrysourceID singer:/status/biologicalstatus singer:/status/biologicalstatusID ... 31
  • 36. Example TAPIR service SEARCH request 32
  • 37. Example TAPIR service Search response 33
  • 38. Example of OAI-PMH service request OAI-PMH requests are expressed as HTTP requests. OAI-PMH requests must be submitted using either the HTTP GET or POST methods. http://an.oa.org/OAI-script?verb=GetRecord &identifier=oai:arXiv.org:hep-th/9901001 &metadataPrefix=oai_dc 34
  • 39. Example of OAI-PMH service RESPONSE OAI-PMH responses are formatted as HTTP responses. With The Content-Type as text/xml. 35
  • 40. OAI-PMH PROTOCOL, metadata formats 36 Request types (verb): Identify ListMetadataFormats ListSets GetRecord ListIdentifiers ListRecords For purposes of interoperability, the metadataPrefix `oai_dc’ is reserved for Dublin Core. Communities adopt own metadataPrefixesfor own metadata fomats. Relevant formats/schemas for Biodiversity Informatics are Darwin Core and ABCD.
  • 43. GBIF PGR Network 2 [http://data.gbif.org/datasets/network/2] 39
  • 44. 40 Distributed network The image is from the BioCASE web site
  • 45. 41 Decentralized network GBIF (Global Biodiversity Information Facility) ALIS (Accession Level Information System) USER Svalbard Global Seed Vault (Safe Backup) (USDA ARS National Germplasm Repositories...) Web Services USDA GRIN (USA) SINGER (CGIAR) (CGIAR International Future Harvest gene banks...) EURISCO (Europe) MCPD IHAR (Poland) WUR CGN (Netherlands) NordGen (Northern Europe) IPK Gatersleben (Germany) (Other European gene banks...)
  • 46. 42 Crop Wild Relatives LKA ARM BOL National Datasets are shared with the central CWR data index. The national datasets as well as access to other International datasets are provided from the CWR data portal. MDG EURISCO UZB http://www.cropwildrelatives.org SINGER
  • 49. 45
  • 50. 46
  • 51. 47
  • 52. 48
  • 53. 49 Outlook The compatibility of data standards between PGR and biodiversity collections made it possible to integrate the worldwide germplasm collections into the biodiversity community. Using GBIF technology (and contributing to its development), the PGR community can easily establish specific PGR networks without duplicating GBIF's work. Use of GBIF technology and integration of PGR collection data into GBIF allows PGR users to simultaneously search PGR collections and other biodiversity collections, and to get access to the data (and possibly the material) of relevant biodiversity collections. The establishment of new data portals on a specific crop, a regional thematic network or similar subset of the total global biodiversity datasets; can be done with rather few efforts! This requires only that all the relevant datasets are provided by GBIF compatible web services (like the BioCASE PyWrapper).
  • 54. Participation and the sharing of your institute datasets with global and national biodiversity projects is important for your public and scientific visibility, promoting the use (usefulness) of your data and ultimately for the continued funding of your institutional activities. 50
  • 55. Special thanks to Bioversity International [http://www.bioversityinternational.org] GBIF, Global Biodiversity Information Facility [http://www.gbif.org] BioCASE, The Biological Collection Access Service for Europe. [http://www.biocase.org] TDWG, Biodiversity Information Standards [http://www.tdwg.org] 51
  • 56. Special thanks to BioCASE and PyWrapper3 software Markus Döring Javier de la Torre DiGIR and TapirLink software Renato de Giovanni Dave Vieglais 52
  • 57. Thank you for listening! 53

Notas do Editor

  1. Image source: University of Ottawa, Distributed Computing Research Group: http://www.genie.uottawa.ca/research/rsrch_site.php?lang=e&id=90 (Google Images).See also: http://en.wikipedia.org/wiki/Fallacies_of_Distributed_Computing
  2. IMAGE: http://blog.tapirtype.com/cartoons/ [Creative Commons License: http://creativecommons.org/licenses/by-nc-sa/3.0/us/]
  3. http://www.tdwg.org
  4. More details see:GBIF NODES meeting 2007 in Amsterdam.Agenda 09 Technical Training session - TAPIR/PyWrapper3:http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i
  5. More details see:GBIF NODES meeting 2007 in Amsterdam.Agenda 09 Technical Training session - TAPIR/PyWrapper3:http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i
  6. Apache HTTPD server:http://www.apache.org/MySQL database server:http://www.mysql.com/Python programming language: http://www.python.org
  7. In some cases with proxies:svn co svn://svn.pywrapper.org/pywrapper/trunk pywrapper
  8. More details see:GBIFNODES meeting 2007 in Amsterdam.Agenda 09 Technical Training session - TAPIR/PyWrapper3:http://circa.gbif.net/Public/irc/gbif/nodes/library?l=/meetings/2007_10_amsterdam/tapir_pywrapper3/_EN_1.0_&a=i
  9. IMAGE source: http://commons.wikimedia.org/wiki/Image:Handshake_(Workshop_Cologne_%2706).jpeg; Copyright: GNU Public Licence
  10. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  11. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  12. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  13. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  14. http://wwwdev.ngb.se/portal/index.php?scope=demohttp://chm.grinfo.net/index.php?app=data_providers
  15. Photographer Dag Terje Endresen (NordGen Picture Archive, image 002986)http://www.nordgen.org/sesto/index.php?scp=ngb&thm=pictures&mod=det&id=002986&img_size=768x512