SlideShare uma empresa Scribd logo
1 de 28
Descarga de datos de bases de datos
        públicas. Caso: GBIF y GENESYS

        Nora Patricia Castañeda-Álvarez

Foto: Neil Palmer, CIAT
Usos
• Modelación de especies
• Descripción de ambientes
• Mapas descriptivos (exploración de
  información)
• Análisis de conservación en áreas protegidas
• Estimaciones del impacto de cambio climático
Registros de herbario




                   Parientes silvestres de
                           Avena
Registros de germoplasma




                   Parientes silvestres de
                           Avena
Solanum peruvianum




Foto: TGRC Tomato Genetic Resources Center http://tgrc.ucdavis.edu
Kernel-density plot of the first two dimensions of an assessment based on variables
derived from temperature and precipitation, for the section Lycopersicoides, genus
Solanum
Solanum chilense
Puntos
georreferenciados

                       Algoritmo de
                    modelación (Maxent)


Capas ambientales




                                     Tomato (Solanum lycopersicum L.) wild relatives potential richness
                                   map. This maps depicts the number of taxa that are potentially found
                                       per unit of area. Darker colours represent greater richness of the
                                         tomato genepool. (Map by N. P. Castañeda Álvarez/May 2012)
Fuentes de datos

 Datos ambientales
Climate


  •Interpolated climate surfaces for the globe up to 1km resolution: WorldClim (www.worldclim.org)
  •Downscaled layers from future climate models (GCMs): Climate Change Agriculture and Food Security (CCAFS) (www.ccafs-climate.org)
  •Reconstructed paleoclimates: US National Oceanic and Atmospheric Administration (NOAA) (www.ncdc.noaa.gov/paleo/paleo.html)


  Topography

  •Elevation, watershed and related variables for the globe at 1km resolution: US Geological Survey (USGS) (http://eros.usgs.gov)
  •High-quality elevation data for large portions of the tropics and other areas of the developing world: SRTM 90m Elevation Data
  (http://srtm.csi.cgiar.org)

  Remote sensing (satellite)

  •Various land-cover datasets: Global Land Cover Facility (GLCF) (http://glcf.umiacs.umd.edu/data)
  •Various atmospheric and land products from the MODIS instrument: National Aeronautics and Space Administration (NASA)
  (http://modis.gsfc.nasa.gov/data)

  Soils

  •Harmonized World Soil Database (www.iiasa.ac.at/Research/LUC/External-World-soil-database/HTML)

  Other spatial data



  •Relevant links and data at DIVA-GIS website (country level, global level, global climate, species occurrence); near global 90-meter resolution
  elevation data, high-resolution satellite images (LandSat) (www.diva-gis.org/Data)
  •Spatial database of the world's administrative areas (or administrative boundaries): Global Administrative Areas (GADM) (www.gadm.org)
  •Database with eight million place names with geographical coordinates: GeoNames (www.geonames.org)
  •Automatic georeferencing tools: BioGeomancer (www.biogeomancer.org)


Van Zonneveld, M., Thomas, E., Galluzzi, E., Scheldeman, X. Mapping the ecogeographic distribution of biodiversity and GIS
tools for plant germplasm collectors. Collecting plant genetic diversity: Technical guidelines. 2011
Algoritmos de modelación
          Modelling algorithm                  Type of input required                   Software source
Maxent (Phillips et al. 2006)               Presence and absence data   www.cs.princeton.edu/~schapire/maxent
                                            (pseudo-absences allowed)
Bioclim                                     Presence data               http://diva-gis.org,
                                                                        http://openmodeller.sourceforge.net
DOMAIN (Carpenter et al. 1993)              Presence data               http://diva-gis.org
Artificial Neural Networks (ANN)            Presence data               http://openmodeller.sourceforge.net
Ecological-Niche Factor Analysis – ENFA-    Presence data               www2.unil.ch/biomapper ,
(Hirzel et al. 2002)                                                    http://openmodeller.sourceforge.net
Genetic Algorithm for Rule Set Production   Presence and absence data   www.nhm.ku.edu/desktopgarp,
–GARP- (Stockwell & Noble 1992)                                         http://openmodeller.sourceforge.net

HABITAT (Walker & Cocks 1991)               Presence data
Generalized Linear Model (GLM)              Presence and absence data   R: package “dismo”, function “glm”
                                                                        R: package “BIOMOD”
Generalized Additive Model (GAM)            Presence and absence data   R: package “mgcv”
                                                                        R: package “BIOMOD”
Mahalanobis Distance (MD)                   Presence data               www.jennessent.com/arcview/mahalanobis_grids.h
                                                                        tm
Classification Tree Analysis (CTA)                                      R: package “BIOMOD”
Surface Range Envelope (SRE)                                            R: package “BIOMOD”
Generalized Boosting Model (GBM)            Presence and absence data   R: package “BIOMOD”
Breiman and Cutler’s random forest for                                  R: package “BIOMOD”
classification and regression (RF)
Flexible Discriminant Analysis (FDA)                                    R: package “BIOMOD”
Multiple Adaptive Regression Splines        Presence and absence data   R: package “BIOMOD”
(MARS)
Fuentes de datos

  Datos biológicos
Nombre                                                           Página
JSTOR Plant Science                                              http://plants.jstor.org/
GENESYS                                                          http://www.genesys-pgr.org/
European Plant Genetic Resources Search Catalogue (EURISCO)      http://eurisco.ecpgr.org/nc/home_page.html


System-wide Information Network for Genetic Resources (SINGER)   http://singer.cgiar.org


Genetic Resources Information Network of the United States       www.ars-grin.gov/npgs/searchgrin.html
Department of Agriculture (GRIN)
ENSCONET                                                         http://enscobase.maich.gr/
Botanic Garden Conservation International (BGCI) database        http://www.bgci.org/plant_search.php
National program: Russia                                         http://www.agroatlas.ru/
National program: Brazil                                         http://www.cria.org.br/
National program: Japan                                          http://www.gene.affrc.go.jp/databases_en.php
National program: Mexico                                         http://www.biodiversidad.gob.mx/genes/proyectoMaices.html
Harold and Adele Lieberman Germplasm Bank (cereals)              http://www.tau.ac.il/lifesci/units/ICCI/genebank1.html
Manchester Museum                                                http://emu.man.ac.uk/mmcustom/BotQuery.php
Millennium Seed Bank, Kew                                        http://www.kew.org/science-conservation/save-seed-
                                                                 prosper/millennium-seed-bank/index.htm
National History Museum, UK                                      http://www.nhm.ac.uk/research-
                                                                 curation/collections/departmental-collections/botany-
                                                                 collections/search/index.php
Royal Botanic Gardens Kew                                        http://apps.kew.org/herbcat/navigator.do
Royal Botanical Garden of Edinburgh                              http://www.rbge.org.uk/databases
SolanaceaeSource                                                 http://www.nhm.ac.uk/research-
                                                                 curation/research/projects/solanaceaesource
United States Virtual Herbarium                                  http://usvirtualherbarium.org/
Por qué GBIF y GENESYS?
GBIF
GBIF
Genesys

          Eurisco

Singer              Grin



          Genesys
Estudio de caso:

Phaseolus coccineus
Ingreso a: data.gbif.org




    Campo para ingresar nombre de la
          especie de interes
Primeros resultados
Selección de campos para
        descargar
www.genesys-pgr.org




   Opciones de búsqueda
OJO: filtrar resultados!
            DATA SUMMARIES > By Genus / Species
Descarga de información
Descarga de información




                           OJO CON LA
                          SELECCION DE
                            CAMPOS!

Mais conteúdo relacionado

Mais procurados

BioMinds Poster!!!!!!!!
BioMinds Poster!!!!!!!!BioMinds Poster!!!!!!!!
BioMinds Poster!!!!!!!!
Zuleika86
 
Capfitogen side event gb5 wo_animation_nt
Capfitogen side event gb5 wo_animation_ntCapfitogen side event gb5 wo_animation_nt
Capfitogen side event gb5 wo_animation_nt
Mauricio Parra Quijano
 

Mais procurados (20)

Ncbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuNcbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osu
 
EMBL
EMBLEMBL
EMBL
 
BioMinds Poster!!!!!!!!
BioMinds Poster!!!!!!!!BioMinds Poster!!!!!!!!
BioMinds Poster!!!!!!!!
 
Perspectives and Challenges of Phenotyping in Crop Improvement. - Copy.pptx
Perspectives and Challenges of Phenotyping in Crop Improvement. - Copy.pptxPerspectives and Challenges of Phenotyping in Crop Improvement. - Copy.pptx
Perspectives and Challenges of Phenotyping in Crop Improvement. - Copy.pptx
 
Targeted RNA Sequencing, Urban Metagenomics, and Astronaut Genomics
Targeted RNA Sequencing, Urban Metagenomics, and Astronaut GenomicsTargeted RNA Sequencing, Urban Metagenomics, and Astronaut Genomics
Targeted RNA Sequencing, Urban Metagenomics, and Astronaut Genomics
 
How to build ontologies - a case study of Agriculture Activity Ontology
How to build ontologies - a case study of Agriculture Activity OntologyHow to build ontologies - a case study of Agriculture Activity Ontology
How to build ontologies - a case study of Agriculture Activity Ontology
 
Pathways and genomes databases in bioinformatics
Pathways and genomes databases in bioinformaticsPathways and genomes databases in bioinformatics
Pathways and genomes databases in bioinformatics
 
Pre breeding in maize
Pre breeding in maizePre breeding in maize
Pre breeding in maize
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Standards and best practices for genebank management
Standards and best practices for genebank managementStandards and best practices for genebank management
Standards and best practices for genebank management
 
Current trends in plant breeding
Current trends in plant breedingCurrent trends in plant breeding
Current trends in plant breeding
 
Features of biological databases
Features of biological databasesFeatures of biological databases
Features of biological databases
 
Capfitogen side event gb5 wo_animation_nt
Capfitogen side event gb5 wo_animation_ntCapfitogen side event gb5 wo_animation_nt
Capfitogen side event gb5 wo_animation_nt
 
Biological Databases
Biological DatabasesBiological Databases
Biological Databases
 
Metagenomics as a tool for biodiversity and health
Metagenomics as a tool for biodiversity and healthMetagenomics as a tool for biodiversity and health
Metagenomics as a tool for biodiversity and health
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Ddbj
DdbjDdbj
Ddbj
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 
Metagenomics: An overview
Metagenomics: An overviewMetagenomics: An overview
Metagenomics: An overview
 
metagenomics
metagenomicsmetagenomics
metagenomics
 

Semelhante a Castaneda2013 capfitogen bases_de_datos

Castaneda2009 Modelamiento Distribucion Especies
Castaneda2009 Modelamiento Distribucion EspeciesCastaneda2009 Modelamiento Distribucion Especies
Castaneda2009 Modelamiento Distribucion Especies
Nora P. Castañeda-Álvarez
 
Nora C Gap Analysis focused on Acutifolii and Rugosi sections
Nora C Gap Analysis focused on Acutifolii and Rugosi sectionsNora C Gap Analysis focused on Acutifolii and Rugosi sections
Nora C Gap Analysis focused on Acutifolii and Rugosi sections
Nora P. Castañeda-Álvarez
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
David Ruau
 
Buffington ecn 2012
Buffington ecn 2012Buffington ecn 2012
Buffington ecn 2012
ECNOfficer
 

Semelhante a Castaneda2013 capfitogen bases_de_datos (20)

Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)
 
Castaneda2009 Modelamiento Distribucion Especies
Castaneda2009 Modelamiento Distribucion EspeciesCastaneda2009 Modelamiento Distribucion Especies
Castaneda2009 Modelamiento Distribucion Especies
 
Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
 
Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Overview on arabidopsis and rice genome
Overview on arabidopsis and rice genomeOverview on arabidopsis and rice genome
Overview on arabidopsis and rice genome
 
Nora C Gap Analysis focused on Acutifolii and Rugosi sections
Nora C Gap Analysis focused on Acutifolii and Rugosi sectionsNora C Gap Analysis focused on Acutifolii and Rugosi sections
Nora C Gap Analysis focused on Acutifolii and Rugosi sections
 
Predictive association between trait data and eco-geographic data for Nordic ...
Predictive association between trait data and eco-geographic data for Nordic ...Predictive association between trait data and eco-geographic data for Nordic ...
Predictive association between trait data and eco-geographic data for Nordic ...
 
Plant phenotyping platforms
Plant phenotyping platformsPlant phenotyping platforms
Plant phenotyping platforms
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)
 
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
 
PAG-2004-Roe
PAG-2004-RoePAG-2004-Roe
PAG-2004-Roe
 
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
OptIPuter: Metagenomics at Light Speed
OptIPuter: Metagenomics at Light SpeedOptIPuter: Metagenomics at Light Speed
OptIPuter: Metagenomics at Light Speed
 
Buffington ecn 2012
Buffington ecn 2012Buffington ecn 2012
Buffington ecn 2012
 
Module 5 - EN - Promoting data use III: Most frequent data analysis techniques
Module 5 - EN - Promoting data use III: Most frequent data analysis techniques Module 5 - EN - Promoting data use III: Most frequent data analysis techniques
Module 5 - EN - Promoting data use III: Most frequent data analysis techniques
 

Mais de Nora P. Castañeda-Álvarez (6)

Castaneda2014 biodiversidad como estrategia para la adaptación al cambio
Castaneda2014 biodiversidad como estrategia para la adaptación al cambioCastaneda2014 biodiversidad como estrategia para la adaptación al cambio
Castaneda2014 biodiversidad como estrategia para la adaptación al cambio
 
Castaneda2013 capfitogen
Castaneda2013 capfitogenCastaneda2013 capfitogen
Castaneda2013 capfitogen
 
Castaneda2011 tomates silvestres
Castaneda2011 tomates silvestresCastaneda2011 tomates silvestres
Castaneda2011 tomates silvestres
 
Castaneda2010 gapanalysis
Castaneda2010 gapanalysisCastaneda2010 gapanalysis
Castaneda2010 gapanalysis
 
Castaneda2010 biogeomancer
Castaneda2010 biogeomancerCastaneda2010 biogeomancer
Castaneda2010 biogeomancer
 
Castaneda2010 Importancia Puntos
Castaneda2010 Importancia PuntosCastaneda2010 Importancia Puntos
Castaneda2010 Importancia Puntos
 

Castaneda2013 capfitogen bases_de_datos

  • 1. Descarga de datos de bases de datos públicas. Caso: GBIF y GENESYS Nora Patricia Castañeda-Álvarez Foto: Neil Palmer, CIAT
  • 2. Usos • Modelación de especies • Descripción de ambientes • Mapas descriptivos (exploración de información) • Análisis de conservación en áreas protegidas • Estimaciones del impacto de cambio climático
  • 3. Registros de herbario Parientes silvestres de Avena
  • 4. Registros de germoplasma Parientes silvestres de Avena
  • 5. Solanum peruvianum Foto: TGRC Tomato Genetic Resources Center http://tgrc.ucdavis.edu
  • 6. Kernel-density plot of the first two dimensions of an assessment based on variables derived from temperature and precipitation, for the section Lycopersicoides, genus Solanum
  • 8. Puntos georreferenciados Algoritmo de modelación (Maxent) Capas ambientales Tomato (Solanum lycopersicum L.) wild relatives potential richness map. This maps depicts the number of taxa that are potentially found per unit of area. Darker colours represent greater richness of the tomato genepool. (Map by N. P. Castañeda Álvarez/May 2012)
  • 9. Fuentes de datos Datos ambientales
  • 10. Climate •Interpolated climate surfaces for the globe up to 1km resolution: WorldClim (www.worldclim.org) •Downscaled layers from future climate models (GCMs): Climate Change Agriculture and Food Security (CCAFS) (www.ccafs-climate.org) •Reconstructed paleoclimates: US National Oceanic and Atmospheric Administration (NOAA) (www.ncdc.noaa.gov/paleo/paleo.html) Topography •Elevation, watershed and related variables for the globe at 1km resolution: US Geological Survey (USGS) (http://eros.usgs.gov) •High-quality elevation data for large portions of the tropics and other areas of the developing world: SRTM 90m Elevation Data (http://srtm.csi.cgiar.org) Remote sensing (satellite) •Various land-cover datasets: Global Land Cover Facility (GLCF) (http://glcf.umiacs.umd.edu/data) •Various atmospheric and land products from the MODIS instrument: National Aeronautics and Space Administration (NASA) (http://modis.gsfc.nasa.gov/data) Soils •Harmonized World Soil Database (www.iiasa.ac.at/Research/LUC/External-World-soil-database/HTML) Other spatial data •Relevant links and data at DIVA-GIS website (country level, global level, global climate, species occurrence); near global 90-meter resolution elevation data, high-resolution satellite images (LandSat) (www.diva-gis.org/Data) •Spatial database of the world's administrative areas (or administrative boundaries): Global Administrative Areas (GADM) (www.gadm.org) •Database with eight million place names with geographical coordinates: GeoNames (www.geonames.org) •Automatic georeferencing tools: BioGeomancer (www.biogeomancer.org) Van Zonneveld, M., Thomas, E., Galluzzi, E., Scheldeman, X. Mapping the ecogeographic distribution of biodiversity and GIS tools for plant germplasm collectors. Collecting plant genetic diversity: Technical guidelines. 2011
  • 11. Algoritmos de modelación Modelling algorithm Type of input required Software source Maxent (Phillips et al. 2006) Presence and absence data www.cs.princeton.edu/~schapire/maxent (pseudo-absences allowed) Bioclim Presence data http://diva-gis.org, http://openmodeller.sourceforge.net DOMAIN (Carpenter et al. 1993) Presence data http://diva-gis.org Artificial Neural Networks (ANN) Presence data http://openmodeller.sourceforge.net Ecological-Niche Factor Analysis – ENFA- Presence data www2.unil.ch/biomapper , (Hirzel et al. 2002) http://openmodeller.sourceforge.net Genetic Algorithm for Rule Set Production Presence and absence data www.nhm.ku.edu/desktopgarp, –GARP- (Stockwell & Noble 1992) http://openmodeller.sourceforge.net HABITAT (Walker & Cocks 1991) Presence data Generalized Linear Model (GLM) Presence and absence data R: package “dismo”, function “glm” R: package “BIOMOD” Generalized Additive Model (GAM) Presence and absence data R: package “mgcv” R: package “BIOMOD” Mahalanobis Distance (MD) Presence data www.jennessent.com/arcview/mahalanobis_grids.h tm Classification Tree Analysis (CTA) R: package “BIOMOD” Surface Range Envelope (SRE) R: package “BIOMOD” Generalized Boosting Model (GBM) Presence and absence data R: package “BIOMOD” Breiman and Cutler’s random forest for R: package “BIOMOD” classification and regression (RF) Flexible Discriminant Analysis (FDA) R: package “BIOMOD” Multiple Adaptive Regression Splines Presence and absence data R: package “BIOMOD” (MARS)
  • 12. Fuentes de datos Datos biológicos
  • 13. Nombre Página JSTOR Plant Science http://plants.jstor.org/ GENESYS http://www.genesys-pgr.org/ European Plant Genetic Resources Search Catalogue (EURISCO) http://eurisco.ecpgr.org/nc/home_page.html System-wide Information Network for Genetic Resources (SINGER) http://singer.cgiar.org Genetic Resources Information Network of the United States www.ars-grin.gov/npgs/searchgrin.html Department of Agriculture (GRIN) ENSCONET http://enscobase.maich.gr/ Botanic Garden Conservation International (BGCI) database http://www.bgci.org/plant_search.php National program: Russia http://www.agroatlas.ru/ National program: Brazil http://www.cria.org.br/ National program: Japan http://www.gene.affrc.go.jp/databases_en.php National program: Mexico http://www.biodiversidad.gob.mx/genes/proyectoMaices.html Harold and Adele Lieberman Germplasm Bank (cereals) http://www.tau.ac.il/lifesci/units/ICCI/genebank1.html Manchester Museum http://emu.man.ac.uk/mmcustom/BotQuery.php Millennium Seed Bank, Kew http://www.kew.org/science-conservation/save-seed- prosper/millennium-seed-bank/index.htm National History Museum, UK http://www.nhm.ac.uk/research- curation/collections/departmental-collections/botany- collections/search/index.php Royal Botanic Gardens Kew http://apps.kew.org/herbcat/navigator.do Royal Botanical Garden of Edinburgh http://www.rbge.org.uk/databases SolanaceaeSource http://www.nhm.ac.uk/research- curation/research/projects/solanaceaesource United States Virtual Herbarium http://usvirtualherbarium.org/
  • 14. Por qué GBIF y GENESYS?
  • 15. GBIF
  • 16. GBIF
  • 17. Genesys Eurisco Singer Grin Genesys
  • 19. Ingreso a: data.gbif.org Campo para ingresar nombre de la especie de interes
  • 21.
  • 22.
  • 23. Selección de campos para descargar
  • 24.
  • 25. www.genesys-pgr.org Opciones de búsqueda
  • 26. OJO: filtrar resultados! DATA SUMMARIES > By Genus / Species
  • 28. Descarga de información OJO CON LA SELECCION DE CAMPOS!