SlideShare uma empresa Scribd logo
1 de 3
Baixar para ler offline
[18:00 30/3/2009 Bioinformatics-btp105.tex] Page: 1096 1096–1098
BIOINFORMATICS APPLICATIONS NOTE
Vol. 25 no. 8 2009, pages 1096–1098
doi:10.1093/bioinformatics/btp105
Databases and ontologies
INTEGRALL: a database and search engine for integrons,
integrases and gene cassettes
Alexandra Moura1,∗, Mário Soares2, Carolina Pereira3, Nuno Leitão2, Isabel Henriques1
and António Correia1
1CESAM & Department of Biology, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro,
2Sindesi – Informatic Systems and Solutions Lda., Rua Manuel Barbuda e Vasconcelos, 62-2D, 3810-498 Aveiro
and 3Department of Biology, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
Received on September 29, 2008; revised on February 16, 2009; accepted on February 17, 2009
Advance Access publication February 19, 2009
Associate Editor: Alex Bateman
ABSTRACT
Summary: INTEGRALL is a freely available, text-based search
system developed with the aim of collecting and organizing infor-
mation on integrons in a single database. The current release (1.2)
contains more than 4800 integron sequences and provides a
public genetic repository for sequence data and nomenclature,
offering scientists an easy and interactive access to integron’s DNA
sequences, their molecular arrangements as well as their genetic
contexts.
Availability: http://integrall.bio.ua.pt
Contact: amoura@ua.pt
1 INTRODUCTION
Prokaryotic genomes are characterized by a high degree of plasticity
and an enormous evolutive potential (van Passel et al., 2008).
One of the major forces contributing to genome evolution is the
exogenous acquisition of genetic material, so-called horizontal
gene transfer (HGT). HGT contributes to adaptation to different
ecological niches in response to selective pressures generating novel
gene combinations (Sørensen et al., 2005).
HGT may overcome species boundaries and has been implicated
in the dispersion of genes involved in complex cellular processes,
including pathogenicity (Hentschel and Hacker, 2001).
The high number of studies regarding mobile genetic elements
(MGEs) has quickly generated a huge amount of information
concerning these structures. As a consequence of this informative
burst, two bottlenecks are limiting the easy access and analysis
of molecular sequences representing mobile elements: confusing
nomenclatures and absence of filtration of public databases to
eliminate non-reliable data. To overcome that, databases dedicated
to MGEs such as plasmids and phages (ACLAME; Leplae et al.,
2004) and insertion sequences (ISfinder; Siguier et al., 2006) have
been created.
Two decades have passed since the discovery of integrons,
but despite their importance and their extraordinary diversity of
organization there no database dedicated to the deposition and
annotation of molecular data concerning these structures. Integrons
are mobilizable platforms that play a key role in the enhancement
∗To whom correspondence should be addressed.
of genetic diversity in bacteria. They act as bacterial recombination
systems that mediate the capture and expression of gene cassettes
(Hall et al., 1999). Integrons are considered as the primary
mechanism for antibiotic resistance gene acquisition among bacteria
and are frequently associated with transposons and conjugative
plasmids (Gillings et al., 2008). Integrons are composed of an
integrase-encoding gene (intI), a recombination site (attI) and one
or two promoters that control the expression of gene cassettes (Hall
et al., 1999). Their genetic organization can be quite diverse, with
different combinations of gene cassettes. In bacterial communities
integrons also represent an important fraction of the mobile gene
pool (Mazel, 2006). Furthermore, integrons are present in ∼9%
of completely or partially sequenced genomes (Boucher et al.,
2007). For a long time and until recently, research on integron
diversity had been restricted to bacterial isolates obtained mainly
from clinical sources. However, recent studies report the existence
of natural environments as reservoirs of gene cassettes and sources
of functionally diverse gene products often recovered by methods
that access the non-cultivable bacterial fraction (Boucher et al.,
2007).
To overcome one of the difficulties produced by the disclosure
of many new sequences, handling and managing data, we propose
a database dedicated to integrons. The INTEGRALL web-based
platform was developed by microbiologists in collaboration with
computer scientists. INTEGRALL aims to be a repository for 20
years of data about integrons. We believe that this database is crucial
to congregate information on the phylogeny of the bacterial hosts
and their ecology, the molecular diversity of inserted gene cassettes
and the type of integrases. The correlation between these data is
essential to understand the role of integrons in bacterial adaptive
responses and interactions.
2 THE SITE
INTEGRALL was developed in PHP 5, using a MySQL 5.0 database
backend. Implementation uses standard PC servers, with a Linux-
based distribution and Apache 1.3 server running PHP as FastCGI.
The current version of INTEGRALL (1.2) includes:
(i) background information regarding integrons;
(ii) a LIST tool;
1096 © The Author 2009. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
byguestonNovember23,2016http://bioinformatics.oxfordjournals.org/Downloadedfrom
[18:00 30/3/2009 Bioinformatics-btp105.tex] Page: 1097 1096–1098
INTEGRALL database
(iii) a SEARCH tool and BLAST interface;
(iv) guidelines for nomenclature of gene cassettes;
(v) a GLOSSARY of integron-associated genes; and
(vi) a FORUM discussion on mobile genetic elements.
The LIST tool provides a table of all integrase genes and/or gene
cassette arrays deposited at NCBI GenBank (http://www.ncbi.nlm.
nih.gov/Genbank).
For every entry, information regarding identification of the
bacterial host, type of integrase and gene cassettes arrays (if
present) is provided. Other remarks may also be included in this
section, such as integron type. Additional data and functions like
information about the source of isolation or direct access to FASTA
sequences together with bibliographic references are also available.
When possible, users may also access information concerning the
genetic context of integrons (as in the case of completely sequenced
plasmids and genomes) by pushing the context button. Moreover,
all entries have a direct link to the complete gene sequence and
additional information deposited at NCBI GenBank.
The SEARCH tool allows two modes of operation: simple or
advanced search. Used in advanced mode, SEARCH is able to
simultaneously combine two or more of the following criteria:
accession number, journal, authors, organism and source of isolation,
integrase gene or gene cassettes. Other features, such as other
integron-associated genes and integron genomic location (plasmid
and/or chromosome) may also be assessed. Sorting of query results
is done by alphabetical order of organism by default, but users may
also sort results by gene cassette array or intI genes.
Results obtained by using the SEARCH tool can be exported to
a separate file in FASTA format allowing data to be used in further
analysis.
The BLAST interface provides an alignment tool allowing the
comparison of sequences submitted to the website against the
INTEGRALL database (Altschul et al., 1997).
NOMENCLATURE guidelines and GLOSSARY of integron-
associated genes are also present and aim to facilitate and standardize
gene identification, as multiple names are often attributed to identical
genes (for instance, in what concerns gene cassettes coding for
aminoglycoside- and trimethoprim-resistance proteins).
The INTEGRALL platform also includes an external link to a
discussion forum on integrons and other mobile genetic elements in
order to promote communication and the exchange of ideas among
scientists worldwide.
To avoid duplicate submissions, all entries are linked to the NCBI
GenBank through their accession number. Daily updates will be
performed to include in INTEGRALL newly released sequences.
3 THE DATABASE
To date, INTEGRALL comprises more than 4800 integron
or integron-related nucleotide sequences and this number is
continually increasing. Up to 70% of integron-related sequences
correspond to uncultured bacteria and 27% were identified in
the genomes of organisms belonging to the γ-Proteobacteria.
The remaining 3% include a few bacterial representatives of
α-, β-, δ- and ε-Proteobacteria, as well as Actinobacteria,
Firmicutes, Cyanobacteria, Chlamydiae/Verrucomicrobia group,
Bacteroidetes/Chlorobi group, Spirochaetes and Planctomycetes.
Thus, integrons appear to have a broad-host range, despite their
higher prevalence in γ-Proteobacteria, which might be related to a
higher number of studies of integrons in clinical settings comparing
to natural environments.
Ecology of integron-carrying microorganisms, and thus
environmental conditions that allow their dispersal, is also
remarkably diverse: it includes wastewaters, river and marine
waters, deep-sea vents, contaminated mine tailings, manure, soil
and sediments, livestock and wild animals, food and clinical
samples.
Regarding the gene cassette pool, an important part of it is strongly
linked to antibiotic resistance: 51% (3134 out of 6116) of gene
cassettes reported encode antibiotic-resistance proteins. In parallel,
46.5% (2846 out of 6116) of gene cassettes, mostly recovered from
genetic libraries constructed using culture-independent methods,
potentially encode proteins with unknown functions. Studies are
needed to elucidate these functions and whether the mobilization
and transfer of these genes contribute to the fitness of bacterial
strains in certain environments thus playing an important role in
the adaptation and evolution of bacterial communities.
4 CONCLUSIONS
Here, we describe a web-based platform dedicated to compile
information on integrons and designed to organize all the data
available for these genetic structures.
This search engine is still a work in progress and scientists
working with these structures are very welcome to contribute with
their suggestions and data. Depending on the feedback more utilities
will be added over time.
ACKNOWLEDGEMENTS
Authors wish to thank Ana Pamplona (Instituto de Medicina
Molecular and Instituto Gulbenkian de Ciência, Portugal) for
revision of the manuscript, Maria João Carvalho (University
of Aveiro, Portugal) for all the assistance provided, António
Nogueira (University of Aveiro, Portugal) for web-server hosting
and Thomas Jové (University of Limoges, France) for all the
valuable contributions made to the INTEGRALL database.
Funding: A.M. and I.H. are funded by Fundação para a Ciência e a
Tecnologia (SFRH/BD/19443/2004 and SFRH/BPD/21384/2005).
Conflict of Interest: none declared.
REFERENCES
Altschul,S.F. et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein
database search programs. Nucleic Acids Res., 25, 3389–3402.
Boucher,Y. et al. (2007) Integrons: mobilizable platforms that promote genetic diversity
in bacteria. Trends Microbiol., 15, 301–309.
Gillings,M. et al. (2008) The evolution of class 1 integrons and the rise of antibiotic
resistance. J. Bacteriol., 190, 5095–5100.
Hall,R.M. et al. (1999) Mobile gene cassettes and integrons in evolution. Ann. N. Y.
Acad. Sci., 870, 68–80.
Hentschel,U. and Hacker,J. (2001) Pathogenicity islands: the tip of the iceberg. Microbes
Infect., 3, 545–548.
Leplae,R. et al. (2004) ACLAME: a classification of mobile genetic elements. Nucleic
Acids Res., 32, D45–D49.
Mazel,D. (2006) Integrons: agents of bacterial evolution. Nat. Rev. Microbiol., 4,
608–620.
1097
byguestonNovember23,2016http://bioinformatics.oxfordjournals.org/Downloadedfrom
[18:00 30/3/2009 Bioinformatics-btp105.tex] Page: 1098 1096–1098
A.Moura et al.
Siguier,P. et al. (2006) ISfinder: the reference centre for bacterial insertion sequences.
Nucleic Acids Res., 34, D32–D36.
Sørensen,S. et al. (2005) Studying plasmid horizontal transfer in situ: a critical review.
Nat. Rev. Microbiol., 3, 700–710.
van Passel,M.W.J. et al. (2008) The emergence and fate of horizontally acquired genes
in Escherichia coli. PLoS Comput. Biol., 4, e1000059.
1098
byguestonNovember23,2016http://bioinformatics.oxfordjournals.org/Downloadedfrom

Mais conteúdo relacionado

Mais procurados

Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and BioinformaticsSharif Shuvo
 
Careers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and JobsCareers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and JobsM Abdullah Chaudhry
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsElena Sügis
 
Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Sijo A
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p collegeSKUASTKashmir
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsAmna Jalil
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformaticsbiinoida
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple nadeem akhter
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in BioinformaticsArindam Ghosh
 
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...mohd younus wani
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu KAUSHAL SAHU
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsBivek Rai
 

Mais procurados (20)

Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and Bioinformatics
 
Careers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and JobsCareers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and Jobs
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 
Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics in present and its future
Bioinformatics in present and its futureBioinformatics in present and its future
Bioinformatics in present and its future
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in Bioinformatics
 
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 

Destaque

Cibercrimen diapos
Cibercrimen diaposCibercrimen diapos
Cibercrimen diaposedberdavid
 
حدیث کساء Quran Hints Islam Shia @quranhints
حدیث کساء Quran Hints Islam Shia @quranhintsحدیث کساء Quran Hints Islam Shia @quranhints
حدیث کساء Quran Hints Islam Shia @quranhintsAli Dalili
 
FIRE SAFETY PRECAUTIONS TAKEN AS A RESULT OF FIRE ENVIRONMENT ANALYSIS IN INT...
FIRE SAFETY PRECAUTIONS TAKEN AS A RESULT OF FIRE ENVIRONMENT ANALYSIS IN INT...FIRE SAFETY PRECAUTIONS TAKEN AS A RESULT OF FIRE ENVIRONMENT ANALYSIS IN INT...
FIRE SAFETY PRECAUTIONS TAKEN AS A RESULT OF FIRE ENVIRONMENT ANALYSIS IN INT...Zuhal Şimşek
 
H. Con. Res. 150 Introduced 9.9.2016
H. Con. Res. 150 Introduced 9.9.2016H. Con. Res. 150 Introduced 9.9.2016
H. Con. Res. 150 Introduced 9.9.2016Kathy Scholpp
 
Informe a 6
Informe a 6Informe a 6
Informe a 6sidokar
 
Proceso de votacion primarias 2013
Proceso de votacion primarias 2013Proceso de votacion primarias 2013
Proceso de votacion primarias 2013Vicente Bove
 
Informe 1111
Informe 1111Informe 1111
Informe 1111sidokar
 

Destaque (20)

Dunia quintero f_cris[1]
Dunia quintero f_cris[1]Dunia quintero f_cris[1]
Dunia quintero f_cris[1]
 
Cibercrimen diapos
Cibercrimen diaposCibercrimen diapos
Cibercrimen diapos
 
حدیث کساء Quran Hints Islam Shia @quranhints
حدیث کساء Quran Hints Islam Shia @quranhintsحدیث کساء Quran Hints Islam Shia @quranhints
حدیث کساء Quran Hints Islam Shia @quranhints
 
FIRE SAFETY PRECAUTIONS TAKEN AS A RESULT OF FIRE ENVIRONMENT ANALYSIS IN INT...
FIRE SAFETY PRECAUTIONS TAKEN AS A RESULT OF FIRE ENVIRONMENT ANALYSIS IN INT...FIRE SAFETY PRECAUTIONS TAKEN AS A RESULT OF FIRE ENVIRONMENT ANALYSIS IN INT...
FIRE SAFETY PRECAUTIONS TAKEN AS A RESULT OF FIRE ENVIRONMENT ANALYSIS IN INT...
 
Los talleres fotos
Los talleres fotosLos talleres fotos
Los talleres fotos
 
H. Con. Res. 150 Introduced 9.9.2016
H. Con. Res. 150 Introduced 9.9.2016H. Con. Res. 150 Introduced 9.9.2016
H. Con. Res. 150 Introduced 9.9.2016
 
Informe a 6
Informe a 6Informe a 6
Informe a 6
 
De la web a losmoviles
De la web a losmoviles De la web a losmoviles
De la web a losmoviles
 
D redes
D redesD redes
D redes
 
Decoraciones[1]
Decoraciones[1]Decoraciones[1]
Decoraciones[1]
 
El verbo
El verboEl verbo
El verbo
 
La musica
La musicaLa musica
La musica
 
Ipad2
Ipad2Ipad2
Ipad2
 
Practica5
Practica5Practica5
Practica5
 
Proceso de votacion primarias 2013
Proceso de votacion primarias 2013Proceso de votacion primarias 2013
Proceso de votacion primarias 2013
 
97 2003.
97 2003.97 2003.
97 2003.
 
MARKETING
MARKETINGMARKETING
MARKETING
 
Informe 1111
Informe 1111Informe 1111
Informe 1111
 
Ejemplo la oracion
Ejemplo la oracionEjemplo la oracion
Ejemplo la oracion
 
Mediació
MediacióMediació
Mediació
 

Semelhante a Bioinformatics-2009-Moura-1096-8

LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSMSCW Mysore
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkLisa Muthukumar
 
Deep learning methods in metagenomics: a review
Deep learning methods in metagenomics: a reviewDeep learning methods in metagenomics: a review
Deep learning methods in metagenomics: a reviewssuser6fc73c
 
Group 3 presentation.pptx
Group 3 presentation.pptxGroup 3 presentation.pptx
Group 3 presentation.pptxRoyBisenti
 
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Jonathan Eisen
 
Meta genomics talks
Meta genomics talksMeta genomics talks
Meta genomics talksAteeq Khaliq
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysisDr. Naveen Gaurav srivastava
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18Jonathan Eisen
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONcsandit
 
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...ijitcs
 
57.insilico studies of cellulase from Aspergillus terreus
57.insilico studies of cellulase from Aspergillus terreus57.insilico studies of cellulase from Aspergillus terreus
57.insilico studies of cellulase from Aspergillus terreusAnnadurai B
 

Semelhante a Bioinformatics-2009-Moura-1096-8 (20)

Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration Framework
 
Deep learning methods in metagenomics: a review
Deep learning methods in metagenomics: a reviewDeep learning methods in metagenomics: a review
Deep learning methods in metagenomics: a review
 
Group 3 presentation.pptx
Group 3 presentation.pptxGroup 3 presentation.pptx
Group 3 presentation.pptx
 
50120140504019 2
50120140504019 250120140504019 2
50120140504019 2
 
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
 
Meta genomics talks
Meta genomics talksMeta genomics talks
Meta genomics talks
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
String.pptx
String.pptxString.pptx
String.pptx
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18
 
D1803012022
D1803012022D1803012022
D1803012022
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
 
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
 
call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...
 
Pattemore 2015
Pattemore 2015Pattemore 2015
Pattemore 2015
 
57.insilico studies of cellulase from Aspergillus terreus
57.insilico studies of cellulase from Aspergillus terreus57.insilico studies of cellulase from Aspergillus terreus
57.insilico studies of cellulase from Aspergillus terreus
 

Bioinformatics-2009-Moura-1096-8

  • 1. [18:00 30/3/2009 Bioinformatics-btp105.tex] Page: 1096 1096–1098 BIOINFORMATICS APPLICATIONS NOTE Vol. 25 no. 8 2009, pages 1096–1098 doi:10.1093/bioinformatics/btp105 Databases and ontologies INTEGRALL: a database and search engine for integrons, integrases and gene cassettes Alexandra Moura1,∗, Mário Soares2, Carolina Pereira3, Nuno Leitão2, Isabel Henriques1 and António Correia1 1CESAM & Department of Biology, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, 2Sindesi – Informatic Systems and Solutions Lda., Rua Manuel Barbuda e Vasconcelos, 62-2D, 3810-498 Aveiro and 3Department of Biology, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal Received on September 29, 2008; revised on February 16, 2009; accepted on February 17, 2009 Advance Access publication February 19, 2009 Associate Editor: Alex Bateman ABSTRACT Summary: INTEGRALL is a freely available, text-based search system developed with the aim of collecting and organizing infor- mation on integrons in a single database. The current release (1.2) contains more than 4800 integron sequences and provides a public genetic repository for sequence data and nomenclature, offering scientists an easy and interactive access to integron’s DNA sequences, their molecular arrangements as well as their genetic contexts. Availability: http://integrall.bio.ua.pt Contact: amoura@ua.pt 1 INTRODUCTION Prokaryotic genomes are characterized by a high degree of plasticity and an enormous evolutive potential (van Passel et al., 2008). One of the major forces contributing to genome evolution is the exogenous acquisition of genetic material, so-called horizontal gene transfer (HGT). HGT contributes to adaptation to different ecological niches in response to selective pressures generating novel gene combinations (Sørensen et al., 2005). HGT may overcome species boundaries and has been implicated in the dispersion of genes involved in complex cellular processes, including pathogenicity (Hentschel and Hacker, 2001). The high number of studies regarding mobile genetic elements (MGEs) has quickly generated a huge amount of information concerning these structures. As a consequence of this informative burst, two bottlenecks are limiting the easy access and analysis of molecular sequences representing mobile elements: confusing nomenclatures and absence of filtration of public databases to eliminate non-reliable data. To overcome that, databases dedicated to MGEs such as plasmids and phages (ACLAME; Leplae et al., 2004) and insertion sequences (ISfinder; Siguier et al., 2006) have been created. Two decades have passed since the discovery of integrons, but despite their importance and their extraordinary diversity of organization there no database dedicated to the deposition and annotation of molecular data concerning these structures. Integrons are mobilizable platforms that play a key role in the enhancement ∗To whom correspondence should be addressed. of genetic diversity in bacteria. They act as bacterial recombination systems that mediate the capture and expression of gene cassettes (Hall et al., 1999). Integrons are considered as the primary mechanism for antibiotic resistance gene acquisition among bacteria and are frequently associated with transposons and conjugative plasmids (Gillings et al., 2008). Integrons are composed of an integrase-encoding gene (intI), a recombination site (attI) and one or two promoters that control the expression of gene cassettes (Hall et al., 1999). Their genetic organization can be quite diverse, with different combinations of gene cassettes. In bacterial communities integrons also represent an important fraction of the mobile gene pool (Mazel, 2006). Furthermore, integrons are present in ∼9% of completely or partially sequenced genomes (Boucher et al., 2007). For a long time and until recently, research on integron diversity had been restricted to bacterial isolates obtained mainly from clinical sources. However, recent studies report the existence of natural environments as reservoirs of gene cassettes and sources of functionally diverse gene products often recovered by methods that access the non-cultivable bacterial fraction (Boucher et al., 2007). To overcome one of the difficulties produced by the disclosure of many new sequences, handling and managing data, we propose a database dedicated to integrons. The INTEGRALL web-based platform was developed by microbiologists in collaboration with computer scientists. INTEGRALL aims to be a repository for 20 years of data about integrons. We believe that this database is crucial to congregate information on the phylogeny of the bacterial hosts and their ecology, the molecular diversity of inserted gene cassettes and the type of integrases. The correlation between these data is essential to understand the role of integrons in bacterial adaptive responses and interactions. 2 THE SITE INTEGRALL was developed in PHP 5, using a MySQL 5.0 database backend. Implementation uses standard PC servers, with a Linux- based distribution and Apache 1.3 server running PHP as FastCGI. The current version of INTEGRALL (1.2) includes: (i) background information regarding integrons; (ii) a LIST tool; 1096 © The Author 2009. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org byguestonNovember23,2016http://bioinformatics.oxfordjournals.org/Downloadedfrom
  • 2. [18:00 30/3/2009 Bioinformatics-btp105.tex] Page: 1097 1096–1098 INTEGRALL database (iii) a SEARCH tool and BLAST interface; (iv) guidelines for nomenclature of gene cassettes; (v) a GLOSSARY of integron-associated genes; and (vi) a FORUM discussion on mobile genetic elements. The LIST tool provides a table of all integrase genes and/or gene cassette arrays deposited at NCBI GenBank (http://www.ncbi.nlm. nih.gov/Genbank). For every entry, information regarding identification of the bacterial host, type of integrase and gene cassettes arrays (if present) is provided. Other remarks may also be included in this section, such as integron type. Additional data and functions like information about the source of isolation or direct access to FASTA sequences together with bibliographic references are also available. When possible, users may also access information concerning the genetic context of integrons (as in the case of completely sequenced plasmids and genomes) by pushing the context button. Moreover, all entries have a direct link to the complete gene sequence and additional information deposited at NCBI GenBank. The SEARCH tool allows two modes of operation: simple or advanced search. Used in advanced mode, SEARCH is able to simultaneously combine two or more of the following criteria: accession number, journal, authors, organism and source of isolation, integrase gene or gene cassettes. Other features, such as other integron-associated genes and integron genomic location (plasmid and/or chromosome) may also be assessed. Sorting of query results is done by alphabetical order of organism by default, but users may also sort results by gene cassette array or intI genes. Results obtained by using the SEARCH tool can be exported to a separate file in FASTA format allowing data to be used in further analysis. The BLAST interface provides an alignment tool allowing the comparison of sequences submitted to the website against the INTEGRALL database (Altschul et al., 1997). NOMENCLATURE guidelines and GLOSSARY of integron- associated genes are also present and aim to facilitate and standardize gene identification, as multiple names are often attributed to identical genes (for instance, in what concerns gene cassettes coding for aminoglycoside- and trimethoprim-resistance proteins). The INTEGRALL platform also includes an external link to a discussion forum on integrons and other mobile genetic elements in order to promote communication and the exchange of ideas among scientists worldwide. To avoid duplicate submissions, all entries are linked to the NCBI GenBank through their accession number. Daily updates will be performed to include in INTEGRALL newly released sequences. 3 THE DATABASE To date, INTEGRALL comprises more than 4800 integron or integron-related nucleotide sequences and this number is continually increasing. Up to 70% of integron-related sequences correspond to uncultured bacteria and 27% were identified in the genomes of organisms belonging to the γ-Proteobacteria. The remaining 3% include a few bacterial representatives of α-, β-, δ- and ε-Proteobacteria, as well as Actinobacteria, Firmicutes, Cyanobacteria, Chlamydiae/Verrucomicrobia group, Bacteroidetes/Chlorobi group, Spirochaetes and Planctomycetes. Thus, integrons appear to have a broad-host range, despite their higher prevalence in γ-Proteobacteria, which might be related to a higher number of studies of integrons in clinical settings comparing to natural environments. Ecology of integron-carrying microorganisms, and thus environmental conditions that allow their dispersal, is also remarkably diverse: it includes wastewaters, river and marine waters, deep-sea vents, contaminated mine tailings, manure, soil and sediments, livestock and wild animals, food and clinical samples. Regarding the gene cassette pool, an important part of it is strongly linked to antibiotic resistance: 51% (3134 out of 6116) of gene cassettes reported encode antibiotic-resistance proteins. In parallel, 46.5% (2846 out of 6116) of gene cassettes, mostly recovered from genetic libraries constructed using culture-independent methods, potentially encode proteins with unknown functions. Studies are needed to elucidate these functions and whether the mobilization and transfer of these genes contribute to the fitness of bacterial strains in certain environments thus playing an important role in the adaptation and evolution of bacterial communities. 4 CONCLUSIONS Here, we describe a web-based platform dedicated to compile information on integrons and designed to organize all the data available for these genetic structures. This search engine is still a work in progress and scientists working with these structures are very welcome to contribute with their suggestions and data. Depending on the feedback more utilities will be added over time. ACKNOWLEDGEMENTS Authors wish to thank Ana Pamplona (Instituto de Medicina Molecular and Instituto Gulbenkian de Ciência, Portugal) for revision of the manuscript, Maria João Carvalho (University of Aveiro, Portugal) for all the assistance provided, António Nogueira (University of Aveiro, Portugal) for web-server hosting and Thomas Jové (University of Limoges, France) for all the valuable contributions made to the INTEGRALL database. Funding: A.M. and I.H. are funded by Fundação para a Ciência e a Tecnologia (SFRH/BD/19443/2004 and SFRH/BPD/21384/2005). Conflict of Interest: none declared. REFERENCES Altschul,S.F. et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25, 3389–3402. Boucher,Y. et al. (2007) Integrons: mobilizable platforms that promote genetic diversity in bacteria. Trends Microbiol., 15, 301–309. Gillings,M. et al. (2008) The evolution of class 1 integrons and the rise of antibiotic resistance. J. Bacteriol., 190, 5095–5100. Hall,R.M. et al. (1999) Mobile gene cassettes and integrons in evolution. Ann. N. Y. Acad. Sci., 870, 68–80. Hentschel,U. and Hacker,J. (2001) Pathogenicity islands: the tip of the iceberg. Microbes Infect., 3, 545–548. Leplae,R. et al. (2004) ACLAME: a classification of mobile genetic elements. Nucleic Acids Res., 32, D45–D49. Mazel,D. (2006) Integrons: agents of bacterial evolution. Nat. Rev. Microbiol., 4, 608–620. 1097 byguestonNovember23,2016http://bioinformatics.oxfordjournals.org/Downloadedfrom
  • 3. [18:00 30/3/2009 Bioinformatics-btp105.tex] Page: 1098 1096–1098 A.Moura et al. Siguier,P. et al. (2006) ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res., 34, D32–D36. Sørensen,S. et al. (2005) Studying plasmid horizontal transfer in situ: a critical review. Nat. Rev. Microbiol., 3, 700–710. van Passel,M.W.J. et al. (2008) The emergence and fate of horizontally acquired genes in Escherichia coli. PLoS Comput. Biol., 4, e1000059. 1098 byguestonNovember23,2016http://bioinformatics.oxfordjournals.org/Downloadedfrom