SlideShare uma empresa Scribd logo
1 de 8
Baixar para ler offline
International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013
DOI: 10.5121/ijbb.2013.3204 37
Amino acid sequence based in silico analysis of β-
galactosidases
Ratnaboli Bose, Shikha Arora, Vivek Dhar Dwivedi & Amit Pandey
Forest Pathology Division, Forest Research Institute, Dehradun, India
vivek_bioinformatics@yahoo.com
Abstract
Amino acid sequences of β-galactosidase enzyme belonging to different families of bacteria, fungi and
plants retrieved from GenPept database were analyzed for multiple sequence alignment, cluster analysis,
conserved motif discovery and their Pfam analysis using different bioinformatics tools. The multiple
sequence alignment revealed different conserved residues of amino acids exclusively for each groups
except fungi. The cluster analysis for different groups uniformly showed three major clusters based on the
closeness of the β-galactosidase protein sequences irrespective of the source organisms. Seven conserved
motifs belonging to different families were assessed. These identified motifs showed the evolutionary
closeness among species at the molecular level.
Keywords
β -galactosidase, conserved motif, cluster analysis, residues
1. Introduction
β-galactosidases are hydrolase enzymes which are involved in the hydrolysis of β-galactosides
into monosaccharides. It is widely distributed enzyme among bacteria, fungi and plants.
Sequencing and analysis of amino acid sequences of β-galactosidases originates many ideas about
their structural and functional activity. In bacteria, the 1024 amino acids of E. coli β-galactosidase
were first sequenced [1] and its structure determined after twenty-four years [2]. The protein is a
464-kDa homotetramer. Each unit of β-galactosidase consists of five domains; domain 1 is a
jelly-roll type barrel, domain 2 and 4 are fibronectin type III-like barrels, domain 5 a β-sandwich,
while the central domain 3 is a TIM-type barrel. The third domain contains the active site [3]. In
fungi a genomic copy of the β-galactosidase gene of Hypocrea jecorina was cloned [4], and this
copy encodes a 1,023-amino-acid protein with a 20-amino-acid signal sequence. This protein has
a molecular mass of 109.3 kDa, belongs to glycosyl hydrolase family 35, and is the major
extracellular β-galactosidase during growth on lactose. In Plants the relationship between fruit
softening and beta-Gal during banana fruit ripening, a beta-Gal cDNA fragment, named MA-Gal,
has been cloned from banana fruit pulp using RT-PCR in this study. The results of sequence
analysis showed that MA-Gal contained 927 bp, encoding a polypeptide of 309 amino acids, the
deduced protein was highly homologous to plant beta-galactosidase expressed in fruit ripening.
The MA-Gal putative amino acids have five homologous domains [5]. In light of above, the study
of β-galactosidase amino acid sequences from various sources is very important. In the present
analysis, we performed the In-silico analysis including conserved motif assessment their family
identification, MSA, and cluster analysis of β-galactosidase amino acid sequences from bacteria,
fungi and plants.
International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013
38
2. Materials and methods
2.1 Sequence retrieval
The 30 full length amino acid sequences of β-galactosidase from bacteria, fungi and plants were
retrieved from GenPept database (http://www.ncbi.nlm.nih.gov/protein). The sequences were
arranged in bacterial, fungal and plant profiles, respectively [6, 7, 8, 9]
2.2 Multiple sequence alignment
The multiple sequence alignment of the individual profiles was performed using MUSCLE at the
European Bioinformatics Institute [10].
2.3 Conserved Motif identification
Motifs were identified in profiles using the expectation maximization approach implemented in
Multiple EM for Motif Elicitation server [11].
2.4 Conserved Motif family identification
Motif families were identified by sequence searching in Pfam database [12]
2.5 Cluster analysis
The UPGMA approach implemented in the Mega program was employed for constructing
phylogenetic relationships among sequences [13]
3. Results
3.1 Sequence retrieval
All the sequences belonging to different families of bacteria, fungi and plants were searched and
retrieved from NCBI protein database (GenPept) and listed in Table 1 along with their accession
number, species name, family and origin.
3.2 Multiple sequence alignment
MSA showed the presence of some conserved residues in all the sequences from different
sources, while others were restricted only to their groups [14]. Four tryptophan, four
phenylalanine, three tyrosine, two proline, two alanine, one glycine, one aspartic acid, one
isoleucine and one glutamic acid were found to be identically conserved residues in all analysed
species of plant. One proline and one glycine were found to be identically conserved residues in
all analyzed sequences of bacteria while no residue was found to be conserved in fungal profile.
3.3 Conserved motif identification
Seven conserved motifs were identified after the analysis of bacterial, fungal and plant profiles
separately. Three conserved motifs were observed in bacterial profile, three in plant profile
whereas a single conserved motif was identified in fungal profile (Table. 2).
3.4 Conserved motif family identification
The seven identified conserved motifs were applied for their family identification in Pfam data
base using sequence search option. First two conserved motifs identified in bacterial profile
belonged to Glyco hydro 42 domain family while the Pfam entry of third bacterial conserved
motif was not found. All the three conserved motifs identified in plant profile belonged to Glyco
International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013
39
hydro 35 domain family while a single conserved motif identified in fungal profile belonged to
Beta Gal dom2 domain family (Table. 2).
3.5 Cluster analysis
3.5.1. Cluster analysis of bacterial profile
Cluster analysis of bacteria showed two major clusters as shown in Figure 1. Cluster A consisted
of six species which was further divided into two sub-clusters. Sub-cluster A contains three
species (Thermus thermophilus, Meiothermus ruber and Streptomyces flavogriseus). Sub-cluster
B contains two species (Bacteroides salanitronis and Bacteroides ovatus). Niastella koreensis
was found to be distantly related and therefore outgrouped from both sub-clusters. Cluster B
consisted of two species namely Xanthomonas axonopodis, and Streptomyces coelicolor.
Frateuria aurantia and Niabella soli were outgrouped from both clusters.
Figure 1. Phylogenetic tree of bacterial profile using UPGMA method
3.5.2. Cluster analysis of fungal profile
Cluster analysis of fungi showed a single major cluster as shown in Figure 2. This cluster
consisted of seven species which was further divided into two sub-clusters. Sub-cluster A
contains five species (Metarhizium anisopliae, Metarhizium acridum, Penicillium decumbens,
Beauveria bassiana and Aspergillus kawachii). Sub-cluster B contains two species (Verticillium
dahlia and Verticillium albo-atrum). Colletotrichum orbiculare, Cordyceps militaris and
Colletotrichum higginsianum were found to be outgrouped from both sub-clusters and therefore
these were distantly related.
3.5.3. Cluster analysis of plant profile
Cluster analysis of plant showed two major clusters as shown in Figure 3. Cluster A
consisted of eight species which was further divided into two sub-clusters. Sub-cluster A
contains three species (Prunus salicina, Pyrus communis and Cicer arietinum). Sub-
cluster B contains two species (Solanum lycopersicum and Capsicum annuum). Oryza
sativa, Brassica oleracea, Medicago truncatula were found to be distantly related and
therefore outgrouped from both sub-clusters. Cluster B consisted of two species namely
Arabidopsis thaliana and Aegilops tauschii.
International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013
40
Figure 2.Phylogenetic tree of fungal profile using UPGMA method
Figure 3.Phylogenetic tree of plant profile using UPGMA method
3.5.4. Cluster analysis of joint bacterial, fungal and plant profile
Three major clusters were obtained by Cluster analysis of joint bacterial, fungal and plant profile
(Figure 4). Cluster A consisted of seventeen species which were further divided into two
subclusters. Subcluster A contained eight species of plants, and one species of bacteria.
Subcluster B consisted of seven species of fungi and one species of bacteria. Cluster B consisted
of six species of bacteria. One species of bacteria was outgrouped from Cluster B. Cluster C
consisted of two species of plant and two species of fungi. One bacterial species and one fungal
species were outgrouped from all three clusters.
International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013
41
Figure4. Phylogenetic tree of joint profile of bacteria, fungi and plants using UPGMA method
International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013
42
Table. 1 Retrieved sequences, source, species name, family and their accession number
Serial
no.
Source Name of Organisms Family Accession no.
1. Bacteria Bacteroides salanitronis Bacteroidaceae ADY37532.1
2. Bacteria Bacteroides ovatus Bacteroidaceae ZP_06725189
3. Bacteria Xanthomonas axonopodis Xanthomonadaceae AGH78562.1
4. Bacteria Frateuria aurantia Xanthomonadaceae YP_005377482.1.
5. Bacteria Niastella koreensis Chitinophagaceae YP_005008117.1
6. Bacteria Niabella soli Chitinophagaceae ZP_09632360.1
7. Bacteria Streptomyces coelicolor Streptomycetaceae NP_733571.1
8. Bacteria Streptomyces flavogriseus Streptomycetaceae ADW06353.1
9. Bacteria Thermus thermophilus Thermaceae ABI35985.1
10. Fungi Metarhizium anisopliae Clavicipitaceae EFZ03727.1
11. Fungi Metarhizium acridum Clavicipitaceae EFY85580.1
12. Fungi Colletotrichum orbiculare Glomerellaceae ENH80113.1
13. Fungi Penicillium decumbens Trichocomaceae AFR36805.1
14. Fungi Aspergillus kawachii Trichocomaceae GAA90667.1
15. Fungi Cordyceps militaris Cordycipitaceae EGX94612.1
16. Fungi Beauveria bassiana Cordycipitaceae EJP64431.
17. Fungi Verticillium dahlia Plectosphaerellaceae EGY23296.1
18. Fungi Verticillium albo-atrum Plectosphaerellaceae EEY14998.1
19. Fungi Colletotrichum orbiculare Glomerellaceae ENH80113.1
20. Fungi Colletotrichum
higginsianum
Glomerellaceae CCF38689.1
21. Plants Brassica oleracea Brassicaceae CAA59162.1
22. Plants Arabidopsis thaliana Brassicaceae AEE79231.1
23. Plants Oryza sativa Poaceae AAM34271.1
24. Plants Aegilops tauschii Poaceae EMT17876.1
International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013
43
25. Plants Solanum lycopersicum Fabaceae AAC25984.1
26. Plants Capsicum annuum Solanaceae BAC10578.2
27. Plants Cicer arietinum Fabaceae CAA06309.1
28. Plants Medicago truncatula Fabaceae AET04927.1
29. Plants Prunus salicina Rosaceae ABY71826.1
30. Plants Pyrus communis Rosaceae CAH18936.1
Table.2 Motifs identified using MEME program and their Pfam analysis using Pfam database
Serial no Motif Width Present in
number of
sequences
Family Source
1. EFAWNQLEPEPGKYDFSWLD 20 10 Glyco
hydro 42
Bacteria
2. YGNHPAVIMWQIDNE 15 10 Glyco
hydro 42
Bacteria
3. EQWKEDLKKMREMG 14 10 Pfam
entry not
found
Bacteria
4. GLDVIQTYVFWNGHEPSPGKY 21 10 Glyco
hydro 35
Plant
5. LYVNLRIGPYVCAEWNFGGFP 21 10 Glyco
hydro 35
Plant
6. INGQRRILISGSIHYPRSTPQ 21 10 Glyco
hydro 35
Plant
7. RDSKIHVTDYPVGDHTLLYSTAEIFTWKK 29 10 Beta Gal
dom2
Fungi
4. Conclusions
Identification of conserved regions in a profile of protein sequences determines common
ancestry combined with conservative evolutionary pressure to maintain important
residues at functionally important parts of the protein. MSA revealed the presence of
some conserved residues in plant and bacterial profile separately while no residue was
found to be conserved in fungal profile. This suggests that the analyzed sequences of
fungi showed high variability when compared to bacteria and plants. Seven conserved
motifs belonging to different families were identified. Three major sequence clusters
were obtained by cluster analysis of all retrieved sequences from different sources
indicating the evolutionary history of β-galactosidases.
International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013
44
References
1. Fowler A.V., & Zabin I. (1977). The amino acid sequence of beta-galactosidase of Escherichia
coli. Proceedings of the National Academy of Sciences, 74(4), 1507-1510.
2. Jacobson R.H., Zhang X. J., Dubose R. F., Matthews B. W. (1994). Three-dimensional structure
of β-galactosidase from E. Coli. Nature 369 (6483): 761–766
3. Matthews B.W. (June 2005). The structure of E. coli beta-galactosidase. C. R. Biol. 328 (6): 549–
56.
4. Seiboth B., Hartl L., Salovuori N., Lanthaler K., Robson G.D., Vehmaanpera, J., & Kubicek C. P.
(2005). Role of the bga1-encoded extracellular β-galactosidase of Hypocrea jecorina in cellulase
induction by lactose. Applied and environmental microbiology, 71(2), 851-857.
5. Zhuang J.P., Su J., Li X.P., & Chen W.X. (2006). Cloning and expression analysis of beta-
galactosidase gene related to softening of banana (Musa sp.) fruit. Zhi wu sheng li yu fen zi sheng
wu xue xue bao= Journal of plant physiology and molecular biology, 32(4), 411.
6. Dwivedi V.D., Arora S., Kumar A. and Mishra S.K. (2013). Computational analysis of xanthine
dehydrogenase enzyme from different source organisms, Network Modeling Analysis in Health
Informatics and Bioinformatics, DOI : 10.1007/s13721-013-0029- 7.
7. Dhar D. V., Tanuj S., Amit P., & Kumar M. S. (2012). INSIGHTS TO SEQUENCE
INFORMATION OF ALPHA AMYLASE ENZYME FROM DIFFERENT SOURCE
ORGANISMS. International Journal of Advanced Biotechnology and Bioinformatics, 1(1), 87-91.
8. Dhar D. V., Tanuj S., Kumar M. S., & Kumar P. A. (2012). Insights to Sequence Information of
Lactoylglutathione Lyase Enzyme from Different Source Organisms. I. Res. J. Biological Sci.,
1(6), 38-42.
9. Yadav .SK., Dubey A.K., Yadav S., Bisht D., Darmwal N.S., Yadav D., Amino acid sequences
based phylogenetic and motif assessment of lipases from different organisms, Online J Bioinform.,
13(3):400-417, 2012.
10. Edgar R.C., (2004). MUSCLE: multiple sequence alignment with high accuracy and high
throughput, Nucleic Acids Res., 19: 32(5), 1792-7.
11. Bailey T.L., Elkan C., (1995). Unsupervised learning of multiple motifs in biopolymers using
expectation maximization, Mach Learn 21 (51), 80-33.
12. Punta M., Coggill P.C., Eberhardt R.Y., Mistry J., Tate J., Boursnell C., Pang N., Forslund K.,
Ceric G., Clements J., Heger A., Holm L., Sonnhammer E.L.L., Eddy S.R., Bateman A., and Finn
R.D. The Pfam Protein Families Database, Nucleic Acids Research Database (2012).
13. Kumar S., Dudley J., Nei M, and Tamura K. (2008). MEGA: a biologist-centric software for
evolutionary analysis of DNA and protein sequences, Briefings in Bioinformatics, 9, 299-306.
14. Malviya N., Srivastava M., Diwakar S. K.. and Mishra S. K. (2011). Insights to sequence
information of polyphenol oxidase enzyme from different source organisms,” Applied
Biochemistry and Biotechnology, 165: 397–405

Mais conteúdo relacionado

Mais procurados

Jocelyn Miller FINAL
Jocelyn Miller FINALJocelyn Miller FINAL
Jocelyn Miller FINAL
Jawwad Mirza
 
Katia roriguez y_karla_santos_29nov2011
Katia roriguez y_karla_santos_29nov2011Katia roriguez y_karla_santos_29nov2011
Katia roriguez y_karla_santos_29nov2011
Programa_BRIC
 
Cronje, 2012, Planta article - Copy
Cronje, 2012, Planta article - CopyCronje, 2012, Planta article - Copy
Cronje, 2012, Planta article - Copy
Christelle Sherman
 
In vitro experiments of prokaryotic and eukaryotic antimicrobial peptide cyto...
In vitro experiments of prokaryotic and eukaryotic antimicrobial peptide cyto...In vitro experiments of prokaryotic and eukaryotic antimicrobial peptide cyto...
In vitro experiments of prokaryotic and eukaryotic antimicrobial peptide cyto...
AI Publications
 
Polyalkoxybenzenes JMC2011
Polyalkoxybenzenes JMC2011Polyalkoxybenzenes JMC2011
Polyalkoxybenzenes JMC2011
Alex Kiselyov
 
Collaborative science to identify novel inhibitors for the pseudokinase TRIB2
Collaborative science to identify novel inhibitors for the pseudokinase TRIB2Collaborative science to identify novel inhibitors for the pseudokinase TRIB2
Collaborative science to identify novel inhibitors for the pseudokinase TRIB2
Morgan Focas
 
Genetic Variability for Antioxidant Activity and Total Phenolic Content in Fo...
Genetic Variability for Antioxidant Activity and Total Phenolic Content in Fo...Genetic Variability for Antioxidant Activity and Total Phenolic Content in Fo...
Genetic Variability for Antioxidant Activity and Total Phenolic Content in Fo...
CrimsonpublishersNTNF
 

Mais procurados (20)

Metabolic engineering ppt
Metabolic engineering pptMetabolic engineering ppt
Metabolic engineering ppt
 
Biotech lett
Biotech lettBiotech lett
Biotech lett
 
Synthetic microbial communities : Microbial consortia engineering
Synthetic microbial communities : Microbial consortia engineeringSynthetic microbial communities : Microbial consortia engineering
Synthetic microbial communities : Microbial consortia engineering
 
Study of Chlorophyll Mutations and Chlorophyll Content in Young Oil Palm (Ela...
Study of Chlorophyll Mutations and Chlorophyll Content in Young Oil Palm (Ela...Study of Chlorophyll Mutations and Chlorophyll Content in Young Oil Palm (Ela...
Study of Chlorophyll Mutations and Chlorophyll Content in Young Oil Palm (Ela...
 
bryA publication
bryA publicationbryA publication
bryA publication
 
Engineering of secondary metabolism
Engineering of secondary metabolismEngineering of secondary metabolism
Engineering of secondary metabolism
 
HPTLC determination of carotenoid profile in the leaf and bark samples of lor...
HPTLC determination of carotenoid profile in the leaf and bark samples of lor...HPTLC determination of carotenoid profile in the leaf and bark samples of lor...
HPTLC determination of carotenoid profile in the leaf and bark samples of lor...
 
Aijrfans14 257
Aijrfans14 257Aijrfans14 257
Aijrfans14 257
 
Sortase enzyme family
Sortase enzyme familySortase enzyme family
Sortase enzyme family
 
Jocelyn Miller FINAL
Jocelyn Miller FINALJocelyn Miller FINAL
Jocelyn Miller FINAL
 
Katia roriguez y_karla_santos_29nov2011
Katia roriguez y_karla_santos_29nov2011Katia roriguez y_karla_santos_29nov2011
Katia roriguez y_karla_santos_29nov2011
 
Cronje, 2012, Planta article - Copy
Cronje, 2012, Planta article - CopyCronje, 2012, Planta article - Copy
Cronje, 2012, Planta article - Copy
 
In vitro experiments of prokaryotic and eukaryotic antimicrobial peptide cyto...
In vitro experiments of prokaryotic and eukaryotic antimicrobial peptide cyto...In vitro experiments of prokaryotic and eukaryotic antimicrobial peptide cyto...
In vitro experiments of prokaryotic and eukaryotic antimicrobial peptide cyto...
 
Polyalkoxybenzenes JMC2011
Polyalkoxybenzenes JMC2011Polyalkoxybenzenes JMC2011
Polyalkoxybenzenes JMC2011
 
Prodigiosin induce apoptosis
Prodigiosin induce apoptosisProdigiosin induce apoptosis
Prodigiosin induce apoptosis
 
21 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-21721 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-217
 
Collaborative science to identify novel inhibitors for the pseudokinase TRIB2
Collaborative science to identify novel inhibitors for the pseudokinase TRIB2Collaborative science to identify novel inhibitors for the pseudokinase TRIB2
Collaborative science to identify novel inhibitors for the pseudokinase TRIB2
 
PROTEOMICS INTRODUCTION AND TECHNIQUES
PROTEOMICS  INTRODUCTION AND TECHNIQUESPROTEOMICS  INTRODUCTION AND TECHNIQUES
PROTEOMICS INTRODUCTION AND TECHNIQUES
 
strain improvement and preservation
strain improvement and preservationstrain improvement and preservation
strain improvement and preservation
 
Genetic Variability for Antioxidant Activity and Total Phenolic Content in Fo...
Genetic Variability for Antioxidant Activity and Total Phenolic Content in Fo...Genetic Variability for Antioxidant Activity and Total Phenolic Content in Fo...
Genetic Variability for Antioxidant Activity and Total Phenolic Content in Fo...
 

Destaque

Phylogenetic Analysis of the Potential Microorganism for Remediation of Heavy...
Phylogenetic Analysis of the Potential Microorganism for Remediation of Heavy...Phylogenetic Analysis of the Potential Microorganism for Remediation of Heavy...
Phylogenetic Analysis of the Potential Microorganism for Remediation of Heavy...
CSCJournals
 
Presentation in English
Presentation in EnglishPresentation in English
Presentation in English
Sarita17
 
Gene expression mining for predicting survivability of patients in earlystage...
Gene expression mining for predicting survivability of patients in earlystage...Gene expression mining for predicting survivability of patients in earlystage...
Gene expression mining for predicting survivability of patients in earlystage...
ijbbjournal
 
Application of three graph Laplacian based semisupervised learning methods to...
Application of three graph Laplacian based semisupervised learning methods to...Application of three graph Laplacian based semisupervised learning methods to...
Application of three graph Laplacian based semisupervised learning methods to...
ijbbjournal
 
Class of 84 on 13th april 700pm at sangeet natak academy, gomti nagar lucknow
Class of 84 on 13th april 700pm at sangeet natak academy, gomti nagar lucknowClass of 84 on 13th april 700pm at sangeet natak academy, gomti nagar lucknow
Class of 84 on 13th april 700pm at sangeet natak academy, gomti nagar lucknow
Theatrefest Repertwahr
 
Novel adaptive filter (naf) for impulse noise suppression from digital images
Novel adaptive filter (naf) for impulse noise suppression from digital imagesNovel adaptive filter (naf) for impulse noise suppression from digital images
Novel adaptive filter (naf) for impulse noise suppression from digital images
ijbbjournal
 
Bacterial Identification by 16s rRNA Sequencing.ppt
Bacterial Identification by 16s rRNA Sequencing.pptBacterial Identification by 16s rRNA Sequencing.ppt
Bacterial Identification by 16s rRNA Sequencing.ppt
Rakesh Kumar
 

Destaque (20)

Phylogenetic Analysis of the Potential Microorganism for Remediation of Heavy...
Phylogenetic Analysis of the Potential Microorganism for Remediation of Heavy...Phylogenetic Analysis of the Potential Microorganism for Remediation of Heavy...
Phylogenetic Analysis of the Potential Microorganism for Remediation of Heavy...
 
DNA IMAGE SEGMENTATION – A COMPARISON OF EDGE DETECTION TECHNIQUES
DNA IMAGE SEGMENTATION – A COMPARISON OF EDGE DETECTION TECHNIQUESDNA IMAGE SEGMENTATION – A COMPARISON OF EDGE DETECTION TECHNIQUES
DNA IMAGE SEGMENTATION – A COMPARISON OF EDGE DETECTION TECHNIQUES
 
A comparative in silico study finds a functional co-relation between human hs...
A comparative in silico study finds a functional co-relation between human hs...A comparative in silico study finds a functional co-relation between human hs...
A comparative in silico study finds a functional co-relation between human hs...
 
Presentation in English
Presentation in EnglishPresentation in English
Presentation in English
 
Gene expression mining for predicting survivability of patients in earlystage...
Gene expression mining for predicting survivability of patients in earlystage...Gene expression mining for predicting survivability of patients in earlystage...
Gene expression mining for predicting survivability of patients in earlystage...
 
A SIMPLE APPROACH FOR RELATIVELY AUTOMATED HIPPOCAMPUS SEGMENTATION FROM SAGI...
A SIMPLE APPROACH FOR RELATIVELY AUTOMATED HIPPOCAMPUS SEGMENTATION FROM SAGI...A SIMPLE APPROACH FOR RELATIVELY AUTOMATED HIPPOCAMPUS SEGMENTATION FROM SAGI...
A SIMPLE APPROACH FOR RELATIVELY AUTOMATED HIPPOCAMPUS SEGMENTATION FROM SAGI...
 
Application of three graph Laplacian based semisupervised learning methods to...
Application of three graph Laplacian based semisupervised learning methods to...Application of three graph Laplacian based semisupervised learning methods to...
Application of three graph Laplacian based semisupervised learning methods to...
 
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
 
Anti-Synchronizing Backstepping Control Design for Arneodo Chaotic System
Anti-Synchronizing Backstepping Control Design for Arneodo Chaotic SystemAnti-Synchronizing Backstepping Control Design for Arneodo Chaotic System
Anti-Synchronizing Backstepping Control Design for Arneodo Chaotic System
 
Study of vitellogenin motif
Study of vitellogenin motifStudy of vitellogenin motif
Study of vitellogenin motif
 
Class of 84 on 13th april 700pm at sangeet natak academy, gomti nagar lucknow
Class of 84 on 13th april 700pm at sangeet natak academy, gomti nagar lucknowClass of 84 on 13th april 700pm at sangeet natak academy, gomti nagar lucknow
Class of 84 on 13th april 700pm at sangeet natak academy, gomti nagar lucknow
 
2015 Princeton Quant Trading Conference, Future of Finance: Cyber Finance
2015 Princeton Quant Trading Conference, Future of Finance: Cyber Finance2015 Princeton Quant Trading Conference, Future of Finance: Cyber Finance
2015 Princeton Quant Trading Conference, Future of Finance: Cyber Finance
 
Novel adaptive filter (naf) for impulse noise suppression from digital images
Novel adaptive filter (naf) for impulse noise suppression from digital imagesNovel adaptive filter (naf) for impulse noise suppression from digital images
Novel adaptive filter (naf) for impulse noise suppression from digital images
 
Biometric System Penetration in Resource Constrained Mobile Device
Biometric System Penetration in Resource Constrained Mobile DeviceBiometric System Penetration in Resource Constrained Mobile Device
Biometric System Penetration in Resource Constrained Mobile Device
 
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
 
Community Detection in Networks Using Page Rank Vectors
Community Detection in Networks Using Page Rank Vectors Community Detection in Networks Using Page Rank Vectors
Community Detection in Networks Using Page Rank Vectors
 
Community Detection in Networks Using Page Rank Vectors
Community Detection in Networks Using Page Rank Vectors Community Detection in Networks Using Page Rank Vectors
Community Detection in Networks Using Page Rank Vectors
 
Bacterial Identification by 16s rRNA Sequencing.ppt
Bacterial Identification by 16s rRNA Sequencing.pptBacterial Identification by 16s rRNA Sequencing.ppt
Bacterial Identification by 16s rRNA Sequencing.ppt
 
identification of bacteria
identification of bacteriaidentification of bacteria
identification of bacteria
 
SURVEY ON MODELLING METHODS APPLICABLE TO GENE REGULATORY NETWORK
SURVEY ON MODELLING METHODS APPLICABLE TO GENE REGULATORY NETWORKSURVEY ON MODELLING METHODS APPLICABLE TO GENE REGULATORY NETWORK
SURVEY ON MODELLING METHODS APPLICABLE TO GENE REGULATORY NETWORK
 

Semelhante a Amino acid sequence based in silico analysis of β- galactosidases

Cloning and Functional Analysis of TCAP3 Gene in Taxus Chinensis var. mairei
Cloning and Functional Analysis of TCAP3 Gene in Taxus Chinensis var. maireiCloning and Functional Analysis of TCAP3 Gene in Taxus Chinensis var. mairei
Cloning and Functional Analysis of TCAP3 Gene in Taxus Chinensis var. mairei
Agriculture Journal IJOEAR
 
Jmb022 12-19 fdoc-1
Jmb022 12-19 fdoc-1Jmb022 12-19 fdoc-1
Jmb022 12-19 fdoc-1
duarfelipe
 
Isolation and characterization of plant growth promoting bacteria (PGPB) from...
Isolation and characterization of plant growth promoting bacteria (PGPB) from...Isolation and characterization of plant growth promoting bacteria (PGPB) from...
Isolation and characterization of plant growth promoting bacteria (PGPB) from...
Agriculture Journal IJOEAR
 
10.11648.j.ajbio.s.2015030201.15
10.11648.j.ajbio.s.2015030201.1510.11648.j.ajbio.s.2015030201.15
10.11648.j.ajbio.s.2015030201.15
nazir uddin
 
THEME – 4 Genomic diversity of domestication in soybean
THEME – 4 Genomic diversity of domestication in soybeanTHEME – 4 Genomic diversity of domestication in soybean
THEME – 4 Genomic diversity of domestication in soybean
ICARDA
 
Proteomic analysis of the interaction between the plant growth promoting fhiz...
Proteomic analysis of the interaction between the plant growth promoting fhiz...Proteomic analysis of the interaction between the plant growth promoting fhiz...
Proteomic analysis of the interaction between the plant growth promoting fhiz...
kys9723331
 
MOLECULAR ANALYSIS OF BACTERIAL GENE CODING CHITINASE ENZYMES, FAMILY 19 STR...
MOLECULAR ANALYSIS OF BACTERIAL GENE CODING CHITINASE  ENZYMES, FAMILY 19 STR...MOLECULAR ANALYSIS OF BACTERIAL GENE CODING CHITINASE  ENZYMES, FAMILY 19 STR...
MOLECULAR ANALYSIS OF BACTERIAL GENE CODING CHITINASE ENZYMES, FAMILY 19 STR...
IJSIT Editor
 
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
Monica Pava-Ripoll
 

Semelhante a Amino acid sequence based in silico analysis of β- galactosidases (20)

Evaluation of seed storage proteins in common bean by some biplot analysis
Evaluation of seed storage proteins in common bean by some biplot analysisEvaluation of seed storage proteins in common bean by some biplot analysis
Evaluation of seed storage proteins in common bean by some biplot analysis
 
1471 2148-6-99
1471 2148-6-991471 2148-6-99
1471 2148-6-99
 
Cloning and Functional Analysis of TCAP3 Gene in Taxus Chinensis var. mairei
Cloning and Functional Analysis of TCAP3 Gene in Taxus Chinensis var. maireiCloning and Functional Analysis of TCAP3 Gene in Taxus Chinensis var. mairei
Cloning and Functional Analysis of TCAP3 Gene in Taxus Chinensis var. mairei
 
Bioactive chemical analysis of enterobacter aerogenes and test of its anti fu...
Bioactive chemical analysis of enterobacter aerogenes and test of its anti fu...Bioactive chemical analysis of enterobacter aerogenes and test of its anti fu...
Bioactive chemical analysis of enterobacter aerogenes and test of its anti fu...
 
د. رافد هادي حميد
د. رافد هادي حميدد. رافد هادي حميد
د. رافد هادي حميد
 
Glycerol-3-phosphate Acyltransferase (GPAT) genes of Eugenia uniflora L. and ...
Glycerol-3-phosphate Acyltransferase (GPAT) genes of Eugenia uniflora L. and ...Glycerol-3-phosphate Acyltransferase (GPAT) genes of Eugenia uniflora L. and ...
Glycerol-3-phosphate Acyltransferase (GPAT) genes of Eugenia uniflora L. and ...
 
Jmb022 12-19 fdoc-1
Jmb022 12-19 fdoc-1Jmb022 12-19 fdoc-1
Jmb022 12-19 fdoc-1
 
Ethical and bio-safety issues related to GM crops
Ethical and bio-safety issues related to GM cropsEthical and bio-safety issues related to GM crops
Ethical and bio-safety issues related to GM crops
 
Isolation and characterization of plant growth promoting bacteria (PGPB) from...
Isolation and characterization of plant growth promoting bacteria (PGPB) from...Isolation and characterization of plant growth promoting bacteria (PGPB) from...
Isolation and characterization of plant growth promoting bacteria (PGPB) from...
 
10.11648.j.ajbio.s.2015030201.15
10.11648.j.ajbio.s.2015030201.1510.11648.j.ajbio.s.2015030201.15
10.11648.j.ajbio.s.2015030201.15
 
Aia review (1)
Aia review (1)Aia review (1)
Aia review (1)
 
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
 
IOSR Journal of Pharmacy (IOSRPHR)
IOSR Journal of Pharmacy (IOSRPHR)IOSR Journal of Pharmacy (IOSRPHR)
IOSR Journal of Pharmacy (IOSRPHR)
 
THEME – 4 Genomic diversity of domestication in soybean
THEME – 4 Genomic diversity of domestication in soybeanTHEME – 4 Genomic diversity of domestication in soybean
THEME – 4 Genomic diversity of domestication in soybean
 
Proteomic analysis of the interaction between the plant growth promoting fhiz...
Proteomic analysis of the interaction between the plant growth promoting fhiz...Proteomic analysis of the interaction between the plant growth promoting fhiz...
Proteomic analysis of the interaction between the plant growth promoting fhiz...
 
85202026
8520202685202026
85202026
 
MOLECULAR ANALYSIS OF BACTERIAL GENE CODING CHITINASE ENZYMES, FAMILY 19 STR...
MOLECULAR ANALYSIS OF BACTERIAL GENE CODING CHITINASE  ENZYMES, FAMILY 19 STR...MOLECULAR ANALYSIS OF BACTERIAL GENE CODING CHITINASE  ENZYMES, FAMILY 19 STR...
MOLECULAR ANALYSIS OF BACTERIAL GENE CODING CHITINASE ENZYMES, FAMILY 19 STR...
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18
 
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
 
DRB 2013 Nutrigenomics
DRB 2013 NutrigenomicsDRB 2013 Nutrigenomics
DRB 2013 Nutrigenomics
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Amino acid sequence based in silico analysis of β- galactosidases

  • 1. International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013 DOI: 10.5121/ijbb.2013.3204 37 Amino acid sequence based in silico analysis of β- galactosidases Ratnaboli Bose, Shikha Arora, Vivek Dhar Dwivedi & Amit Pandey Forest Pathology Division, Forest Research Institute, Dehradun, India vivek_bioinformatics@yahoo.com Abstract Amino acid sequences of β-galactosidase enzyme belonging to different families of bacteria, fungi and plants retrieved from GenPept database were analyzed for multiple sequence alignment, cluster analysis, conserved motif discovery and their Pfam analysis using different bioinformatics tools. The multiple sequence alignment revealed different conserved residues of amino acids exclusively for each groups except fungi. The cluster analysis for different groups uniformly showed three major clusters based on the closeness of the β-galactosidase protein sequences irrespective of the source organisms. Seven conserved motifs belonging to different families were assessed. These identified motifs showed the evolutionary closeness among species at the molecular level. Keywords β -galactosidase, conserved motif, cluster analysis, residues 1. Introduction β-galactosidases are hydrolase enzymes which are involved in the hydrolysis of β-galactosides into monosaccharides. It is widely distributed enzyme among bacteria, fungi and plants. Sequencing and analysis of amino acid sequences of β-galactosidases originates many ideas about their structural and functional activity. In bacteria, the 1024 amino acids of E. coli β-galactosidase were first sequenced [1] and its structure determined after twenty-four years [2]. The protein is a 464-kDa homotetramer. Each unit of β-galactosidase consists of five domains; domain 1 is a jelly-roll type barrel, domain 2 and 4 are fibronectin type III-like barrels, domain 5 a β-sandwich, while the central domain 3 is a TIM-type barrel. The third domain contains the active site [3]. In fungi a genomic copy of the β-galactosidase gene of Hypocrea jecorina was cloned [4], and this copy encodes a 1,023-amino-acid protein with a 20-amino-acid signal sequence. This protein has a molecular mass of 109.3 kDa, belongs to glycosyl hydrolase family 35, and is the major extracellular β-galactosidase during growth on lactose. In Plants the relationship between fruit softening and beta-Gal during banana fruit ripening, a beta-Gal cDNA fragment, named MA-Gal, has been cloned from banana fruit pulp using RT-PCR in this study. The results of sequence analysis showed that MA-Gal contained 927 bp, encoding a polypeptide of 309 amino acids, the deduced protein was highly homologous to plant beta-galactosidase expressed in fruit ripening. The MA-Gal putative amino acids have five homologous domains [5]. In light of above, the study of β-galactosidase amino acid sequences from various sources is very important. In the present analysis, we performed the In-silico analysis including conserved motif assessment their family identification, MSA, and cluster analysis of β-galactosidase amino acid sequences from bacteria, fungi and plants.
  • 2. International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013 38 2. Materials and methods 2.1 Sequence retrieval The 30 full length amino acid sequences of β-galactosidase from bacteria, fungi and plants were retrieved from GenPept database (http://www.ncbi.nlm.nih.gov/protein). The sequences were arranged in bacterial, fungal and plant profiles, respectively [6, 7, 8, 9] 2.2 Multiple sequence alignment The multiple sequence alignment of the individual profiles was performed using MUSCLE at the European Bioinformatics Institute [10]. 2.3 Conserved Motif identification Motifs were identified in profiles using the expectation maximization approach implemented in Multiple EM for Motif Elicitation server [11]. 2.4 Conserved Motif family identification Motif families were identified by sequence searching in Pfam database [12] 2.5 Cluster analysis The UPGMA approach implemented in the Mega program was employed for constructing phylogenetic relationships among sequences [13] 3. Results 3.1 Sequence retrieval All the sequences belonging to different families of bacteria, fungi and plants were searched and retrieved from NCBI protein database (GenPept) and listed in Table 1 along with their accession number, species name, family and origin. 3.2 Multiple sequence alignment MSA showed the presence of some conserved residues in all the sequences from different sources, while others were restricted only to their groups [14]. Four tryptophan, four phenylalanine, three tyrosine, two proline, two alanine, one glycine, one aspartic acid, one isoleucine and one glutamic acid were found to be identically conserved residues in all analysed species of plant. One proline and one glycine were found to be identically conserved residues in all analyzed sequences of bacteria while no residue was found to be conserved in fungal profile. 3.3 Conserved motif identification Seven conserved motifs were identified after the analysis of bacterial, fungal and plant profiles separately. Three conserved motifs were observed in bacterial profile, three in plant profile whereas a single conserved motif was identified in fungal profile (Table. 2). 3.4 Conserved motif family identification The seven identified conserved motifs were applied for their family identification in Pfam data base using sequence search option. First two conserved motifs identified in bacterial profile belonged to Glyco hydro 42 domain family while the Pfam entry of third bacterial conserved motif was not found. All the three conserved motifs identified in plant profile belonged to Glyco
  • 3. International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013 39 hydro 35 domain family while a single conserved motif identified in fungal profile belonged to Beta Gal dom2 domain family (Table. 2). 3.5 Cluster analysis 3.5.1. Cluster analysis of bacterial profile Cluster analysis of bacteria showed two major clusters as shown in Figure 1. Cluster A consisted of six species which was further divided into two sub-clusters. Sub-cluster A contains three species (Thermus thermophilus, Meiothermus ruber and Streptomyces flavogriseus). Sub-cluster B contains two species (Bacteroides salanitronis and Bacteroides ovatus). Niastella koreensis was found to be distantly related and therefore outgrouped from both sub-clusters. Cluster B consisted of two species namely Xanthomonas axonopodis, and Streptomyces coelicolor. Frateuria aurantia and Niabella soli were outgrouped from both clusters. Figure 1. Phylogenetic tree of bacterial profile using UPGMA method 3.5.2. Cluster analysis of fungal profile Cluster analysis of fungi showed a single major cluster as shown in Figure 2. This cluster consisted of seven species which was further divided into two sub-clusters. Sub-cluster A contains five species (Metarhizium anisopliae, Metarhizium acridum, Penicillium decumbens, Beauveria bassiana and Aspergillus kawachii). Sub-cluster B contains two species (Verticillium dahlia and Verticillium albo-atrum). Colletotrichum orbiculare, Cordyceps militaris and Colletotrichum higginsianum were found to be outgrouped from both sub-clusters and therefore these were distantly related. 3.5.3. Cluster analysis of plant profile Cluster analysis of plant showed two major clusters as shown in Figure 3. Cluster A consisted of eight species which was further divided into two sub-clusters. Sub-cluster A contains three species (Prunus salicina, Pyrus communis and Cicer arietinum). Sub- cluster B contains two species (Solanum lycopersicum and Capsicum annuum). Oryza sativa, Brassica oleracea, Medicago truncatula were found to be distantly related and therefore outgrouped from both sub-clusters. Cluster B consisted of two species namely Arabidopsis thaliana and Aegilops tauschii.
  • 4. International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013 40 Figure 2.Phylogenetic tree of fungal profile using UPGMA method Figure 3.Phylogenetic tree of plant profile using UPGMA method 3.5.4. Cluster analysis of joint bacterial, fungal and plant profile Three major clusters were obtained by Cluster analysis of joint bacterial, fungal and plant profile (Figure 4). Cluster A consisted of seventeen species which were further divided into two subclusters. Subcluster A contained eight species of plants, and one species of bacteria. Subcluster B consisted of seven species of fungi and one species of bacteria. Cluster B consisted of six species of bacteria. One species of bacteria was outgrouped from Cluster B. Cluster C consisted of two species of plant and two species of fungi. One bacterial species and one fungal species were outgrouped from all three clusters.
  • 5. International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013 41 Figure4. Phylogenetic tree of joint profile of bacteria, fungi and plants using UPGMA method
  • 6. International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013 42 Table. 1 Retrieved sequences, source, species name, family and their accession number Serial no. Source Name of Organisms Family Accession no. 1. Bacteria Bacteroides salanitronis Bacteroidaceae ADY37532.1 2. Bacteria Bacteroides ovatus Bacteroidaceae ZP_06725189 3. Bacteria Xanthomonas axonopodis Xanthomonadaceae AGH78562.1 4. Bacteria Frateuria aurantia Xanthomonadaceae YP_005377482.1. 5. Bacteria Niastella koreensis Chitinophagaceae YP_005008117.1 6. Bacteria Niabella soli Chitinophagaceae ZP_09632360.1 7. Bacteria Streptomyces coelicolor Streptomycetaceae NP_733571.1 8. Bacteria Streptomyces flavogriseus Streptomycetaceae ADW06353.1 9. Bacteria Thermus thermophilus Thermaceae ABI35985.1 10. Fungi Metarhizium anisopliae Clavicipitaceae EFZ03727.1 11. Fungi Metarhizium acridum Clavicipitaceae EFY85580.1 12. Fungi Colletotrichum orbiculare Glomerellaceae ENH80113.1 13. Fungi Penicillium decumbens Trichocomaceae AFR36805.1 14. Fungi Aspergillus kawachii Trichocomaceae GAA90667.1 15. Fungi Cordyceps militaris Cordycipitaceae EGX94612.1 16. Fungi Beauveria bassiana Cordycipitaceae EJP64431. 17. Fungi Verticillium dahlia Plectosphaerellaceae EGY23296.1 18. Fungi Verticillium albo-atrum Plectosphaerellaceae EEY14998.1 19. Fungi Colletotrichum orbiculare Glomerellaceae ENH80113.1 20. Fungi Colletotrichum higginsianum Glomerellaceae CCF38689.1 21. Plants Brassica oleracea Brassicaceae CAA59162.1 22. Plants Arabidopsis thaliana Brassicaceae AEE79231.1 23. Plants Oryza sativa Poaceae AAM34271.1 24. Plants Aegilops tauschii Poaceae EMT17876.1
  • 7. International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013 43 25. Plants Solanum lycopersicum Fabaceae AAC25984.1 26. Plants Capsicum annuum Solanaceae BAC10578.2 27. Plants Cicer arietinum Fabaceae CAA06309.1 28. Plants Medicago truncatula Fabaceae AET04927.1 29. Plants Prunus salicina Rosaceae ABY71826.1 30. Plants Pyrus communis Rosaceae CAH18936.1 Table.2 Motifs identified using MEME program and their Pfam analysis using Pfam database Serial no Motif Width Present in number of sequences Family Source 1. EFAWNQLEPEPGKYDFSWLD 20 10 Glyco hydro 42 Bacteria 2. YGNHPAVIMWQIDNE 15 10 Glyco hydro 42 Bacteria 3. EQWKEDLKKMREMG 14 10 Pfam entry not found Bacteria 4. GLDVIQTYVFWNGHEPSPGKY 21 10 Glyco hydro 35 Plant 5. LYVNLRIGPYVCAEWNFGGFP 21 10 Glyco hydro 35 Plant 6. INGQRRILISGSIHYPRSTPQ 21 10 Glyco hydro 35 Plant 7. RDSKIHVTDYPVGDHTLLYSTAEIFTWKK 29 10 Beta Gal dom2 Fungi 4. Conclusions Identification of conserved regions in a profile of protein sequences determines common ancestry combined with conservative evolutionary pressure to maintain important residues at functionally important parts of the protein. MSA revealed the presence of some conserved residues in plant and bacterial profile separately while no residue was found to be conserved in fungal profile. This suggests that the analyzed sequences of fungi showed high variability when compared to bacteria and plants. Seven conserved motifs belonging to different families were identified. Three major sequence clusters were obtained by cluster analysis of all retrieved sequences from different sources indicating the evolutionary history of β-galactosidases.
  • 8. International Journal on Bioinformatics & Biosciences (IJBB) Vol.3, No.2, June 2013 44 References 1. Fowler A.V., & Zabin I. (1977). The amino acid sequence of beta-galactosidase of Escherichia coli. Proceedings of the National Academy of Sciences, 74(4), 1507-1510. 2. Jacobson R.H., Zhang X. J., Dubose R. F., Matthews B. W. (1994). Three-dimensional structure of β-galactosidase from E. Coli. Nature 369 (6483): 761–766 3. Matthews B.W. (June 2005). The structure of E. coli beta-galactosidase. C. R. Biol. 328 (6): 549– 56. 4. Seiboth B., Hartl L., Salovuori N., Lanthaler K., Robson G.D., Vehmaanpera, J., & Kubicek C. P. (2005). Role of the bga1-encoded extracellular β-galactosidase of Hypocrea jecorina in cellulase induction by lactose. Applied and environmental microbiology, 71(2), 851-857. 5. Zhuang J.P., Su J., Li X.P., & Chen W.X. (2006). Cloning and expression analysis of beta- galactosidase gene related to softening of banana (Musa sp.) fruit. Zhi wu sheng li yu fen zi sheng wu xue xue bao= Journal of plant physiology and molecular biology, 32(4), 411. 6. Dwivedi V.D., Arora S., Kumar A. and Mishra S.K. (2013). Computational analysis of xanthine dehydrogenase enzyme from different source organisms, Network Modeling Analysis in Health Informatics and Bioinformatics, DOI : 10.1007/s13721-013-0029- 7. 7. Dhar D. V., Tanuj S., Amit P., & Kumar M. S. (2012). INSIGHTS TO SEQUENCE INFORMATION OF ALPHA AMYLASE ENZYME FROM DIFFERENT SOURCE ORGANISMS. International Journal of Advanced Biotechnology and Bioinformatics, 1(1), 87-91. 8. Dhar D. V., Tanuj S., Kumar M. S., & Kumar P. A. (2012). Insights to Sequence Information of Lactoylglutathione Lyase Enzyme from Different Source Organisms. I. Res. J. Biological Sci., 1(6), 38-42. 9. Yadav .SK., Dubey A.K., Yadav S., Bisht D., Darmwal N.S., Yadav D., Amino acid sequences based phylogenetic and motif assessment of lipases from different organisms, Online J Bioinform., 13(3):400-417, 2012. 10. Edgar R.C., (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput, Nucleic Acids Res., 19: 32(5), 1792-7. 11. Bailey T.L., Elkan C., (1995). Unsupervised learning of multiple motifs in biopolymers using expectation maximization, Mach Learn 21 (51), 80-33. 12. Punta M., Coggill P.C., Eberhardt R.Y., Mistry J., Tate J., Boursnell C., Pang N., Forslund K., Ceric G., Clements J., Heger A., Holm L., Sonnhammer E.L.L., Eddy S.R., Bateman A., and Finn R.D. The Pfam Protein Families Database, Nucleic Acids Research Database (2012). 13. Kumar S., Dudley J., Nei M, and Tamura K. (2008). MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences, Briefings in Bioinformatics, 9, 299-306. 14. Malviya N., Srivastava M., Diwakar S. K.. and Mishra S. K. (2011). Insights to sequence information of polyphenol oxidase enzyme from different source organisms,” Applied Biochemistry and Biotechnology, 165: 397–405