SlideShare uma empresa Scribd logo
1 de 11
Baixar para ler offline
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
175
COMPRESSIVE ASSESSMENT OF BIOINFORMATICS IN BIOMEDICAL
IMAGING AND IMAGE PROCESSING
Shashikant S. Patil Sachin A Sonawane
Associate Professor Assistant Professor
SVKM’s NMIMS SVKM’s NMIMS
Nischay Upadhyay Aanchal Srivastava
SVKM’s NMIMS SVKM’s NMIMS
ABSTRACT
Bioinformatics is basically the method of storing, organizing, retrieving and analyzing the
biological data. The protein sequences of the organisms are stored in databases and can be retrieved
as per requirement. This paper reviews the bioinformatics and its importance in detail with the
description of various analyzing methods. Whenever a new virus is found, its genetic material is
compared with the existing genetic information. Based on the comparison of results, the protein
sequence that is required for curing can be identified. Based on this observation the appropriate
medicine is prescribed. The data stored in the databases is updated as per the discovery of new
protein sequences. Hence, it is essential to keep the information in the database in the most organized
and optimized form for its quick retrieval.
Keywords: Bioinformatics, Biomedical Imaging, Compressive assessment, Genomics analysis,
Protein sequence.
1. INTRODUCTION
Biological data are being produced at a phenomenal rate. For instance till the year 2000 there
were 88,166 sequences of protein database in SWISS-PROT and 8,214,000 sequences of neuclic acid
contained in the GenBank. Typically observing these scenario the databases are expanding by twice
every 15 months []. Moreover, complete sequences for database for more than 4 organisms have
been released that ranges from 50 genes to as much as 100,000. Appending to the data from the
uncountable number of related projects that is based on gene expressions tells us the protein structure
INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING &
TECHNOLOGY (IJCET)
ISSN 0976 – 6367(Print)
ISSN 0976 – 6375(Online)
Volume 5, Issue 4, April (2014), pp. 175-185
© IAEME: www.iaeme.com/ijcet.asp
Journal Impact Factor (2014): 8.5328 (Calculated by GISI)
www.jifactor.com
IJCET
© I A E M E
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
176
encoded by the genes and also gives information about how these product will interact with each
other. This indeed gives our imagination a wider sense about the variety and amount of information
that is being processed [15, 19]. This overflowing of data leads to the necessity of computers in the
biological research. Introducing the computer in the system is an ideal approach as computer has
capability of dealing with a large amount of data and solving the complex problems with an ease.
Bioinformatics, the subject of the current review, is often defined as an interdisciplinary field of
science that has develop various techniques to store, retrieve, organize and analyse the biological
data. Life itself revolves around information technology [2, 23]. An individual’s psychology is
largely correlated with its genes which at its most primitive form can be looked as a digitalized
information. Simultaneously with the emergence in the bioinformatics there has been advanced
enhancements in the technologies that supply the initial raw data. Recently in the studies of Anthony
Kerlavage of Celera has stated that an experimental laboratory is able to produce data over 100
Gigabytes within a day easily [9, 14, 32]. To match the incredibly large data the computer employed
should be able to deal with the data with incredible processing powers. The more the data processing
is there more is the challenges in the computing field. The computer employed should be of high
efficiency, larger disk storage, should allow faster computation and should be optimized to its
highest possible limit.
2. AIMS OF BIOINFORMATICS
The bioinformatics has some simple aim that can be explained in three-folds. Firstly, the
bioinformatics should organize and optimize the data in such a way that it can be easily accessible by
the researchers and new entries in the database can be made easily [6, 7, 11]. While curing the data is
one of the important tasks, hence, the information that is stored in the databases are not fruitful until
and unless they are analysed. Therefore, the functionality of bioinformatics is further extended.
Secondly, the aim of bioinformatics is to develop techniques and resources that will help in the
analysing phase of data. For instance, if some sequence of protein is found, then it is of researcher’s
interest to compare it with the previously characterized sequences of protein. This search of the
previously matching sequence is not a mere search of data from a stack of data rather it requires
programs like FASTA and PSI-BLAST that does the biological significant matching of the
sequences. The development in the resources is therefore demanding the expertise in the
computational field and thorough understanding of the biology and its related terms. Thirdly, the aim
of bioinformatics is to use the provided tools to analyse the given data and interpret the outcomes in
a biologically understandable manner [23]. Customarily the biological studies goes on by examining
the subject in detail and comparing the gathered data with the related previously characterized data.
In bioinformatics now we can compare the database globally which has led to uncovering general
principles that may apply around various present system and helps to highlight the novel features
present. The biological data gathered in the examining process goes through many processing stages
which includes areas such as computer science, mathematical analysis, statistics and engineering.
3. HISTORY
On the basis of importance of data transmission, accumulation and processing in biological
systems, PaulienHogeweg, discovered the term “Bioinformatics” in 1978. The term implies to the
study of data processes in biotic systems [3]. Because of its definition this field is similar to
biochemistry or biophysics. The data accumulation and maintenance in models of prebiotic evolution
and social interaction structures by simple behavioural rules are the relevant examples of biological
information processes which were studied earlier under bioinformatics [5].
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
177
Elvin A. Kabat, contributed to bioinformatics with his comprehensive volumes of antibody
sequences which was released with Tai Te Wu between 1980 and 1991. Margaret Oakley Dayhoff is
another significant pioneer in this field who has been hailed by David Lipman, director of the
National Centre for Biotechnology Information, as the "mother and father of bioinformatics [16, 20,
27].
The term bioinformatics was re-invented at the beginning of the "genomic revolution", to
refer to the creation and maintenance of a database to store biological data such as amino acid
sequences and nucleotide sequences [8, 10]. This type of database included not only design issues
but also the development of complex interfaces so that the researchers could easily access the
existing data and also submit the revised or new data.
4. GOALS
The biological data are combined together to study the alterations of cellular activities in
various disease states [14, 25]. The field of bioinformatics includes protein domains, amino acid and
nucleotide sequences and protein structures which emerged this field in such a way that now it
includes the interpretation and analysis of several types of data. The actual term for analyzing and
interpreting data is known as computational biology [31, 33]. The vital sub-disciplines within
computational biology and bioinformatics include:
• The implementation and the development of tools that enables effective access for use and
management of several kind of data.
• The improvement in statistics and new mathematical formula to access the interaction among
members of large data sets. For instance, the protein structure and/or function and cluster
protein sequences are predicted into the families of related sequences in order to locate a gene
within a sequence.
The initial goal of bioinformatics is to escalate the understanding of biological processes. Its
focus on improving and applying computationally intensive methods to achieve this goal, makes this
field different from other approaches. Samples include: data mining, machine learning algorithms,
pattern recognition and visualization [21]. Key research efforts in the field include gene finding, drug
design, sequence alignment, drug discovery, genome assembly, protein structure prediction, protein
structure alignment, genome-wide association studies, prediction of gene expression and protein-
protein interactions, and the modeling of evolution [18].
In order to solve practical and formal problems that rise from the analysis and the
management of biological information, bioinformatics [29] requires the creation and development of
algorithms, databases, statistical and computational methods.
The combination of continuous development in information technologies and genomic and
other molecular research [14] technologies has given rise to a large amount of data related to
molecular biology. Thus, the name given to these computing and mathematical approaches is
“bioinformatics” for the understanding of biological processes.
5. APPROACHES
Mapping and analysing DNA and several protein sequences and aligning them with different
DNA [13] and protein sequences in order to compare them and then creating the 3-D models of these
sequences and viewing it, these are the common activates in bioinformatics.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
178
The two ways of modelling a biological system for instance, living cell are as follows:
1) Static : sequences- Peptides and Nucleic acids and Proteins Relationship among the above
entities including metabolites, microarray information and Networks of proteins
2) Dynamic: Structures- Nucleic acids, Proteins, Peptides and Ligands System biology, under
this category, includes variable concentrations of metabolites and reaction fluxes
6. MAJOR RESEARCH AREAS
The major research areas of bioinformatics is bioinformatics and computational biology,
genetics and genomics and lastly system biology. Bioinformatics has emerged as an essential part of
numerous areas of biology [33]. In starting level of experimental biology many image processing
techniques and signal processing techniques are combined and used to encapsulate the important
outcomes from a huge quantity of raw data. Bioinformatics has been working in the field of genetics
and genomes in which it helps in sequencing and interpreting genomes[14] and simultaneously
observes their transformations. It plays an essential role in making a note of the recorded data and
arranging it sequentially in an organized manner in the databases. It has a role in the analyzing stages
of genes and protein sequences. In the area of system biology it aids in the modelling and simulation
of several microscopic level things like the RNA, DNA[23,25] and the protein structures and also
their interaction at molecular level of studies.
7. SEQUENCE ANALYSIS
Sequence analysis is basically a process in which the protein sequence, DNA or RNA are
subjected to the broader range of methods that analytically analyzes the subjects in order to
understand its characteristics in detail. When we talk about a characteristics of the peptide sequences
we talk about its features, its functions, its structure and its evolution. The analytical methods used
consist of the alignment of the sequences[12,32] and comparison of the sequence with the previous
biological databases. The sequence analysis of the functions include a wide range of similar topics:
• The comparison of the sequences with the biological databases to find the similarities
between them and also infer whether they are homologous [14, 22].
• Intrinsic features of the sequence is identified using the sequence analysis.
• Genetic marking based upon the differences and variation from the biological database
sequences is used to identify the signals.
• Finally after comparing the new sequence with the old database we reveal the genetic
diversity and evolution of the new sequence or organism.
• Sequence alone is sufficient to identify the molecular structure of the organism.
Intrinsic features mentioned in one of the points above refer to the features like post
translation modification sites, active sites, gene structures, reading frames, distribution of exons,
introns [5, 9] and other regulatory elements.
Sequence analysis is followed in a particular fashion. The main contents in the analysis are
the sequence alignment which is followed by profile comparison. The profile comparison is done by
aligning the new sequence with the old database. After the profile comparison the next stage is of the
sequence assemble and gene prediction after which the protein structure prediction takes place.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
179
8. GENOME ANNOTATION
Genome can be defined as a complete set of DNA within a single cell of organism. The very
first software system that can explain the genome was designed by Owen White in 1995. He with his
team analyzed and sequenced the very first genome of the bacterium haemophilus influenza. Genes
are defined as the fragments of genomic sequence [6, 8] that encode the protein within. Owen White
then build a software that was capable of finding the genes, transferring RNA’s and moreover can
assign initial functions to the genes. Since the development in the computer field is tremendous
hence the programs now available are more advanced, changing and improved in its own ways.
9. COMPUTATIONAL EVOLUTIONARY BIOLOGY
The study of the origin of a species, its descent as well as the change over time is known as
Evolution biology [3]. In simple words the meaning of evolution biology can be judged from its
name itself i.e. it is the study of an evolution of species. The invention of bioinformatics in
evolutionary biology has enabled the researchers to trace the descent of an uncountable number of
species. Also bioinformatics has aided in the study of the evolution of more complex species that
include the cases of gene[2,10] duplication, gene transfer horizontally, etc. It has also helped in
estimating the result of any population over time. Bioinformatics has main usage in tracking and
sharing the information of a huge number species and organism.
10. LITERATURE ANALYSIS
As the number of researcher are growing, in the same manner the number of papers published
are increasing at a tremendous rate [1, 5]. Hence to get into the expanding library of the textual
resources, literature analysis is required. Literature analysis targets to make use of various
computational and statistical linguistics for this analysis. It uses abbreviation recognition, named
entity recognition and protein- protein interaction for the same. In abbreviation recognition it finds
out the full form of the abbreviated biological [18, 20] terms. In named entity recognition it
recognizes the other biological terms like the names of the genes etc. whereas in the protein-protein
interaction it identifies that which protein is interacting with which protein from the given text.
11. ANALYSIS OF GENE EXPRESSION
To analyze the gene expression we have to measure its mRNA level. There are various
techniques to measure the mRNA level, some of which are microarrays, serial analysis of gene
expression, expressed cDNA sequence[17,32] tag (EST) sequencing, massively parallel signature
sequencing (MPSS), RNA-Seq. that is also called as "Whole Transcriptome Shotgun Sequencing"
(WTSS). The techniques mentioned above are all noise prone. Most of the efficiency and time in the
computational biology is absorbed in developing various statistical tools that are capable of
separating the noise from signal.
12. ANALYSIS OF REGULATION
Before analysis of regulation we should first understand what regulation is all about.
Regulation can be defined as a complex arrangement of the events[12,21] initiating with a hormone
or any extracellular signal that eventually leads to the increment or the decrement in one or more
activity of any number of proteins. For the in-depth study of the process bioinformatics is required.
To infer about a gene regulation expression data can be used [14]. If the analysis is restricted to a
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
180
single celled species then the various stages of the cell cycle can be compared under different
conditions of stress i.e. under heat, shock, etc. After getting the expression data we can apply the
clustering algorithm in order to tell that which of the genes are co-expressed. Clustering algorithm
used for gene clustering are like k-means clustering, SOMs (self-organizing maps), consensus
clustering and hierarchical clustering.
13. ANALYSIS OF PROTEIN EXPRESSION
The proteins present in the biological sample can be judged by providing a snapshot using the
protein microarrays and high throughout (HT) mass spectrometry (MS). These snapshots can be
turned into sensible data using bioinformatics [9, 12]. It faced a problem of matching large quantity
of data against the previously available protein database and also there were multiple complicated
statistical analysis of samples whereas it detected the incomplete peptides from each protein.
14. COMPARATIVE GENOMICS
Comparative genomics is a biological research field that is used for the comparison of the
genomic features of different organisms. The features that are compared between organisms are
generally the DNA sequences, the genes, the regulatory sequences, the gene order and various other
genomic structural landmarks[41]. The comparative genomics has the ability to find out the
similarities as well as differences in biological terms between different species. Evolutionary
relationships between the alike organisms can also be found out by using comparative genomics. The
principle of the comparative genomics is that each and every organism has its features encoded
within its DNA and these features are evolutionary conserved within them. Hence, the common
features of two organisms are also encoded within their DNAs [4, 6]. Therefore, comparative
genomics makes some sort of alignment of the genome sequences and then finds the common
sequence in the aligned sequences. Finally it verifies that to which extend are the sequences
conserved.
15. NETWORK AND SYSTEMS BIOLOGY
15.1. Network Biology
The relationships between biological networks such as the metabolic network or the protein-
protein network are understood by doing the network analysis. Network biology seldom contains
different data types like small molecules, proteins[31], data of gene expression and many other types
that may be connected physically or/and functionally. A biological network can be formed from a
single type of genes or molecules but still the above mentioned data types are always attempted to be
integrated [19].
15.2. Systems biology
In order to visualize and analyze the complex connections of the cellular processes the
computer simulation of cellular subsystems are used. Cellular subsystem [23] comprises a network of
enzymes and metabolites which includes metabolism, gene regulatory networks and signal
transduction pathways. With the aid of the simple (artificial) life forms computer stimulations, the
virtual evolution also known as artificial life tries to comprehend the evolutionary processes [9].
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
181
16. HIGH-THROUGHPUT IMAGE ANALYSIS
The biomedical imagery has a lot of information stored into it. For complete analysis of these
images computer technologies are deployed which in-turn accelerate or sometimes fully automate the
processing, analysis[22] and quantification of the large amount of data stored in the biological
images. The analysis systems in the modern times has increased the ability of the observer to take
measurements from larger distances and has also improved the accuracy, speed or objectivity. Some
of the modern fully developed analysis systems has completely removed the observer. These
enhanced systems[29] are not only related to the field of biomedical imaging. Nowadays biomedical
imaging is becoming useful for both the research and diagnostics purposes.
17. SOFTWARE AND TOOLS
Several software tools used for bioinformatics are varied from simple command-line tools to
more complex graphical programs. Further the software and tools in bioinformatics are classified as
follows:
17.1. Open-source bioinformatics software
Since 1980s, there have been many free and open source software tools that existed and are
further continuing to grow. Regardless of the funding arrangements, the association of several open
code bases that are freely available, innovation in silicoexperiments [6, 27] and necessity for new
algorithms for the analysis of evolving types of biological readouts have helped all research groups
to contribute to both the variety of open source software available and bioinformatics. These open
source tools often serve as the brooding place for ideas, or community supported plug-ins in
commercial applications [25, 28].
The variety of open-source software packages involves titles such as BioPerl, BioJava,
Taverna workbench, Bioconductor, UGENE [31], Biopython, BioRuby, EMBOSS, .NET Bio and
Bioclipse. The annual Bioinformatics Open Source Conference (BOSC) has been supported by the
non-profit Open Bioinformatics Foundation since 2000, to maintain the tradition create further
opportunities.
17.2. Web services in bioinformatics
For a wide range of applications in bioinformatics, several interfaces based on SOAP and
REST [16, 17] have been developed which allows an application to run on one computer in one part
of the world so as to use information, algorithms and computing resources on servers in other parts
of the world. The main advantage of using web services in bioinformatics is that there is no need for
the end users to deal with database maintenance overheads and software.
EBI further sub-divides the basic bioinformatics services into three categories which are as follows:
1) BSA (Biological Sequence Analysis)
2) SSS (Sequence Search Services)
3) MSA (Multiple Sequence Alignment)
The applicability of web-based bioinformatics solutions is illustrated through the availability
of these service oriented bioinformatics resources that varies from an assembly of standalone tools
having a common information pattern under a single web-based interface[28], to distributed,
integrative and extensible bioinformatics workflow management systems.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
182
17.3. Bioinformatics workflow management systems
Bioinformatics workflow management system is defined as a specialized kind of a workflow
management system. The main aim of this system is to constitute and execute a series of
computational or information manipulation steps, or a workflow in a bioinformatics application. This
system helps scientists to create their own workflows[11, 19] for individual application by providing
them an easy to use environment. It also provides them various interactive tools that enable them to
execute their workflows and view the results in real time. This system simplifies the process of
reusing and sharing workflows between the scientists. Thus, it helps the scientists to track the
workflow creation steps and the provenance[17] of the workflow execution results. Galaxy, Kepler,
Anyaya and TavernaAnduril are four such platforms that provide the above mentioned service
currently.
18. PRACTICAL APPLICATIONS
18.1. Finding Homologues
The search for similarities between unlike biomolecules is one of the driving forces behind
bioinformatics. The identification of protein homologues has some direct practical uses in
bioinformatics, apart from the systematic organization [22, 24] of information. The key application
of this, is transferring of information between related proteins. For instance, it is possible to look for
homologues that are better understood and also applies some of the knowledge as compared to a
given poorly characterized protein. The theoretical models of proteins are usually built on
experimentally solved structures of close homologues specifically in the case of structural data. The
method used in fold recognition is same as the above mentioned technique in which tertiary structure
predictions rely on searching structures of remote homologues[26,30] and then further it is checked
that whether the prediction is energetically viable or not. Wherever there is lack in structural and
biochemical data, the studies could be made in low-level organisms such as yeast and the results of
these experiments could be applied to high level organisms like humans, where the experiments are
more demanding. A corresponding approach is also applied in genomics. Basically homologue
finding is used in newly sequenced genomes to confirm coding regions and the functional
information is recurrently transferred to annotate individual genes [31].
The problem of understanding complex genomes is simplified by first analysing simple
organisms and then applying the same techniques to high level complicated organisms. This is
applicable on large scale and is one of the reasons why the projects of early structural genomics
focussed on Mycoplasma genitalium[17]. Ironically, this identical idea can also be applied vice
versa. By checking the missing microbial proteins in humans, one can quickly discover the potential
drug targets. Drug molecules may be designed on a small scale by harnessing the structural
differences between alike proteins, that specifically bind to one structure but not another.
18.2. Rational Drug Design
Assisting rational drug design has been one of the earliest medical applications of
bioinformatics. Using the MLH1 gene[14] product is the commonly cited approach as an example of
drug target. MLH1 is a human gene which is used to encode a mismatch repair protein (mmr) that is
situated on the shortarm of chromosome [1, 5]. After going through the linkage analysis, it has been
observed that MLH1 gene is similar to the mmr genes in mice, therefore the gene has now been
implicated in nonpolyposis colorectal cancer. Translation software can be used to determine the
probable amino acid sequence of the encoded protein when the nucleotide sequence is given. Firstly
homologues are found in model organisms by using sequence search techniques and then on the
basis of the similarity of the sequence, the structure of the human protein is modeled based on the
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
183
experimentally characterized structures [22, 23]. Finally, to bind the model structure, molecules are
designed using docking algorithms.
18.3. Large-scale censuses
The bioinformatics database is capable of storing all the information and sequences of the
protein, genomes and their structure and datasets expression[29] but on the other hand it is very
important to condense that information in the understandable form by the user.
In order to identify the subjects and areas of interest for further analysis[2,4] and detailed study the
Broad generalization helps in doing so and also it aids in placing the new observations at appropriate
place. This sorting assists the user to judge whether the content is unusual in any pf the possible
ways.
The surveys held at a particularly large scale can give rise to a number of questions related to
bioinformatics. Questions such as whether the protein fold are related to a particular groups of
phylogenetic? In a particular organism how the different folds are common? What percentages of the
folds are commonly shared between similar organisms?
We have also integrated data on to the protein functions. We know that certain protein folds
are responsible for the particular biochemical functions. The sharing of the folds by an organism
indeed follows the routine phylogenetic classification [6, 11]. A query also arises that to which
extent of sharing parallel the deprived relatedness are derived from the traditional evolutionary trees?
Expression data is the most electrifying and latest source of genomic information. The question after
the combination of the expression information with the functional and the structural classification of
the protein the high presence of the protein fold in a genome is the revealing of the higher levels of
expression. The subcellular[22] localization of the protein and the interactions between them is
included in the surveys at larger scale to get the genomic data. Along with the union of the structural
data we can start by executing the maps of all the protein-protein interaction that takes place within
an organism.
18.4. Further applications in medical sciences
The current application in the biomedical sciences has been wrapped around the gene
expression analysis. In the biomedical disease detection the compiled expression data of the affected
cell due to any disease like cancer, etc. is compared with the normal expression levels of the
unaffected cells. The gene expressed by the affected cells provides us with the cause of the disease
and therefore indicates the various possible treatments for the same using specialized
drugs[23].Given a lead compound, microarray trials can then be helpful in evaluating the responses
to the pharmacological intervention, moreover it aids in predicting the toxicity [26, 29] of the applied
drugs and is used to facilitate with the early test results.
The combination of experimental genomics with the advancement in bioinformatics is
predicted to bring about a revolution in the future of healthcare for the individuals. For a patient a
distinctive scenario may arise with post natal genotyping in order to assess immunity from specific
pathogens and diseases. Therefore, keeping in mind this information, in order to reduce the costs
related to healthcare, a distinct combination of vaccines could be prescribed saving the unnecessary
treatments and expecting the onslaught of diseases later in life.
19. REFERENCES
1. Waterman, Michael S. (1995). Introduction to Computational Biology: Sequences, Maps and
Genomes. CRC Press.ISBN 0-412-9991-0.
2. Mount, David W. (May 2002). Bioinformatics: Sequence and Genome Analysis. Spring
Harbor Press. ISBN 0-879-69608-7.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
184
3. Claverie, J.M.; Notredame, C. (2003). Bioinformatics for Dummies. Wiley. ISBN 0-765-
1696-5.
4. Hogeweg, P. (2011). "The Roots of Bioinformatics in Theoretical Biology". In Searls, David
B. PLoS Computational Biology 7 (3):.
5. Hesper B, Hogeweg P (1970). Bioinformatica: eenwerkconcept 1 (6). Kameleon. pp. 28–29.
6. Hogeweg, P. (1978). "Simulating the growth of cellular forms". Simulation 31 (3):
90–96.doi:10.1177/001754977803100305.
7. Johnson, George; Tai TeWua (January 2000). "Kabat Database and its applications: 30 years
after the first variability plot". Nucleic Acids Res 28 (1): 214–218.
doi:10.1093/nar/28.1.214. PMC 102231.PMID PMC102231.
8. Moody, Glyn (2004). Digital Code of Life: How Bioinformatics is Revolutionizing Science,
Medicine, and Business. ISBN 978-0-471-32788-2.
9. Attwood TK, Gisel A, Eriksson N-E, Bongcam-Rudloff E (2011). "Concepts, Historical
Milestones and the Central Place of Bioinformatics in Modern Biology: A European
Perspective". Bioinformatics - Trends and Methodologies. InTech. Retrieved 8 Jan 2012.
10. Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA, Hutchison CA,
Slocombe PM, Smith M (February 1977). "Nucleotide sequence of bacteriophage phi X174
DNA". Nature 265 (5596):.
11. Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL (January 2008).
"GenBank". Nucleic Acids Res.6 (Database issue): D25–30.
doi:10.1093/nar/gkm929. PMC 2218932. PMID 18073190.
12. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ,
Tomb JF, Dougherty BA, Merrick JM (July 1995). "Whole-genome random sequencing and
assembly of Haemophilus influenzae Rd". Science 269 (5223): 496–512.
Bibcode:1995Sci...269..496F. doi:10.1126/science.7532800. PMID 7532800.
13. Antonio Carvajal-Rodríguez (2012). "Simulation of Genes and Genomes Forward in
Time". Current Genomics(Bentham Science Publishers Ltd.) 11 (1): 58–61.
doi:10.2174/118920210790218007. PMC 2851118.PMID 20808525. More than one of
|work= and |journal= specified (help).
14. Abu-Jamous B, Fa R, Roberts DJ, Nandi AK, Peddada SD (11 February 2013). "Paradigm of
Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene
Discovery". In Peddada, ShyamalD.PLoS ONE 8 (2):
e56232. doi:10.1171/journal.pone.0056232. PMC 3169326. PMID 209186.
15. Abu-Jamous B, Fa R, Roberts DJ, Nandi AK (24 January 2013). "Yeast gene
CMR1/YDL156W is consistently co-expressed with genes participating in DNA-metabolic
processes in a variety of stringent clustering experiments". Journal of the Royal Society
Interface 10 (81): 20120990–20120990. doi:10.1098/rsif.2012.0990.
16. "Open Bioinformatics Foundation: About us". Official website. Open Bioinformatics
Foundation. Retrieved 10 May 2011.
17. "Open Bioinformatics Foundation: BOSC". Official website. Open Bioinformatics
Foundation. Retrieved 10 May 2011.
18. ROSALIND: An Addictive Bioinformatics Learning Site.
19. ROSALIND (Project Euler for bioinformatics).
20. Researchers Launch Innovative, Hands-on Online Tool for Science Education.
21. Rosalind is a platform for learning bioinformatics through problem solving.
22. "Q&A: UCSD's Philip Compeau Discusses Rosalind, a Problem-Oriented Bioinformatics
Education Platform". Bioinform. 5 Oct 2012.
23. Achuthsankar S Nair Computational Biology & Bioinformatics – A gentle Overview,
Communications of Computer Society of India, January 2007.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME
185
24. Aluru, Srinivas, ed. Handbook of Computational Molecular Biology. Chapman & Hall/Crc,
2006. ISBN 1-58488-46-1 (Chapman & Hall/Crc Computer and Information Science Series).
25. Baldi, P and Brunak, S, Bioinformatics: The Machine Learning Approach, 2nd edition. MIT
Press, 2001. ISBN 0-262-02506-X
26. Barnes, M.R. and Gray, I.C., eds., Bioinformatics for Geneticists, first edition. Wiley,
2003. ISBN 0-470-8294-2.
27. Baxevanis, A.D. and Ouellette, B.F.F., eds., Bioinformatics: A Practical Guide to the
Analysis of Genes and Proteins, third edition. Wiley, 2005. ISBN 0-471-47878-4.
28. Baxevanis, A.D., Petsko, G.A., Stein, L.D., and Stormo, G.D., eds., Current Protocols in
Bioinformatics. Wiley, 2007. ISBN 0-471-25093-7.
29. Cristianini, N. and Hahn, M. Introduction to Computational Genomics, Cambridge
University Press, 2006. (ISBN 9780521671910 | ISBN 0-521-67191-4).
30. Durbin, R., S. Eddy, A. Krogh and G. Mitchison, Biological sequence analysis. Cambridge
University Press, 1998. ISBN 0-521-62971-3.
31. Gilbert, D. Bioinformatics software resources. Briefings in Bioinformatics, Briefings in
Bioinformatics, 2004 5(3):300–304.
32. Keedwell, E., Intelligent Bioinformatics: The Application of Artificial Intelligence
Techniques to Bioinformatics Problems. Wiley, 2005. ISBN 0-470-02175-6.
33. Kohane, et al. Microarrays for an Integrative Genomics. The MIT Press, 2002. ISBN 0-262-
11271-X.
34. Lund, O. et al. Immunological Bioinformatics. The MIT Press, 2005. ISBN 0-262-12280-4.
35. Ali E. Taki El_Deen, Mohy E. Abo-Elsoud and Salma M. Saif, “Highly Secured Ciphered -
Watermarked Biomedical Images”, International Journal of Advanced Research in
Engineering & Technology (IJARET), Volume 5, Issue 1, 2014, pp. 123 - 137, ISSN Print:
0976-6480, ISSN Online: 0976-6499.
36. J.Rajarajan and Dr.G.Kalivarathan, “Influence of Local Segmentation in the Context of
Digital Image Processing – A Feasibility Study”, International Journal of Computer
Engineering & Technology (IJCET), Volume 3, Issue 3, 2012, pp. 340 - 347, ISSN Print:
0976 – 6367, ISSN Online: 0976 – 6375.
37. J Ashvini Bhamare, Prof. Sachin Sonawane and Prof. Shashikantpatil, “Comprehensive
Assessment, Performance Overview and Review of All Existing Techniques of Reversible
Image Watermarking for Medical Imaging”, International Journal of Computer Engineering
& Technology (IJCET), Volume 4, Issue 6, 2013, pp. 95 - 102, ISSN Print: 0976 – 6367,
ISSN Online: 0976 – 6375.

Mais conteúdo relacionado

Mais procurados

Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
Daniela Rotariu
 
Bioinformatics Project Training for 2,4,6 month
Bioinformatics Project Training for 2,4,6 monthBioinformatics Project Training for 2,4,6 month
Bioinformatics Project Training for 2,4,6 month
biinoida
 

Mais procurados (20)

David Tyrpak CV
David Tyrpak CVDavid Tyrpak CV
David Tyrpak CV
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
Current Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsCurrent Trends & Developments of Bioinformatics
Current Trends & Developments of Bioinformatics
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
 
Bioinformatics Project Training for 2,4,6 month
Bioinformatics Project Training for 2,4,6 monthBioinformatics Project Training for 2,4,6 month
Bioinformatics Project Training for 2,4,6 month
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
50120140501018
5012014050101850120140501018
50120140501018
 
Omics Logic Genomics Program
Omics Logic Genomics ProgramOmics Logic Genomics Program
Omics Logic Genomics Program
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
Bioinformatics Projects And Applications
Bioinformatics Projects And ApplicationsBioinformatics Projects And Applications
Bioinformatics Projects And Applications
 
Functional Analysis & Screening Technologies Congress
Functional Analysis & Screening Technologies CongressFunctional Analysis & Screening Technologies Congress
Functional Analysis & Screening Technologies Congress
 
Ijetcas14 325
Ijetcas14 325Ijetcas14 325
Ijetcas14 325
 
150522 bioinfo gis lr
150522 bioinfo gis lr150522 bioinfo gis lr
150522 bioinfo gis lr
 
The Role of Bioinformatics in The Drug Discovery Process
The Role of Bioinformatics in The Drug Discovery ProcessThe Role of Bioinformatics in The Drug Discovery Process
The Role of Bioinformatics in The Drug Discovery Process
 
Introduction to Chemoinfornatics
Introduction to ChemoinfornaticsIntroduction to Chemoinfornatics
Introduction to Chemoinfornatics
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
Novel modelling of clustering for enhanced classification performance on gene...
Novel modelling of clustering for enhanced classification performance on gene...Novel modelling of clustering for enhanced classification performance on gene...
Novel modelling of clustering for enhanced classification performance on gene...
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
 

Destaque

Training and development
Training and developmentTraining and development
Training and development
danny688
 
Logistica y Cadena de Suministros
Logistica y Cadena de SuministrosLogistica y Cadena de Suministros
Logistica y Cadena de Suministros
Gonzalo Lagunes
 
Guia informaticabasica
Guia informaticabasicaGuia informaticabasica
Guia informaticabasica
linda31
 
Gfpi f-019 guia de aprendizaje 01 tda orientar fpi
Gfpi f-019 guia de aprendizaje 01 tda orientar fpiGfpi f-019 guia de aprendizaje 01 tda orientar fpi
Gfpi f-019 guia de aprendizaje 01 tda orientar fpi
lisbet bravo
 
Manual de cadena de custodia
Manual de cadena de custodiaManual de cadena de custodia
Manual de cadena de custodia
Mona Beautifull
 

Destaque (20)

E portafolio verónica casas castrillon grupo 201512_55
E  portafolio verónica casas castrillon grupo 201512_55 E  portafolio verónica casas castrillon grupo 201512_55
E portafolio verónica casas castrillon grupo 201512_55
 
E portafolio leidy-johannadelgado_201512_103
E portafolio leidy-johannadelgado_201512_103E portafolio leidy-johannadelgado_201512_103
E portafolio leidy-johannadelgado_201512_103
 
ETAPAS DE LA ENFERMEDAD
ETAPAS DE LA ENFERMEDADETAPAS DE LA ENFERMEDAD
ETAPAS DE LA ENFERMEDAD
 
Unidad i y ii
Unidad i y iiUnidad i y ii
Unidad i y ii
 
Costos
CostosCostos
Costos
 
Historia natural de la enfermedad
Historia natural de la enfermedadHistoria natural de la enfermedad
Historia natural de la enfermedad
 
Training and development
Training and developmentTraining and development
Training and development
 
Logistica y Cadena de Suministros
Logistica y Cadena de SuministrosLogistica y Cadena de Suministros
Logistica y Cadena de Suministros
 
La logistica en la gestion de cadenas de suministro
La logistica en la gestion de cadenas de suministroLa logistica en la gestion de cadenas de suministro
La logistica en la gestion de cadenas de suministro
 
20100116 01 Word Pressテンプレートのカスタマイズ&Xml出力
20100116 01 Word Pressテンプレートのカスタマイズ&Xml出力20100116 01 Word Pressテンプレートのカスタマイズ&Xml出力
20100116 01 Word Pressテンプレートのカスタマイズ&Xml出力
 
Guia de aprendizaje diagnostico
Guia de aprendizaje  diagnosticoGuia de aprendizaje  diagnostico
Guia de aprendizaje diagnostico
 
Guia informaticabasica
Guia informaticabasicaGuia informaticabasica
Guia informaticabasica
 
Guia 01
Guia 01Guia 01
Guia 01
 
Gfpi f-019 guia de aprendizaje 01 tda orientar fpi
Gfpi f-019 guia de aprendizaje 01 tda orientar fpiGfpi f-019 guia de aprendizaje 01 tda orientar fpi
Gfpi f-019 guia de aprendizaje 01 tda orientar fpi
 
Guia fund. teoricos Elaboracion Guias Aprendizaje
Guia fund. teoricos Elaboracion Guias AprendizajeGuia fund. teoricos Elaboracion Guias Aprendizaje
Guia fund. teoricos Elaboracion Guias Aprendizaje
 
Guia fund. teoricos
Guia fund. teoricosGuia fund. teoricos
Guia fund. teoricos
 
Manual de cadena de custodia
Manual de cadena de custodiaManual de cadena de custodia
Manual de cadena de custodia
 
Cápsula 1. estudios de mercado
Cápsula 1. estudios de mercadoCápsula 1. estudios de mercado
Cápsula 1. estudios de mercado
 
C:\Fakepath\Christie
C:\Fakepath\ChristieC:\Fakepath\Christie
C:\Fakepath\Christie
 
Curso Gestión de Procesos FEB.2014 - Dr. Miguel Aguilar Serrano
Curso Gestión de Procesos FEB.2014 - Dr. Miguel Aguilar SerranoCurso Gestión de Procesos FEB.2014 - Dr. Miguel Aguilar Serrano
Curso Gestión de Procesos FEB.2014 - Dr. Miguel Aguilar Serrano
 

Semelhante a 50120140504019 2

Computational of Bioinformatics
Computational of BioinformaticsComputational of Bioinformatics
Computational of Bioinformatics
ijtsrd
 
Protein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyProtein sequence classification in data mining– a study
Protein sequence classification in data mining– a study
Zac Darcy
 
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYPROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
Zac Darcy
 
Applications of data mining in bioinformatics
Applications of data mining in bioinformaticsApplications of data mining in bioinformatics
Applications of data mining in bioinformatics
kalai75
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
biinoida
 

Semelhante a 50120140504019 2 (20)

LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
Computational of Bioinformatics
Computational of BioinformaticsComputational of Bioinformatics
Computational of Bioinformatics
 
Protein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyProtein sequence classification in data mining– a study
Protein sequence classification in data mining– a study
 
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYPROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
 
Health Informatics- Module 5-Chapter 3.pptx
Health Informatics- Module 5-Chapter 3.pptxHealth Informatics- Module 5-Chapter 3.pptx
Health Informatics- Module 5-Chapter 3.pptx
 
Bio informatics
Bio informaticsBio informatics
Bio informatics
 
Bio informatics
Bio informaticsBio informatics
Bio informatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Applications of data mining in bioinformatics
Applications of data mining in bioinformaticsApplications of data mining in bioinformatics
Applications of data mining in bioinformatics
 
Bioinformatics-General_Intro
Bioinformatics-General_IntroBioinformatics-General_Intro
Bioinformatics-General_Intro
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptx
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
97320630009588
9732063000958897320630009588
97320630009588
 
PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...
PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...
PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...
 
PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...
PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...
PERFORMANCE EVALUATION OF STRUCTURED AND SEMI-STRUCTURED BIOINFORMATICS TOOLS...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Basic of bioinformatics
Basic of bioinformaticsBasic of bioinformatics
Basic of bioinformatics
 
Bioinformatics in the Clinical Pipeline: Contribution in Genomic Medicine
Bioinformatics in the Clinical Pipeline: Contribution in Genomic MedicineBioinformatics in the Clinical Pipeline: Contribution in Genomic Medicine
Bioinformatics in the Clinical Pipeline: Contribution in Genomic Medicine
 

Mais de IAEME Publication

A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
IAEME Publication
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
IAEME Publication
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
IAEME Publication
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
IAEME Publication
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
IAEME Publication
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
IAEME Publication
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
IAEME Publication
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
IAEME Publication
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
IAEME Publication
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
IAEME Publication
 

Mais de IAEME Publication (20)

IAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdf
 
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
 
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
 
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSDETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
 
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
 
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOVOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
 
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
 
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYVISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
 
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
 
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
 
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTA MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

50120140504019 2

  • 1. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 175 COMPRESSIVE ASSESSMENT OF BIOINFORMATICS IN BIOMEDICAL IMAGING AND IMAGE PROCESSING Shashikant S. Patil Sachin A Sonawane Associate Professor Assistant Professor SVKM’s NMIMS SVKM’s NMIMS Nischay Upadhyay Aanchal Srivastava SVKM’s NMIMS SVKM’s NMIMS ABSTRACT Bioinformatics is basically the method of storing, organizing, retrieving and analyzing the biological data. The protein sequences of the organisms are stored in databases and can be retrieved as per requirement. This paper reviews the bioinformatics and its importance in detail with the description of various analyzing methods. Whenever a new virus is found, its genetic material is compared with the existing genetic information. Based on the comparison of results, the protein sequence that is required for curing can be identified. Based on this observation the appropriate medicine is prescribed. The data stored in the databases is updated as per the discovery of new protein sequences. Hence, it is essential to keep the information in the database in the most organized and optimized form for its quick retrieval. Keywords: Bioinformatics, Biomedical Imaging, Compressive assessment, Genomics analysis, Protein sequence. 1. INTRODUCTION Biological data are being produced at a phenomenal rate. For instance till the year 2000 there were 88,166 sequences of protein database in SWISS-PROT and 8,214,000 sequences of neuclic acid contained in the GenBank. Typically observing these scenario the databases are expanding by twice every 15 months []. Moreover, complete sequences for database for more than 4 organisms have been released that ranges from 50 genes to as much as 100,000. Appending to the data from the uncountable number of related projects that is based on gene expressions tells us the protein structure INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) ISSN 0976 – 6367(Print) ISSN 0976 – 6375(Online) Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME: www.iaeme.com/ijcet.asp Journal Impact Factor (2014): 8.5328 (Calculated by GISI) www.jifactor.com IJCET © I A E M E
  • 2. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 176 encoded by the genes and also gives information about how these product will interact with each other. This indeed gives our imagination a wider sense about the variety and amount of information that is being processed [15, 19]. This overflowing of data leads to the necessity of computers in the biological research. Introducing the computer in the system is an ideal approach as computer has capability of dealing with a large amount of data and solving the complex problems with an ease. Bioinformatics, the subject of the current review, is often defined as an interdisciplinary field of science that has develop various techniques to store, retrieve, organize and analyse the biological data. Life itself revolves around information technology [2, 23]. An individual’s psychology is largely correlated with its genes which at its most primitive form can be looked as a digitalized information. Simultaneously with the emergence in the bioinformatics there has been advanced enhancements in the technologies that supply the initial raw data. Recently in the studies of Anthony Kerlavage of Celera has stated that an experimental laboratory is able to produce data over 100 Gigabytes within a day easily [9, 14, 32]. To match the incredibly large data the computer employed should be able to deal with the data with incredible processing powers. The more the data processing is there more is the challenges in the computing field. The computer employed should be of high efficiency, larger disk storage, should allow faster computation and should be optimized to its highest possible limit. 2. AIMS OF BIOINFORMATICS The bioinformatics has some simple aim that can be explained in three-folds. Firstly, the bioinformatics should organize and optimize the data in such a way that it can be easily accessible by the researchers and new entries in the database can be made easily [6, 7, 11]. While curing the data is one of the important tasks, hence, the information that is stored in the databases are not fruitful until and unless they are analysed. Therefore, the functionality of bioinformatics is further extended. Secondly, the aim of bioinformatics is to develop techniques and resources that will help in the analysing phase of data. For instance, if some sequence of protein is found, then it is of researcher’s interest to compare it with the previously characterized sequences of protein. This search of the previously matching sequence is not a mere search of data from a stack of data rather it requires programs like FASTA and PSI-BLAST that does the biological significant matching of the sequences. The development in the resources is therefore demanding the expertise in the computational field and thorough understanding of the biology and its related terms. Thirdly, the aim of bioinformatics is to use the provided tools to analyse the given data and interpret the outcomes in a biologically understandable manner [23]. Customarily the biological studies goes on by examining the subject in detail and comparing the gathered data with the related previously characterized data. In bioinformatics now we can compare the database globally which has led to uncovering general principles that may apply around various present system and helps to highlight the novel features present. The biological data gathered in the examining process goes through many processing stages which includes areas such as computer science, mathematical analysis, statistics and engineering. 3. HISTORY On the basis of importance of data transmission, accumulation and processing in biological systems, PaulienHogeweg, discovered the term “Bioinformatics” in 1978. The term implies to the study of data processes in biotic systems [3]. Because of its definition this field is similar to biochemistry or biophysics. The data accumulation and maintenance in models of prebiotic evolution and social interaction structures by simple behavioural rules are the relevant examples of biological information processes which were studied earlier under bioinformatics [5].
  • 3. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 177 Elvin A. Kabat, contributed to bioinformatics with his comprehensive volumes of antibody sequences which was released with Tai Te Wu between 1980 and 1991. Margaret Oakley Dayhoff is another significant pioneer in this field who has been hailed by David Lipman, director of the National Centre for Biotechnology Information, as the "mother and father of bioinformatics [16, 20, 27]. The term bioinformatics was re-invented at the beginning of the "genomic revolution", to refer to the creation and maintenance of a database to store biological data such as amino acid sequences and nucleotide sequences [8, 10]. This type of database included not only design issues but also the development of complex interfaces so that the researchers could easily access the existing data and also submit the revised or new data. 4. GOALS The biological data are combined together to study the alterations of cellular activities in various disease states [14, 25]. The field of bioinformatics includes protein domains, amino acid and nucleotide sequences and protein structures which emerged this field in such a way that now it includes the interpretation and analysis of several types of data. The actual term for analyzing and interpreting data is known as computational biology [31, 33]. The vital sub-disciplines within computational biology and bioinformatics include: • The implementation and the development of tools that enables effective access for use and management of several kind of data. • The improvement in statistics and new mathematical formula to access the interaction among members of large data sets. For instance, the protein structure and/or function and cluster protein sequences are predicted into the families of related sequences in order to locate a gene within a sequence. The initial goal of bioinformatics is to escalate the understanding of biological processes. Its focus on improving and applying computationally intensive methods to achieve this goal, makes this field different from other approaches. Samples include: data mining, machine learning algorithms, pattern recognition and visualization [21]. Key research efforts in the field include gene finding, drug design, sequence alignment, drug discovery, genome assembly, protein structure prediction, protein structure alignment, genome-wide association studies, prediction of gene expression and protein- protein interactions, and the modeling of evolution [18]. In order to solve practical and formal problems that rise from the analysis and the management of biological information, bioinformatics [29] requires the creation and development of algorithms, databases, statistical and computational methods. The combination of continuous development in information technologies and genomic and other molecular research [14] technologies has given rise to a large amount of data related to molecular biology. Thus, the name given to these computing and mathematical approaches is “bioinformatics” for the understanding of biological processes. 5. APPROACHES Mapping and analysing DNA and several protein sequences and aligning them with different DNA [13] and protein sequences in order to compare them and then creating the 3-D models of these sequences and viewing it, these are the common activates in bioinformatics.
  • 4. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 178 The two ways of modelling a biological system for instance, living cell are as follows: 1) Static : sequences- Peptides and Nucleic acids and Proteins Relationship among the above entities including metabolites, microarray information and Networks of proteins 2) Dynamic: Structures- Nucleic acids, Proteins, Peptides and Ligands System biology, under this category, includes variable concentrations of metabolites and reaction fluxes 6. MAJOR RESEARCH AREAS The major research areas of bioinformatics is bioinformatics and computational biology, genetics and genomics and lastly system biology. Bioinformatics has emerged as an essential part of numerous areas of biology [33]. In starting level of experimental biology many image processing techniques and signal processing techniques are combined and used to encapsulate the important outcomes from a huge quantity of raw data. Bioinformatics has been working in the field of genetics and genomes in which it helps in sequencing and interpreting genomes[14] and simultaneously observes their transformations. It plays an essential role in making a note of the recorded data and arranging it sequentially in an organized manner in the databases. It has a role in the analyzing stages of genes and protein sequences. In the area of system biology it aids in the modelling and simulation of several microscopic level things like the RNA, DNA[23,25] and the protein structures and also their interaction at molecular level of studies. 7. SEQUENCE ANALYSIS Sequence analysis is basically a process in which the protein sequence, DNA or RNA are subjected to the broader range of methods that analytically analyzes the subjects in order to understand its characteristics in detail. When we talk about a characteristics of the peptide sequences we talk about its features, its functions, its structure and its evolution. The analytical methods used consist of the alignment of the sequences[12,32] and comparison of the sequence with the previous biological databases. The sequence analysis of the functions include a wide range of similar topics: • The comparison of the sequences with the biological databases to find the similarities between them and also infer whether they are homologous [14, 22]. • Intrinsic features of the sequence is identified using the sequence analysis. • Genetic marking based upon the differences and variation from the biological database sequences is used to identify the signals. • Finally after comparing the new sequence with the old database we reveal the genetic diversity and evolution of the new sequence or organism. • Sequence alone is sufficient to identify the molecular structure of the organism. Intrinsic features mentioned in one of the points above refer to the features like post translation modification sites, active sites, gene structures, reading frames, distribution of exons, introns [5, 9] and other regulatory elements. Sequence analysis is followed in a particular fashion. The main contents in the analysis are the sequence alignment which is followed by profile comparison. The profile comparison is done by aligning the new sequence with the old database. After the profile comparison the next stage is of the sequence assemble and gene prediction after which the protein structure prediction takes place.
  • 5. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 179 8. GENOME ANNOTATION Genome can be defined as a complete set of DNA within a single cell of organism. The very first software system that can explain the genome was designed by Owen White in 1995. He with his team analyzed and sequenced the very first genome of the bacterium haemophilus influenza. Genes are defined as the fragments of genomic sequence [6, 8] that encode the protein within. Owen White then build a software that was capable of finding the genes, transferring RNA’s and moreover can assign initial functions to the genes. Since the development in the computer field is tremendous hence the programs now available are more advanced, changing and improved in its own ways. 9. COMPUTATIONAL EVOLUTIONARY BIOLOGY The study of the origin of a species, its descent as well as the change over time is known as Evolution biology [3]. In simple words the meaning of evolution biology can be judged from its name itself i.e. it is the study of an evolution of species. The invention of bioinformatics in evolutionary biology has enabled the researchers to trace the descent of an uncountable number of species. Also bioinformatics has aided in the study of the evolution of more complex species that include the cases of gene[2,10] duplication, gene transfer horizontally, etc. It has also helped in estimating the result of any population over time. Bioinformatics has main usage in tracking and sharing the information of a huge number species and organism. 10. LITERATURE ANALYSIS As the number of researcher are growing, in the same manner the number of papers published are increasing at a tremendous rate [1, 5]. Hence to get into the expanding library of the textual resources, literature analysis is required. Literature analysis targets to make use of various computational and statistical linguistics for this analysis. It uses abbreviation recognition, named entity recognition and protein- protein interaction for the same. In abbreviation recognition it finds out the full form of the abbreviated biological [18, 20] terms. In named entity recognition it recognizes the other biological terms like the names of the genes etc. whereas in the protein-protein interaction it identifies that which protein is interacting with which protein from the given text. 11. ANALYSIS OF GENE EXPRESSION To analyze the gene expression we have to measure its mRNA level. There are various techniques to measure the mRNA level, some of which are microarrays, serial analysis of gene expression, expressed cDNA sequence[17,32] tag (EST) sequencing, massively parallel signature sequencing (MPSS), RNA-Seq. that is also called as "Whole Transcriptome Shotgun Sequencing" (WTSS). The techniques mentioned above are all noise prone. Most of the efficiency and time in the computational biology is absorbed in developing various statistical tools that are capable of separating the noise from signal. 12. ANALYSIS OF REGULATION Before analysis of regulation we should first understand what regulation is all about. Regulation can be defined as a complex arrangement of the events[12,21] initiating with a hormone or any extracellular signal that eventually leads to the increment or the decrement in one or more activity of any number of proteins. For the in-depth study of the process bioinformatics is required. To infer about a gene regulation expression data can be used [14]. If the analysis is restricted to a
  • 6. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 180 single celled species then the various stages of the cell cycle can be compared under different conditions of stress i.e. under heat, shock, etc. After getting the expression data we can apply the clustering algorithm in order to tell that which of the genes are co-expressed. Clustering algorithm used for gene clustering are like k-means clustering, SOMs (self-organizing maps), consensus clustering and hierarchical clustering. 13. ANALYSIS OF PROTEIN EXPRESSION The proteins present in the biological sample can be judged by providing a snapshot using the protein microarrays and high throughout (HT) mass spectrometry (MS). These snapshots can be turned into sensible data using bioinformatics [9, 12]. It faced a problem of matching large quantity of data against the previously available protein database and also there were multiple complicated statistical analysis of samples whereas it detected the incomplete peptides from each protein. 14. COMPARATIVE GENOMICS Comparative genomics is a biological research field that is used for the comparison of the genomic features of different organisms. The features that are compared between organisms are generally the DNA sequences, the genes, the regulatory sequences, the gene order and various other genomic structural landmarks[41]. The comparative genomics has the ability to find out the similarities as well as differences in biological terms between different species. Evolutionary relationships between the alike organisms can also be found out by using comparative genomics. The principle of the comparative genomics is that each and every organism has its features encoded within its DNA and these features are evolutionary conserved within them. Hence, the common features of two organisms are also encoded within their DNAs [4, 6]. Therefore, comparative genomics makes some sort of alignment of the genome sequences and then finds the common sequence in the aligned sequences. Finally it verifies that to which extend are the sequences conserved. 15. NETWORK AND SYSTEMS BIOLOGY 15.1. Network Biology The relationships between biological networks such as the metabolic network or the protein- protein network are understood by doing the network analysis. Network biology seldom contains different data types like small molecules, proteins[31], data of gene expression and many other types that may be connected physically or/and functionally. A biological network can be formed from a single type of genes or molecules but still the above mentioned data types are always attempted to be integrated [19]. 15.2. Systems biology In order to visualize and analyze the complex connections of the cellular processes the computer simulation of cellular subsystems are used. Cellular subsystem [23] comprises a network of enzymes and metabolites which includes metabolism, gene regulatory networks and signal transduction pathways. With the aid of the simple (artificial) life forms computer stimulations, the virtual evolution also known as artificial life tries to comprehend the evolutionary processes [9].
  • 7. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 181 16. HIGH-THROUGHPUT IMAGE ANALYSIS The biomedical imagery has a lot of information stored into it. For complete analysis of these images computer technologies are deployed which in-turn accelerate or sometimes fully automate the processing, analysis[22] and quantification of the large amount of data stored in the biological images. The analysis systems in the modern times has increased the ability of the observer to take measurements from larger distances and has also improved the accuracy, speed or objectivity. Some of the modern fully developed analysis systems has completely removed the observer. These enhanced systems[29] are not only related to the field of biomedical imaging. Nowadays biomedical imaging is becoming useful for both the research and diagnostics purposes. 17. SOFTWARE AND TOOLS Several software tools used for bioinformatics are varied from simple command-line tools to more complex graphical programs. Further the software and tools in bioinformatics are classified as follows: 17.1. Open-source bioinformatics software Since 1980s, there have been many free and open source software tools that existed and are further continuing to grow. Regardless of the funding arrangements, the association of several open code bases that are freely available, innovation in silicoexperiments [6, 27] and necessity for new algorithms for the analysis of evolving types of biological readouts have helped all research groups to contribute to both the variety of open source software available and bioinformatics. These open source tools often serve as the brooding place for ideas, or community supported plug-ins in commercial applications [25, 28]. The variety of open-source software packages involves titles such as BioPerl, BioJava, Taverna workbench, Bioconductor, UGENE [31], Biopython, BioRuby, EMBOSS, .NET Bio and Bioclipse. The annual Bioinformatics Open Source Conference (BOSC) has been supported by the non-profit Open Bioinformatics Foundation since 2000, to maintain the tradition create further opportunities. 17.2. Web services in bioinformatics For a wide range of applications in bioinformatics, several interfaces based on SOAP and REST [16, 17] have been developed which allows an application to run on one computer in one part of the world so as to use information, algorithms and computing resources on servers in other parts of the world. The main advantage of using web services in bioinformatics is that there is no need for the end users to deal with database maintenance overheads and software. EBI further sub-divides the basic bioinformatics services into three categories which are as follows: 1) BSA (Biological Sequence Analysis) 2) SSS (Sequence Search Services) 3) MSA (Multiple Sequence Alignment) The applicability of web-based bioinformatics solutions is illustrated through the availability of these service oriented bioinformatics resources that varies from an assembly of standalone tools having a common information pattern under a single web-based interface[28], to distributed, integrative and extensible bioinformatics workflow management systems.
  • 8. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 182 17.3. Bioinformatics workflow management systems Bioinformatics workflow management system is defined as a specialized kind of a workflow management system. The main aim of this system is to constitute and execute a series of computational or information manipulation steps, or a workflow in a bioinformatics application. This system helps scientists to create their own workflows[11, 19] for individual application by providing them an easy to use environment. It also provides them various interactive tools that enable them to execute their workflows and view the results in real time. This system simplifies the process of reusing and sharing workflows between the scientists. Thus, it helps the scientists to track the workflow creation steps and the provenance[17] of the workflow execution results. Galaxy, Kepler, Anyaya and TavernaAnduril are four such platforms that provide the above mentioned service currently. 18. PRACTICAL APPLICATIONS 18.1. Finding Homologues The search for similarities between unlike biomolecules is one of the driving forces behind bioinformatics. The identification of protein homologues has some direct practical uses in bioinformatics, apart from the systematic organization [22, 24] of information. The key application of this, is transferring of information between related proteins. For instance, it is possible to look for homologues that are better understood and also applies some of the knowledge as compared to a given poorly characterized protein. The theoretical models of proteins are usually built on experimentally solved structures of close homologues specifically in the case of structural data. The method used in fold recognition is same as the above mentioned technique in which tertiary structure predictions rely on searching structures of remote homologues[26,30] and then further it is checked that whether the prediction is energetically viable or not. Wherever there is lack in structural and biochemical data, the studies could be made in low-level organisms such as yeast and the results of these experiments could be applied to high level organisms like humans, where the experiments are more demanding. A corresponding approach is also applied in genomics. Basically homologue finding is used in newly sequenced genomes to confirm coding regions and the functional information is recurrently transferred to annotate individual genes [31]. The problem of understanding complex genomes is simplified by first analysing simple organisms and then applying the same techniques to high level complicated organisms. This is applicable on large scale and is one of the reasons why the projects of early structural genomics focussed on Mycoplasma genitalium[17]. Ironically, this identical idea can also be applied vice versa. By checking the missing microbial proteins in humans, one can quickly discover the potential drug targets. Drug molecules may be designed on a small scale by harnessing the structural differences between alike proteins, that specifically bind to one structure but not another. 18.2. Rational Drug Design Assisting rational drug design has been one of the earliest medical applications of bioinformatics. Using the MLH1 gene[14] product is the commonly cited approach as an example of drug target. MLH1 is a human gene which is used to encode a mismatch repair protein (mmr) that is situated on the shortarm of chromosome [1, 5]. After going through the linkage analysis, it has been observed that MLH1 gene is similar to the mmr genes in mice, therefore the gene has now been implicated in nonpolyposis colorectal cancer. Translation software can be used to determine the probable amino acid sequence of the encoded protein when the nucleotide sequence is given. Firstly homologues are found in model organisms by using sequence search techniques and then on the basis of the similarity of the sequence, the structure of the human protein is modeled based on the
  • 9. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 183 experimentally characterized structures [22, 23]. Finally, to bind the model structure, molecules are designed using docking algorithms. 18.3. Large-scale censuses The bioinformatics database is capable of storing all the information and sequences of the protein, genomes and their structure and datasets expression[29] but on the other hand it is very important to condense that information in the understandable form by the user. In order to identify the subjects and areas of interest for further analysis[2,4] and detailed study the Broad generalization helps in doing so and also it aids in placing the new observations at appropriate place. This sorting assists the user to judge whether the content is unusual in any pf the possible ways. The surveys held at a particularly large scale can give rise to a number of questions related to bioinformatics. Questions such as whether the protein fold are related to a particular groups of phylogenetic? In a particular organism how the different folds are common? What percentages of the folds are commonly shared between similar organisms? We have also integrated data on to the protein functions. We know that certain protein folds are responsible for the particular biochemical functions. The sharing of the folds by an organism indeed follows the routine phylogenetic classification [6, 11]. A query also arises that to which extent of sharing parallel the deprived relatedness are derived from the traditional evolutionary trees? Expression data is the most electrifying and latest source of genomic information. The question after the combination of the expression information with the functional and the structural classification of the protein the high presence of the protein fold in a genome is the revealing of the higher levels of expression. The subcellular[22] localization of the protein and the interactions between them is included in the surveys at larger scale to get the genomic data. Along with the union of the structural data we can start by executing the maps of all the protein-protein interaction that takes place within an organism. 18.4. Further applications in medical sciences The current application in the biomedical sciences has been wrapped around the gene expression analysis. In the biomedical disease detection the compiled expression data of the affected cell due to any disease like cancer, etc. is compared with the normal expression levels of the unaffected cells. The gene expressed by the affected cells provides us with the cause of the disease and therefore indicates the various possible treatments for the same using specialized drugs[23].Given a lead compound, microarray trials can then be helpful in evaluating the responses to the pharmacological intervention, moreover it aids in predicting the toxicity [26, 29] of the applied drugs and is used to facilitate with the early test results. The combination of experimental genomics with the advancement in bioinformatics is predicted to bring about a revolution in the future of healthcare for the individuals. For a patient a distinctive scenario may arise with post natal genotyping in order to assess immunity from specific pathogens and diseases. Therefore, keeping in mind this information, in order to reduce the costs related to healthcare, a distinct combination of vaccines could be prescribed saving the unnecessary treatments and expecting the onslaught of diseases later in life. 19. REFERENCES 1. Waterman, Michael S. (1995). Introduction to Computational Biology: Sequences, Maps and Genomes. CRC Press.ISBN 0-412-9991-0. 2. Mount, David W. (May 2002). Bioinformatics: Sequence and Genome Analysis. Spring Harbor Press. ISBN 0-879-69608-7.
  • 10. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 184 3. Claverie, J.M.; Notredame, C. (2003). Bioinformatics for Dummies. Wiley. ISBN 0-765- 1696-5. 4. Hogeweg, P. (2011). "The Roots of Bioinformatics in Theoretical Biology". In Searls, David B. PLoS Computational Biology 7 (3):. 5. Hesper B, Hogeweg P (1970). Bioinformatica: eenwerkconcept 1 (6). Kameleon. pp. 28–29. 6. Hogeweg, P. (1978). "Simulating the growth of cellular forms". Simulation 31 (3): 90–96.doi:10.1177/001754977803100305. 7. Johnson, George; Tai TeWua (January 2000). "Kabat Database and its applications: 30 years after the first variability plot". Nucleic Acids Res 28 (1): 214–218. doi:10.1093/nar/28.1.214. PMC 102231.PMID PMC102231. 8. Moody, Glyn (2004). Digital Code of Life: How Bioinformatics is Revolutionizing Science, Medicine, and Business. ISBN 978-0-471-32788-2. 9. Attwood TK, Gisel A, Eriksson N-E, Bongcam-Rudloff E (2011). "Concepts, Historical Milestones and the Central Place of Bioinformatics in Modern Biology: A European Perspective". Bioinformatics - Trends and Methodologies. InTech. Retrieved 8 Jan 2012. 10. Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA, Hutchison CA, Slocombe PM, Smith M (February 1977). "Nucleotide sequence of bacteriophage phi X174 DNA". Nature 265 (5596):. 11. Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL (January 2008). "GenBank". Nucleic Acids Res.6 (Database issue): D25–30. doi:10.1093/nar/gkm929. PMC 2218932. PMID 18073190. 12. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM (July 1995). "Whole-genome random sequencing and assembly of Haemophilus influenzae Rd". Science 269 (5223): 496–512. Bibcode:1995Sci...269..496F. doi:10.1126/science.7532800. PMID 7532800. 13. Antonio Carvajal-Rodríguez (2012). "Simulation of Genes and Genomes Forward in Time". Current Genomics(Bentham Science Publishers Ltd.) 11 (1): 58–61. doi:10.2174/118920210790218007. PMC 2851118.PMID 20808525. More than one of |work= and |journal= specified (help). 14. Abu-Jamous B, Fa R, Roberts DJ, Nandi AK, Peddada SD (11 February 2013). "Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery". In Peddada, ShyamalD.PLoS ONE 8 (2): e56232. doi:10.1171/journal.pone.0056232. PMC 3169326. PMID 209186. 15. Abu-Jamous B, Fa R, Roberts DJ, Nandi AK (24 January 2013). "Yeast gene CMR1/YDL156W is consistently co-expressed with genes participating in DNA-metabolic processes in a variety of stringent clustering experiments". Journal of the Royal Society Interface 10 (81): 20120990–20120990. doi:10.1098/rsif.2012.0990. 16. "Open Bioinformatics Foundation: About us". Official website. Open Bioinformatics Foundation. Retrieved 10 May 2011. 17. "Open Bioinformatics Foundation: BOSC". Official website. Open Bioinformatics Foundation. Retrieved 10 May 2011. 18. ROSALIND: An Addictive Bioinformatics Learning Site. 19. ROSALIND (Project Euler for bioinformatics). 20. Researchers Launch Innovative, Hands-on Online Tool for Science Education. 21. Rosalind is a platform for learning bioinformatics through problem solving. 22. "Q&A: UCSD's Philip Compeau Discusses Rosalind, a Problem-Oriented Bioinformatics Education Platform". Bioinform. 5 Oct 2012. 23. Achuthsankar S Nair Computational Biology & Bioinformatics – A gentle Overview, Communications of Computer Society of India, January 2007.
  • 11. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 4, April (2014), pp. 175-185 © IAEME 185 24. Aluru, Srinivas, ed. Handbook of Computational Molecular Biology. Chapman & Hall/Crc, 2006. ISBN 1-58488-46-1 (Chapman & Hall/Crc Computer and Information Science Series). 25. Baldi, P and Brunak, S, Bioinformatics: The Machine Learning Approach, 2nd edition. MIT Press, 2001. ISBN 0-262-02506-X 26. Barnes, M.R. and Gray, I.C., eds., Bioinformatics for Geneticists, first edition. Wiley, 2003. ISBN 0-470-8294-2. 27. Baxevanis, A.D. and Ouellette, B.F.F., eds., Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, third edition. Wiley, 2005. ISBN 0-471-47878-4. 28. Baxevanis, A.D., Petsko, G.A., Stein, L.D., and Stormo, G.D., eds., Current Protocols in Bioinformatics. Wiley, 2007. ISBN 0-471-25093-7. 29. Cristianini, N. and Hahn, M. Introduction to Computational Genomics, Cambridge University Press, 2006. (ISBN 9780521671910 | ISBN 0-521-67191-4). 30. Durbin, R., S. Eddy, A. Krogh and G. Mitchison, Biological sequence analysis. Cambridge University Press, 1998. ISBN 0-521-62971-3. 31. Gilbert, D. Bioinformatics software resources. Briefings in Bioinformatics, Briefings in Bioinformatics, 2004 5(3):300–304. 32. Keedwell, E., Intelligent Bioinformatics: The Application of Artificial Intelligence Techniques to Bioinformatics Problems. Wiley, 2005. ISBN 0-470-02175-6. 33. Kohane, et al. Microarrays for an Integrative Genomics. The MIT Press, 2002. ISBN 0-262- 11271-X. 34. Lund, O. et al. Immunological Bioinformatics. The MIT Press, 2005. ISBN 0-262-12280-4. 35. Ali E. Taki El_Deen, Mohy E. Abo-Elsoud and Salma M. Saif, “Highly Secured Ciphered - Watermarked Biomedical Images”, International Journal of Advanced Research in Engineering & Technology (IJARET), Volume 5, Issue 1, 2014, pp. 123 - 137, ISSN Print: 0976-6480, ISSN Online: 0976-6499. 36. J.Rajarajan and Dr.G.Kalivarathan, “Influence of Local Segmentation in the Context of Digital Image Processing – A Feasibility Study”, International Journal of Computer Engineering & Technology (IJCET), Volume 3, Issue 3, 2012, pp. 340 - 347, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375. 37. J Ashvini Bhamare, Prof. Sachin Sonawane and Prof. Shashikantpatil, “Comprehensive Assessment, Performance Overview and Review of All Existing Techniques of Reversible Image Watermarking for Medical Imaging”, International Journal of Computer Engineering & Technology (IJCET), Volume 4, Issue 6, 2013, pp. 95 - 102, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375.