SlideShare uma empresa Scribd logo
1 de 7
Baixar para ler offline
TELKOMNIKA Telecommunication, Computing, Electronics and Control
Vol. 19, No. 1, February 2021, pp. 310~316
ISSN: 1693-6930, accredited First Grade by Kemenristekdikti, Decree No: 21/E/KPT/2018
DOI: 10.12928/TELKOMNIKA.v19i1.16381  310
Journal homepage: http://journal.uad.ac.id/index.php/TELKOMNIKA
A genetic algorithm approach for predicting ribonucleic acid
sequencing data classification using KNN and decision tree
Micheal Olaolu Arowolo, Marion Olubunmi Adebiyi, Ayodele Ariyo Adebiyi
Department of Computer Science, Landmark University, Omu-Aran, Kwara State, Nigeria
Article Info ABSTRACT
Article history:
Received Apr 14, 2020
Revised Jun 9, 2020
Accepted Sep 24, 2020
Malaria larvae accept explosive variable lifecycle as they spread across
numerous mosquito vector stratosphere. Transcriptomes arise in thousands of
diverse parasites. Ribonucleic acid sequencing (RNA-seq) is a prevalent gene
expression that has led to enhanced understanding of genetic queries. RNA-
seq tests transcript of gene expression, and provides methodological
enhancements to machine learning procedures. Researchers have proposed
several methods in evaluating and learning biological data. Genetic algorithm
(GA) as a feature selection process is used in this study to fetch relevant
information from the RNA-Seq Mosquito Anopheles gambiae malaria vector
dataset, and evaluates the results using kth nearest neighbor (KNN) and
decision tree classification algorithms. The experimental results obtained a
classification accuracy of 88.3 and 98.3 percents respectively.
Keywords:
Decision tree
Genetic algorithm
KNN
Mosquito anopheles
Ribonucleic acid sequencing This is an open access article under the CC BY-SA license.
Corresponding Author:
Micheal Olaolu Arowolo
Department of Computer Science
Lamdmark University
Omu-Aran, Kwara State Nigeria
Email: arowolo.olaolu@lmu.edu.ng
1. INTRODUCTION
Next-generation high-throughput sequencing technology has created profuse wide-ranging datasets,
this enormous data expanse helps biologists to analyze and perform daunting gene transcripts, such as disease
related and RNA such as infections (malaria), cancer, inherited, genetics, physiology, among others [1].
Blood-sucking mosquitoes such as Mosquito Anopheles with vectors of malaria plasmodium falciparum are
found in Africa. Mosquito Anopheles is a deadly malaria parasite, responsible for demises of thousands of
humans daily. Antimalaria combat suppositories blowouts, state-of-the-art antimalarials treatment upsurges,
fetching for ground-breaking medications requires improved biotic studies of this infenctions. The parasite
tolerates precise parameter of gene expression query enormously and necessitates making enhanced thorough
extrapolative model transcriptions of vectors [2].
Approachable revealing genetic inquiries have been made in ribonucleic acid sequencing (RNA-seq)
study by unfolding a cautious purposeful biological strategy for enhancement of the learning. RNA-Seq data
requires removal of expletive high-dimension, such as; noises, complaints, repetition, irrelevant, inactivity,
unfitting data, and others [3]. New capabilities strengthen solutions to the development of ground-breaking
healthcare frameworks such as effective public wellbeing nursing systems, advanced interventions and medical
diagnosis and disorders [4].
Machine learning means have been established with convincing uniqueness to investigate the
enormous amount of cutting-edge RNA-Seq knowledge by studying the naturally material structures [5].
Scientists have used machine learning algorithms with relevant achievement for gene expression data results
TELKOMNIKA Telecommun Comput El Control 
A genetic algorithm approach for predicting ribonucleic acid sequencing data… (Micheal Olaolu Arowolo)
311
of RNA-Seq [6-8]. In this study, a genetic algorithm (GA) pre-processor, to obtain reduced dimensionality of
data with kth nearest neighbor (KNN) and decision tree classifiers are proposed to classify discrete genetic
structures and obtain advances that are suitable system for predicting and detecting innovative genes for malaria
ailments in human.
2. REVIEWS
Computational procedures are based on enormous samples of individuals genes with or without
diseases, mutations may be found accountable for the precense of diseases. Differential expressed genes (DEG)
are defined through some methods. Machine Learning measures are important for spotting the variation
between genes found from human genome. Machine learning techniques have been emulated severally in
investigating and classifying various profiles of diseases gene expressions. Various machine learning
approaches are reported and reviewed, using recent trends in the evaluations [4].
Machine learning for predicting Autism spectrum ailment was experimented and classify transcripts,
using RNA data from gene omnibus expression data. This study ranked cluster analysis and relatively
discriminated, using SVM and KNN classifiers, an estimate accuracy of 94% was achieved [9]. Clustering and
classification of RNA-Seq data was carried out by performing a mutual valuation, and emphasizing the
expertise and ploys of methods occurring in recent time as predominant shifts, uing nonlinear and linear
dimension reduction systems, by combining scRNA-seq data [10]. Group of RNA-Seq genes for ranking genes
set of huge ensembles using a supervised learning approach was carried out using random forests classification
method, on 1210 samples of tumor RNA-Seq datasets showed hidden supervised learning selection approaches
necessity on analysis [11]. A supervised single-cell RNA-Seq data classification model was proposed using a
comprehensive approach by combining independent feature selection approaches. scPred RNA-seq datasets
showed high accuracy [12]. RNA-DNA machine learning analysis was proposed to indicate small genome
expression to influence PAH ailment, feature selection algorithm was proposed to classify relevant genes with
an outcome that reveals unique PAH [13].
Stomach tumor gene expression data using CNN classification procedure was developed based on
deep learning approach, 60,000 data made up of stomach tumor genes were evaluated using PCA), heatmaps,
and CNN algorithms with an accuracy of 96% and 51% [14]. RNA-Seq hidden transcripts in malaria parasites
was proposed by relating variations of procedures to deconvolute transcriptional differences for distinct
mosquitos and revealed hidden distinct transcriptional signatures [15].
An ensemble classification algorithm for cancer dataset was developed using decision tree, ensemble
decision trees algorithms on available cancerous microarray, the results enhances than the decision trees
classification [16]. An investigative cancer gene expression ensemble classification method was
proposed using a hybrid RFE-Adaboost algorithm to fetch significant features for enhancing classification
performancet [17]. Classification of cancer data was carried out using an effective ensemble classification
method by increasing the classification, the result were less contingent [18]. A metaheuristics system for
fetching relevant RNA/DNA data genes for classification was proposed by briefing recent developments of
metaheuristic-based methods in embedded feature selection methods, useful data for operatives for ranking
coefficients of SVM classifier is used [19]. A GA presenting a state-of-the-art approach was proposed using
filter-wrapper based feature selection on five biological datasets, the results showed an important reduction of
features for classification [20]. An enhanced ensemble classification for certain features was proposed for
learning an ensemble-based feature selection approach with random trees using a subset, the method removes
the unfitting structures and picks the best structures by means of a probability weighing value for classification
evaluation using RF, SVM, and NB [21]. Review of several feature extraction algorithms for gene expression
investigation, such as the PCA, ICA, PLS, and LLE was carried out and discussed for the purpose of the
machine learning applications [22].
3. MATERIALS AND METHODS
Several high-dimensional data enhancement methods are in place, this paper carries out a feature
selection using GA technique and ensemble classification algorithm for fetching relevant information in a huge
dimensional data and classification. A western Kenya RNA-Seq data mosquitos’ genes with 2457 instances
with 7 gene attributes [23], MATLAB environment tool is used carry out the experiment using GA to select
relevant subset of features from the dataset as shown in Table 1, Ensemble algorithm approach is used as a
classifier on the selected features [24].
 ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 19, No. 1, February 2021: 310 - 316
312
Table 1. Features of the data
Dataset Attributes Instances
Mosquito Anopheles Gambiae 7 2457
3.1. GA
GA is a wrapper-based feature selection approach that examines suitable features from a given high
dimensional datasets, with numerous parameter procedures, where mutation and crossover operators are
associated with relevant recognized binary constraints features [19]. RNA features with N number denotes a
feature having selected and unselected values 1 and 0 respectively. GA is important and helps to find feature
subset models with selected figure features for composite classifications. GA structure is adopted and
well-defined in Algorithm 1 [20]:
Algorithm 1. Genetic Algorithm
Necessitate: Set parameters nPop = m, tmax, t = 0;
Confirm: Optimal feature subset with the maximum suitable value.
1: while (t<=tmax) do
2: Create pop m, tmax;
3: For k = 1 to m do
4: Parents [m1, m2] = system selection (m, nPop)
5: Child = Xor [m1, m2]
6: M u = mutation [Child}
7: End for
8: Replace m with Child1, Child2, …, Childm
9: t = t+ 1;
10: End while
11: Store the Highest fitness value;
m = population size, r = random number 0 to 1, chrome = certain or non-certain feature through threshold δ,
set value = 0.5, and α = threshold number of picked features. Selecting maximum fit features from the
predictable datasets is the main problem of the GA technique.
3.2. KNN
A supervised learning K-nearest neighbor classification technique for gene datasets, performs
neighborhood classification evaluation value of innovative application occurrence. KNN algorithm classifies
innovative entity developed on instances, attributes as well as training models. KNN classifiers do not train
models to fit but built on retention. The features selected are assumed as input to segments. The K value of
nearest neighbors are selected nearest to the query spot. Detachment between query-instance and training
models are considered and sorted based on the Kth
minimum determined distance. Group Y of the nearest
neighbors is fetched. The unassuming common of the group of nearest neighbors as the estimate amount of the
query instance is used. Bonds can fragment randomly [25].
3.3. Decision trees
Decision tree classification algorithm divides recursively instance spacing with hyperplanes
orthogonally. Decision tree model assembles derivative nodes signifying attributes, based on instance space
attribute value roles selected inversely for algorithms, using its values. Advanced data sub-space iteratively
divides till end principle is determined and terminal nodes (leaf nodes) are allocated to class labels
characterizing the classification. Accurate conventional end procedure is a significant tree with too huge,
overfitted and trivial trees, underfitted and suffers loss in accuracy. Algorithms have assembled overfitting
strategies, labelled trimming, classifying new instances by leading the tree basis down a leaf, with respect to
the examination result along the pathway [26]. Competent models are discovered using decision tree classifiers
and ensembles, with unbalanced varying trained datasets, with resultant models totally unalike.
3.4. Performance evaluation and applications
Machine learning model need evaluation and validation of performance metrics using a confusion
matrix and its formula [4, 27]. Expression of gene analysis suggest enhanced RNA-Seq data path identification,
to learn applicable helpful genes in advancing applications such as treatment modifications, diseases diagnosis,
drugs and gene discoveries, classification of cancers, typhoid, malaria, among other ailments. Designs and
inconsistency findings between machine learning data has discovered great algorithms applicable to many
fields such as engineering, banking, health sectors among others. MATLAB 2015A is proposed as an
experimental and executing tool for the prognosis of malaria infections on an iCore2 processor, 4GB RAM
size, 64-bit System.
TELKOMNIKA Telecommun Comput El Control 
A genetic algorithm approach for predicting ribonucleic acid sequencing data… (Micheal Olaolu Arowolo)
313
4. RESULTS AND ANALYSIS
In this study, 2457 instances of RNA-Seq dataset “Mosquitoes Anopheles Gambiae” containing
resistants and susceptibles of genes is used on a GA to draw optimal reduced number of subsets in the data,
taking away uncorrelated attributes to pick maximum variance features. The result shows important gene
evidence suitable for KNN and decision tree classification algorithm study on MATLAB environment for the
model experiment. Genetic algorithm makes use of 0.5 threshold and achieves 708 optimal subset features of
significant genes. Classifiers used 10-folds cross validation was used on KNN and decision tree classifiers, to
implement evaluations of the model’s performance with 0.05 holdout training data parameter and classifier
accuracy tests the data with 25%. A learning classification procedure evaluation train and test evaluates using
10-fold cross validation to remove the partiality in sampling. The performance metrics and time computation
is evaluated [27] and relates the model classification performance, by means of KNN (bagging) and decision
tree, with 98.3% and 88.3% accuracy separately using confusion matrix and result outpus as shown in
Figure 1. Related components were fetched by GA from the full data shown in Figure 1, the subset data features
pass into KNN as well as decision tree and shows the Confusion matix result in the Figure 2 and Figure 3 to
derive the solution to the performance metrics. KNN classification algorithm achieves an accuracy of 88.3%,
while the decision tree classification algorithm achieved an accuracy of 98.3%, metrics of other performance
are shown in Table 1.
Figure 1. Loaded mosquito anopheles gambiae on MATLAB environment
In this study, RNA-Seq data uses a mosquito anopheles gambiae dataset [28], to test the machine
learning method performance. Genetic algorithm dimensionality reduction model selects 708 subset features
from 2457 features of genes form the data. The selected components were classified using classification
algorithms (KNN and decision tree) performance evaluation. The efficiency of machine learning approach in
genes are shown in the results to confirm the method, the outcomes are revealed and related in Table 2 showing
GA-decision tree outperforms GA-KNN terms of accuracy. In this study, an improved classification of malaria
vector data is analyzed using GA with decision tree and KNN algorithms respectivel, numerous works have
been reviewed, the results prove that GA enhances classification yield for KNN and decision tree.
 ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 19, No. 1, February 2021: 310 - 316
314
Figure 2. RNA-Seq confusion matrix using decision tree algorithm TP=38; TN=21; FP=0; FN=1
Figure 3. RNA-Seq confusion matrix using KNN TP=36; TN=17; FP=4; FN=3
Table 2. Performance metrics table for the confusion matrix
Performance Metrics GA-Decision Tree Classification GA-KNN Classification
Accuracy (%) 98.3 88.3
Sensitivity (%) 97.4 92.3
Specificity (%) 100 81.0
Precision (%) 100 90.0
Recall (%) 97.4 92.3
F-Score (%) 98.7 91.1
5. CONCLUSION
In this study, improvements efficience for predicting and detecting malaria ailments in human are
proposed using machine learning dimensionality reduction and classification techniques. GA feature selection
dimensionality reduction and KNN and decision tree classifiers were employed by performing evaluating and
TELKOMNIKA Telecommun Comput El Control 
A genetic algorithm approach for predicting ribonucleic acid sequencing data… (Micheal Olaolu Arowolo)
315
analysing the performance results obtained. This study enhanced malaria vector data classification, and
compared with quite a lot of proposed works in reviews by numerous researchers, the outcomes demonstrates
that, GA dimensionality reduction model helps to develop classification output such as decision tree.
Investigating current works proposed in literature can improve feature selection models and algorithms and
compared with recent other state-of-the-art classification algorithm.
REFERENCES
[1] S. Shanwen, et al., “Machine Learning and Its Applications in Plant Molecular Studies,” Briefings in Functional
Genomics, vol. 19, no. 1, pp. 40-48, 2019.
[2] F. David., et al., “Predicting Gene Expression in the Human Malaria Parasite Plasmodium Falciparum Using Histone
Modification, Nucleosome Positioning, and 3D Localization Features,” PLOS Computational Biology, vol. 15, no. 9, 2019.
[3] M. Arowolo, et al., “A Dimensional Reduced Model for the Classification of RNA-Seq Anopheles Gambiae Data,”
Journal of Theoretical and Applied Information Technology, vol. 97, no. 23, pp. 3487-96, 2019.
[4] S. Karthik, et al., “A Survey on Machine Learning Approaches in Gene Expression Classification in Modelling
Computational Diagnostic System for Complex Diseases,” International Journal of Engineering and Advanced
Technology, vol. 8, no. 2, pp. 182-191, 2018.
[5] N. Johnson, et al., “Biological classification with RNA-seq data: Can alternatively spliced transcript expression
enhance machine learning classifiers?,” Cold Spring Harbor Laboratory Press for the RNA Society, vol. 24, no. 9,
pp. 1119-1132, 2018.
[6] M. Libbrecht, et al., “Machine learning applications in genetics and genomics,” Nat Rev Genetics, vol. 16,
pp. 321-332, 2015.
[7] Z. Jagga, et al., “Classification models for clear cell renal carcinoma stage progression, based on tumor RNAseq
expression trained supervised machine learning algorithms,” BMC Proceedings, vol. 8, no. 2, 2014.
[8] Anopheles gambiae 1000 Genomes Consortium, “Genetic diversity of the African malaria vector Anopheles
gambiae,” Nature, vol. 552, no. 7683, pp. 96-100, 2017.
[9] D. Oh, et al., “Predicting Autism Spectrum Disorder Using Blood-based Gene Expression Signatures and Machine
Learning,” Clin Psychopharmacology Neuroscience, vol. 15, no. 1, pp. 47-52, 2017.
[10] Q. Ren, et al., “Clustering and Classification Methods for Single-cell RNA-Seq Data,” Briefings in Bioinformatics,
vol. 21, no. 4, pp. 1-13, 2019.
[11] W. Stephen, et al., “Using Supervised Learning Methods for Gene Selection in RNA-Seq Case-Control Studies,”
Frontiers in Genetic, vol. 9, no. 297, pp. 1-6, 2018.
[12] J. Alquicira-Hernandez, et al., “scPred: Accurate Supervised Method for Cell-type Classification from Single-cell
RNA-seq Data,” Genome Biology, vol. 20, no. 264, 2019.
[13] S. Cui, et al., “Machine Learning-based Microarray Analyses Indicate Low-Expression Genes Might Collectively
Influence PAH Disease,” PLOS Computational Biology, vol. 15, no. 8, 2019.
[14] H. Shon, et al., “Classification of Stomach Canacer Gene Expression Data Using CNN Algorithm of Deep Learning,”
Journal of Biomedical Translation Research, vol. 20, no. 1, pp. 15-20, 2019.
[15] J. R. Adam, et al., “Single-cell RNA-seq reveals hidden transcriptional variation in malaria parasites,” eLife Sciences,
vol. 7, pp. 1-29, 2018.
[16] A. Tan, et al., “Ensemble Machine Learning on Gene Expression Data for Cancer Classification,” Applied
Bioinformatics, vol. 2, no. 3, pp. 75-83, 2003.
[17] N. Song, et al., “Design and Analysis of Ensemble Classifier for Gene Expression Data of Cancer,” Advancement in
Genetic Engineering, vol. 5, no. 1, pp. 1-7, 2016.
[18] S. Tarek, et al., “Gene Expression Based Cancer Classification,” Egyptian Informatics Journal, vol. 18, no. 3,
pp. 151-159, 2017.
[19] Duval, et al., “Advances in Metaheuristics for Gene Selectio and Classification of Microarray Data,” Briefings in
Bioinformatics, vol. 11, no. 1, pp. 127-141, 2010.
[20] A. Shukla, et al., “A New Hybrid Feature Subset Selection Framework Based on Binary Genetic Algorithm and
Information Theory,” International Journal of Computational Intelligence and Applications, vol. 18, no. 3, pp. 1-11,
2019.
[21] A. Tan, et al., “Ensemble Machine Learning on Gene Expression Data for Cancer Classification,” Applied
Bioinformatics, vol. 3, pp. 1-10, 2003.
[22] K. Kamran, Kiana J. M., Mojtaba H., Sanjana M., Laura B., Donald B., “Text Classification Algorithms: A Survey,”
Information MDPI, vol. 10, no. 150, pp. 2-68, 2019.
[23] B. Mariangela, et al., “RNA-seq analyses of changes in the Anopheles gambiae transcriptome associated with
resistance to pyrethroids in Kenya: identification of candidate-resistance genes and candidate-resistance SNPs,”
Parasites and Vector, vol. 8, no. 474, pp. 1-13, 2015.
[24] G. James, et al., “An introduction to statistical learning with application in R,” New York (NY): Springer; 2013.
[25] J. Bose, et al., “A Hybrid GA/KNN/SVM Algorithm for Classification of Data,” BioHouse Journal of Computer
science, vol. 2, no. 2, pp. 5-11, 2016.
[26] I. Polaka, et al., “Decision Tree Classifiers in Bioinformatics,” Scientific Journal of Riga Technical University,
pp. 110-123, 2010.
[27] M. Arowolo, et al., “A Comparative Analysis of Feature Selection and Feature Extraction Models for Classifying
Microarray Dataset,” Computing and Information System, vol. 22, no. 2, pp. 29-38, 2018.
 ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 19, No. 1, February 2021: 310 - 316
316
[28] Mariangela Bonizzoni, et al., “Additional file 4: of RNA-seq analyses of changes in the Anopheles gambiae
transcriptome associated with resistance to pyrethroids in Kenya: identification of candidate-resistance genes and
candidate-resistance SNPs,” 2015. https://figshare.com/articles/Additional_file_4_of_RNAseq_analyses_of_changes
_in_the_Anopheles_gambiae_transcriptome_associated_with_resistance_to_pyrethroids_in_Kenya_identification_o
f_candidateresistance_genes_and_candidateresistance_SNPs/4346279/1
BIOGRAPHIES OF AUTHORS
Micheal Olaolu Arowolo, is a faculty of the Department of Computer Science at Landmark
University, Omu-Aran Nigeria. He holds a Bachelor Degree from Al-Hikmah University,
Ilorin, Nigeria and a Masters Degree from Kwara State University, Malete Nigeria, he is
presently a PhD Student of Landmark University, Omu-Aran Nigeria. His area of research
interest includes Machine Learning, Bioinformatics, Datamining, Cyber Security and
Computer Arithmetic. He has published widely in local and international reputable journals,
he is a member of IAENG, APISE, SDIWC, and an Oracle Certified Expert.
Dr. Marion Olubunmi Adebiyi, is a faculty of the Department of Computer Science at
Landmark University, Omu-Aran, Nigeria. She holds a BSc Degree from University of
Ilorin, Ilorin Nigeria. She had her MSc and PhD Degree in Computer Science from Covenant
University, Nigeria respectively. Her research interests include, Bioinformatics of Infectious
(African) Diseases/ Population, Organism’s Inter-pathway analysis, High throughput data
analytics, Homology modellin and Artificial Intelligence. She has published widely in local
and international reputable journals She is a member of Nigerian Computer Society (NCS),
the Computer Registration Council of Nigeria (CPN) and IEEE member.
Professor Ayodele Ariyo Adebiyi, is a faculty and former Head of Department of Computer
and Information Sciences, Covenant University, Ota Nigeria. He is currently the Head of
Department of Computer Science at Landmark University, Omu-Aran, Nigeria, a sister
University to Covenant University. He holds a BSc degree in Computer Science and MBA
degree from University of Ilorin, Ilorin Nigeria. He had his MSc and PhD degree in
Management Information System (MIS) from Covenant University, Nigeria respectively.
His research interests include, application of soft computing techniques in solving real life
problems, software engineering and information system research. He has successfully
mentored and supervised several postgraduate students at Masters and PhD level. He has
published widely in local and international reputable journals. He is a member of Nigerian
Computer Society (NCS), the Computer Registration Council of Nigeria (CPN) and
IEEE member.

Mais conteúdo relacionado

Mais procurados

A Fault tolerant system based on Genetic Algorithm for Target Tracking in Wir...
A Fault tolerant system based on Genetic Algorithm for Target Tracking in Wir...A Fault tolerant system based on Genetic Algorithm for Target Tracking in Wir...
A Fault tolerant system based on Genetic Algorithm for Target Tracking in Wir...Editor IJCATR
 
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...IRJET Journal
 
DATA TRANSMISSION IN WIRELESS SENSOR NETWORKS FOR EFFECTIVE AND SECURE COMMUN...
DATA TRANSMISSION IN WIRELESS SENSOR NETWORKS FOR EFFECTIVE AND SECURE COMMUN...DATA TRANSMISSION IN WIRELESS SENSOR NETWORKS FOR EFFECTIVE AND SECURE COMMUN...
DATA TRANSMISSION IN WIRELESS SENSOR NETWORKS FOR EFFECTIVE AND SECURE COMMUN...IJEEE
 
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...Drjabez
 
A clonal based algorithm for the reconstruction of genetic network using s sy...
A clonal based algorithm for the reconstruction of genetic network using s sy...A clonal based algorithm for the reconstruction of genetic network using s sy...
A clonal based algorithm for the reconstruction of genetic network using s sy...eSAT Journals
 
LIFETIME IMPROVEMENT USING MOBILE AGENT IN WIRELESS SENSOR NETWORK
LIFETIME IMPROVEMENT USING MOBILE AGENT IN WIRELESS SENSOR NETWORKLIFETIME IMPROVEMENT USING MOBILE AGENT IN WIRELESS SENSOR NETWORK
LIFETIME IMPROVEMENT USING MOBILE AGENT IN WIRELESS SENSOR NETWORKJournal For Research
 
A clonal based algorithm for the reconstruction of
A clonal based algorithm for the reconstruction ofA clonal based algorithm for the reconstruction of
A clonal based algorithm for the reconstruction ofeSAT Publishing House
 
NRNB Annual Report 2018
NRNB Annual Report 2018NRNB Annual Report 2018
NRNB Annual Report 2018Alexander Pico
 
NRNB Annual Report 2012
NRNB Annual Report 2012NRNB Annual Report 2012
NRNB Annual Report 2012Alexander Pico
 
An Approach of Mobile Wireless Sensor Network for Target Coverage and Network...
An Approach of Mobile Wireless Sensor Network for Target Coverage and Network...An Approach of Mobile Wireless Sensor Network for Target Coverage and Network...
An Approach of Mobile Wireless Sensor Network for Target Coverage and Network...IRJET Journal
 
NRNB Annual Report 2017
NRNB Annual Report 2017NRNB Annual Report 2017
NRNB Annual Report 2017Alexander Pico
 
AN EFFICIENT DEPLOYMENT APPROACH FOR IMPROVED COVERAGE IN WIRELESS SENSOR NET...
AN EFFICIENT DEPLOYMENT APPROACH FOR IMPROVED COVERAGE IN WIRELESS SENSOR NET...AN EFFICIENT DEPLOYMENT APPROACH FOR IMPROVED COVERAGE IN WIRELESS SENSOR NET...
AN EFFICIENT DEPLOYMENT APPROACH FOR IMPROVED COVERAGE IN WIRELESS SENSOR NET...csandit
 
A new approach for area coverage problem in wireless sensor networks with hyb...
A new approach for area coverage problem in wireless sensor networks with hyb...A new approach for area coverage problem in wireless sensor networks with hyb...
A new approach for area coverage problem in wireless sensor networks with hyb...ijmnct
 
Analysis of Various Cluster Algorithms Based on Flying Insect Wing Beat Sound...
Analysis of Various Cluster Algorithms Based on Flying Insect Wing Beat Sound...Analysis of Various Cluster Algorithms Based on Flying Insect Wing Beat Sound...
Analysis of Various Cluster Algorithms Based on Flying Insect Wing Beat Sound...IRJET Journal
 
Efficient Data Gathering with Compressive Sensing in Wireless Sensor Networks
Efficient Data Gathering with Compressive Sensing in Wireless Sensor NetworksEfficient Data Gathering with Compressive Sensing in Wireless Sensor Networks
Efficient Data Gathering with Compressive Sensing in Wireless Sensor NetworksIRJET Journal
 
NRNB Annual Report 2013
NRNB Annual Report 2013NRNB Annual Report 2013
NRNB Annual Report 2013Alexander Pico
 
Comparative study between metaheuristic algorithms for internet of things wir...
Comparative study between metaheuristic algorithms for internet of things wir...Comparative study between metaheuristic algorithms for internet of things wir...
Comparative study between metaheuristic algorithms for internet of things wir...IJECEIAES
 
Solution for intra/inter-cluster event-reporting problem in cluster-based pro...
Solution for intra/inter-cluster event-reporting problem in cluster-based pro...Solution for intra/inter-cluster event-reporting problem in cluster-based pro...
Solution for intra/inter-cluster event-reporting problem in cluster-based pro...IJECEIAES
 

Mais procurados (20)

A Fault tolerant system based on Genetic Algorithm for Target Tracking in Wir...
A Fault tolerant system based on Genetic Algorithm for Target Tracking in Wir...A Fault tolerant system based on Genetic Algorithm for Target Tracking in Wir...
A Fault tolerant system based on Genetic Algorithm for Target Tracking in Wir...
 
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
K Mean and Fuzzy Clustering Algorithm Predicated Brain Tumor Segmentation And...
 
DATA TRANSMISSION IN WIRELESS SENSOR NETWORKS FOR EFFECTIVE AND SECURE COMMUN...
DATA TRANSMISSION IN WIRELESS SENSOR NETWORKS FOR EFFECTIVE AND SECURE COMMUN...DATA TRANSMISSION IN WIRELESS SENSOR NETWORKS FOR EFFECTIVE AND SECURE COMMUN...
DATA TRANSMISSION IN WIRELESS SENSOR NETWORKS FOR EFFECTIVE AND SECURE COMMUN...
 
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
Novel Methodology of Data Management in Ad Hoc Network Formulated using Nanos...
 
A clonal based algorithm for the reconstruction of genetic network using s sy...
A clonal based algorithm for the reconstruction of genetic network using s sy...A clonal based algorithm for the reconstruction of genetic network using s sy...
A clonal based algorithm for the reconstruction of genetic network using s sy...
 
LIFETIME IMPROVEMENT USING MOBILE AGENT IN WIRELESS SENSOR NETWORK
LIFETIME IMPROVEMENT USING MOBILE AGENT IN WIRELESS SENSOR NETWORKLIFETIME IMPROVEMENT USING MOBILE AGENT IN WIRELESS SENSOR NETWORK
LIFETIME IMPROVEMENT USING MOBILE AGENT IN WIRELESS SENSOR NETWORK
 
A clonal based algorithm for the reconstruction of
A clonal based algorithm for the reconstruction ofA clonal based algorithm for the reconstruction of
A clonal based algorithm for the reconstruction of
 
NRNB Annual Report 2018
NRNB Annual Report 2018NRNB Annual Report 2018
NRNB Annual Report 2018
 
NRNB Annual Report 2012
NRNB Annual Report 2012NRNB Annual Report 2012
NRNB Annual Report 2012
 
Ac36167172
Ac36167172Ac36167172
Ac36167172
 
An Approach of Mobile Wireless Sensor Network for Target Coverage and Network...
An Approach of Mobile Wireless Sensor Network for Target Coverage and Network...An Approach of Mobile Wireless Sensor Network for Target Coverage and Network...
An Approach of Mobile Wireless Sensor Network for Target Coverage and Network...
 
NRNB Annual Report 2017
NRNB Annual Report 2017NRNB Annual Report 2017
NRNB Annual Report 2017
 
AN EFFICIENT DEPLOYMENT APPROACH FOR IMPROVED COVERAGE IN WIRELESS SENSOR NET...
AN EFFICIENT DEPLOYMENT APPROACH FOR IMPROVED COVERAGE IN WIRELESS SENSOR NET...AN EFFICIENT DEPLOYMENT APPROACH FOR IMPROVED COVERAGE IN WIRELESS SENSOR NET...
AN EFFICIENT DEPLOYMENT APPROACH FOR IMPROVED COVERAGE IN WIRELESS SENSOR NET...
 
A new approach for area coverage problem in wireless sensor networks with hyb...
A new approach for area coverage problem in wireless sensor networks with hyb...A new approach for area coverage problem in wireless sensor networks with hyb...
A new approach for area coverage problem in wireless sensor networks with hyb...
 
Analysis of Various Cluster Algorithms Based on Flying Insect Wing Beat Sound...
Analysis of Various Cluster Algorithms Based on Flying Insect Wing Beat Sound...Analysis of Various Cluster Algorithms Based on Flying Insect Wing Beat Sound...
Analysis of Various Cluster Algorithms Based on Flying Insect Wing Beat Sound...
 
Efficient Data Gathering with Compressive Sensing in Wireless Sensor Networks
Efficient Data Gathering with Compressive Sensing in Wireless Sensor NetworksEfficient Data Gathering with Compressive Sensing in Wireless Sensor Networks
Efficient Data Gathering with Compressive Sensing in Wireless Sensor Networks
 
NRNB Annual Report 2013
NRNB Annual Report 2013NRNB Annual Report 2013
NRNB Annual Report 2013
 
Comparative study between metaheuristic algorithms for internet of things wir...
Comparative study between metaheuristic algorithms for internet of things wir...Comparative study between metaheuristic algorithms for internet of things wir...
Comparative study between metaheuristic algorithms for internet of things wir...
 
Matlab 2013 14 papers astract
Matlab 2013 14 papers astractMatlab 2013 14 papers astract
Matlab 2013 14 papers astract
 
Solution for intra/inter-cluster event-reporting problem in cluster-based pro...
Solution for intra/inter-cluster event-reporting problem in cluster-based pro...Solution for intra/inter-cluster event-reporting problem in cluster-based pro...
Solution for intra/inter-cluster event-reporting problem in cluster-based pro...
 

Semelhante a A genetic algorithm approach for predicting ribonucleic acid sequencing data classification using KNN and decision tree

EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...IJDKP
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...IJDKP
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...rahulmonikasharma
 
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...mlaij
 
Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square IJECEIAES
 
Corrected 2e-5
Corrected 2e-5Corrected 2e-5
Corrected 2e-5Dago Noel
 
Corrected 2e-5
Corrected 2e-5Corrected 2e-5
Corrected 2e-5Dago Noel
 
An evaluation of machine learning algorithms coupled to an electronic olfact...
An evaluation of machine learning algorithms coupled to an  electronic olfact...An evaluation of machine learning algorithms coupled to an  electronic olfact...
An evaluation of machine learning algorithms coupled to an electronic olfact...IJECEIAES
 
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...IJTET Journal
 
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.Ehsan Lotfi
 
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna DesignANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna DesignAIRCC Publishing Corporation
 
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna DesignANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna DesignAIRCC Publishing Corporation
 
PLANT LEAF DISEASE CLASSIFICATION USING CNN
PLANT LEAF DISEASE CLASSIFICATION USING CNNPLANT LEAF DISEASE CLASSIFICATION USING CNN
PLANT LEAF DISEASE CLASSIFICATION USING CNNIRJET Journal
 
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic MethodsAnalytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic Methodscscpconf
 
SCDT: FC-NNC-structured Complex Decision Technique for Gene Analysis Using Fu...
SCDT: FC-NNC-structured Complex Decision Technique for Gene Analysis Using Fu...SCDT: FC-NNC-structured Complex Decision Technique for Gene Analysis Using Fu...
SCDT: FC-NNC-structured Complex Decision Technique for Gene Analysis Using Fu...IJECEIAES
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationIJAEMSJORNAL
 
Molecular marker and its application in breed improvement and conservation.docx
Molecular marker and its application in breed improvement and conservation.docxMolecular marker and its application in breed improvement and conservation.docx
Molecular marker and its application in breed improvement and conservation.docxTrilokMandal2
 
Application of Molecular Markers SNP and DArT in Plant Breeding: A Review Paper
Application of Molecular Markers SNP and DArT in Plant Breeding: A Review PaperApplication of Molecular Markers SNP and DArT in Plant Breeding: A Review Paper
Application of Molecular Markers SNP and DArT in Plant Breeding: A Review PaperJournal of Agriculture and Crops
 

Semelhante a A genetic algorithm approach for predicting ribonucleic acid sequencing data classification using KNN and decision tree (20)

EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
project
projectproject
project
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
 
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
 
Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square
 
Corrected 2e-5
Corrected 2e-5Corrected 2e-5
Corrected 2e-5
 
Corrected 2e-5
Corrected 2e-5Corrected 2e-5
Corrected 2e-5
 
An evaluation of machine learning algorithms coupled to an electronic olfact...
An evaluation of machine learning algorithms coupled to an  electronic olfact...An evaluation of machine learning algorithms coupled to an  electronic olfact...
An evaluation of machine learning algorithms coupled to an electronic olfact...
 
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
 
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.2014 Gene expressionmicroarrayclassification usingPCA–BEL.
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
 
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna DesignANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
 
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna DesignANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
ANN Based Features Selection Approach Using Hybrid GA-PSO for Sirna Design
 
PLANT LEAF DISEASE CLASSIFICATION USING CNN
PLANT LEAF DISEASE CLASSIFICATION USING CNNPLANT LEAF DISEASE CLASSIFICATION USING CNN
PLANT LEAF DISEASE CLASSIFICATION USING CNN
 
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic MethodsAnalytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
SCDT: FC-NNC-structured Complex Decision Technique for Gene Analysis Using Fu...
SCDT: FC-NNC-structured Complex Decision Technique for Gene Analysis Using Fu...SCDT: FC-NNC-structured Complex Decision Technique for Gene Analysis Using Fu...
SCDT: FC-NNC-structured Complex Decision Technique for Gene Analysis Using Fu...
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferation
 
Molecular marker and its application in breed improvement and conservation.docx
Molecular marker and its application in breed improvement and conservation.docxMolecular marker and its application in breed improvement and conservation.docx
Molecular marker and its application in breed improvement and conservation.docx
 
Application of Molecular Markers SNP and DArT in Plant Breeding: A Review Paper
Application of Molecular Markers SNP and DArT in Plant Breeding: A Review PaperApplication of Molecular Markers SNP and DArT in Plant Breeding: A Review Paper
Application of Molecular Markers SNP and DArT in Plant Breeding: A Review Paper
 

Mais de TELKOMNIKA JOURNAL

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...TELKOMNIKA JOURNAL
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...TELKOMNIKA JOURNAL
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...TELKOMNIKA JOURNAL
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...TELKOMNIKA JOURNAL
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...TELKOMNIKA JOURNAL
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaTELKOMNIKA JOURNAL
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireTELKOMNIKA JOURNAL
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkTELKOMNIKA JOURNAL
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsTELKOMNIKA JOURNAL
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...TELKOMNIKA JOURNAL
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesTELKOMNIKA JOURNAL
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...TELKOMNIKA JOURNAL
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemTELKOMNIKA JOURNAL
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...TELKOMNIKA JOURNAL
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorTELKOMNIKA JOURNAL
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...TELKOMNIKA JOURNAL
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...TELKOMNIKA JOURNAL
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksTELKOMNIKA JOURNAL
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingTELKOMNIKA JOURNAL
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...TELKOMNIKA JOURNAL
 

Mais de TELKOMNIKA JOURNAL (20)

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antenna
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fire
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio network
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bands
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertainties
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensor
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networks
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imaging
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
 

Último

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spaintimesproduction05
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01KreezheaRecto
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLManishPatel169454
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Christo Ananth
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfRagavanV2
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 

Último (20)

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spain
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 

A genetic algorithm approach for predicting ribonucleic acid sequencing data classification using KNN and decision tree

  • 1. TELKOMNIKA Telecommunication, Computing, Electronics and Control Vol. 19, No. 1, February 2021, pp. 310~316 ISSN: 1693-6930, accredited First Grade by Kemenristekdikti, Decree No: 21/E/KPT/2018 DOI: 10.12928/TELKOMNIKA.v19i1.16381  310 Journal homepage: http://journal.uad.ac.id/index.php/TELKOMNIKA A genetic algorithm approach for predicting ribonucleic acid sequencing data classification using KNN and decision tree Micheal Olaolu Arowolo, Marion Olubunmi Adebiyi, Ayodele Ariyo Adebiyi Department of Computer Science, Landmark University, Omu-Aran, Kwara State, Nigeria Article Info ABSTRACT Article history: Received Apr 14, 2020 Revised Jun 9, 2020 Accepted Sep 24, 2020 Malaria larvae accept explosive variable lifecycle as they spread across numerous mosquito vector stratosphere. Transcriptomes arise in thousands of diverse parasites. Ribonucleic acid sequencing (RNA-seq) is a prevalent gene expression that has led to enhanced understanding of genetic queries. RNA- seq tests transcript of gene expression, and provides methodological enhancements to machine learning procedures. Researchers have proposed several methods in evaluating and learning biological data. Genetic algorithm (GA) as a feature selection process is used in this study to fetch relevant information from the RNA-Seq Mosquito Anopheles gambiae malaria vector dataset, and evaluates the results using kth nearest neighbor (KNN) and decision tree classification algorithms. The experimental results obtained a classification accuracy of 88.3 and 98.3 percents respectively. Keywords: Decision tree Genetic algorithm KNN Mosquito anopheles Ribonucleic acid sequencing This is an open access article under the CC BY-SA license. Corresponding Author: Micheal Olaolu Arowolo Department of Computer Science Lamdmark University Omu-Aran, Kwara State Nigeria Email: arowolo.olaolu@lmu.edu.ng 1. INTRODUCTION Next-generation high-throughput sequencing technology has created profuse wide-ranging datasets, this enormous data expanse helps biologists to analyze and perform daunting gene transcripts, such as disease related and RNA such as infections (malaria), cancer, inherited, genetics, physiology, among others [1]. Blood-sucking mosquitoes such as Mosquito Anopheles with vectors of malaria plasmodium falciparum are found in Africa. Mosquito Anopheles is a deadly malaria parasite, responsible for demises of thousands of humans daily. Antimalaria combat suppositories blowouts, state-of-the-art antimalarials treatment upsurges, fetching for ground-breaking medications requires improved biotic studies of this infenctions. The parasite tolerates precise parameter of gene expression query enormously and necessitates making enhanced thorough extrapolative model transcriptions of vectors [2]. Approachable revealing genetic inquiries have been made in ribonucleic acid sequencing (RNA-seq) study by unfolding a cautious purposeful biological strategy for enhancement of the learning. RNA-Seq data requires removal of expletive high-dimension, such as; noises, complaints, repetition, irrelevant, inactivity, unfitting data, and others [3]. New capabilities strengthen solutions to the development of ground-breaking healthcare frameworks such as effective public wellbeing nursing systems, advanced interventions and medical diagnosis and disorders [4]. Machine learning means have been established with convincing uniqueness to investigate the enormous amount of cutting-edge RNA-Seq knowledge by studying the naturally material structures [5]. Scientists have used machine learning algorithms with relevant achievement for gene expression data results
  • 2. TELKOMNIKA Telecommun Comput El Control  A genetic algorithm approach for predicting ribonucleic acid sequencing data… (Micheal Olaolu Arowolo) 311 of RNA-Seq [6-8]. In this study, a genetic algorithm (GA) pre-processor, to obtain reduced dimensionality of data with kth nearest neighbor (KNN) and decision tree classifiers are proposed to classify discrete genetic structures and obtain advances that are suitable system for predicting and detecting innovative genes for malaria ailments in human. 2. REVIEWS Computational procedures are based on enormous samples of individuals genes with or without diseases, mutations may be found accountable for the precense of diseases. Differential expressed genes (DEG) are defined through some methods. Machine Learning measures are important for spotting the variation between genes found from human genome. Machine learning techniques have been emulated severally in investigating and classifying various profiles of diseases gene expressions. Various machine learning approaches are reported and reviewed, using recent trends in the evaluations [4]. Machine learning for predicting Autism spectrum ailment was experimented and classify transcripts, using RNA data from gene omnibus expression data. This study ranked cluster analysis and relatively discriminated, using SVM and KNN classifiers, an estimate accuracy of 94% was achieved [9]. Clustering and classification of RNA-Seq data was carried out by performing a mutual valuation, and emphasizing the expertise and ploys of methods occurring in recent time as predominant shifts, uing nonlinear and linear dimension reduction systems, by combining scRNA-seq data [10]. Group of RNA-Seq genes for ranking genes set of huge ensembles using a supervised learning approach was carried out using random forests classification method, on 1210 samples of tumor RNA-Seq datasets showed hidden supervised learning selection approaches necessity on analysis [11]. A supervised single-cell RNA-Seq data classification model was proposed using a comprehensive approach by combining independent feature selection approaches. scPred RNA-seq datasets showed high accuracy [12]. RNA-DNA machine learning analysis was proposed to indicate small genome expression to influence PAH ailment, feature selection algorithm was proposed to classify relevant genes with an outcome that reveals unique PAH [13]. Stomach tumor gene expression data using CNN classification procedure was developed based on deep learning approach, 60,000 data made up of stomach tumor genes were evaluated using PCA), heatmaps, and CNN algorithms with an accuracy of 96% and 51% [14]. RNA-Seq hidden transcripts in malaria parasites was proposed by relating variations of procedures to deconvolute transcriptional differences for distinct mosquitos and revealed hidden distinct transcriptional signatures [15]. An ensemble classification algorithm for cancer dataset was developed using decision tree, ensemble decision trees algorithms on available cancerous microarray, the results enhances than the decision trees classification [16]. An investigative cancer gene expression ensemble classification method was proposed using a hybrid RFE-Adaboost algorithm to fetch significant features for enhancing classification performancet [17]. Classification of cancer data was carried out using an effective ensemble classification method by increasing the classification, the result were less contingent [18]. A metaheuristics system for fetching relevant RNA/DNA data genes for classification was proposed by briefing recent developments of metaheuristic-based methods in embedded feature selection methods, useful data for operatives for ranking coefficients of SVM classifier is used [19]. A GA presenting a state-of-the-art approach was proposed using filter-wrapper based feature selection on five biological datasets, the results showed an important reduction of features for classification [20]. An enhanced ensemble classification for certain features was proposed for learning an ensemble-based feature selection approach with random trees using a subset, the method removes the unfitting structures and picks the best structures by means of a probability weighing value for classification evaluation using RF, SVM, and NB [21]. Review of several feature extraction algorithms for gene expression investigation, such as the PCA, ICA, PLS, and LLE was carried out and discussed for the purpose of the machine learning applications [22]. 3. MATERIALS AND METHODS Several high-dimensional data enhancement methods are in place, this paper carries out a feature selection using GA technique and ensemble classification algorithm for fetching relevant information in a huge dimensional data and classification. A western Kenya RNA-Seq data mosquitos’ genes with 2457 instances with 7 gene attributes [23], MATLAB environment tool is used carry out the experiment using GA to select relevant subset of features from the dataset as shown in Table 1, Ensemble algorithm approach is used as a classifier on the selected features [24].
  • 3.  ISSN: 1693-6930 TELKOMNIKA Telecommun Comput El Control, Vol. 19, No. 1, February 2021: 310 - 316 312 Table 1. Features of the data Dataset Attributes Instances Mosquito Anopheles Gambiae 7 2457 3.1. GA GA is a wrapper-based feature selection approach that examines suitable features from a given high dimensional datasets, with numerous parameter procedures, where mutation and crossover operators are associated with relevant recognized binary constraints features [19]. RNA features with N number denotes a feature having selected and unselected values 1 and 0 respectively. GA is important and helps to find feature subset models with selected figure features for composite classifications. GA structure is adopted and well-defined in Algorithm 1 [20]: Algorithm 1. Genetic Algorithm Necessitate: Set parameters nPop = m, tmax, t = 0; Confirm: Optimal feature subset with the maximum suitable value. 1: while (t<=tmax) do 2: Create pop m, tmax; 3: For k = 1 to m do 4: Parents [m1, m2] = system selection (m, nPop) 5: Child = Xor [m1, m2] 6: M u = mutation [Child} 7: End for 8: Replace m with Child1, Child2, …, Childm 9: t = t+ 1; 10: End while 11: Store the Highest fitness value; m = population size, r = random number 0 to 1, chrome = certain or non-certain feature through threshold δ, set value = 0.5, and α = threshold number of picked features. Selecting maximum fit features from the predictable datasets is the main problem of the GA technique. 3.2. KNN A supervised learning K-nearest neighbor classification technique for gene datasets, performs neighborhood classification evaluation value of innovative application occurrence. KNN algorithm classifies innovative entity developed on instances, attributes as well as training models. KNN classifiers do not train models to fit but built on retention. The features selected are assumed as input to segments. The K value of nearest neighbors are selected nearest to the query spot. Detachment between query-instance and training models are considered and sorted based on the Kth minimum determined distance. Group Y of the nearest neighbors is fetched. The unassuming common of the group of nearest neighbors as the estimate amount of the query instance is used. Bonds can fragment randomly [25]. 3.3. Decision trees Decision tree classification algorithm divides recursively instance spacing with hyperplanes orthogonally. Decision tree model assembles derivative nodes signifying attributes, based on instance space attribute value roles selected inversely for algorithms, using its values. Advanced data sub-space iteratively divides till end principle is determined and terminal nodes (leaf nodes) are allocated to class labels characterizing the classification. Accurate conventional end procedure is a significant tree with too huge, overfitted and trivial trees, underfitted and suffers loss in accuracy. Algorithms have assembled overfitting strategies, labelled trimming, classifying new instances by leading the tree basis down a leaf, with respect to the examination result along the pathway [26]. Competent models are discovered using decision tree classifiers and ensembles, with unbalanced varying trained datasets, with resultant models totally unalike. 3.4. Performance evaluation and applications Machine learning model need evaluation and validation of performance metrics using a confusion matrix and its formula [4, 27]. Expression of gene analysis suggest enhanced RNA-Seq data path identification, to learn applicable helpful genes in advancing applications such as treatment modifications, diseases diagnosis, drugs and gene discoveries, classification of cancers, typhoid, malaria, among other ailments. Designs and inconsistency findings between machine learning data has discovered great algorithms applicable to many fields such as engineering, banking, health sectors among others. MATLAB 2015A is proposed as an experimental and executing tool for the prognosis of malaria infections on an iCore2 processor, 4GB RAM size, 64-bit System.
  • 4. TELKOMNIKA Telecommun Comput El Control  A genetic algorithm approach for predicting ribonucleic acid sequencing data… (Micheal Olaolu Arowolo) 313 4. RESULTS AND ANALYSIS In this study, 2457 instances of RNA-Seq dataset “Mosquitoes Anopheles Gambiae” containing resistants and susceptibles of genes is used on a GA to draw optimal reduced number of subsets in the data, taking away uncorrelated attributes to pick maximum variance features. The result shows important gene evidence suitable for KNN and decision tree classification algorithm study on MATLAB environment for the model experiment. Genetic algorithm makes use of 0.5 threshold and achieves 708 optimal subset features of significant genes. Classifiers used 10-folds cross validation was used on KNN and decision tree classifiers, to implement evaluations of the model’s performance with 0.05 holdout training data parameter and classifier accuracy tests the data with 25%. A learning classification procedure evaluation train and test evaluates using 10-fold cross validation to remove the partiality in sampling. The performance metrics and time computation is evaluated [27] and relates the model classification performance, by means of KNN (bagging) and decision tree, with 98.3% and 88.3% accuracy separately using confusion matrix and result outpus as shown in Figure 1. Related components were fetched by GA from the full data shown in Figure 1, the subset data features pass into KNN as well as decision tree and shows the Confusion matix result in the Figure 2 and Figure 3 to derive the solution to the performance metrics. KNN classification algorithm achieves an accuracy of 88.3%, while the decision tree classification algorithm achieved an accuracy of 98.3%, metrics of other performance are shown in Table 1. Figure 1. Loaded mosquito anopheles gambiae on MATLAB environment In this study, RNA-Seq data uses a mosquito anopheles gambiae dataset [28], to test the machine learning method performance. Genetic algorithm dimensionality reduction model selects 708 subset features from 2457 features of genes form the data. The selected components were classified using classification algorithms (KNN and decision tree) performance evaluation. The efficiency of machine learning approach in genes are shown in the results to confirm the method, the outcomes are revealed and related in Table 2 showing GA-decision tree outperforms GA-KNN terms of accuracy. In this study, an improved classification of malaria vector data is analyzed using GA with decision tree and KNN algorithms respectivel, numerous works have been reviewed, the results prove that GA enhances classification yield for KNN and decision tree.
  • 5.  ISSN: 1693-6930 TELKOMNIKA Telecommun Comput El Control, Vol. 19, No. 1, February 2021: 310 - 316 314 Figure 2. RNA-Seq confusion matrix using decision tree algorithm TP=38; TN=21; FP=0; FN=1 Figure 3. RNA-Seq confusion matrix using KNN TP=36; TN=17; FP=4; FN=3 Table 2. Performance metrics table for the confusion matrix Performance Metrics GA-Decision Tree Classification GA-KNN Classification Accuracy (%) 98.3 88.3 Sensitivity (%) 97.4 92.3 Specificity (%) 100 81.0 Precision (%) 100 90.0 Recall (%) 97.4 92.3 F-Score (%) 98.7 91.1 5. CONCLUSION In this study, improvements efficience for predicting and detecting malaria ailments in human are proposed using machine learning dimensionality reduction and classification techniques. GA feature selection dimensionality reduction and KNN and decision tree classifiers were employed by performing evaluating and
  • 6. TELKOMNIKA Telecommun Comput El Control  A genetic algorithm approach for predicting ribonucleic acid sequencing data… (Micheal Olaolu Arowolo) 315 analysing the performance results obtained. This study enhanced malaria vector data classification, and compared with quite a lot of proposed works in reviews by numerous researchers, the outcomes demonstrates that, GA dimensionality reduction model helps to develop classification output such as decision tree. Investigating current works proposed in literature can improve feature selection models and algorithms and compared with recent other state-of-the-art classification algorithm. REFERENCES [1] S. Shanwen, et al., “Machine Learning and Its Applications in Plant Molecular Studies,” Briefings in Functional Genomics, vol. 19, no. 1, pp. 40-48, 2019. [2] F. David., et al., “Predicting Gene Expression in the Human Malaria Parasite Plasmodium Falciparum Using Histone Modification, Nucleosome Positioning, and 3D Localization Features,” PLOS Computational Biology, vol. 15, no. 9, 2019. [3] M. Arowolo, et al., “A Dimensional Reduced Model for the Classification of RNA-Seq Anopheles Gambiae Data,” Journal of Theoretical and Applied Information Technology, vol. 97, no. 23, pp. 3487-96, 2019. [4] S. Karthik, et al., “A Survey on Machine Learning Approaches in Gene Expression Classification in Modelling Computational Diagnostic System for Complex Diseases,” International Journal of Engineering and Advanced Technology, vol. 8, no. 2, pp. 182-191, 2018. [5] N. Johnson, et al., “Biological classification with RNA-seq data: Can alternatively spliced transcript expression enhance machine learning classifiers?,” Cold Spring Harbor Laboratory Press for the RNA Society, vol. 24, no. 9, pp. 1119-1132, 2018. [6] M. Libbrecht, et al., “Machine learning applications in genetics and genomics,” Nat Rev Genetics, vol. 16, pp. 321-332, 2015. [7] Z. Jagga, et al., “Classification models for clear cell renal carcinoma stage progression, based on tumor RNAseq expression trained supervised machine learning algorithms,” BMC Proceedings, vol. 8, no. 2, 2014. [8] Anopheles gambiae 1000 Genomes Consortium, “Genetic diversity of the African malaria vector Anopheles gambiae,” Nature, vol. 552, no. 7683, pp. 96-100, 2017. [9] D. Oh, et al., “Predicting Autism Spectrum Disorder Using Blood-based Gene Expression Signatures and Machine Learning,” Clin Psychopharmacology Neuroscience, vol. 15, no. 1, pp. 47-52, 2017. [10] Q. Ren, et al., “Clustering and Classification Methods for Single-cell RNA-Seq Data,” Briefings in Bioinformatics, vol. 21, no. 4, pp. 1-13, 2019. [11] W. Stephen, et al., “Using Supervised Learning Methods for Gene Selection in RNA-Seq Case-Control Studies,” Frontiers in Genetic, vol. 9, no. 297, pp. 1-6, 2018. [12] J. Alquicira-Hernandez, et al., “scPred: Accurate Supervised Method for Cell-type Classification from Single-cell RNA-seq Data,” Genome Biology, vol. 20, no. 264, 2019. [13] S. Cui, et al., “Machine Learning-based Microarray Analyses Indicate Low-Expression Genes Might Collectively Influence PAH Disease,” PLOS Computational Biology, vol. 15, no. 8, 2019. [14] H. Shon, et al., “Classification of Stomach Canacer Gene Expression Data Using CNN Algorithm of Deep Learning,” Journal of Biomedical Translation Research, vol. 20, no. 1, pp. 15-20, 2019. [15] J. R. Adam, et al., “Single-cell RNA-seq reveals hidden transcriptional variation in malaria parasites,” eLife Sciences, vol. 7, pp. 1-29, 2018. [16] A. Tan, et al., “Ensemble Machine Learning on Gene Expression Data for Cancer Classification,” Applied Bioinformatics, vol. 2, no. 3, pp. 75-83, 2003. [17] N. Song, et al., “Design and Analysis of Ensemble Classifier for Gene Expression Data of Cancer,” Advancement in Genetic Engineering, vol. 5, no. 1, pp. 1-7, 2016. [18] S. Tarek, et al., “Gene Expression Based Cancer Classification,” Egyptian Informatics Journal, vol. 18, no. 3, pp. 151-159, 2017. [19] Duval, et al., “Advances in Metaheuristics for Gene Selectio and Classification of Microarray Data,” Briefings in Bioinformatics, vol. 11, no. 1, pp. 127-141, 2010. [20] A. Shukla, et al., “A New Hybrid Feature Subset Selection Framework Based on Binary Genetic Algorithm and Information Theory,” International Journal of Computational Intelligence and Applications, vol. 18, no. 3, pp. 1-11, 2019. [21] A. Tan, et al., “Ensemble Machine Learning on Gene Expression Data for Cancer Classification,” Applied Bioinformatics, vol. 3, pp. 1-10, 2003. [22] K. Kamran, Kiana J. M., Mojtaba H., Sanjana M., Laura B., Donald B., “Text Classification Algorithms: A Survey,” Information MDPI, vol. 10, no. 150, pp. 2-68, 2019. [23] B. Mariangela, et al., “RNA-seq analyses of changes in the Anopheles gambiae transcriptome associated with resistance to pyrethroids in Kenya: identification of candidate-resistance genes and candidate-resistance SNPs,” Parasites and Vector, vol. 8, no. 474, pp. 1-13, 2015. [24] G. James, et al., “An introduction to statistical learning with application in R,” New York (NY): Springer; 2013. [25] J. Bose, et al., “A Hybrid GA/KNN/SVM Algorithm for Classification of Data,” BioHouse Journal of Computer science, vol. 2, no. 2, pp. 5-11, 2016. [26] I. Polaka, et al., “Decision Tree Classifiers in Bioinformatics,” Scientific Journal of Riga Technical University, pp. 110-123, 2010. [27] M. Arowolo, et al., “A Comparative Analysis of Feature Selection and Feature Extraction Models for Classifying Microarray Dataset,” Computing and Information System, vol. 22, no. 2, pp. 29-38, 2018.
  • 7.  ISSN: 1693-6930 TELKOMNIKA Telecommun Comput El Control, Vol. 19, No. 1, February 2021: 310 - 316 316 [28] Mariangela Bonizzoni, et al., “Additional file 4: of RNA-seq analyses of changes in the Anopheles gambiae transcriptome associated with resistance to pyrethroids in Kenya: identification of candidate-resistance genes and candidate-resistance SNPs,” 2015. https://figshare.com/articles/Additional_file_4_of_RNAseq_analyses_of_changes _in_the_Anopheles_gambiae_transcriptome_associated_with_resistance_to_pyrethroids_in_Kenya_identification_o f_candidateresistance_genes_and_candidateresistance_SNPs/4346279/1 BIOGRAPHIES OF AUTHORS Micheal Olaolu Arowolo, is a faculty of the Department of Computer Science at Landmark University, Omu-Aran Nigeria. He holds a Bachelor Degree from Al-Hikmah University, Ilorin, Nigeria and a Masters Degree from Kwara State University, Malete Nigeria, he is presently a PhD Student of Landmark University, Omu-Aran Nigeria. His area of research interest includes Machine Learning, Bioinformatics, Datamining, Cyber Security and Computer Arithmetic. He has published widely in local and international reputable journals, he is a member of IAENG, APISE, SDIWC, and an Oracle Certified Expert. Dr. Marion Olubunmi Adebiyi, is a faculty of the Department of Computer Science at Landmark University, Omu-Aran, Nigeria. She holds a BSc Degree from University of Ilorin, Ilorin Nigeria. She had her MSc and PhD Degree in Computer Science from Covenant University, Nigeria respectively. Her research interests include, Bioinformatics of Infectious (African) Diseases/ Population, Organism’s Inter-pathway analysis, High throughput data analytics, Homology modellin and Artificial Intelligence. She has published widely in local and international reputable journals She is a member of Nigerian Computer Society (NCS), the Computer Registration Council of Nigeria (CPN) and IEEE member. Professor Ayodele Ariyo Adebiyi, is a faculty and former Head of Department of Computer and Information Sciences, Covenant University, Ota Nigeria. He is currently the Head of Department of Computer Science at Landmark University, Omu-Aran, Nigeria, a sister University to Covenant University. He holds a BSc degree in Computer Science and MBA degree from University of Ilorin, Ilorin Nigeria. He had his MSc and PhD degree in Management Information System (MIS) from Covenant University, Nigeria respectively. His research interests include, application of soft computing techniques in solving real life problems, software engineering and information system research. He has successfully mentored and supervised several postgraduate students at Masters and PhD level. He has published widely in local and international reputable journals. He is a member of Nigerian Computer Society (NCS), the Computer Registration Council of Nigeria (CPN) and IEEE member.