SlideShare uma empresa Scribd logo
1 de 4
Baixar para ler offline
International Journal of Trend in Scientific Research and Development (IJTSRD)
Volume 3 Issue 5, August 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470
@ IJTSRD | Unique Paper ID – IJTSRD26545 | Volume – 3 | Issue – 5 | July - August 2019 Page 1392
Using ID3 Decision Tree Algorithm to the
Student Grade Analysis and Prediction
Khin Khin Lay, San San Nwe
Associate Professor, University of Computer Studies, Maubin, Myanmar
How to cite this paper: Khin Khin Lay |
San San Nwe "Using ID3 Decision Tree
Algorithm to the Student Grade Analysis
and Prediction"
Published in
International
Journal of Trend in
Scientific Research
and Development
(ijtsrd), ISSN: 2456-
6470, Volume-3 |
Issue-5, August 2019, pp.1392-1395,
https://doi.org/10.31142/ijtsrd26545
Copyright © 2019 by author(s) and
International Journalof Trendin Scientific
Research and Development Journal. This
is an Open Access article distributed
under the terms of
the Creative
CommonsAttribution
License (CC BY 4.0)
(http://creativecommons.org/licenses/by
/4.0)
ABSTRACT
Data mining techniques play an important role in data analysis. For the
construction of a classification model which could predict performance of
students, particularly for engineering branches, a decision tree algorithm
associated with the data mining techniques have been used in the research. A
number of factors may affect the performance of students. Data mining
technology which can related to this student grade well and we also used
classification algorithms prediction. In this paper, we used educational data
mining to predict students' final grade based on their performance. We
proposed student data classification using ID3(Iterative Dichotomiser 3)
Decision Tree Algorithm.
KEYWORDS: Classification, ID3, Data Mining, Decision Tree, Predicting
Performance
I. INTRODUCTION
Educational data mining is an interesting research area which extracts useful,
previously unknown patterns from educational database for better
understanding, improved educational performance and assessment of the
student learning process (Surjeet & Saurabh, 2012). The main functionality of
data mining techniques is applying various methods and algorithms inorder to
discover and extract patterns of stored data. These interesting patterns are
presented to the user and may be stored as new knowledge in knowledge base.
Data mining has been used in areas such as database
systems, data warehousing,statistics,machinelearning, data
visualization, and information retrieval.
Data mining techniques have been introduced to new areas
including neural networks, patternsrecognition,spatialdata
analysis, image databases and many application fields such
as business, economics and bioinformatics. Some types of
data mining techniques are: Clustering, Association Rule
Mining, Neural Networks, Genetic Algorithms, Nearest
Neighbor Method, Classification Rule Mining, Decision trees
and many others. The outcome of their resultsindicated that
Decision Tree model had better prediction than other
models.
A decision tree is a flow-chart-liketreestructure,where each
internal node is denoted by rectangles, and leaf nodes are
denoted by ovals. All internal nodes have two or more child
nodes. All internal nodes contain splits, which test the value
of an expression of the attributes. Arcsfromaninternalnode
to its children are labelled with distinct outcomes of thetest.
Each leaf node has a class label associated with it.
Decision tree are commonly used forgaininginformation for
the purpose of decision -making. Decision tree starts with a
root node on which it is for users to take actions.
From this node, users split each node recursively according
to decision tree learning algorithm. The final result is a
decision tree in which each branch represents a possible
scenario of decision and its outcome (Surjeet & Saurabh,
2012).
In data mining, decision trees can be described also as the
combination of mathematical and computationaltechniques
to aid the description, categorization and generalization of a
given set of data. The four widely used decision treelearning
algorithms are: ID3, CART, CHAID and C4.5.
II. RELATED WORK
In order to predict the performance of students the
researcher took into consideration the work of other 14 A
Decision Tree Approach for Predicting Students Academic
Performance researchers that are in the same direction.
Other researchers have looked at the work of predicting
students’ performance by applying many approaches and
coming up with diverse results.
Three supervised data mining algorithms, i.e. Bayesian,
Decision trees and Neural Networks which were applied by
[1] on the preoperative assessmentdatatopredictsuccess in
a course (to produce result as either passed or failed) and
the performance of the learning methods were evaluated
based on their predictive accuracy, ease of learningand user
friendly characteristics. The researchers observed that that
this methodology can be used to help students and teachers
to improve student’s performance; reduce failing ratio by
taking appropriate steps at right time to improve the quality
of learning.
IJTSRD26545
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26545 | Volume – 3 | Issue – 5 | July - August 2019 Page 1393
[2] compared four different classifiers and combined the
results into a multiple classifier. Their research divided the
data into three (3) different classes weighing the features
and using a genetic algorithm to minimize the error rate
improves the prediction accuracy at least 10% in the all
cases of 2, 3 and 9-Classes. In cases where the number of
features is low, the feature weighting worked much better
than feature selection. The successful optimization of
student classification in all three cases demonstrates the
merits of using the LON-CAPA data to predict the students‟
final grades based on their features, which are extracted
from the homework data. However, the research in this case
was based on an online course as opposed to the regular
classroom class that the present study considers.
Furthermore, [3] observed that in the problem of prediction
of performance, it is possible to automatically predict
students’ performance. Moreover by using extensible
classification formalism such as Bayesian networks, which
was employed in their research it becomes possibletoeasily
and uniformly integrate such knowledge into the learning
task. The researchers‟ experiments also show the need for
methods aimed at predicting performance and exploring
more learning algorithms.
Also, [8] used Iterative Dichotomiser 3 (ID3) decision tree
algorithm to predict the university students‟ grade of a
university in Nigeria. A prediction accuracy of 79,556 was
obtained from the model. They further suggested the use of
other decision based model to predict student’s
performance.
III. OUR PROPOSED METHOD
A. The ID3 Decision Tree
ID3 is a simple decision tree learning algorithm developed
by Ross Quinlan (1983). The basic idea of ID3 algorithmis to
construct the decision tree by employingatop-down,greedy
search through the given sets to test each attribute at every
tree node. In order to select the attribute that is most useful
for classifying a given sets, we introduce a metric-
information gain.
To find an optimal way to classify a learning set, what we
need to do is to minimize the questions asked (i.e.
minimizing the depth of the tree). Thus, we need some
function which can measure which questions provide the
most balanced splitting. The information gain metric is such
a function.
The basic idea of ID3 algorithm is to construct the decision
tree by employing a top-down, greedy search through the
given sets to test each attribute at every tree node. In order
to select the attribute that is most useful for classifying a
given sets, we introduce a metric - information gain. To find
an optimal way to classify a learning set we need some
function which provides the most balanced splitting. The
information gain metric is such a function. Given adatatable
that contains attributes and class of the attributes, we can
measure homogeneity of the table based on the classes. The
index used to measure degree of impurityisEntropy[2]. The
Entropy is calculated as follows: Splitting criteria used for
splitting of nodes of the tree is Information gain. To
determine the best attribute for a particular node in the tree
we use the measure called Information Gain.
B. Advantage of ID3
Understandable prediction rules are created from the
training data.
Builds the fastest tree.
Builds a short tree.
Only need to test enough attributes until all data is
classified.
Finding leaf nodes enables test data to be pruned,
reducing number of tests.
C. Disadvantage of ID3
Data may be over-fitted or over classified, if a small
sample is tested.
Only one attribute at a time is tested for making a
decision.
Classifying continuous data may be computationally
expensive, as many trees must be generated to see
where to break the continuum.
IV. Data Preparation
The first step in this paper is to collect data. It is importantto
select the most suitable attributes which influence the
student performance. We have training set of 30 under
graduate students. We were provided withatrainingdataset
consisting of information about students admitted to the
first year in Table I.
TableI Training Data Set
Sr. no. Roll no. Attend-ance Apti- tute Assign-ment Test Presentation Grade
1 IT1 Good Avg Yes Pass Good Excellent
2 IT2 Good Avg Yes Pass Good Excellent
3 IT 3 Good Avg Yes Pass Good Excellent
4 IT4 Good Avg Yes Pass Good Excellent
5 IT5 Good Avg Yes Pass Good Excellent
6 IT6 Avg Avg Yes Pass Avg Good
7 IT7 Poor Good Yes Pass Avg Good
8 IT8 Avg Good Yes Pass Avg Good
9 IT9 Avg Good Yes Pass Avg Good
10 IT10 Poor Poor No Fail Poor Fail
11 IT11 Poor Poor No Fail Poor Fail
12 IT12 Avg Age Yes Pass Age Good
13 IT13 Good Good Yes Pass Good Excellent
14 IT14 Good Good Yes Pass Good Excellent
15 IT15 Good Good Yes Pass Good Excellent
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26545 | Volume – 3 | Issue – 5 | July - August 2019 Page 1394
16 IT16 Good Good Yes Pass Good Excellent
17 IT17 Good Avg Yes Pass Good Excellent
18 IT18 Good Avg Yes Pass Good Excellent
19 IT19 Good Avg Yes Pass Good Excellent
20 IT20 Good Poor Yes Pass Good Excellent
21 IT21 Good Poor Yes Pass Good Excellent
22 IT22 Good Poor Yes Pass Good Excellent
23 IT23 Good Poor Yes Pass Good Excellent
24 IT24 Good Poor Yes Pass Good Excellent
25 IT25 Poor Poor No Fail Poor Fail
26 IT26 Avg Good Yes Pass Avg Good
27 IT27 Poor Good No Fail Poor Fail
28 IT28 Good Good Yes Pass Good Excellent
29 IT29 Good Good Yes Pass Good Excellent
30 IT30 Good Good Yes Pass Good Excellent
To work out the information gain for A relative to S, we first
need to calculate the entropy of S(Grade). Here S(Grade) is a
set of 30 examples are 20“Excellent(Ex)”, 6 “Good(G)” and 4
“Fail(F)”.
Entropy(S) = - PEx log2(PEx) - PGlog2(PG) -PF log2(PF ) (1.1)
= - [20/30]log2[ 21/30] -[ 6/30]log2[ 6/30]
- [4/30]log2[ 4/30]
= 1.241946
To determine the best attribute for a particular node in the
tree we use the measure called Information Gain. The
information gain, Gain (S, A) of an attribute A in Table II,
relative to a collection of examples S,
Gain(S, Attendance) = Entropy(S)-│SG │Entropy(SG )
│S│
- Entropy(S)-│SAvg│Entropy(SAvg )
│S│
- Entropy(S)-│SPoor│Entropy(SPoor ) (1.2)
│S│
= 1.241946 - 0.1203213
= 1.1216247
Table II Information Gain Value Table
Gain Value
Gain(S, Attendance) 1.1216247
Gain(S, Aptitude) 0.234518
Gain(S, Assignment) 0.5665102
Gain(S, Test) 0.5665095
Gain(S, Presentation) 1.241946
Therefore, “Presentation” attribute is the decision attribute
in the root node. “Presentation” as root node has three
possible values – Good, Average, Poor. as shown in figure 1.
Figure1. Presentation as rood node
This process goes on until all data classified perfectly or run
out of attributes. The knowledge represented by decision
tree can be extracted and represented in the form of IF-
THEN rules in figure II.
IF Presentation = ‟Good” AND Attendance
= ‟ Good” THEN Grade = “Excellent”
IF Presentation = ‟Average” AND Test
= ‟ Pass” THEN Grade = “Good”
IF Presentation = ‟ Poor” AND Test
= ‟ Fail” THEN Grade = “Fail”
Figure2. Rule Set generated by Decision Tree
V. CONCLUSIONS
A classification model has been proposed in this study for
predicting student’s grades particularly for IT under
graduate students. In this paper, the classification task is
used on student database to predict the students division on
the basis of previous database. As there are many
approaches that are used for data classification, thedecision
tree method is used here. Information’s like Attendance,
Class test, Aptitude, Presentation and Assignment marks
were collected from the student’s previous database, to
predict the performance at the end of the semester.
REFERENCES
[1] Osmanbegovic E., Suljic M. “Data mining approach for
predicting student performance” Economic Review-
Journal of Economics and Business. Volume 10(1)
(2012)
[2] Behrouz, M, Karshy, D, Korlemeyer G, Punch, W.
“Predicting student performance: an application of
data Mining methods with the educational web-based
system” Lon-capa. 33rd ASEE/IEEE Frontiers in
Education Conference. Boulder C.O. USA, (2003).
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD26545 | Volume – 3 | Issue – 5 | July - August 2019 Page 1395
[3] Bekele, R., Menzel, W. “A bayesian approach to predict
performance of a student (BAPPS): A Case with
Ethiopian Students”. Journal of Information Science
(2013).
[4] Kovacic, Z. “Early prediction of studentsuccess:Mining
student enrollment data” Proceedings of Informing
Science & IT Education Conference. (2010).
[5] Surjeet K, Yadav, Bharadwaj, B. Pal B.” Data Mining
Applications: A comparative Study for Predicting
Student’s performance.” International journal of
innovative technology & creative engineering. Volume
1(12). (2012).
[6] Nnamani, C. N, Dikko, H. G and Kinta, L. M. “Impact of
students‟ financial strength on their academic
performance: Kaduna Polytechnic experience”. African
Research Review 8(1), (2014).
[7] Ogunde A.O., Ajibade D.A. “A data Mining System for
Predicting University Students F=Graduation Grade
Using ID3 Decision Tree approach”, Journal of
Computer Science and Information Technology,
Volume 2(1) (2014).
[8] Undavia, J. N., Dolia, P. M.; Shah, N. P. “Prediction of
Graduate Students for Master Degree based on Their
Past Performance using Decision Tree in Weka
Environment”. International Journal of Computer
Applications; Volume 74 (21), (2013).
https://www.wekatutorial.com/

Mais conteúdo relacionado

Mais procurados

An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...
IJERA Editor
 
Vertical intent prediction approach based on Doc2vec and convolutional neural...
Vertical intent prediction approach based on Doc2vec and convolutional neural...Vertical intent prediction approach based on Doc2vec and convolutional neural...
Vertical intent prediction approach based on Doc2vec and convolutional neural...
IJECEIAES
 
Predicting students performance using classification techniques in data mining
Predicting students performance using classification techniques in data miningPredicting students performance using classification techniques in data mining
Predicting students performance using classification techniques in data mining
Lovely Professional University
 
11.software modules clustering an effective approach for reusability
11.software modules clustering an effective approach for  reusability11.software modules clustering an effective approach for  reusability
11.software modules clustering an effective approach for reusability
Alexander Decker
 

Mais procurados (18)

Assessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s RecitalAssessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s Recital
 
Evaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classificationEvaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classification
 
Evaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileEvaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for file
 
L016136369
L016136369L016136369
L016136369
 
An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...
 
Analyzing undergraduate students’ performance in various perspectives using d...
Analyzing undergraduate students’ performance in various perspectives using d...Analyzing undergraduate students’ performance in various perspectives using d...
Analyzing undergraduate students’ performance in various perspectives using d...
 
IRJET- Student Placement Prediction using Machine Learning
IRJET- Student Placement Prediction using Machine LearningIRJET- Student Placement Prediction using Machine Learning
IRJET- Student Placement Prediction using Machine Learning
 
27 11 sep17 29aug 8513 9956-1-ed (edit)
27 11 sep17 29aug 8513 9956-1-ed (edit)27 11 sep17 29aug 8513 9956-1-ed (edit)
27 11 sep17 29aug 8513 9956-1-ed (edit)
 
Ijciet 10 02_007
Ijciet 10 02_007Ijciet 10 02_007
Ijciet 10 02_007
 
Vertical intent prediction approach based on Doc2vec and convolutional neural...
Vertical intent prediction approach based on Doc2vec and convolutional neural...Vertical intent prediction approach based on Doc2vec and convolutional neural...
Vertical intent prediction approach based on Doc2vec and convolutional neural...
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
 
Predicting students performance using classification techniques in data mining
Predicting students performance using classification techniques in data miningPredicting students performance using classification techniques in data mining
Predicting students performance using classification techniques in data mining
 
11.software modules clustering an effective approach for reusability
11.software modules clustering an effective approach for  reusability11.software modules clustering an effective approach for  reusability
11.software modules clustering an effective approach for reusability
 
Educational Data Mining to Analyze Students Performance – Concept Plan
Educational Data Mining to Analyze Students Performance – Concept PlanEducational Data Mining to Analyze Students Performance – Concept Plan
Educational Data Mining to Analyze Students Performance – Concept Plan
 
Data Mining Application in Advertisement Management of Higher Educational Ins...
Data Mining Application in Advertisement Management of Higher Educational Ins...Data Mining Application in Advertisement Management of Higher Educational Ins...
Data Mining Application in Advertisement Management of Higher Educational Ins...
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
 
[IJCT-V3I2P26] Authors: Sunny Sharma
[IJCT-V3I2P26] Authors: Sunny Sharma[IJCT-V3I2P26] Authors: Sunny Sharma
[IJCT-V3I2P26] Authors: Sunny Sharma
 
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
 

Semelhante a Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction

Distributed Digital Artifacts on the Semantic Web
Distributed Digital Artifacts on the Semantic WebDistributed Digital Artifacts on the Semantic Web
Distributed Digital Artifacts on the Semantic Web
Editor IJCATR
 
EDM_IJTIR_Article_201504020
EDM_IJTIR_Article_201504020EDM_IJTIR_Article_201504020
EDM_IJTIR_Article_201504020
Ritika Saxena
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
IJERA Editor
 
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
IIRindia
 
A Survey of Modern Data Classification Techniques
A Survey of Modern Data Classification TechniquesA Survey of Modern Data Classification Techniques
A Survey of Modern Data Classification Techniques
ijsrd.com
 

Semelhante a Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction (20)

Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...
 
Using Naive Bayesian Classifier for Predicting Performance of a Student
Using Naive Bayesian Classifier for Predicting Performance of a StudentUsing Naive Bayesian Classifier for Predicting Performance of a Student
Using Naive Bayesian Classifier for Predicting Performance of a Student
 
IRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning TechniquesIRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning Techniques
 
Distributed Digital Artifacts on the Semantic Web
Distributed Digital Artifacts on the Semantic WebDistributed Digital Artifacts on the Semantic Web
Distributed Digital Artifacts on the Semantic Web
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
 
IRJET- Performance for Student Higher Education using Decision Tree to Predic...
IRJET- Performance for Student Higher Education using Decision Tree to Predic...IRJET- Performance for Student Higher Education using Decision Tree to Predic...
IRJET- Performance for Student Higher Education using Decision Tree to Predic...
 
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
 
EDM_IJTIR_Article_201504020
EDM_IJTIR_Article_201504020EDM_IJTIR_Article_201504020
EDM_IJTIR_Article_201504020
 
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
 
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
 
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
 
Clustering Students of Computer in Terms of Level of Programming
Clustering Students of Computer in Terms of Level of ProgrammingClustering Students of Computer in Terms of Level of Programming
Clustering Students of Computer in Terms of Level of Programming
 
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...
 
Extending the Student’s Performance via K-Means and Blended Learning
Extending the Student’s Performance via K-Means and Blended Learning Extending the Student’s Performance via K-Means and Blended Learning
Extending the Student’s Performance via K-Means and Blended Learning
 
A Survey of Modern Data Classification Techniques
A Survey of Modern Data Classification TechniquesA Survey of Modern Data Classification Techniques
A Survey of Modern Data Classification Techniques
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & Prediction
 
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
 
IJET-V2I6P22
IJET-V2I6P22IJET-V2I6P22
IJET-V2I6P22
 

Mais de ijtsrd

‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation
ijtsrd
 
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and ProspectsDynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
ijtsrd
 
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
ijtsrd
 
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
ijtsrd
 
Problems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A StudyProblems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A Study
ijtsrd
 
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
ijtsrd
 
A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...
ijtsrd
 
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
ijtsrd
 
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
ijtsrd
 
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. SadikuSustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
ijtsrd
 
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
ijtsrd
 
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
ijtsrd
 
Activating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment MapActivating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment Map
ijtsrd
 
Educational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger SocietyEducational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger Society
ijtsrd
 
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
ijtsrd
 

Mais de ijtsrd (20)

‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation
 
Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...
 
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and ProspectsDynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
 
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
 
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
 
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
 
Problems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A StudyProblems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A Study
 
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
 
The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...
 
A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...
 
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
 
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
 
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. SadikuSustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
 
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
 
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
 
Activating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment MapActivating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment Map
 
Educational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger SocietyEducational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger Society
 
Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...
 
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
 
Streamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine LearningStreamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine Learning
 

Último

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Último (20)

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 

Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction

  • 1. International Journal of Trend in Scientific Research and Development (IJTSRD) Volume 3 Issue 5, August 2019 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470 @ IJTSRD | Unique Paper ID – IJTSRD26545 | Volume – 3 | Issue – 5 | July - August 2019 Page 1392 Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction Khin Khin Lay, San San Nwe Associate Professor, University of Computer Studies, Maubin, Myanmar How to cite this paper: Khin Khin Lay | San San Nwe "Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456- 6470, Volume-3 | Issue-5, August 2019, pp.1392-1395, https://doi.org/10.31142/ijtsrd26545 Copyright © 2019 by author(s) and International Journalof Trendin Scientific Research and Development Journal. This is an Open Access article distributed under the terms of the Creative CommonsAttribution License (CC BY 4.0) (http://creativecommons.org/licenses/by /4.0) ABSTRACT Data mining techniques play an important role in data analysis. For the construction of a classification model which could predict performance of students, particularly for engineering branches, a decision tree algorithm associated with the data mining techniques have been used in the research. A number of factors may affect the performance of students. Data mining technology which can related to this student grade well and we also used classification algorithms prediction. In this paper, we used educational data mining to predict students' final grade based on their performance. We proposed student data classification using ID3(Iterative Dichotomiser 3) Decision Tree Algorithm. KEYWORDS: Classification, ID3, Data Mining, Decision Tree, Predicting Performance I. INTRODUCTION Educational data mining is an interesting research area which extracts useful, previously unknown patterns from educational database for better understanding, improved educational performance and assessment of the student learning process (Surjeet & Saurabh, 2012). The main functionality of data mining techniques is applying various methods and algorithms inorder to discover and extract patterns of stored data. These interesting patterns are presented to the user and may be stored as new knowledge in knowledge base. Data mining has been used in areas such as database systems, data warehousing,statistics,machinelearning, data visualization, and information retrieval. Data mining techniques have been introduced to new areas including neural networks, patternsrecognition,spatialdata analysis, image databases and many application fields such as business, economics and bioinformatics. Some types of data mining techniques are: Clustering, Association Rule Mining, Neural Networks, Genetic Algorithms, Nearest Neighbor Method, Classification Rule Mining, Decision trees and many others. The outcome of their resultsindicated that Decision Tree model had better prediction than other models. A decision tree is a flow-chart-liketreestructure,where each internal node is denoted by rectangles, and leaf nodes are denoted by ovals. All internal nodes have two or more child nodes. All internal nodes contain splits, which test the value of an expression of the attributes. Arcsfromaninternalnode to its children are labelled with distinct outcomes of thetest. Each leaf node has a class label associated with it. Decision tree are commonly used forgaininginformation for the purpose of decision -making. Decision tree starts with a root node on which it is for users to take actions. From this node, users split each node recursively according to decision tree learning algorithm. The final result is a decision tree in which each branch represents a possible scenario of decision and its outcome (Surjeet & Saurabh, 2012). In data mining, decision trees can be described also as the combination of mathematical and computationaltechniques to aid the description, categorization and generalization of a given set of data. The four widely used decision treelearning algorithms are: ID3, CART, CHAID and C4.5. II. RELATED WORK In order to predict the performance of students the researcher took into consideration the work of other 14 A Decision Tree Approach for Predicting Students Academic Performance researchers that are in the same direction. Other researchers have looked at the work of predicting students’ performance by applying many approaches and coming up with diverse results. Three supervised data mining algorithms, i.e. Bayesian, Decision trees and Neural Networks which were applied by [1] on the preoperative assessmentdatatopredictsuccess in a course (to produce result as either passed or failed) and the performance of the learning methods were evaluated based on their predictive accuracy, ease of learningand user friendly characteristics. The researchers observed that that this methodology can be used to help students and teachers to improve student’s performance; reduce failing ratio by taking appropriate steps at right time to improve the quality of learning. IJTSRD26545
  • 2. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26545 | Volume – 3 | Issue – 5 | July - August 2019 Page 1393 [2] compared four different classifiers and combined the results into a multiple classifier. Their research divided the data into three (3) different classes weighing the features and using a genetic algorithm to minimize the error rate improves the prediction accuracy at least 10% in the all cases of 2, 3 and 9-Classes. In cases where the number of features is low, the feature weighting worked much better than feature selection. The successful optimization of student classification in all three cases demonstrates the merits of using the LON-CAPA data to predict the students‟ final grades based on their features, which are extracted from the homework data. However, the research in this case was based on an online course as opposed to the regular classroom class that the present study considers. Furthermore, [3] observed that in the problem of prediction of performance, it is possible to automatically predict students’ performance. Moreover by using extensible classification formalism such as Bayesian networks, which was employed in their research it becomes possibletoeasily and uniformly integrate such knowledge into the learning task. The researchers‟ experiments also show the need for methods aimed at predicting performance and exploring more learning algorithms. Also, [8] used Iterative Dichotomiser 3 (ID3) decision tree algorithm to predict the university students‟ grade of a university in Nigeria. A prediction accuracy of 79,556 was obtained from the model. They further suggested the use of other decision based model to predict student’s performance. III. OUR PROPOSED METHOD A. The ID3 Decision Tree ID3 is a simple decision tree learning algorithm developed by Ross Quinlan (1983). The basic idea of ID3 algorithmis to construct the decision tree by employingatop-down,greedy search through the given sets to test each attribute at every tree node. In order to select the attribute that is most useful for classifying a given sets, we introduce a metric- information gain. To find an optimal way to classify a learning set, what we need to do is to minimize the questions asked (i.e. minimizing the depth of the tree). Thus, we need some function which can measure which questions provide the most balanced splitting. The information gain metric is such a function. The basic idea of ID3 algorithm is to construct the decision tree by employing a top-down, greedy search through the given sets to test each attribute at every tree node. In order to select the attribute that is most useful for classifying a given sets, we introduce a metric - information gain. To find an optimal way to classify a learning set we need some function which provides the most balanced splitting. The information gain metric is such a function. Given adatatable that contains attributes and class of the attributes, we can measure homogeneity of the table based on the classes. The index used to measure degree of impurityisEntropy[2]. The Entropy is calculated as follows: Splitting criteria used for splitting of nodes of the tree is Information gain. To determine the best attribute for a particular node in the tree we use the measure called Information Gain. B. Advantage of ID3 Understandable prediction rules are created from the training data. Builds the fastest tree. Builds a short tree. Only need to test enough attributes until all data is classified. Finding leaf nodes enables test data to be pruned, reducing number of tests. C. Disadvantage of ID3 Data may be over-fitted or over classified, if a small sample is tested. Only one attribute at a time is tested for making a decision. Classifying continuous data may be computationally expensive, as many trees must be generated to see where to break the continuum. IV. Data Preparation The first step in this paper is to collect data. It is importantto select the most suitable attributes which influence the student performance. We have training set of 30 under graduate students. We were provided withatrainingdataset consisting of information about students admitted to the first year in Table I. TableI Training Data Set Sr. no. Roll no. Attend-ance Apti- tute Assign-ment Test Presentation Grade 1 IT1 Good Avg Yes Pass Good Excellent 2 IT2 Good Avg Yes Pass Good Excellent 3 IT 3 Good Avg Yes Pass Good Excellent 4 IT4 Good Avg Yes Pass Good Excellent 5 IT5 Good Avg Yes Pass Good Excellent 6 IT6 Avg Avg Yes Pass Avg Good 7 IT7 Poor Good Yes Pass Avg Good 8 IT8 Avg Good Yes Pass Avg Good 9 IT9 Avg Good Yes Pass Avg Good 10 IT10 Poor Poor No Fail Poor Fail 11 IT11 Poor Poor No Fail Poor Fail 12 IT12 Avg Age Yes Pass Age Good 13 IT13 Good Good Yes Pass Good Excellent 14 IT14 Good Good Yes Pass Good Excellent 15 IT15 Good Good Yes Pass Good Excellent
  • 3. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26545 | Volume – 3 | Issue – 5 | July - August 2019 Page 1394 16 IT16 Good Good Yes Pass Good Excellent 17 IT17 Good Avg Yes Pass Good Excellent 18 IT18 Good Avg Yes Pass Good Excellent 19 IT19 Good Avg Yes Pass Good Excellent 20 IT20 Good Poor Yes Pass Good Excellent 21 IT21 Good Poor Yes Pass Good Excellent 22 IT22 Good Poor Yes Pass Good Excellent 23 IT23 Good Poor Yes Pass Good Excellent 24 IT24 Good Poor Yes Pass Good Excellent 25 IT25 Poor Poor No Fail Poor Fail 26 IT26 Avg Good Yes Pass Avg Good 27 IT27 Poor Good No Fail Poor Fail 28 IT28 Good Good Yes Pass Good Excellent 29 IT29 Good Good Yes Pass Good Excellent 30 IT30 Good Good Yes Pass Good Excellent To work out the information gain for A relative to S, we first need to calculate the entropy of S(Grade). Here S(Grade) is a set of 30 examples are 20“Excellent(Ex)”, 6 “Good(G)” and 4 “Fail(F)”. Entropy(S) = - PEx log2(PEx) - PGlog2(PG) -PF log2(PF ) (1.1) = - [20/30]log2[ 21/30] -[ 6/30]log2[ 6/30] - [4/30]log2[ 4/30] = 1.241946 To determine the best attribute for a particular node in the tree we use the measure called Information Gain. The information gain, Gain (S, A) of an attribute A in Table II, relative to a collection of examples S, Gain(S, Attendance) = Entropy(S)-│SG │Entropy(SG ) │S│ - Entropy(S)-│SAvg│Entropy(SAvg ) │S│ - Entropy(S)-│SPoor│Entropy(SPoor ) (1.2) │S│ = 1.241946 - 0.1203213 = 1.1216247 Table II Information Gain Value Table Gain Value Gain(S, Attendance) 1.1216247 Gain(S, Aptitude) 0.234518 Gain(S, Assignment) 0.5665102 Gain(S, Test) 0.5665095 Gain(S, Presentation) 1.241946 Therefore, “Presentation” attribute is the decision attribute in the root node. “Presentation” as root node has three possible values – Good, Average, Poor. as shown in figure 1. Figure1. Presentation as rood node This process goes on until all data classified perfectly or run out of attributes. The knowledge represented by decision tree can be extracted and represented in the form of IF- THEN rules in figure II. IF Presentation = ‟Good” AND Attendance = ‟ Good” THEN Grade = “Excellent” IF Presentation = ‟Average” AND Test = ‟ Pass” THEN Grade = “Good” IF Presentation = ‟ Poor” AND Test = ‟ Fail” THEN Grade = “Fail” Figure2. Rule Set generated by Decision Tree V. CONCLUSIONS A classification model has been proposed in this study for predicting student’s grades particularly for IT under graduate students. In this paper, the classification task is used on student database to predict the students division on the basis of previous database. As there are many approaches that are used for data classification, thedecision tree method is used here. Information’s like Attendance, Class test, Aptitude, Presentation and Assignment marks were collected from the student’s previous database, to predict the performance at the end of the semester. REFERENCES [1] Osmanbegovic E., Suljic M. “Data mining approach for predicting student performance” Economic Review- Journal of Economics and Business. Volume 10(1) (2012) [2] Behrouz, M, Karshy, D, Korlemeyer G, Punch, W. “Predicting student performance: an application of data Mining methods with the educational web-based system” Lon-capa. 33rd ASEE/IEEE Frontiers in Education Conference. Boulder C.O. USA, (2003).
  • 4. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD26545 | Volume – 3 | Issue – 5 | July - August 2019 Page 1395 [3] Bekele, R., Menzel, W. “A bayesian approach to predict performance of a student (BAPPS): A Case with Ethiopian Students”. Journal of Information Science (2013). [4] Kovacic, Z. “Early prediction of studentsuccess:Mining student enrollment data” Proceedings of Informing Science & IT Education Conference. (2010). [5] Surjeet K, Yadav, Bharadwaj, B. Pal B.” Data Mining Applications: A comparative Study for Predicting Student’s performance.” International journal of innovative technology & creative engineering. Volume 1(12). (2012). [6] Nnamani, C. N, Dikko, H. G and Kinta, L. M. “Impact of students‟ financial strength on their academic performance: Kaduna Polytechnic experience”. African Research Review 8(1), (2014). [7] Ogunde A.O., Ajibade D.A. “A data Mining System for Predicting University Students F=Graduation Grade Using ID3 Decision Tree approach”, Journal of Computer Science and Information Technology, Volume 2(1) (2014). [8] Undavia, J. N., Dolia, P. M.; Shah, N. P. “Prediction of Graduate Students for Master Degree based on Their Past Performance using Decision Tree in Weka Environment”. International Journal of Computer Applications; Volume 74 (21), (2013). https://www.wekatutorial.com/