SlideShare uma empresa Scribd logo
1 de 11
Baixar para ler offline
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
DOI: 10.5121/ijbb.2014.4101 1
A NEW SDM CLASSIFIER USING JACCARD
MINING PROCEDURE
CASE STUDY: RHEUMATIC FEVER DATA
1
M.E. Abd El-Monsef, 2
E. A. Rady, 3
A. M. Kozea,
4
W. A. Hassanein, 5
S. Abd El-Badie
1,3,4,5
Mathematics Department, Faculty of Science, Tanta University, Tanta, Egypt
2
Institute of Statistical Studies & Research (ISSR), Cairo University, Cairo, Egypt
ABSTRACT
In this paper, a new Statistical Data Mining (SDM) technique is proposed using Jaccard Mining Procedure
(JMP) contributing a novel classifier & predictor by applying very effective stages on the training data
depending on Jaccard (J) distance matrix Linked with the Gini Index Measure as precision measure for
initiating a new classifier and a new predictor, The proposed SDM technique using JMP is applied and
examined on a Rheumatic Fever Data to demonstrate its applicability.
KEYWORDS
Jaccard, Classifier, Mining, Gini, Rheumatic Fever, Data
1. INTRODUCTION
Classification [3, 4, 5, 7], is a data mining function that assigns items in a collection to target
categories or classes. The goal of classification is to accurately predict the target class for each
case in the data. For example, a classification model could be used to identify loan applicants as
low, medium, or high credit risks.
A classification task begins with a data set in which the class assignments are known. For
example, a classification model that predicts credit risk could be developed based on observed
data for many loan applicants over a period of time. In addition to the historical credit rating, the
data might track employment history, home ownership or rental, years of residence, number and
type of investments, and so on. Credit rating would be the target, the other attributes would be the
predictors, and the data for each customer would constitute a case.
The simplest type of classification problem is binary classification. In binary classification, the
target attribute has only two possible values: for example, high credit rating or low credit rating.
Multiclass targets have more than two values: for example, low, medium, high, or unknown credit
rating.
Different classification algorithms [7] use different techniques for finding relationships. These
relationships are summarized in a model, which can then be applied to a different data set in
which the class assignments are unknown. Classification has many applications in customer
segmentation, business modeling, marketing, credit analysis, and biomedical and drug response
modeling.
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
2
There is a difference between the definition of the algorithm and the flowchart [6], an algorithm is
just a detailed sequence of simple steps that are needed to solve a problem, from the other side; a
flowchart is a graphical representation of an algorithm. So we will introduce the flowchart and the
algorithm o our paper.
The sequence of this paper is organized as follows; Section 2 represents the materials & methods
which are subtitled into, Rheumatic Fever Data Characteristics, Jaccard distance (J) & Gini Index.
Section 3 presents the JMP algorithm, JMP flow chart and JMP systematic structure stages using
Jaccard distance, Jaccard classes, Jaccard classifiers linking the previous stages with the Gini
Index as precision measures for initiating a new classifier and concluding a new predictor.
Finally, the conclusion and the future work are introduced in Section 4.
2. JMP MATERIALS & METHODS
2.1 Rheumatic Fever Data: Characteristic
No doubt that the Rheumatic Fever is a very common disease [2] and it has many symptoms
differs from patient to another though the diagnosis is the same. So, we obtained the following
example on seven rheumatic fever patients from Tanta University Hospital, Egypt. All patients
are between 9-12 years old with history of Arthritis began from age 3-5 years. This disease has
many symptoms and it is usually started in young age and still with the patient along his life.
Table (1) introduced seven patients characterized by 8 symptoms (attributes) [1] using them to
decide the diagnosis for each patient (decision attribute). Table (2) introduces the training coded
data. Where, {S, F, A, R, K, E, P, H} are the conditional attributes, {P1, P2, P3,…, P7} in Table
(2) are the Rheumatic Fever training data objects and the diagnosis attribute (D) as the decision
attribute.
Table (1): Rheumatic Fever Data Description
At. Name ِ◌At. Symbol At. Code At. Description
Sex S
1 Male
2 Female
Pharyngitis F
1 No
2 Yes
Arthritis A
1 No Arthritis
2 Arthritis
Carditis R
1 Not Affected
2 Affected
Chorea K
1 No
2 Yes
ESR E
1 Normal
2 High
Abdominal
Pain
P
1 No
2 Yes
Headache H
1 No
2 Yes
Diagnosis D
1 Rheumatic Arthritis
2 Rheumatic Carditis
3 Rheumatic Arthritis &Carditis
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
3
S F A R K E P H D
P1 2 2 2 2 2 1 1 1 3
P2 1 2 2 2 2 2 1 2 3
P3 2 2 2 2 1 1 1 1 3
P4 1 2 2 1 1 1 1 1 1
P5 1 1 1 2 1 1 2 1 2
P6 2 2 2 2 1 2 1 1 3
P7 1 2 2 2 1 1 1 2 3
Table (2): Coded Rheumatic Fever Data
2.2 Jaccard Distance (JDist.)
It is very important to note that, Jaccard coefficient is a measure for similarity between two
variables and Jaccard distance [3] is a measure of dissimilarity and both of them are measurement
of asymmetric information on binary and non-binary variables. The definition of the Jaccard
similarity coefficient and the Jaccard distance are as follows;
Def.1: Jaccard similarity between binary variables A and B
۸‫ܕܑ܁‬ሺ‫,ۯ‬ ۰ሻ =
‫۾‬ሺ‫۰∩ۯ‬ሻ
‫۾‬ሺ‫۰∪ۯ‬ሻ
Eq. (1)
Def.2: Jaccard Distance between binary variables A and B
۸۲ܑ‫.ܜܛ‬ሺ‫,ۯ‬ ۰ሻ = ૚ − ۸‫ܕܑ܁‬ሺ‫,ۯ‬ ۰ሻ Eq. (2)
Example: (Jaccard Distance Calculations)
S F
P1 2 2
P2 1 2
P3 2 2
P4 1 2
P5 1 1
P6 2 2
P7 1 2
JSim (S, F) = 4/7 → JDist (S, F) = 3/7= 0.43
Similarly for the entire conditional attributes as in Table (3)
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
4
2.3 Gini Index
The Gini index [3], measures the impurity of the target attribute, so we will use it as a precision
method in our paper. The Gini index of a data partition or set of training tuples, as
۵ܑ‫	ܑܖ‬ሺ‫ۯ‬ሻ = ૚ − ∑ ‫۾‬‫ܒ‬
૛‫ܖ‬
‫ܒ‬ୀ૚ Eq. (3)
Where pj is the relative frequency of class j in A.
The Gini index considers a binary split for each attribute. Let’s first consider the case
where B is a discrete-valued attribute having v distinct values, {b1, b2… bv}, occurring in
A. When considering a binary split, we compute a weighted sum of the impurity of each
resulting partition. For example, if a binary split on B partitions the attribute A into A1
and A2, the Gini index of the attribute A given that partitioning is,
۵ܑ‫ܑܖ‬۰ሺ‫ۯ‬ሻ =
|‫ۯ‬૚|
|‫|ۯ‬
	۵ܑ‫ܑܖ‬ሺ‫ۯ‬૚ሻ +
|‫ۯ‬૛|
|‫|ۯ‬
	۵ܑ‫ܑܖ‬ሺ‫ۯ‬૛ሻ					 Eq. (4)
In General the Gini of the split formula is,
۵ܑ‫	ܑܖ‬ሺ‫ۯ‬ሻ = ૚ −	∑ [ࡼሺ࢐ ࡭ሻ]⁄ ૛
࢐ 																													 Eq. (5)
The most important characteristics of the Gini Index that,
1) It varies between 0 and 1.
2) If Gini = 0 → Complete Equality
3) If Gini = 1 → Complete Inequality
4) If 0 < Gini < 1 → the higher the Gini index, the greater the inequality.
5) Biased to multivalued attributes.
6) Has difficulty when number of classes is large.
7) Tends to favor tests that result in equal-sized partitions and purity in both partitions.
Example: (Gini Index Calculations)
Calculating the Gini of the diagnosis attribute (D) using Eq. (3),
( ) ( ) ( ) 45.0]7/57/17/1[1)( 222
=++−=DGINI
& calculating the Gini of the attribute (S) using Eq. (4),
( ) ( ) 5.0]4/24/2[1)( 22
1 =+−=sGINI & ( ) ( ) 0]2/03/3[1)( 22
2 =+−=sGINI
→ )(SGINI = [(4/7) (0.5) + (3/7) (0)] = 0.286
Similarly for the entire conditional attributes as in Table (6).
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
5
3. JMP Systematic Structure Stages
3.1 JMP Algorithm
In this section the JMP algorithm steps will be introduced in the following.
Input
Training Data = ),( DAU U
Processing
1. {}←J
2. Do
3. JClassesJ ←
4. ClassesJrsClasssifieJ ←
5. {}←IndexGini
5. IndexGinirsClasssifieJ ←
6. IndexGiniAveragesIndexGini ←
7. AveragesIndexGiniClassifierJMPBest ←
Output
Best JMP Classifier & JMP Predictor
Where,
• U is the universe of all attributes, A are the conditional attributes & D is the decision
attribute
• J is the Jaccard Matrix
3.2 JMP Flow Chart
In this section the flow chart of the JMP Systematic Structure Stages
flowchart Figure (1).
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
6
Figure (1): JMP Flow Chart
3.3 Jaccard Matrix
The 1st
stage in our JMP, Calculating Jaccard matrix of the Rheumatic Fever Data conditional
attributes of Table (2).The results are given below in Table (3) Approximated to 2 decimals.
S F A R K E P H
S 0.00 0.43 0.43 0.43 0.43 0.43 0.57 0.71
F 0.43 0.00 0.00 0.29 0.57 0.57 1.00 0.57
A 0.43 0.00 0.00 0.29 0.57 0.57 1.00 0.57
R 0.43 0.29 0.29 0.00 0.57 0.57 0.71 0.57
K 0.43 0.57 0.57 0.57 0.00 0.29 0.43 0.29
E 0.43 0.57 0.57 0.57 0.29 0.00 0.43 0.29
P 0.57 1.00 1.00 0.71 0.43 0.43 0.00 0.43
H 0.71 0.57 0.57 0.57 0.29 0.29 0.43 0.00
Table (3): Rheumatic Fever Conditional Attributes Jaccard Matrix
3.4 Jaccard Classes
The next stage in our classifier to divide the values of the Jaccard matrix of Table (3) into three
classes; (J = 0, 0 < J ≤ 0.5, 0.5 < J ≤ 1) and obtaining the conditional attributes classes introducing
the output of this stage in Table (4) of the conditional attributes Jaccard classes.
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
7
J= 0 0 < J ≤ 0.5 0.5 < J ≤ 1
S {ܵ} {‫,ܨ‬ ‫,ܣ‬ ܴ, ‫,ܭ‬ ‫}ܧ‬ {ܲ, ‫}ܪ‬
F {‫,ܨ‬ ‫}ܣ‬ {ܵ, ܴ} {‫,ܭ‬ ‫,ܧ‬ ܲ, ‫}ܪ‬
A {‫,ܨ‬ ‫}ܣ‬ {ܵ, ܴ} {‫,ܭ‬ ‫,ܧ‬ ܲ, ‫}ܪ‬
R {ܴ} {ܵ, ‫,ܨ‬ ‫}ܣ‬ {‫,ܭ‬ ‫,ܧ‬ ܲ, ‫}ܪ‬
K {‫}ܭ‬ {ܵ, ‫,ܧ‬ ܲ, ‫}ܪ‬ {‫,ܨ‬ ‫,ܣ‬ ܴ}
E {‫}ܧ‬ {ܵ, ‫,ܭ‬ ܲ, ‫}ܪ‬ {‫,ܨ‬ ‫,ܣ‬ ܴ}
P {ܲ} {‫,ܭ‬ ‫,ܧ‬ ‫}ܪ‬ {ܵ, ‫,ܨ‬ ‫,ܣ‬ ܴ}
H {‫}ܪ‬ {‫,ܭ‬ ‫,ܧ‬ ܲ} {ܵ, ‫,ܨ‬ ‫,ܣ‬ ܴ}
Table (4): Conditional Attributes Jaccard Classes3.5 JMP Classifiers
3.5 JMP Classifiers
Comparing the conditional attributes row classes of Table (4) to get the Jaccard classifiers in
Table (5).
J= 0 0 < J ≤ 0.5 0.5 < J ≤ 1
{ܵ} {ܵ} {ܵ}
{‫,ܨ‬ ‫}ܣ‬ {‫,ܨ‬ ‫}ܣ‬ {‫,ܨ‬ ‫,ܣ‬ ܴ}
{ܴ} {ܴ} {‫,ܭ‬ ‫}ܧ‬
{‫}ܭ‬ {‫}ܭ‬ {ܲ, ‫}ܪ‬
{‫}ܧ‬ {‫}ܧ‬
{ܲ} {ܲ}
{‫}ܪ‬ {‫}ܪ‬
Table (5): Conditional Attributes Jaccard Classifier
One can note that the mining Jaccard classifier is, 0.5 < J ≤ 1 classifier which has the least
number of sets and the division of the attribute sets are very appropriate with the medical sight.
Deciding which the best classifier is depends on the following calculations.
3.6 Rheumatic Fever Data Gini Index
Applying the Gini index rules in Section (2.3) on the data in Table (2), the values of the Gini
Index of each attribute is calculated and the results are summarized and approximated to two
decimal spaces in Table (6) and Chart (1) using the frequencies of all attributes which are
represented in Figure (2) respectively.
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
8
Table (6): Rheumatic Fever Gini Index
Chart (1): Rheumatic Fever Attributes Gini Index
Figure (2): Rheumatic Fever Data Frequencies
0
0.1
0.2
0.3
0.4
0.5
S F A R K E P H D
GiniIndex
Attributes
Gini IndexAt.
0.29S
0.24F
0.21A
0.24R
0.34K
0.34E
0.24P
0.19H
0.45D
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
3.7 Best JMP Classifier
Deciding the best Jaccard classifier of Table (5) depends on the conditional attributes Gini Index
of Table (6). So, the JMP classifier will be 0.5 < J
3.8 JMP Predictor
This stage to determine which set of the best JMP classifier sets to be the JMP predictor which we
can use it with any prediction technique for predicti
calculating the Gini index averges for each set of the best JMP classifier of Table (7)
in Table (8) and Chart (2).
Table (8): Best JMP Classifier Gini Averages
Chart (2): Best JMP Classifier Gini Averages
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
0.5
{S}
GiniIndexAverges
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
Deciding the best Jaccard classifier of Table (5) depends on the conditional attributes Gini Index
of Table (6). So, the JMP classifier will be 0.5 < J ≤ 1 the best JMP classifier as in Table (7).
0<J ≤ 1
{ܵ}
{‫,ܨ‬ ‫,ܣ‬ ܴሽ
ሼ‫,ܭ‬ ‫ܧ‬ሽ
ሼܲ, ‫ܪ‬ሽ
Table (7): Best JMP Classifier
This stage to determine which set of the best JMP classifier sets to be the JMP predictor which we
can use it with any prediction technique for predicting the diagnosis for any test data, by
calculating the Gini index averges for each set of the best JMP classifier of Table (7)
Gini Average0 ≤ J ≤ 1
0.29ሼܵሽ
0.23ሼ‫,ܨ‬ ‫,ܣ‬ ܴሽ
0.34ሼ‫,ܭ‬ ‫ܧ‬ሽ
0.22ሼܲ, ‫ܪ‬ሽ
0.45D
Table (8): Best JMP Classifier Gini Averages
Best JMP Classifier
Chart (2): Best JMP Classifier Gini Averages
{F,A,R} {K,E} {P,H} D
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
9
Deciding the best Jaccard classifier of Table (5) depends on the conditional attributes Gini Index
1 the best JMP classifier as in Table (7).
This stage to determine which set of the best JMP classifier sets to be the JMP predictor which we
ng the diagnosis for any test data, by
calculating the Gini index averges for each set of the best JMP classifier of Table (7) represented
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
10
From Chart (2) we can find that the value of the Gini average of {K,E}is the closest value to the
diagnosis Gini and this indicates that {K,E}will be the JMP predictor attributes.
JMP Predictor = ሼࡷ, ࡱ} Eq. (6)
4.CONCLUSION
A new Statistical Data Mining (SDM) technique is initiated in this paper using Jaccard Mining
Procedure (JMP) contributing a novel classifier & predictor depending on Jaccard (J) distance
matrix and Gini Index Measure. Applying JMP on real life application of Rheumatic Fever data
diagnosis to see the accuracy of its applicability and the result was very accurate for the diagnosis
of the data specialist. JMP opens the way for other new SDM techniques using an alternative
distance measures and other accuracy measures according to the data type.
REFERENCES
[1] A.M. Kozae, M.M.E. Abd El-Monsef, and S. Abd El-Badie, New Approaches for Data Reduction in
Generalized Multi-valued Decision Information System (GMDIS): Case Study of Rheumatic Fever
Patients, The 2nd International Conference on MATHEMATICS: TRENDS AND
DEVELOPMENTS, Egyptian Mathematical Society (ETMS), December 27–30, 232-BM, 2007,
Cairo – EGYPT.
[2] Chun LT, Reddy DV, Yamamoto LG. Rheumatic fever in children and adolescents in Hawaii.
Pediatrics. 1987; 79: 549–552.
[3] Han J. and Kamber M. Data Mining: Concepts and Technique, 2nd ed., the Morgan Kaufmann Series
in Data Management Systems, Jim Gray, Series Editor Morgan Kaufmann Publishers, 2006. ISBN 1-
55860-901-6.
[4] M.E. Abd El-Monsef, E. A. Rady, A. M. Kozea, W. A. Hassanein and S. Abd El-Badie What is the
Major Power Linking Statistics & Data Mining?, International Journal of Data Mining & Knowledge
Management Process , 2013, Volume 3, Number 6, ISSN 2230-9608, AIRCC Co.
[5] Oracle, Database Online Documentation Library Master Index, Copyright © 2014, Oracle and/or its
affiliates. All rights reserved. http://docs.oracle.com/cd/E11882_01/nav/mindx.html
[6] Paul E. Black, "algorithm", in Dictionary of Algorithms and Data Structures [online], Vreda Pieterse
and Paul E. Black, eds. 10 January 2007. (accessed TODAY) Available from:
http://www.nist.gov/dads/HTML/algorithm.html
[7] Thair Nu Phyu, Survey of Classification Techniques in Data Mining, International Multi Conference
of Engineers and Computer Scientists (IMECS), 2009, Hong Kong, Vol I.
International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014
11
Authors
Mohamed Ezzat Abd El- Monsef, Professor of Pure Mathematics, Mathematics
Department, Faculty of Science, Tanta university, Egypt. He was the former dean of the
faculty. He is a member in many mathematical societies. He got the membership of the
National Committee for Mathematics. He is a supervisor on many MS. C. and Ph. D.
Thesis in many universities in and out of Egypt. He is a member in, ERS Group,
http://www.cba.edu.kw/abo/rough-sets-working-group.html and the Egyptian
Mathematical Society http://etms-eg.org Homepage: http://at.yorku.ca/h/a/a/a/48.html
El-Houssainy Abd El-Bar Rady, Professor of Applied Statistics Applied Statistics
Department, I.S.S.R, Cairo, Egypt. His Ph.D. in Statistics, from Oregon State University
(OSU), USA. He was the Director of the Statistical and Econometrical Consultation
Center. He got many awards, Sarhan Award of the Scientific research Academy, Egypt,
Lee Award, OSU, USA, Member of the Honor Society of Phi Kappa Phi, USA and Thabet
El-Sherief Award, ISSR, Cairo University. Homepage: http://at.yorku.ca/h/a/a/a/48.html
Abd El-Monem Mohamed Kozae, Professor of Pure Mathematics, Mathematics
Department, Faculty of Science, Tanta university, Egypt He is a supervisor on many MS.
in C. and Ph. D. Thesis in and many universities out of Egypt. He is a member in many
mathematical societies, ERS Group, http://www.cba.edu.kw/abo/rough-sets-working-
group.html, IRSS http://roughsets.home.pl/www/ and Egyptian Mathematical
Societyhttp://etmseg.org Homepage:http://telc.tanta.edu.eg/hosting/pro6/pro6_index.html
Wafaa Anwar Abd El-Latif, Lecturer of Mathematical Statistics, Mathematics
Department, Faculty of Science, Tanta University, Egypt. She got her BS. C., 2000, by
excellence with honors degree, Her Ms. C. degree, 2004. And she got her Ph.D. degree,
2007, Titled “Uncertainty in Statistics”, 2007, Faculty of Science, Tanta University,
Egypt. She is a member in, ERS Group, http://www.cba.edu.kw/abo/rough-sets-
working-group.html and Egyptian Mathematical Society http://etms-eg.org Homepage:
http://tdb.tanta.edu.eg/staff_data/Staff%20Detailed%20Data-ar.aspx?MemberID=847
Soaad Abd El-Badie Attia El-Afefy, Ph. D. Student, Mathematics Department, Tanta
University, Egypt. She got her Ms. C. Titled ”A New Data Reduction Approach”, 2006,
Faculty of Science, Tanta University, Egypt. She got the Best Student Presentation and
Best Student Paper Awards. She worked as teaching assistant at GUC, She is a member
in, ERS Group http://www.cba.edu.kw/abo/rough-sets-working-group.html,IRSS
http://roughsets.home.pl/www/, and Egyptian Mathematical Society http://etms-eg.org
Homepage: http://www.savvymore.mysite.com

Mais conteúdo relacionado

Mais procurados

Parametric Sensitivity Analysis of a Mathematical Model of Two Interacting Po...
Parametric Sensitivity Analysis of a Mathematical Model of Two Interacting Po...Parametric Sensitivity Analysis of a Mathematical Model of Two Interacting Po...
Parametric Sensitivity Analysis of a Mathematical Model of Two Interacting Po...IOSR Journals
 
Integrating Fuzzy Dematel and SMAA-2 for Maintenance Expenses
Integrating Fuzzy Dematel and SMAA-2 for Maintenance ExpensesIntegrating Fuzzy Dematel and SMAA-2 for Maintenance Expenses
Integrating Fuzzy Dematel and SMAA-2 for Maintenance Expensesinventionjournals
 
A Comparison of Accuracy Measures for Remote Sensing Image Classification: Ca...
A Comparison of Accuracy Measures for Remote Sensing Image Classification: Ca...A Comparison of Accuracy Measures for Remote Sensing Image Classification: Ca...
A Comparison of Accuracy Measures for Remote Sensing Image Classification: Ca...CSCJournals
 
Discriminant Analysis in Sports
Discriminant Analysis in SportsDiscriminant Analysis in Sports
Discriminant Analysis in SportsJ P Verma
 
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...Seval Çapraz
 
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...ijscmcj
 
hb2s5_BSc scriptie Steyn Heskes
hb2s5_BSc scriptie Steyn Heskeshb2s5_BSc scriptie Steyn Heskes
hb2s5_BSc scriptie Steyn HeskesSteyn Heskes
 
Linear Discriminant Analysis (LDA)
Linear Discriminant Analysis (LDA)Linear Discriminant Analysis (LDA)
Linear Discriminant Analysis (LDA)Anmol Dwivedi
 
Parametric sensitivity analysis of a mathematical model of facultative mutualism
Parametric sensitivity analysis of a mathematical model of facultative mutualismParametric sensitivity analysis of a mathematical model of facultative mutualism
Parametric sensitivity analysis of a mathematical model of facultative mutualismIOSR Journals
 
Group Testing with Test Errors Made Easier
Group Testing with Test Errors Made EasierGroup Testing with Test Errors Made Easier
Group Testing with Test Errors Made EasierWaqas Tariq
 
Assessing Discriminatory Performance of a Binary Logistic Regression Model
Assessing Discriminatory Performance of a Binary Logistic Regression ModelAssessing Discriminatory Performance of a Binary Logistic Regression Model
Assessing Discriminatory Performance of a Binary Logistic Regression Modelsajjalp
 
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTIONINFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTIONIJDKP
 
Sparse representation based classification of mr images of brain for alzheime...
Sparse representation based classification of mr images of brain for alzheime...Sparse representation based classification of mr images of brain for alzheime...
Sparse representation based classification of mr images of brain for alzheime...eSAT Publishing House
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Uncertainity
UncertainityUncertainity
UncertainityVIGNESH C
 

Mais procurados (18)

Parametric Sensitivity Analysis of a Mathematical Model of Two Interacting Po...
Parametric Sensitivity Analysis of a Mathematical Model of Two Interacting Po...Parametric Sensitivity Analysis of a Mathematical Model of Two Interacting Po...
Parametric Sensitivity Analysis of a Mathematical Model of Two Interacting Po...
 
MISHRA DISTRIBUTION
MISHRA DISTRIBUTIONMISHRA DISTRIBUTION
MISHRA DISTRIBUTION
 
Sample A_Book Typesetting
Sample A_Book TypesettingSample A_Book Typesetting
Sample A_Book Typesetting
 
Integrating Fuzzy Dematel and SMAA-2 for Maintenance Expenses
Integrating Fuzzy Dematel and SMAA-2 for Maintenance ExpensesIntegrating Fuzzy Dematel and SMAA-2 for Maintenance Expenses
Integrating Fuzzy Dematel and SMAA-2 for Maintenance Expenses
 
A Comparison of Accuracy Measures for Remote Sensing Image Classification: Ca...
A Comparison of Accuracy Measures for Remote Sensing Image Classification: Ca...A Comparison of Accuracy Measures for Remote Sensing Image Classification: Ca...
A Comparison of Accuracy Measures for Remote Sensing Image Classification: Ca...
 
Discriminant Analysis in Sports
Discriminant Analysis in SportsDiscriminant Analysis in Sports
Discriminant Analysis in Sports
 
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
Statistical Data Analysis on a Data Set (Diabetes 130-US hospitals for years ...
 
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
K-Medoids Clustering Using Partitioning Around Medoids for Performing Face Re...
 
hb2s5_BSc scriptie Steyn Heskes
hb2s5_BSc scriptie Steyn Heskeshb2s5_BSc scriptie Steyn Heskes
hb2s5_BSc scriptie Steyn Heskes
 
Linear Discriminant Analysis (LDA)
Linear Discriminant Analysis (LDA)Linear Discriminant Analysis (LDA)
Linear Discriminant Analysis (LDA)
 
Parametric sensitivity analysis of a mathematical model of facultative mutualism
Parametric sensitivity analysis of a mathematical model of facultative mutualismParametric sensitivity analysis of a mathematical model of facultative mutualism
Parametric sensitivity analysis of a mathematical model of facultative mutualism
 
Group Testing with Test Errors Made Easier
Group Testing with Test Errors Made EasierGroup Testing with Test Errors Made Easier
Group Testing with Test Errors Made Easier
 
Assessing Discriminatory Performance of a Binary Logistic Regression Model
Assessing Discriminatory Performance of a Binary Logistic Regression ModelAssessing Discriminatory Performance of a Binary Logistic Regression Model
Assessing Discriminatory Performance of a Binary Logistic Regression Model
 
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTIONINFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
 
Sparse representation based classification of mr images of brain for alzheime...
Sparse representation based classification of mr images of brain for alzheime...Sparse representation based classification of mr images of brain for alzheime...
Sparse representation based classification of mr images of brain for alzheime...
 
Bf32370372
Bf32370372Bf32370372
Bf32370372
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
Uncertainity
UncertainityUncertainity
Uncertainity
 

Semelhante a A new sdm classifier using jaccard mining procedure case study rheumatic fever data

MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMSMULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMSijcsit
 
Fuzzy Regression Model for Knee Osteoarthritis Disease Diagnosis
Fuzzy Regression Model for Knee Osteoarthritis Disease DiagnosisFuzzy Regression Model for Knee Osteoarthritis Disease Diagnosis
Fuzzy Regression Model for Knee Osteoarthritis Disease DiagnosisIRJET Journal
 
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...IJMIT JOURNAL
 
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...IJMIT JOURNAL
 
Statistical techniques used in measurement
Statistical techniques used in measurementStatistical techniques used in measurement
Statistical techniques used in measurementShivamKhajuria3
 
Face recognition vendor test 2002 evaluation report
Face recognition vendor test 2002  evaluation reportFace recognition vendor test 2002  evaluation report
Face recognition vendor test 2002 evaluation reportSungkwan Park
 
Android Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction SystemAndroid Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction Systemijtsrd
 
A Thresholding Method to Estimate Quantities of Each Class
A Thresholding Method to Estimate Quantities of Each ClassA Thresholding Method to Estimate Quantities of Each Class
A Thresholding Method to Estimate Quantities of Each ClassWaqas Tariq
 
Multimodal Biometrics Recognition by Dimensionality Diminution Method
Multimodal Biometrics Recognition by Dimensionality Diminution MethodMultimodal Biometrics Recognition by Dimensionality Diminution Method
Multimodal Biometrics Recognition by Dimensionality Diminution MethodIJERA Editor
 
Preprocessing and Classification in WEKA Using Different Classifiers
Preprocessing and Classification in WEKA Using Different ClassifiersPreprocessing and Classification in WEKA Using Different Classifiers
Preprocessing and Classification in WEKA Using Different ClassifiersIJERA Editor
 
Accuracy, Sensitivity and Specificity Measurement of Various Classification T...
Accuracy, Sensitivity and Specificity Measurement of Various Classification T...Accuracy, Sensitivity and Specificity Measurement of Various Classification T...
Accuracy, Sensitivity and Specificity Measurement of Various Classification T...IOSR Journals
 
Smooth Support Vector Machine for Suicide-Related Behaviours Prediction
Smooth Support Vector Machine for Suicide-Related Behaviours Prediction Smooth Support Vector Machine for Suicide-Related Behaviours Prediction
Smooth Support Vector Machine for Suicide-Related Behaviours Prediction IJECEIAES
 
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATARESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATAorajjournal
 
Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix IJECEIAES
 
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATADETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATAIJCSEA Journal
 
Classification of Heart Diseases Patients using Data Mining Techniques
Classification of Heart Diseases Patients using Data Mining TechniquesClassification of Heart Diseases Patients using Data Mining Techniques
Classification of Heart Diseases Patients using Data Mining TechniquesLovely Professional University
 
IRJET- Differential Transform Method for the Vaccination Model of the Cholera...
IRJET- Differential Transform Method for the Vaccination Model of the Cholera...IRJET- Differential Transform Method for the Vaccination Model of the Cholera...
IRJET- Differential Transform Method for the Vaccination Model of the Cholera...IRJET Journal
 
IRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
IRJET- Dengue Possibility Forecasting Model using Machine Learning AlgorithmsIRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
IRJET- Dengue Possibility Forecasting Model using Machine Learning AlgorithmsIRJET Journal
 
Diagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approachDiagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approachcsandit
 
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH cscpconf
 

Semelhante a A new sdm classifier using jaccard mining procedure case study rheumatic fever data (20)

MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMSMULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
MULTI-PARAMETER BASED PERFORMANCE EVALUATION OF CLASSIFICATION ALGORITHMS
 
Fuzzy Regression Model for Knee Osteoarthritis Disease Diagnosis
Fuzzy Regression Model for Knee Osteoarthritis Disease DiagnosisFuzzy Regression Model for Knee Osteoarthritis Disease Diagnosis
Fuzzy Regression Model for Knee Osteoarthritis Disease Diagnosis
 
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
 
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
PREDICTING CLASS-IMBALANCED BUSINESS RISK USING RESAMPLING, REGULARIZATION, A...
 
Statistical techniques used in measurement
Statistical techniques used in measurementStatistical techniques used in measurement
Statistical techniques used in measurement
 
Face recognition vendor test 2002 evaluation report
Face recognition vendor test 2002  evaluation reportFace recognition vendor test 2002  evaluation report
Face recognition vendor test 2002 evaluation report
 
Android Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction SystemAndroid Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction System
 
A Thresholding Method to Estimate Quantities of Each Class
A Thresholding Method to Estimate Quantities of Each ClassA Thresholding Method to Estimate Quantities of Each Class
A Thresholding Method to Estimate Quantities of Each Class
 
Multimodal Biometrics Recognition by Dimensionality Diminution Method
Multimodal Biometrics Recognition by Dimensionality Diminution MethodMultimodal Biometrics Recognition by Dimensionality Diminution Method
Multimodal Biometrics Recognition by Dimensionality Diminution Method
 
Preprocessing and Classification in WEKA Using Different Classifiers
Preprocessing and Classification in WEKA Using Different ClassifiersPreprocessing and Classification in WEKA Using Different Classifiers
Preprocessing and Classification in WEKA Using Different Classifiers
 
Accuracy, Sensitivity and Specificity Measurement of Various Classification T...
Accuracy, Sensitivity and Specificity Measurement of Various Classification T...Accuracy, Sensitivity and Specificity Measurement of Various Classification T...
Accuracy, Sensitivity and Specificity Measurement of Various Classification T...
 
Smooth Support Vector Machine for Suicide-Related Behaviours Prediction
Smooth Support Vector Machine for Suicide-Related Behaviours Prediction Smooth Support Vector Machine for Suicide-Related Behaviours Prediction
Smooth Support Vector Machine for Suicide-Related Behaviours Prediction
 
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATARESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
 
Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix
 
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATADETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
 
Classification of Heart Diseases Patients using Data Mining Techniques
Classification of Heart Diseases Patients using Data Mining TechniquesClassification of Heart Diseases Patients using Data Mining Techniques
Classification of Heart Diseases Patients using Data Mining Techniques
 
IRJET- Differential Transform Method for the Vaccination Model of the Cholera...
IRJET- Differential Transform Method for the Vaccination Model of the Cholera...IRJET- Differential Transform Method for the Vaccination Model of the Cholera...
IRJET- Differential Transform Method for the Vaccination Model of the Cholera...
 
IRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
IRJET- Dengue Possibility Forecasting Model using Machine Learning AlgorithmsIRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
IRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
 
Diagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approachDiagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approach
 
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
 

Último

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 

Último (20)

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 

A new sdm classifier using jaccard mining procedure case study rheumatic fever data

  • 1. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 DOI: 10.5121/ijbb.2014.4101 1 A NEW SDM CLASSIFIER USING JACCARD MINING PROCEDURE CASE STUDY: RHEUMATIC FEVER DATA 1 M.E. Abd El-Monsef, 2 E. A. Rady, 3 A. M. Kozea, 4 W. A. Hassanein, 5 S. Abd El-Badie 1,3,4,5 Mathematics Department, Faculty of Science, Tanta University, Tanta, Egypt 2 Institute of Statistical Studies & Research (ISSR), Cairo University, Cairo, Egypt ABSTRACT In this paper, a new Statistical Data Mining (SDM) technique is proposed using Jaccard Mining Procedure (JMP) contributing a novel classifier & predictor by applying very effective stages on the training data depending on Jaccard (J) distance matrix Linked with the Gini Index Measure as precision measure for initiating a new classifier and a new predictor, The proposed SDM technique using JMP is applied and examined on a Rheumatic Fever Data to demonstrate its applicability. KEYWORDS Jaccard, Classifier, Mining, Gini, Rheumatic Fever, Data 1. INTRODUCTION Classification [3, 4, 5, 7], is a data mining function that assigns items in a collection to target categories or classes. The goal of classification is to accurately predict the target class for each case in the data. For example, a classification model could be used to identify loan applicants as low, medium, or high credit risks. A classification task begins with a data set in which the class assignments are known. For example, a classification model that predicts credit risk could be developed based on observed data for many loan applicants over a period of time. In addition to the historical credit rating, the data might track employment history, home ownership or rental, years of residence, number and type of investments, and so on. Credit rating would be the target, the other attributes would be the predictors, and the data for each customer would constitute a case. The simplest type of classification problem is binary classification. In binary classification, the target attribute has only two possible values: for example, high credit rating or low credit rating. Multiclass targets have more than two values: for example, low, medium, high, or unknown credit rating. Different classification algorithms [7] use different techniques for finding relationships. These relationships are summarized in a model, which can then be applied to a different data set in which the class assignments are unknown. Classification has many applications in customer segmentation, business modeling, marketing, credit analysis, and biomedical and drug response modeling.
  • 2. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 2 There is a difference between the definition of the algorithm and the flowchart [6], an algorithm is just a detailed sequence of simple steps that are needed to solve a problem, from the other side; a flowchart is a graphical representation of an algorithm. So we will introduce the flowchart and the algorithm o our paper. The sequence of this paper is organized as follows; Section 2 represents the materials & methods which are subtitled into, Rheumatic Fever Data Characteristics, Jaccard distance (J) & Gini Index. Section 3 presents the JMP algorithm, JMP flow chart and JMP systematic structure stages using Jaccard distance, Jaccard classes, Jaccard classifiers linking the previous stages with the Gini Index as precision measures for initiating a new classifier and concluding a new predictor. Finally, the conclusion and the future work are introduced in Section 4. 2. JMP MATERIALS & METHODS 2.1 Rheumatic Fever Data: Characteristic No doubt that the Rheumatic Fever is a very common disease [2] and it has many symptoms differs from patient to another though the diagnosis is the same. So, we obtained the following example on seven rheumatic fever patients from Tanta University Hospital, Egypt. All patients are between 9-12 years old with history of Arthritis began from age 3-5 years. This disease has many symptoms and it is usually started in young age and still with the patient along his life. Table (1) introduced seven patients characterized by 8 symptoms (attributes) [1] using them to decide the diagnosis for each patient (decision attribute). Table (2) introduces the training coded data. Where, {S, F, A, R, K, E, P, H} are the conditional attributes, {P1, P2, P3,…, P7} in Table (2) are the Rheumatic Fever training data objects and the diagnosis attribute (D) as the decision attribute. Table (1): Rheumatic Fever Data Description At. Name ِ◌At. Symbol At. Code At. Description Sex S 1 Male 2 Female Pharyngitis F 1 No 2 Yes Arthritis A 1 No Arthritis 2 Arthritis Carditis R 1 Not Affected 2 Affected Chorea K 1 No 2 Yes ESR E 1 Normal 2 High Abdominal Pain P 1 No 2 Yes Headache H 1 No 2 Yes Diagnosis D 1 Rheumatic Arthritis 2 Rheumatic Carditis 3 Rheumatic Arthritis &Carditis
  • 3. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 3 S F A R K E P H D P1 2 2 2 2 2 1 1 1 3 P2 1 2 2 2 2 2 1 2 3 P3 2 2 2 2 1 1 1 1 3 P4 1 2 2 1 1 1 1 1 1 P5 1 1 1 2 1 1 2 1 2 P6 2 2 2 2 1 2 1 1 3 P7 1 2 2 2 1 1 1 2 3 Table (2): Coded Rheumatic Fever Data 2.2 Jaccard Distance (JDist.) It is very important to note that, Jaccard coefficient is a measure for similarity between two variables and Jaccard distance [3] is a measure of dissimilarity and both of them are measurement of asymmetric information on binary and non-binary variables. The definition of the Jaccard similarity coefficient and the Jaccard distance are as follows; Def.1: Jaccard similarity between binary variables A and B ۸‫ܕܑ܁‬ሺ‫,ۯ‬ ۰ሻ = ‫۾‬ሺ‫۰∩ۯ‬ሻ ‫۾‬ሺ‫۰∪ۯ‬ሻ Eq. (1) Def.2: Jaccard Distance between binary variables A and B ۸۲ܑ‫.ܜܛ‬ሺ‫,ۯ‬ ۰ሻ = ૚ − ۸‫ܕܑ܁‬ሺ‫,ۯ‬ ۰ሻ Eq. (2) Example: (Jaccard Distance Calculations) S F P1 2 2 P2 1 2 P3 2 2 P4 1 2 P5 1 1 P6 2 2 P7 1 2 JSim (S, F) = 4/7 → JDist (S, F) = 3/7= 0.43 Similarly for the entire conditional attributes as in Table (3)
  • 4. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 4 2.3 Gini Index The Gini index [3], measures the impurity of the target attribute, so we will use it as a precision method in our paper. The Gini index of a data partition or set of training tuples, as ۵ܑ‫ ܑܖ‬ሺ‫ۯ‬ሻ = ૚ − ∑ ‫۾‬‫ܒ‬ ૛‫ܖ‬ ‫ܒ‬ୀ૚ Eq. (3) Where pj is the relative frequency of class j in A. The Gini index considers a binary split for each attribute. Let’s first consider the case where B is a discrete-valued attribute having v distinct values, {b1, b2… bv}, occurring in A. When considering a binary split, we compute a weighted sum of the impurity of each resulting partition. For example, if a binary split on B partitions the attribute A into A1 and A2, the Gini index of the attribute A given that partitioning is, ۵ܑ‫ܑܖ‬۰ሺ‫ۯ‬ሻ = |‫ۯ‬૚| |‫|ۯ‬ ۵ܑ‫ܑܖ‬ሺ‫ۯ‬૚ሻ + |‫ۯ‬૛| |‫|ۯ‬ ۵ܑ‫ܑܖ‬ሺ‫ۯ‬૛ሻ Eq. (4) In General the Gini of the split formula is, ۵ܑ‫ ܑܖ‬ሺ‫ۯ‬ሻ = ૚ − ∑ [ࡼሺ࢐ ࡭ሻ]⁄ ૛ ࢐ Eq. (5) The most important characteristics of the Gini Index that, 1) It varies between 0 and 1. 2) If Gini = 0 → Complete Equality 3) If Gini = 1 → Complete Inequality 4) If 0 < Gini < 1 → the higher the Gini index, the greater the inequality. 5) Biased to multivalued attributes. 6) Has difficulty when number of classes is large. 7) Tends to favor tests that result in equal-sized partitions and purity in both partitions. Example: (Gini Index Calculations) Calculating the Gini of the diagnosis attribute (D) using Eq. (3), ( ) ( ) ( ) 45.0]7/57/17/1[1)( 222 =++−=DGINI & calculating the Gini of the attribute (S) using Eq. (4), ( ) ( ) 5.0]4/24/2[1)( 22 1 =+−=sGINI & ( ) ( ) 0]2/03/3[1)( 22 2 =+−=sGINI → )(SGINI = [(4/7) (0.5) + (3/7) (0)] = 0.286 Similarly for the entire conditional attributes as in Table (6).
  • 5. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 5 3. JMP Systematic Structure Stages 3.1 JMP Algorithm In this section the JMP algorithm steps will be introduced in the following. Input Training Data = ),( DAU U Processing 1. {}←J 2. Do 3. JClassesJ ← 4. ClassesJrsClasssifieJ ← 5. {}←IndexGini 5. IndexGinirsClasssifieJ ← 6. IndexGiniAveragesIndexGini ← 7. AveragesIndexGiniClassifierJMPBest ← Output Best JMP Classifier & JMP Predictor Where, • U is the universe of all attributes, A are the conditional attributes & D is the decision attribute • J is the Jaccard Matrix 3.2 JMP Flow Chart In this section the flow chart of the JMP Systematic Structure Stages flowchart Figure (1).
  • 6. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 6 Figure (1): JMP Flow Chart 3.3 Jaccard Matrix The 1st stage in our JMP, Calculating Jaccard matrix of the Rheumatic Fever Data conditional attributes of Table (2).The results are given below in Table (3) Approximated to 2 decimals. S F A R K E P H S 0.00 0.43 0.43 0.43 0.43 0.43 0.57 0.71 F 0.43 0.00 0.00 0.29 0.57 0.57 1.00 0.57 A 0.43 0.00 0.00 0.29 0.57 0.57 1.00 0.57 R 0.43 0.29 0.29 0.00 0.57 0.57 0.71 0.57 K 0.43 0.57 0.57 0.57 0.00 0.29 0.43 0.29 E 0.43 0.57 0.57 0.57 0.29 0.00 0.43 0.29 P 0.57 1.00 1.00 0.71 0.43 0.43 0.00 0.43 H 0.71 0.57 0.57 0.57 0.29 0.29 0.43 0.00 Table (3): Rheumatic Fever Conditional Attributes Jaccard Matrix 3.4 Jaccard Classes The next stage in our classifier to divide the values of the Jaccard matrix of Table (3) into three classes; (J = 0, 0 < J ≤ 0.5, 0.5 < J ≤ 1) and obtaining the conditional attributes classes introducing the output of this stage in Table (4) of the conditional attributes Jaccard classes.
  • 7. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 7 J= 0 0 < J ≤ 0.5 0.5 < J ≤ 1 S {ܵ} {‫,ܨ‬ ‫,ܣ‬ ܴ, ‫,ܭ‬ ‫}ܧ‬ {ܲ, ‫}ܪ‬ F {‫,ܨ‬ ‫}ܣ‬ {ܵ, ܴ} {‫,ܭ‬ ‫,ܧ‬ ܲ, ‫}ܪ‬ A {‫,ܨ‬ ‫}ܣ‬ {ܵ, ܴ} {‫,ܭ‬ ‫,ܧ‬ ܲ, ‫}ܪ‬ R {ܴ} {ܵ, ‫,ܨ‬ ‫}ܣ‬ {‫,ܭ‬ ‫,ܧ‬ ܲ, ‫}ܪ‬ K {‫}ܭ‬ {ܵ, ‫,ܧ‬ ܲ, ‫}ܪ‬ {‫,ܨ‬ ‫,ܣ‬ ܴ} E {‫}ܧ‬ {ܵ, ‫,ܭ‬ ܲ, ‫}ܪ‬ {‫,ܨ‬ ‫,ܣ‬ ܴ} P {ܲ} {‫,ܭ‬ ‫,ܧ‬ ‫}ܪ‬ {ܵ, ‫,ܨ‬ ‫,ܣ‬ ܴ} H {‫}ܪ‬ {‫,ܭ‬ ‫,ܧ‬ ܲ} {ܵ, ‫,ܨ‬ ‫,ܣ‬ ܴ} Table (4): Conditional Attributes Jaccard Classes3.5 JMP Classifiers 3.5 JMP Classifiers Comparing the conditional attributes row classes of Table (4) to get the Jaccard classifiers in Table (5). J= 0 0 < J ≤ 0.5 0.5 < J ≤ 1 {ܵ} {ܵ} {ܵ} {‫,ܨ‬ ‫}ܣ‬ {‫,ܨ‬ ‫}ܣ‬ {‫,ܨ‬ ‫,ܣ‬ ܴ} {ܴ} {ܴ} {‫,ܭ‬ ‫}ܧ‬ {‫}ܭ‬ {‫}ܭ‬ {ܲ, ‫}ܪ‬ {‫}ܧ‬ {‫}ܧ‬ {ܲ} {ܲ} {‫}ܪ‬ {‫}ܪ‬ Table (5): Conditional Attributes Jaccard Classifier One can note that the mining Jaccard classifier is, 0.5 < J ≤ 1 classifier which has the least number of sets and the division of the attribute sets are very appropriate with the medical sight. Deciding which the best classifier is depends on the following calculations. 3.6 Rheumatic Fever Data Gini Index Applying the Gini index rules in Section (2.3) on the data in Table (2), the values of the Gini Index of each attribute is calculated and the results are summarized and approximated to two decimal spaces in Table (6) and Chart (1) using the frequencies of all attributes which are represented in Figure (2) respectively.
  • 8. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 8 Table (6): Rheumatic Fever Gini Index Chart (1): Rheumatic Fever Attributes Gini Index Figure (2): Rheumatic Fever Data Frequencies 0 0.1 0.2 0.3 0.4 0.5 S F A R K E P H D GiniIndex Attributes Gini IndexAt. 0.29S 0.24F 0.21A 0.24R 0.34K 0.34E 0.24P 0.19H 0.45D
  • 9. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 3.7 Best JMP Classifier Deciding the best Jaccard classifier of Table (5) depends on the conditional attributes Gini Index of Table (6). So, the JMP classifier will be 0.5 < J 3.8 JMP Predictor This stage to determine which set of the best JMP classifier sets to be the JMP predictor which we can use it with any prediction technique for predicti calculating the Gini index averges for each set of the best JMP classifier of Table (7) in Table (8) and Chart (2). Table (8): Best JMP Classifier Gini Averages Chart (2): Best JMP Classifier Gini Averages 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 {S} GiniIndexAverges International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 Deciding the best Jaccard classifier of Table (5) depends on the conditional attributes Gini Index of Table (6). So, the JMP classifier will be 0.5 < J ≤ 1 the best JMP classifier as in Table (7). 0<J ≤ 1 {ܵ} {‫,ܨ‬ ‫,ܣ‬ ܴሽ ሼ‫,ܭ‬ ‫ܧ‬ሽ ሼܲ, ‫ܪ‬ሽ Table (7): Best JMP Classifier This stage to determine which set of the best JMP classifier sets to be the JMP predictor which we can use it with any prediction technique for predicting the diagnosis for any test data, by calculating the Gini index averges for each set of the best JMP classifier of Table (7) Gini Average0 ≤ J ≤ 1 0.29ሼܵሽ 0.23ሼ‫,ܨ‬ ‫,ܣ‬ ܴሽ 0.34ሼ‫,ܭ‬ ‫ܧ‬ሽ 0.22ሼܲ, ‫ܪ‬ሽ 0.45D Table (8): Best JMP Classifier Gini Averages Best JMP Classifier Chart (2): Best JMP Classifier Gini Averages {F,A,R} {K,E} {P,H} D International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 9 Deciding the best Jaccard classifier of Table (5) depends on the conditional attributes Gini Index 1 the best JMP classifier as in Table (7). This stage to determine which set of the best JMP classifier sets to be the JMP predictor which we ng the diagnosis for any test data, by calculating the Gini index averges for each set of the best JMP classifier of Table (7) represented
  • 10. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 10 From Chart (2) we can find that the value of the Gini average of {K,E}is the closest value to the diagnosis Gini and this indicates that {K,E}will be the JMP predictor attributes. JMP Predictor = ሼࡷ, ࡱ} Eq. (6) 4.CONCLUSION A new Statistical Data Mining (SDM) technique is initiated in this paper using Jaccard Mining Procedure (JMP) contributing a novel classifier & predictor depending on Jaccard (J) distance matrix and Gini Index Measure. Applying JMP on real life application of Rheumatic Fever data diagnosis to see the accuracy of its applicability and the result was very accurate for the diagnosis of the data specialist. JMP opens the way for other new SDM techniques using an alternative distance measures and other accuracy measures according to the data type. REFERENCES [1] A.M. Kozae, M.M.E. Abd El-Monsef, and S. Abd El-Badie, New Approaches for Data Reduction in Generalized Multi-valued Decision Information System (GMDIS): Case Study of Rheumatic Fever Patients, The 2nd International Conference on MATHEMATICS: TRENDS AND DEVELOPMENTS, Egyptian Mathematical Society (ETMS), December 27–30, 232-BM, 2007, Cairo – EGYPT. [2] Chun LT, Reddy DV, Yamamoto LG. Rheumatic fever in children and adolescents in Hawaii. Pediatrics. 1987; 79: 549–552. [3] Han J. and Kamber M. Data Mining: Concepts and Technique, 2nd ed., the Morgan Kaufmann Series in Data Management Systems, Jim Gray, Series Editor Morgan Kaufmann Publishers, 2006. ISBN 1- 55860-901-6. [4] M.E. Abd El-Monsef, E. A. Rady, A. M. Kozea, W. A. Hassanein and S. Abd El-Badie What is the Major Power Linking Statistics & Data Mining?, International Journal of Data Mining & Knowledge Management Process , 2013, Volume 3, Number 6, ISSN 2230-9608, AIRCC Co. [5] Oracle, Database Online Documentation Library Master Index, Copyright © 2014, Oracle and/or its affiliates. All rights reserved. http://docs.oracle.com/cd/E11882_01/nav/mindx.html [6] Paul E. Black, "algorithm", in Dictionary of Algorithms and Data Structures [online], Vreda Pieterse and Paul E. Black, eds. 10 January 2007. (accessed TODAY) Available from: http://www.nist.gov/dads/HTML/algorithm.html [7] Thair Nu Phyu, Survey of Classification Techniques in Data Mining, International Multi Conference of Engineers and Computer Scientists (IMECS), 2009, Hong Kong, Vol I.
  • 11. International Journal on Bioinformatics & Biosciences (IJBB) Vol.4, No.1, March 2014 11 Authors Mohamed Ezzat Abd El- Monsef, Professor of Pure Mathematics, Mathematics Department, Faculty of Science, Tanta university, Egypt. He was the former dean of the faculty. He is a member in many mathematical societies. He got the membership of the National Committee for Mathematics. He is a supervisor on many MS. C. and Ph. D. Thesis in many universities in and out of Egypt. He is a member in, ERS Group, http://www.cba.edu.kw/abo/rough-sets-working-group.html and the Egyptian Mathematical Society http://etms-eg.org Homepage: http://at.yorku.ca/h/a/a/a/48.html El-Houssainy Abd El-Bar Rady, Professor of Applied Statistics Applied Statistics Department, I.S.S.R, Cairo, Egypt. His Ph.D. in Statistics, from Oregon State University (OSU), USA. He was the Director of the Statistical and Econometrical Consultation Center. He got many awards, Sarhan Award of the Scientific research Academy, Egypt, Lee Award, OSU, USA, Member of the Honor Society of Phi Kappa Phi, USA and Thabet El-Sherief Award, ISSR, Cairo University. Homepage: http://at.yorku.ca/h/a/a/a/48.html Abd El-Monem Mohamed Kozae, Professor of Pure Mathematics, Mathematics Department, Faculty of Science, Tanta university, Egypt He is a supervisor on many MS. in C. and Ph. D. Thesis in and many universities out of Egypt. He is a member in many mathematical societies, ERS Group, http://www.cba.edu.kw/abo/rough-sets-working- group.html, IRSS http://roughsets.home.pl/www/ and Egyptian Mathematical Societyhttp://etmseg.org Homepage:http://telc.tanta.edu.eg/hosting/pro6/pro6_index.html Wafaa Anwar Abd El-Latif, Lecturer of Mathematical Statistics, Mathematics Department, Faculty of Science, Tanta University, Egypt. She got her BS. C., 2000, by excellence with honors degree, Her Ms. C. degree, 2004. And she got her Ph.D. degree, 2007, Titled “Uncertainty in Statistics”, 2007, Faculty of Science, Tanta University, Egypt. She is a member in, ERS Group, http://www.cba.edu.kw/abo/rough-sets- working-group.html and Egyptian Mathematical Society http://etms-eg.org Homepage: http://tdb.tanta.edu.eg/staff_data/Staff%20Detailed%20Data-ar.aspx?MemberID=847 Soaad Abd El-Badie Attia El-Afefy, Ph. D. Student, Mathematics Department, Tanta University, Egypt. She got her Ms. C. Titled ”A New Data Reduction Approach”, 2006, Faculty of Science, Tanta University, Egypt. She got the Best Student Presentation and Best Student Paper Awards. She worked as teaching assistant at GUC, She is a member in, ERS Group http://www.cba.edu.kw/abo/rough-sets-working-group.html,IRSS http://roughsets.home.pl/www/, and Egyptian Mathematical Society http://etms-eg.org Homepage: http://www.savvymore.mysite.com