TY  - JOUR
ID  - 84900
TI  - Filter-Based Feature Selection Using Information Theory and Binary Cuckoo Optimisation Algorithm
JO  - Journal of Information Technology Management
JA  - JITM
LA  - en
SN  - 
AU  - Usman, Ali Muhammad
AU  - Yusof, Umi Kalsom
AU  - Sabudin, Maziani
AD  - chool of Computer Sciences, University Sains Malaysia 11800 Pulau Pinang, Malaysia; Department of Computer Sciences, Federal College of Education (Technical) Gombe, Nigeria
AD  - Assistant Professor, School of Computer Sciences, University Sains Malaysia 11800 Pulau Pinang, Malaysia.
AD  - School of Computer Sciences, University Sains Malaysia 11800 Pulau Pinang, Malaysia.
Y1  - 2022
PY  - 2022
VL  - 14
IS  - Special Issue: 5th International Conference of Reliable Information and Communication Technology (IRICT 2020)
SP  - 203
EP  - 222
KW  - Feature Selection
KW  - Filter-Based
KW  - Binary Cuckoo Optimization
KW  - information theory
DO  - 10.22059/jitm.2022.84900
N2  - Dimensionality reduction is among the data mining process that is used to reduce the noise and complexity of features in various datasets. Feature selection (FS) is one of the most commonly used dimensionalities that reduces the unwanted features from the datasets. FS can be either wrapper or filter. Wrappers select subsets of the feature with better classification performance but are computationally expensive. On the other hand, filters are computationally fast but lack feature interaction among selected subsets of features which in turn affect the classification performance of the chosen subsets of features. This study proposes two concepts of information theory mutual information (MI). As well as entropy (E). Both were used together with binary cuckoo optimization algorithm BCOA (BCOA-MI and BCOA-EI). The target is to improve classification performance (reduce the error rate and computational complexity) on eight datasets with varying degrees of complexity. A support vector machine classifier was used to measure and computes the error rates of each of the datasets for both BCOA-MI and BCOA-E. The analysis of the results showed that BCOA-E selects a fewer number of features and performed better in terms of error rate. In contrast, BCOA-MI is computationally faster but chooses a larger number of features. Comparison with other methods found in the literature shows that the proposed BCOA-MI and BCOA-E performed better in terms of accuracy, the number of selected features, and execution time in most of the datasets.
UR  - https://jitm.ut.ac.ir/article_84900.html
L1  - https://jitm.ut.ac.ir/article_84900_0a6603fdfdf3514291700cf7edb1497b.pdf
ER  -