TY - JOUR ID - 84900 TI - Filter-Based Feature Selection Using Information Theory and Binary Cuckoo Optimisation Algorithm JO - Journal of Information Technology Management JA - JITM LA - en SN - AU - Usman, Ali Muhammad AU - Yusof, Umi Kalsom AU - Sabudin, Maziani AD - chool of Computer Sciences, University Sains Malaysia 11800 Pulau Pinang, Malaysia; Department of Computer Sciences, Federal College of Education (Technical) Gombe, Nigeria AD - Assistant Professor, School of Computer Sciences, University Sains Malaysia 11800 Pulau Pinang, Malaysia. AD - School of Computer Sciences, University Sains Malaysia 11800 Pulau Pinang, Malaysia. Y1 - 2022 PY - 2022 VL - 14 IS - Special Issue: 5th International Conference of Reliable Information and Communication Technology (IRICT 2020) SP - 203 EP - 222 KW - Feature Selection KW - Filter-Based KW - Binary Cuckoo Optimization KW - information theory DO - 10.22059/jitm.2022.84900 N2 - Dimensionality reduction is among the data mining process that is used to reduce the noise and complexity of features in various datasets. Feature selection (FS) is one of the most commonly used dimensionalities that reduces the unwanted features from the datasets. FS can be either wrapper or filter. Wrappers select subsets of the feature with better classification performance but are computationally expensive. On the other hand, filters are computationally fast but lack feature interaction among selected subsets of features which in turn affect the classification performance of the chosen subsets of features. This study proposes two concepts of information theory mutual information (MI). As well as entropy (E). Both were used together with binary cuckoo optimization algorithm BCOA (BCOA-MI and BCOA-EI). The target is to improve classification performance (reduce the error rate and computational complexity) on eight datasets with varying degrees of complexity. A support vector machine classifier was used to measure and computes the error rates of each of the datasets for both BCOA-MI and BCOA-E. The analysis of the results showed that BCOA-E selects a fewer number of features and performed better in terms of error rate. In contrast, BCOA-MI is computationally faster but chooses a larger number of features. Comparison with other methods found in the literature shows that the proposed BCOA-MI and BCOA-E performed better in terms of accuracy, the number of selected features, and execution time in most of the datasets. UR - https://jitm.ut.ac.ir/article_84900.html L1 - https://jitm.ut.ac.ir/article_84900_0a6603fdfdf3514291700cf7edb1497b.pdf ER -