%0 Journal Article %T Filter-Based Feature Selection Using Information Theory and Binary Cuckoo Optimisation Algorithm %J Journal of Information Technology Management %I Faculty of Management, University of Tehran %Z 2980-7972 %A Usman, Ali Muhammad %A Yusof, Umi Kalsom %A Sabudin, Maziani %D 2022 %\ 02/01/2022 %V 14 %N Special Issue: 5th International Conference of Reliable Information and Communication Technology (IRICT 2020) %P 203-222 %! Filter-Based Feature Selection Using Information Theory and Binary Cuckoo Optimisation Algorithm %K Feature Selection %K Filter-Based %K Binary Cuckoo Optimization %K information theory %R 10.22059/jitm.2022.84900 %X Dimensionality reduction is among the data mining process that is used to reduce the noise and complexity of features in various datasets. Feature selection (FS) is one of the most commonly used dimensionalities that reduces the unwanted features from the datasets. FS can be either wrapper or filter. Wrappers select subsets of the feature with better classification performance but are computationally expensive. On the other hand, filters are computationally fast but lack feature interaction among selected subsets of features which in turn affect the classification performance of the chosen subsets of features. This study proposes two concepts of information theory mutual information (MI). As well as entropy (E). Both were used together with binary cuckoo optimization algorithm BCOA (BCOA-MI and BCOA-EI). The target is to improve classification performance (reduce the error rate and computational complexity) on eight datasets with varying degrees of complexity. A support vector machine classifier was used to measure and computes the error rates of each of the datasets for both BCOA-MI and BCOA-E. The analysis of the results showed that BCOA-E selects a fewer number of features and performed better in terms of error rate. In contrast, BCOA-MI is computationally faster but chooses a larger number of features. Comparison with other methods found in the literature shows that the proposed BCOA-MI and BCOA-E performed better in terms of accuracy, the number of selected features, and execution time in most of the datasets. %U https://jitm.ut.ac.ir/article_84900_0a6603fdfdf3514291700cf7edb1497b.pdf