Enhancing Fake News Detection by Attention-Based BiLSTM and Hybrid Whale-Multi-Verse Optimization

Document Type : Research Paper

Authors

Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, A.P.-522 502, India.

10.22059/jitm.2025.102975

Abstract

The proliferation of fake news, characterized by the dissemination of inaccurate information to deceive audiences, has become a pressing concern in recent times. Traditional approaches to phony news detection, often focused on analyzing Twitter content, are susceptible to noise and variations in input sequences, leading to suboptimal performance. To address these challenges, this study proposes a novel method called Multi-Head Attention-Hierarchical Bidirectional Long Short-Term Memory (MHA-HBiLSTM) Networks. Our approach involves two phases: training and testing, wherein we employ tweet pre-processing techniques such as stemming, punctuation removal, stop-word elimination, URL handling, and Twitter control removal. Features are represented using the Glove word embedding technique for experimental evaluation and comparison. The MHA-HBiLSTM model integrates multi-head attention and hierarchical concepts, allowing meaningful information extraction from Twitter data. Notably, our model utilizes dual-level attention mechanisms and a hierarchical structure, reflecting the inherent hierarchy in documents and prioritizing key material during document representation. The effectiveness of the proposed MHA-HBiLSTM algorithm is evaluated using the Whale & Multi-Verse (W-MVO) Optimizer approach, with tests conducted on Kaggle and FakeNewsNet datasets. Comparative analysis with traditional machine learning approaches and deep learning models demonstrates the superior performance of the MHA-HBiLSTM approach in fake news detection.

Keywords


Adedoyin, Z., & Mariyappan, B. (2022). Fake news detection using machine learning algorithms and recurrent neural networks. Advance Preprint, 1(1), 1–31. https://doi.org/10.31124/advance.20751379.v1
Ahmed, H., Traore, I., & Saad, S. (2017). Detection of online fake news using N-gram analysis and machine learning techniques. In Lecture Notes in Computer Science (Vol. 10618, pp. 127–138). Springer. https://doi.org/10.1007/978-3-319-69155-8_9
Bahad, P., Saxena, P., & Kamal, R. (2019). Fake news detection using bidirectional LSTM recurrent neural network. Procedia Computer Science, 165, 74–82. https://doi.org/10.1016/j.procs.2020.01.072
Bergstra, J., & Bengio, Y. (2012). Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13, 281–305. https://dl.acm.org/doi/10.5555/2188385.2188395
Castillo, C., Mendoza, M., & Poblete, B. (2011). Information credibility on Twitter. In Proceedings of the 20th International Conference on World Wide Web (pp. 675–684). https://doi.org/10.1145/1963405.1963500
Chen, T., Li, X., Yin, H., & Zhang, J. (2018). Call attention to rumors: Deep attention-based recurrent neural networks for early rumor detection. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining (pp. 40–52). Springer. https://doi.org/10.48550/arXiv.1704.05973
Chen, X., Lian, C., Wang, L., Deng, H., Fung, S. H., Nie, D., Thung, K. H., Yap, P. T., Gateno, J., & Xia, J. J. (2019). One-shot generative adversarial learning for MRI segmentation of cranio-maxillofacial bony structures. IEEE Transactions on Medical Imaging, 39(3), 787–796. https://doi.org/10.1109/TMI.2019.2935409
Chen, Y., Sui, J., Hu, L., & Gong, W. (2019). Attention-residual network with CNN for rumor detection. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management (pp. 1121–1130). https://doi.org/10.1145/3357384.3357950
Ciampaglia, G. L., Shiralkar, P., Rocha, L. M., Bollen, J., Menczer, F., & Flammini, A. (2015). Computational fact-checking from knowledge networks. PLoS ONE, 10(6), Article e0128193. https://doi.org/10.1371/journal.pone.0128193
Conroy, N. J., Rubin, V. L., & Chen, Y. (2015). Automatic deception detection: Methods for finding fake news. In Proceedings of the 78th ASIS&T Annual Meeting: Information Science with Impact: Research in and for the Community (pp. 1–4). https://doi.org/10.1002/pra2.2015.145052010082
Dhiman, P., Kaur, A., & Bonkra, A. (2023). Fake information detection using deep learning methods: A survey. In Proceedings of the 2023 International Conference on Artificial Intelligence and Smart Communication (AISC) (pp. 858–863). https://doi.org/10.1109/aisc56616.2023.10085519
Dong, M., Yao, L., Wang, X., Benatallah, B., Sheng, Q. Z., & Huang, H. (2018). Dual: A deep unified attention model with latent relation representations for fake news detection. In Proceedings of the International Conference on Web Information Systems Engineering (pp. 199–209). https://doi.org/10.1007/978-3-030-02922-7_14
Guacho, G. B., Abdali, S., Shah, N., & Papalexakis, E. E. (2018). Semi-supervised content-based detection of misinformation via tensor embeddings. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (pp. 322–325). https://doi.org/10.48550/arXiv.1804.09088
Hu, G., Ding, Y., Qi, S., Wang, X., & Liao, Q. (2019). Multi-depth graph convolutional networks for fake news detection. In Proceedings of the Natural Language Processing and Chinese Computing (pp. 698–710). https://doi.org/10.1007/978-3-030-32233-5_54
Kim, Y. (2014). Convolutional neural networks for sentence classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 1746–1751). https://doi.org/10.48550/arXiv.1408.5882
Konkobo, P. M., Zhang, R., Huang, S., Minoungou, T. T., Ouedraogo, J. A., & Li, L. (2020). A deep learning model for early detection of fake news on social media. In Proceedings of the 7th International Conference on Behavioural and Social Computing (BESC) (pp. 1–6). https://doi.org/10.1109/BESC51023.2020.9348311
Kuang, Q., & Xu, X. (2010). Improvement and application of TF-IDF method based on text classification. In Proceedings of the IEEE International Conference on Internet Technology and Applications (pp. 1–4). https://doi.org/10.1109/ITAPP.2010.5566113
Li, X., Lu, P., Hu, L., Wang, X., & Lu, L. (2021). A novel self-learning semi-supervised deep learning network to detect fake news on social media. Multimedia Tools and Applications, 1–9. https://doi.org/10.1007/s11042-021-11065-x
Liashchynskyi, P., & Liashchynskyi, P. (2019). Grid search, random search, genetic algorithm: A big comparison for NAS. https://doi.org/10.48550/arXiv.1912.06059
Lin, Z., Feng, M., dos Santos, C. N., Yu, M., Xiang, B., Zhou, B., & Bengio, Y. (2017). A structured self-attentive sentence embedding. In Proceedings of the 5th International Conference on Learning Representations (ICLR). https://doi.org/10.48550/arXiv.1703.03130
Long, Y., Lu, Q., Xiang, R., Li, M., & Huang, C. R. (2017). Fake news detection through multi-perspective speaker profiles. In Proceedings of the 8th International Joint Conference on Natural Language Processing (pp. 252–256). https://aclanthology.org/I17-2043.pdf
Ma, J., Gao, W., & Wong, K. F. (2019). Detect rumors on Twitter by promoting information campaigns with generative adversarial learning. In Proceedings of the International World Wide Web Conferences (pp. 3049–3055). https://doi.org/10.1145/3308558.3313741
Ma, J., Gao, W., Wei, Z., Lu, Y., & Wong, K. F. (2015). Detect rumors using time series of social context information on microblogging websites. In Proceedings of the 24th ACM International Conference on Information and Knowledge Management (CIKM) (pp. 1751–1754). https://doi.org/10.1145/2806416.2806607
Ma, Z., Yu, C., & Hu, B. (2018). Detecting rumors from microblogs with recurrent neural networks. In Proceedings of the 26th International Conference on World Wide Web (pp. 1531–1539). https://ink.library.smu.edu.sg/sis_research/4630/
McCallum, A., & Nigam, K. (1998). A comparison of event models for naive Bayes text classification. In Proceedings of the Workshop on Learning for Text Categorization (pp. 91–98). http://www.kamalnigam.com/papers/multinomial-aaaiws98.pdf
Naithani, K., Raiwani, Y. P., Alam, I., & Aknan, M. (2023). Analyzing hybrid C4.5 algorithm for sentiment extraction over lexical and semantic interpretation. Journal of Information Technology Management, 15(Special Issue), 57–79. https://doi.org/10.22059/jitm.2023.95246
Potthast, M., Köpsel, S., Stein, B., & Hagen, M. (2017). A stylometric inquiry into hyperpartisan and fake news. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (pp. 167–176). https://doi.org/10.18653/v1/P18-1022
Qiu, S., Zhao, Y., Jiao, J., Wei, Y., & Wei, S. (2019). Referring image segmentation by generative adversarial learning. IEEE Transactions on Multimedia, 22(5), 1333–1344. https://doi.org/10.1109/TMM.2019.2942480
Rathore, R. S., Sangwan, S., & Prakash, S. (2020). Hybrid WGWO: Whale grey wolf optimization-based novel energy-efficient clustering for EH-WSNs. Journal of Wireless Communications and Networking, 2020(101). https://doi.org/10.1186/s13638-020-01721-5
Ruchansky, N., Seo, S., & Liu, Y. (2017). CSI: A hybrid deep model for fake news detection. In Proceedings of the 26th International Conference on World Wide Web (pp. 797–806). https://doi.org/10.1145/3132847.3132877
Samadi, M., & Momtazi, S. (2023). Multichannel convolutional neural networks for detecting COVID-19 fake news. Digital Scholarship in the Humanities, 38(1), 379–389. https://doi.org/10.1093/llc/fqac023
Shikalgar, M. B., & Arage, C. S. (2023). Fake news detection using hybrid BiLSTM-TCN model with attention mechanism. In Proceedings of the 2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC) (pp. 1130–1136). https://doi.org/10.1109/ICAAIC56838.2023.10140491
Shu, K., Mahudeswaran, D., Wang, S., & Liu, H. (2017). Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations Newsletter, 19(1), 22–36. https://doi.org/10.1145/3137597.3137600
Shu, K., Mahudeswaran, D., Wang, S., Lee, D., & Liu, H. (2020). FakeNewsNet: A data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data, 8(3), 171–188. https://doi.org/10.48550/arXiv.1809.01286
Shu, K., Wang, S., & Liu, H. (2018). Understanding user profiles on social media for fake news detection. In Proceedings of the 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR) (pp. 430–435). https://doi.org/10.1109/MIPR.2018.00092
Singh, M. K., & Kumar, A. (2023). Cucumber leaf disease detection and classification using a deep convolutional neural network. Journal of Information Technology Management, 15(Special Issue: Intelligent and Security for Communication, Computing Application (ISCCA-2022)), 94–110. https://doi.org/10.22059/jitm.2023.95248
Tanuku, S. R. (2022). Novel approach to capture fake news classification using LSTM and GRU networks. In Proceedings of the 2022 International Conference on Futuristic Technologies (INCOFT) (pp. 1–4). https://doi.org/10.1109/incoft55651.2022.10094467
Trueman, T. E., Kumar, A. J., Narayanasamy, P., & Vidya, J. (2021). Attention-based C-BiLSTM for fake news detection. Applied Soft Computing, 110, 107600. https://doi.org/10.1016/j.asoc.2021.107600
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. In Proceedings of the Neural Information Processing Systems (NIPS). https://doi.org/10.48550/arXiv.1706.03762
Vosoughi, S., Roy, D., & Aral, S. (2018). The spread of true and false news online. Science, 359, 1146–1151. https://doi.org/10.1126/science.aap9559
Wang, W. Y. (2017). "Liar, liar pants on fire": A new benchmark dataset for fake news detection. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL) (Vol. 2, pp. 422–426). https://doi.org/10.18653/v1/p17-2067
Wu, L., Rao, Y., Jin, H., Nazir, A., & Sun, L. (2019). Different absorption from the same sharing: Sifted multi-task learning for fake news detection. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (pp. 4644–4653). https://arxiv.org/pdf/1909.01720.pdf
Yu, F., Liu, Q., Wu, S., Wang, L., & Tan, T. (2019). Attention-based convolutional approach for misinformation identification from massive and noisy microblog posts. Computers & Security, 83, 106–121. https://doi.org/10.1016/j.cose.2019.02.003
Yuan, C., Ma, Q., Zhou, W., Han, J., & Hu, S. (2020). Early detection of fake news by utilizing the credibility of news, publishers, and users based on weakly supervised learning. In Proceedings of the 28th International Conference on Computational Linguistics (COLING) (pp. 5444–5454). https://doi.org/10.48550/arXiv.2012.04233
Zhang, D., & Lee, W. S. (2003). Question classification using support vector machines. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 26–32). https://doi.org/10.1145/860435.860443
Zhang, Y., Chen, W., Yeo, C. K., Lau, C. T., & Lee, B. S. (2017). Detecting rumors on online social networks using multi-layer autoencoder. In Proceedings of the 2017 IEEE Technology & Engineering Management Conference (TEMSCON) (pp. 437–441). https://doi.org/10.1109/TEMSCON.2017.7998415