Artificial Intelligence-Driven Cyberbullying Detection: A Survey of Current Techniques

Document Type : Research Paper

Authors

1 Department of Information Technology, College of Computer Qassim University Buraydah 51452, Saudi Arabia.

2 Department of Computer Science, College of Computer, Qassim University, Buraydah 52571, Saudi Arabia.

10.22059/jitm.2024.99050

Abstract

Cyberbullying involves using hurtful or offensive language that goes against basic rules of respect and politeness. It harms the online environment and can negatively affect people by causing harassment, discrimination, or emotional pain. To combat this, it is crucial to develop automated methods for detecting and preventing the dissemination of such content. Deep learning, a branch of artificial intelligence, leverages neural networks to learn from data and perform complex tasks, effectively capturing semantic and grammatical nuances to differentiate between abusive and non-abusive language. This survey paper reviews current techniques and advancements in deep learning-based approaches for detecting cyberbullying content on online platforms, aiming to provide a comprehensive understanding of existing methodologies and identify potential avenues for future research to mitigate the spread and impact of such behaviors on the internet.

Keywords


Ahmed, M. T., Rahman, M., Nur, S., Islam, A. Z. M. T., & Das, D. (2021). Natural language processing and machine learning based cyberbullying detection for Bangla and Romanized Bangla texts. TELKOMNIKA (Telecommunication Computing Electronics and Control), 20(1), 89-97.
Ahmed, Md. T., Islam, A., Rahman, M., Rashed, Md. G., Urmi, A. S., & Das, D. (2023). Cyberbullying detection based on hybrid ensemble method using deep learning technique in Bangla dataset. International Journal of Advanced Computer Science and Applications, 14(9), 545-551.
Akhter, M. P., Jiangbin, Z., Naqvi, I. R., AbdelMajeed, M., & Zia, T. (2022). Abusive language detection from social media comments using conventional machine learning and deep learning approaches. Multimedia Systems, 28(6), 1925-1940.
Al Shamsi, A. A. (2019). Effectiveness of cyber security awareness program for young children: A case study in UAE. International Journal of Information Technology and Language Studies, 3(2), 8-29.
Alakrot, A., Murray, L., & Nikolov, N. S. (2018). Dataset construction for the detection of anti-social behaviour in online communication in Arabic. Procedia Computer Science, 142, 174-181.
Alabdulwahab, A., Haq, M. A., & Alshehri, M. (2023). Cyberbullying detection using machine learning and deep learning. International Journal of Advanced Computer Science and Applications, 14(10), 424–432.
Alduailaj, A. M., & Belghith, A. (2023). Detecting Arabic cyberbullying tweets using machine learning. Machine Learning and Knowledge Extraction, 5(1), 29-42.
AlFarah, M. E., Alharbi, B. Y., & Ibrahim, D. M. (2022). Arabic cyberbullying detection from imbalanced dataset using machine learning. Soft Computing and Its Engineering Applications, 397–409.
AlHarbi, B. Y., AlHarbi, M. S., AlZahrani, N. J., Alsheail, M. M., Alshobaili, J. F., & Ibrahim, D. M. (2019). Automatic cyberbullying detection in Arabic social media. International Journal of Engineering Research and Technology, 12(12), 2330-2335.
AlHarbi, B. Y., AlHarbi, M. S., AlZahrani, N. J., Alsheail, M. M., & Ibrahim, D. M. (2020). Using machine learning algorithms for automatic cyberbullying detection in Arabic social media. Journal of Information Technology Management, 12(2), 123-130.
Al-Hashedi, M., Soon, L. K., Goh, H. N., Lim, A. H. L., & Siew, E. G. (2023). Cyberbullying detection based on emotion. IEEE Access, 11, 53907-53918.
Almutiry, S., & Abdel Fattah, M. (2021). Arabic cyberbullying detection using Arabic sentiment analysis. The Egyptian Journal of Language Engineering, 8(1), 39-50.
Alrashidi, B., Jamal, A., & Alkhathlan, A. (2023). Abusive content detection in Arabic tweets using multi-task learning and transformer-based models. Applied Sciences, 13(10), 5825.
Alzaqebah, M., Jaradat, G. M., Nassan, D., Alnasser, R., Alsmadi, M. K., Almarashdeh, I., & Alkhushayni, S. (2023). Cyberbullying detection framework for short and imbalanced Arabic datasets. Journal of King Saud University-Computer and Information Sciences, 35(8), 101652.
Balakrishnan, V., Khan, S., & Arabnia, H. R. (2020). Improving cyberbullying detection using Twitter users’ psychological features and machine learning. Computers & Security, 90, 101710.
Bouliche, A., & Rezoug, A. (2022). Detection of cyberbullying in Arabic social media using dynamic graph neural network. In TACC (pp. 1-11).
Charbuty, B., & Abdulazeez, A. (2021). Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends, 2(01), 20-28.
Chen, Y., Zhou, Y., Zhu, S., & Xu, H. (2012, September). Detecting offensive language in social media to protect adolescent online safety. In 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Conference on Social Computing (pp. 71-80). IEEE.
Cheng, L., Silva, Y. N., Hall, D., & Liu, H. (2020). Session-based cyberbullying detection: Problems and challenges. IEEE Internet Computing, 25(2), 66-72.
Chowdhury, S. A., Mubarak, H., Abdelali, A., Jung, S. G., Jansen, B. J., & Salminen, J. (2020, May). A multi-platform Arabic news comment dataset for offensive language detection. In Proceedings of the Twelfth Language Resources and Evaluation Conference (pp. 6203-6212).
Craig, W. M., & Pepler, D. J. (2007). Understanding bullying: From research to practice. Canadian Psychology/Psychologie Canadienne, 48(2), 86.
Daniel, R., Murthy, T., Kumari, C., Lydia, L., Ishak, M. K., Hadjouni, M., & Mostafa, S. M. (2023). Ensemble learning with tournament selected glowworm swarm optimization algorithm for cyberbullying detection on social media. IEEE Access, 20, 1–8.
Dinakar, K., et al. (2012). Common sense reasoning for detection, prevention, and mitigation of cyberbullying. ACM Transactions on Interactive Intelligent Systems (TiiS), 2(3), 1-30.
Emon, E. A., Rahman, S., Banarjee, J., Das, A. K., & Mittra, T. (2019, June). A deep learning approach to detect abusive Bengali text. In 2019 7th International Conference on Smart Computing & Communications (ICSCC) (pp. 1-5). IEEE.
Gudumotu, C. E., Nukala, S. R., Reddy, K., Konduri, A., & Gireesh, C. (2023). A survey on deep learning models to detect hate speech and bullying in social media. In Artificial Intelligence for Societal Issues (pp. 27-44). Cham: Springer International Publishing.
Guo, Y., Liu, Y., Oerlemans, A., Lao, S., Wu, S., & Lew, M. S. (2016). Deep learning for visual understanding: A review. Neurocomputing, 187, 27-48.
Haidar, B., Chamoun, M., & Yamout, F. (2016). Cyberbullying detection: A survey on multilingual techniques. In UKSIM 2016. Pisa, Italy: Publisher.
Haidar, B., Chamoun, M., & Serhrouchni, A. (2017, October). Multilingual cyberbullying detection system: Detecting cyberbullying in Arabic content. In 2017 1st Cyber Security in Networking Conference (CSNet) (pp. 1-8). IEEE.
Hasan, M., Al, E., Hossain, M. S., Akter, A., Ahmed, M., & Islam, S. (2023). A review on deep-learning-based cyberbullying detection. Future Internet, 15(179), 1-47.
Hoque, M. N., Chakraborty, P., & Seddiqui, M. H. (2023). The challenges and approaches during the detection of cyberbullying text for low-resource language: A literature review. ECTI Transactions on Computer and Information Technology (ECTI-CIT), 17(2), 192-214.
Husain, F., & Uzuner, O. (2021). Transfer learning approach for Arabic offensive language detection system—BERT-based model. arXiv preprint arXiv:2102.05708.
Husain, F., Lee, J., Henry, S., & Uzuner, O. (2020). SalamNet at SemEval-2020 Task 12: Deep learning approach for Arabic offensive language detection. arXiv preprint arXiv:2007.13974.
Kanam, V. O. (2020). Comparative sentiment analysis of techniques for cyberbullying detection on Twitter (Doctoral dissertation, Strathmore University).
Kanan, T., Aldaaja, A., & Hawashin, B. (2020). Cyberbullying and cyberharassment detection using supervised machine learning techniques in Arabic social media contents. Journal of Internet Technology, 21(5), 1409-1421.
Khairy, M., Mahmoud, T. M., Omar, A., & Abd El-Hafeez, T. (2023). Comparative performance of ensemble machine learning for Arabic cyberbullying and offensive language detection. Language Resources and Evaluation, 1-18.
Kompally, P., Sethuraman, S. C., Walczak, S., Johnson, S., & Cruz, M. V. (2021). Malang: A decentralized deep learning approach for detecting abusive textual content. Applied Sciences, 11(18), 8701.
Kufel, J., et al. (2023). What is machine learning, artificial neural networks and deep learning?—Examples of practical applications in medicine. Diagnostics, 13(15), 2582.
Kumari, K., & Singh, J. P. (2020, May). AI_ML_NIT_Patna@TRAC-2: Deep learning approach for multilingual aggression identification. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying (pp. 113-119).
Kumari, K., & Singh, J. P. (2019). AI ML NIT Patna at HASOC 2019: Deep learning approach for identification of abusive content. FIRE (Working Notes), 2517, 328-335.
Mahesh, Batta. (2020). Machine learning algorithms: A review. International Journal of Science and Research (IJSR), 9(1), 381-386.
Mahmud, T., Ptaszynski, M., Eronen, J., & Masui, F. (2023). Cyberbullying detection for low-resource languages and dialects: Review of the state of the art. Information Processing & Management, 60(5), 103454.
Mat Ali, H. (2005). Censorship agent: Identifying and determining offensive words.
McFarland, L. A., & Ployhart, R. E. (2015). Social media: A contextual framework to guide research and practice. Journal of Applied Psychology, 100(6), 1653–1677.
Mohaouchane, H., Mourhir, A., & Nikolov, N. S. (2019, October). Detecting offensive language on Arabic social media using deep learning. In 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS) (pp. 466-471). IEEE.
Mubarak, H., et al. (2020). Arabic offensive language on Twitter: Analysis and experiments. arXiv preprint arXiv:2004.02192.
Muneer, A., et al. (2023). Cyberbullying detection on social media using stacking ensemble learning and enhanced BERT. Information, 14(8), 467.
Nandakumar, V., Kovoor, B. C., & Sreeja, M. U. (2018). Cyberbullying revelation in Twitter data using Naïve Bayes classifier algorithm. International Journal of Advanced Research in Computer Science, 9(1).
Narayanan, S., & Panayiotis, G. G. (2013). Behavioral signal processing: Deriving human behavioral informatics from speech and language. Proceedings of the IEEE, 101(5), 1203-1233.
Noviantho, S. I., & Livia, A. (2017). Cyberbullying classification using text mining. 2017 1st International Conference on Informatics and Computational Sciences (ICICoS).
Osisanwo, F. Y., et al. (2017). Supervised machine learning algorithms: Classification and comparison. International Journal of Computer Trends and Technology (IJCTT), 48(3), 128-138.
PK, F. A. (1984). What is Artificial Intelligence? In Success is No Accident: It is Hard Work, Perseverance, Learning, Studying, Sacrifice and Most of All, Love of What You are Doing or Learning to Do (p. 65).
Reynolds, K., Kontostathis, A., & Edwards, L. (2011, December). Using machine learning to detect cyberbullying. In 2011 10th International Conference on Machine Learning and Applications and Workshops (Vol. 2, pp. 241-244). IEEE.
Roy, P. K., & Mali, F. U. (2022). Cyberbullying detection using deep transfer learning. Complex & Intelligent Systems, 8(6), 5449–5467.
Salawu, S., Lumsden, J., & He, Y. (2021, August). A large-scale English multi-label Twitter dataset for cyberbullying and online abuse detection. In The 5th Workshop on Online Abuse and Harms (pp. 146-156). Association for Computational Linguistics.
Sari, T. I., Ardilla, Z. N., Hayatin, N., & Maskat, R. (2022). Abusive comment identification on Indonesian social media data using hybrid deep learning. IAES International Journal of Artificial Intelligence, 11(3), 895-904.
Sarker, I. H., Furhad, M. H., & Nowrozy, R. (2021). AI-driven cybersecurity: An overview, security intelligence modeling, and research directions. SN Computer Science, 2, 1-18.
Sarker, M., Hossain, M. F., Liza, F. R., Sakib, S. N., & Al Farooq, A. (2022, February). A machine learning approach to classify anti-social Bengali comments on social media. In 2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE) (pp. 1-6). IEEE.
Shalev-Shwartz, S., & Ben-David, S. (2014). Understanding machine learning: From theory to algorithms. Cambridge University Press.
Shannag, F., Hammo, B. H., & Faris, H. (2022). The design, construction, and evaluation of annotated Arabic cyberbullying corpus. Education and Information Technologies, 27(8), 10977-11023.
Shannaq, F., et al. (2022). Offensive language detection in Arabic social networks using evolutionary-based classifiers learned from fine-tuned embeddings. IEEE Access, 10, 75018–75039.
Shapiro, S. C. (1992). Encyclopedia of artificial intelligence (2nd ed.). New Jersey: Wiley Inter-science.
Willard, N. E. (2007). Cyberbullying and cyberthreats: Responding to the challenge of online social aggression, threats, and distress. Research Press.
Yastira, E., & Cahyono, H. D. (2023, October). Intelligent web service system for detecting cyberbullying on Twitter based on support vector machine and random forest algorithms. In 2023 International Conference on Converging Technology in Electrical and Information Engineering (ICCTEIE) (pp. 50-55). IEEE.
Yi, P., & Zubiaga, A. (2023). Session-based cyberbullying detection in social media: A survey. Online Social Networks and Media, 36, 100250.
Zheng, A., & Casari, A. (2018). Feature engineering for machine learning: Principles and techniques for data scientists. O'Reilly Media, Inc.
Židová, M., Hollá, K., & Rybanský, Ľ. (2021). Parental control and cyberbullying. EDULEARN21 Proceedings. IATED.