Attentional Deep Learning with Inverse Transform Sampling for Robust Respiratory Sound Classification

Document Type : Research Paper

Authors

1 Department of Computer Science, Christ University, Bengaluru, India.

2 Department of Computer Applications, Presidency College, Bangalore, India.

3 Department of Commerce, Manipal Academy of Higher Education, Manipal, India.

4 Assistant Professor, Department of Computer Science and Engineering, Excel Engineering College Komarpalayampalyam, Tamilnadu, India.

5 Associate Professor, Department of Computer Science and Engineering, Vinayaka Mission`s Kirupananda Variyar Engineering College, Salem (Vinayaka Mission`s Research Foundation), India.

6 Associate professor, Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Erode, Tamilnadu, India.

10.22059/jitm.2026.106257

Abstract

The necessity for efficient breathing sound classification systems originates from respiratory diseases, which impair oxygen-carbon dioxide exchange and impact lung function. Feature extraction and pattern categorization are general components of such systems. Because of their effectiveness with big datasets, deep neural networks have acquired popularity recently in the category of breathing sounds. Enhancing medical care requires cooperation amongst researchers, medical professionals, and patients. An attentional deep learning model with inverse transform sampling is presented in this study to classify respiratory diseases from audio data. Robust models were developed to classify and detect respiratory elements using the Respiratory Sound dataset. The primary objectives include effectively determining lung sounds and determining respiratory illnesses. The architectures of CNN, VGG16, and ResNet50 were developed to extract features and categorize data. Also, the pre-trained models ResNet50 and VGG16 identify critical characteristics in spectrum pictures more accurately. Inverse transfer sampling is used to rectify class imbalance in respiratory datasets.  The models achieved 98% accuracy with the CNN model, 83% accuracy with VGG16, and 95% accuracy with ResNet50. Moreover, LSTM and CRNN models offer more information on how respiratory illnesses are classified.

Keywords


Ali, Shams Nafisa, et al. "An End-to-end Deep Learning Framework for Real-Time Denoising of Heart Sounds for Effective Computer-Aided Auscultation in Unseen Noise." Authorea Preprints (2023).
Anupama, H. S., K. R. Pradeep, G. Shreeya, Pratiksha Rao, and S. K. Tejasvi. "Detection of Chronic Lung Disorders using Deep Learning." In 2022 Fourth International Conference on Cognitive Computing and Information Processing (CCIP), pp. 1-5. IEEE, 2022.
Bapa, Aditya, Omkar Bandgar, Arnav Ekapure, and Jignesh Sisodia. "Respiratory disorder classification based on lung auscultation using MFCC, Mel Spectrogram and Chroma STFT." In 2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1), pp. 1-7. IEEE, 2023.
Khanaghavalle GR, Manoj A, Karthikeyan J, Murali R. A Deep Learning Framework for Multiclass Categorization of Pulmonary Diseases. In 2023 World Conference on Communication & Computing (WCONF), pp. 1-6. IEEE, 2023.
Kilic, M., Barua, P. D., Keles, T., Yildiz, A. M., Tuncer, I., Dogan, S., ... & Acharya, U. R. (2024). GCLP: An automated asthma detection model based on global chaotic logistic pattern using cough sounds. Engineering Applications of Artificial Intelligence, 127, 107184.
Koshta, Vaibhav, Bikesh Kumar Singh, Ajoy K. Behera, and Ranganath T. Ganga. "Classification of Asthma, COPD and Healthy Lung Sounds Using Fourier Bessel Series Expansion in Machine Learning and Deep Learning Paradigm." In 2023 11th International Conference on Intelligent Systems and Embedded Design (ISED), pp. 1-6. IEEE, 2023.
Kumar, Dhirendra. "Multi Spectral Feature Extraction to Improve Lung Sound Classification using CNN." In 2023 10th International Conference on Signal Processing and Integrated Networks (SPIN), pp. 186-191. IEEE, 2023.
Ma, W. B., Deng, X. Y., Yang, Y., & Fang, W. C. (2022). An effective lung sound classification system for respiratory disease diagnosis using DenseNet CNN model with sound preprocessing engine. In 2022 IEEE Biomedical Circuits and Systems Conference (BioCAS), pp. 218-222. IEEE.
Pham, Lam, Huy Phan, Ramaswamy Palaniappan, Alfred Mertins, and Ian McLoughlin. "CNN-MoE based framework for classification of respiratory anomalies and lung disease detection." IEEE Journal of Biomedical and Health Informatics, 25(8), 2938–2947, 2021.
Phettom, R., Theera-Umpon, N., & Auephanwiriyakul, S. (2023). Automatic Identification of Abnormal Lung Sounds Using Time-Frequency Analysis and CNN. In 2023 ICITEE, pp. 1-6. IEEE.
Roy, Arka, and Udit Satija. "RDLINet: A Novel Lightweight Inception Network for Respiratory Disease Classification Using Lung Sounds." IEEE Transactions on Instrumentation and Measurement (2023).
Shuvo, S. B., Ali, S. N., Swapnil, S. I., Hasan, T., & Bhuiyan, M. I. H. (2020). A lightweight CNN model for detecting respiratory diseases from lung auscultation sounds using EMD–CWT-based hybrid scalogram. IEEE Journal of Biomedical and Health Informatics, 25(7), 2595–2603.
Tariq, Zeenat, Sayed Khushal Shah, and Yugyung Lee. "Lung disease classification using deep convolutional neural network." In 2019 IEEE BIBM, pp. 732-735. IEEE, 2019.
Tariq, Zeenat, Sayed Khushal Shah, and Yugyung Lee. "Multimodal lung disease classification using deep convolutional neural network." In 2020 IEEE BIBM, pp. 2530-2537. IEEE, 2020.
Tsai, K. H., Wang, W. C., Cheng, C. H., Tsai, C. Y., Wang, J. K., Lin, T. H., ... & Tsao, Y. (2020). Blind monaural source separation on heart and lung sounds based on periodic-coded deep autoencoder. IEEE Journal of Biomedical and Health Informatics, 24(11), 3203–3214.
Wang, F., Yuan, X., & Meng, B. (2023). Classification of Abnormal Lung Sounds Using Deep Learning. In 2023 ICSIP, pp. 506-510. IEEE.
Yamashita, M. (2021). Classification Between Normal and Abnormal Respiration Using Ergodic HMM for Intermittent Abnormal Sounds. In 2021 European Signal Processing Conference (EUSIPCO), pp. 1187-1190. IEEE.
Zakaria, Neili, Fezari Mohamed, Redjati Abdelghani, and Kenneth Sundaraj. "VGG16, ResNet-50, and GoogLeNet deep learning architecture for breathing sound classification: a comparative study." In 2021 AI-CSP Conference, pp. 1-6. IEEE, 2021.