Development of Boosting Algorithm Machine Learning Models for Diabetic Retinopathy Classification Using Symlet Wavelets
Main Article Content
Abstract
This research aims to 1) develop and evaluate the performance of boosting machine learning models, and 2) classify retinal images with diabetic retinopathy using symlet wavelet transform feature extraction from 10 sub-bands. A total of 60 symlet features were obtained by extracting six statistical measures, namely mean, standard deviation, energy, entropy, skewness, and kurtosis, from each of the 10 sub-bands. The feature set was fed into three boosting machine learning algorithms, specifically XGBoost, LightGBM, and CatBoost, for classification model development and performance comparison. Model performance was assessed through 10-fold cross-validation and confusion matrix analysis to determine accuracy, sensitivity, specificity, and F1-score metrics for optimal model selection. Comparative performance evaluation demonstrated that the LightGBM model achieved the best overall performance across nearly all evaluation metrics, with an accuracy of 94.47%, sensitivity of 95.24%, specificity of 93.69%, and F1-score of 94.56%. These findings suggest that integrating symlet wavelet features with boosting-based machine learning algorithms provides a promising approach for diabetic retinopathy screening and diagnosis.
Downloads
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The owner (Research and Development Institute, Kalasin University), the authors agree that any copies of the article or any part thereof distributed or posted by them in print or electronic format as permitted will include the notice of copyright as stipulated in the journal and a full citation to the final published version of the contribution in the journal as published by Research and Development Institute, Kalasin University.
References
Chanlalit W. Ocular complications from diabetes mellitus. J Med Health Sci. 2016; 23(2): 36-45. https://shorturl.asia/bz4ST
กรกัญจน์ จิตโศภิษฐ์. ความชุกของภาวะเบาหวานขึ้นจอประสาทตาและระดับน้ำตาลสะสมในเลือดที่สัมพันธ์กับ การเกิดภาวะเบาหวานขึ้นจอประสาทตาในผู้ป่วยโรคเบาหวาน โรงพยาบาลสันป่าตอง จังหวัดเชียงใหม่. วารสารวิชาการป้องกันควบคุมโรค สคร.2 พิษณุโลก. 2566; 10(2): 15-25. สืบค้นจาก https://he01.tci-thaijo.org/index.php/dpcphs/article/view/262223
Onjun R, Lowmunkhong S, Kaennakham S, Phattaramarut K. Diabetic retinopathy detection using a convolutional neural network enhanced by wavelet transformation pooling. In: Proceedings of the 2024 16th International Conference on Bioinformatics and Biomedical Technology (ICBBT '24); 2024 May 24-26; Chongqing. Association for Computing Machinery; 2024. p. 265-269.
https://dl.acm.org/doi/10.1145/3674658.3674699
Tomasi C, Manduchi R. Bilateral filtering for gray and color images. In: Proceedings of the 6th International Conference on Computer Vision; 1998 Jan 7; Bombay. IEEE; 1998. p. 839-846. https://ieeexplore.ieee.org/document/710815
Rasta SH, Partovi ME, Seyedarabi H, Javadzadeh A. A comparative study on preprocessing techniques in diabetic retinopathy retinal images: illumination correction and contrast enhancement. J Med Signals Sens. 2015; 5(1): 40-48. Available from https://shorturl.asia/0x8hs
Mutawa AM, Al-Sabti K, Raizada S, Sruthi S. A deep learning model for detecting diabetic retinopathy stages with discrete wavelet transform. Appl Sci. 2024; 14(11): 4428. https://doi.org/10.3390/app14114428
Darabi P. Diagnosis of diabetic retinopathy [dataset]. ResearchGate; 2024. https://www.researchgate.net/publication/382264856_Diagnosis_of_Diabetic_Retinopathy
PyWavelets. PyWavelets: wavelet transforms in python [Internet]. Read the Docs; [cited 2025 Nov 5]. Available from: https://pywavelets.readthedocs.io/en/latest/
Scikit-Image. Scikit-image: image processing in python [Internet]. PyPI; [cited 2025 Nov 6]. Available from: https://pypi.org/project/scikit-image
OpenCV. opencv-python [Internet]. PyPI; [cited 2025 Nov 19]. Available from: https://pypi.org/project/opencv-python/
Ganie SM, Pramanik PKD, Bashir Malik M, Mallik S, Qin H. An ensemble learning approach for diabetes prediction using boosting techniques. Front Genet. 2023; 14: 1252159. https://doi.org/10.3389/fgene.2023.1252159
Chen T, Guestrin C. XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2016 Aug 13-17; San Francisco. New York: ACM; 2016. p. 785-794. https://dl.acm.org/doi/pdf/10.1145/2939672.2939785
Yang P, Yang B. Development and validation of predictive models for diabetic retinopathy using machine learning. PLoS One. 2025; 20(2): e0318226. https://doi.org/10.1371/journal.pone.0318226
Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. LightGBM: a highly efficient gradient boosting decision tree. In: Proceedings of the 31st International Conference on Neural Information Processing Systems; 2017 Dec 4; Long Beach. Red Hook, NY: Curran Associates; 2017. p. 3149-3157. https://dl.acm.org/doi/10.5555/3294996.3295074
Rufo DD, Debelee TG, Ibenthal A, Negera WG. Diagnosis of diabetes mellitus using gradient boosting machine (LightGBM). Diagnostics. 2021; 11(9): 1714. https://doi.org/10.3390/diagnostics11091714
มนัสพร ตรีรุ่งโรจน์. ตัวแบบการเรียนรู้ของเครื่องอิทธิพลผสมสำหรับการวิเคราะห์การรอดชีพเวลาไม่ต่อเนื่อง [วิทยานิพนธ์ปริญญาดุษฎีบัณฑิต]. กรุงเทพฯ: คณะพาณิชยศาสตร์และการบัญชี จุฬาลงกรณ์มหาวิทยาลัย; 2565. https://digital.car.chula.ac.th/chulaetd/6671
Xu W, Wang W, Ren H, Li X, Wen Y. Prediction and analysis of risk factors for diabetic retinopathy based on machine learning and interpretable models. Heliyon. 2024; 10(9): e29497. https://doi.org/10.1016/j.heliyon.2024.e29497
Pandas development team. Pandas: python data analysis library [Internet]. [cited 2025 Nov 9]. Available from: https://pandas.pydata.org
NumPy. NumPy: Fundamental package for array computing in Python [Internet]. PyPI; [cited 2025 Dec 7]. Available from: https://pypi.org/project/numpy
Scikit-learn. A set of python modules for machine learning and data mining [Internet]. PyPI; [cited 2025 Dec 7]. Available from: https://pypi.org/project/scikit-learn
SciPy. SciPy: Fundamental algorithms for scientific computing in Python [Internet]. PyPI; [cited 2025 Nov 14]. Available from: https://pypi.org/project/scipy
Kohavi R. A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence; 1995 Aug 20; Montreal. San Francisco: Morgan Kaufmann; 1995. p.1137-1143. https://dl.acm.org/doi/10.5555/1643031.1643047
Bidwai P, Gite S, Pradhan B, Almari A. Explainable diabetic retinopathy detection using a distributed CNN and LightGBM framework. Comput Mater Contin. 2025; 84(2): 2645-2676. https://www.techscience.com/cmc/v84n2/62859
Bapatla S, Harikiran J. LuNet-LightGBM: an effective hybrid approach for lesion segmentation and DR grading. Comput Syst Sci Eng. 2023; 46(1): 597-617. https://doi.org/10.32604/csse.2023.034998
Aziz A, Tezel NS, Kaçmaz S, Attallah Y. Early diabetic retinopathy detection from OCT images using multifractal analysis and multi-layer perceptron classification. Diagnostics. 2025; 15(13): https://doi.org/10.3390/diagnostics15131616