Comparison of DeepLab v3+’s base networks for crack segmentation under limited resources

Main Article Content

Thitiporn Lertrusdachakul
Pierre-Emmanuel Leni

Abstract

Accurate crack segmentation plays a crucial role in infrastructure assessment and preventive maintenance. This research explored the crack segmentation efficacy of DeepLab v3+, a modern and advanced semantic segmentation network with a high performance and reduced computational cost. The performance comparison was investigated of DeepLab v3+ with different base networks, including Inception-ResNet-v2, Xception, ResNet-50, MobileNet-v2, and ResNet-18. The objective of this paper was to recommend the base network and its optimizer of DeepLab v3+ architecture in terms of crack segmentation of structure for structural health assessment and monitoring under limited resources. The optimizer algorithm, mini-batch size, learning rate, and squared gradient decay factor were adjusted to obtain the best model for each base network considering limited resources of graphics processing unit (GPU) for model training. The best results were analyzed in terms of mean accuracy, class accuracy, and weighted IoU whilst taking the model size into account. The recommended models ranked from the most accurate to the smallest in size are DeepLab v3+ network based on ResNet-50 with Adam optimizer, Xception with RMSProp optimizer, ResNet-18 with SGDM optimizer, and MobileNet-v2 with RMSProp optimizer, respectively. The findings assist in choosing a suitable network architecture for specific applications considering the compromise between model size and performance. The results also highlight the feasibility of the network architecture with tested conditions in terms of structural crack segmentation under limited computational resources.

Downloads

Download data is not yet available.

Article Details

How to Cite
Lertrusdachakul, T., & Leni, P.-E. (2025). Comparison of DeepLab v3+’s base networks for crack segmentation under limited resources. Science, Engineering and Health Studies, 19, 25020009. https://doi.org/10.69598/sehs.19.25020009
Section
Physical sciences

References

Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., & Adam, H. (2018). Encoder-decoder with atrous separable convolution for semantic image segmentation. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Computer vision – ECCV 2018. Lecture note in computer science, volume 11211 (pp. 833–851). Springer. https://doi.org/10.1007/978-3-030-01234-2_49

Chollet, F. (2017). Xception: Deep learning with depthwise separable convolutions. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 1800–1807). IEEE. https://doi.org/10.1109/CVPR.2017.195

Cui, X., Wang, Q., Dai, J., Li, S., Xie, C., & Wang, J. (2022). Pixel-level intelligent recognition of concrete cracks based on DRACNN. Materials Letters, 306, Article 130867. https://doi.org/10.1016/j.matlet.2021.130867

Du, J. (2019). The frontier of SGD and its variants in machine learning. Journal of Physics: Conference Series, 1229, Article 012046. https://doi.org/10.1088/1742-6596/1229/1/012046

Fu, H., Meng, D., Li, W., & Wang, Y. (2021). Bridge crack semantic segmentation based on improved Deeplabv3+. Journal of Marine Science and Engineering, 9(6), Article 671. https://doi.org/10.3390/jmse9060671

Han, C., Ma, T., Huyan, J., Huang, X., & Zhang, Y. (2022). CrackW-Net: A novel pavement crack image segmentation convolutional neural network. IEEE Transactions on Intelligent Transportation Systems, 23(11), 22135–22144. https://doi.org/10.1109/TITS.2021.3095507

He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 770–778). IEEE. https://doi.org/10.1109/CVPR.2016.90

Kheradmandi, N., & Mehranfar, V. (2022). A critical review and comparative study on image segmentation-based techniques for pavement crack detection. Construction and Building Materials, 321, Article 126162. https://doi.org/10.1016/j.conbuildmat.2021.126162

Kim, J., Shim, S., Cha, Y., & Cho, G.-C. (2021). Lightweight pixel-wise segmentation for efficient concrete crack detection using hierarchical convolutional neural network. Smart Materials and Structures, 30(4), Article 045023. https://doi.org/10.1088/1361-665X/abea1e

Lau, S. L. H., Chong, E. K. P., Yang, X., & Wang, X. (2020). Automated pavement crack segmentation using U-Net-based convolutional neural network. IEEE Access, 8, 114892–114899. https://doi.org/10.1109/ACCESS.2020.3003638

Li, H., Wang, W., Wang, M., Li, L., & Vimlund, V. (2022). A review of deep learning methods for pixel-level crack detection. Journal of Traffic and Transportation Engineering (English Edition), 9(6), 945–968. https://doi.org/10.1016/j.jtte.2022.11.003

Liu, J., Yang, X., Lau, S., Wang, X., Luo, S., Lee, V. C.-S., & Ding, L. (2020). Automated pavement crack detection and segmentation based on two-step convolutional neural network. Computer-Aided Civil and Infrastructure Engineering, 35(11), 1291–1305. https://doi.org/10.1111/mice.12622

Liu, Y., Yao, J., Lu, X., Xie, R., & Li, L. (2019). DeepCrack: A deep hierarchical feature learning architecture for crack segmentation. Neurocomputing, 338, 139–153. https://doi.org/10.1016/j.neucom.2019.01.036

Mei, Q., & Gül, M. (2020). A cost effective solution for pavement crack inspection using cameras and deep neural networks. Construction and Building Materials, 256, Article 119397. https://doi.org/10.1016/j.conbuildmat.2020.119397

Mohan, A., & Poobal, S. (2018). Crack detection using image processing: A critical review and analysis. Alexandria Engineering Journal, 57(2), 787–798. https://doi.org/10.1016/j.aej.2017.01.020

Nguyen, N. H. T., Perry, S., Bone, D., Le, H. T., & Nguyen, T. T. (2021). Two-stage convolutional neural network for road crack detection and segmentation. Expert Systems with Applications, 186, Article 115718. https://doi.org/10.1016/j.eswa.2021.115718

Nguyen, T.-G., Do, T.-L., Nguyen, T.-N., & Nguyen, N.-N. (2024). Semantic segmentation of cracks using DeepLabv3+. In J. N. Reddy, C. M. Wang, V. H. Luong, & A. T. Le (Eds.), Proceedings of the Third International Conference on Sustainable Civil Engineering and Architecture (ICSCEA 2023) (pp. 1539–1546). Springer. https://doi.org/10.1007/978-981-99-7434-4_165

Pu, R., Ren, G., Li, H., Jiang, W., Zhang, J., & Qin, H. (2022). Autonomous concrete crack semantic segmentation using deep fully convolutional encoder–decoder network in concrete structures inspection. Buildings, 12(11), Article 2019. https://doi.org/10.3390/Buildings12112019

Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., & Chen, L.-C. (2018). MobileNetV2: Inverted residuals and linear bottlenecks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 4510–4520). IEEE. https://doi.org/10.1109/CVPR.2018.00474

Su, C., & Wang, W. (2020). Concrete cracks detection using convolutional neural network based on transfer learning. Mathematical Problems in Engineering, 2020(1), Article 7240129. https://doi.org/10.1155/2020/7240129

Su, H., Wang, X., Han, T., Wang, Z., Zhao, Z., & Zhang, P. (2022). Research on a U-Net bridge crack identification and feature-calculation methods based on a CBAM attention mechanism. Buildings, 12(10), Article 1561. https://doi.org/10.3390/buildings12101561

Sun, X., Xie, Y., Jiang, L., Cao, Y., & Liu, B. (2022). DMA-Net: DeepLab with multi-scale attention for pavement crack segmentation. IEEE Transactions on Intelligent Transportation Systems, 23(10), 18392–18403. https://doi.org/10.1109/TITS.2022.3158670

Szegedy, C., Ioffe, S., Vanhoucke, V., & Alemi, A. (2017). Inception-v4, Inception-ResNet and the impact of residual connections on learning. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1), 4278–4284. https://doi.org/10.1609/aaai.v31i1.11231

Talab, A. M. A., Huang, Z., Xi, F., & HaiMing, L. (2016). Detection crack in image using Otsu method and multiple filtering in image processing techniques. Optik, 127(3), 1030–1033. https://doi.org/10.1016/j.ijleo.2015.09.147

Xie, Z., Lu, Q., Guo, J., Lin, W., Ge, G., Tang, Y., Pasini, D., & Wang, W. (2024). Semantic segmentation for tooth cracks using improved DeepLabv3+ model. Heliyon, 10(4), Article e25892. https://doi.org/10.1016/j.heliyon.2024.e25892

Xu, G., Yue, Q., & Liu, X. (2023). Deep learning algorithm for real-time automatic crack detection, segmentation, qualification. Engineering Applications of Artificial Intelligence, 126(Part C), Article 107085. https://doi.org/10.1016/j.engappai.2023.107085

Yang, L., Huang, H., Kong, S., & Liu, Y. (2023). A deep segmentation network for crack detection with progressive and hierarchical context fusion. Journal of Building Engineering, 75, Article 106886. https://doi.org/10.1016/j.jobe.2023.106886

Zhou, Z., Zheng, Y., Zhang, J., & Yang, H. (2023). Fast detection algorithm for cracks on tunnel linings based on deep semantic segmentation. Frontiers of Structural and Civil Engineering, 17(5), 732–744. https://doi.org/10.1007/s11709-023-0965-y