Comparison of DeepLab v3+’s base networks for crack segmentation under limited resources
Main Article Content
Abstract
Accurate crack segmentation plays a crucial role in infrastructure assessment and preventive maintenance. This research explored the crack segmentation efficacy of DeepLab v3+, a modern and advanced semantic segmentation network with a high performance and reduced computational cost. The performance comparison was investigated of DeepLab v3+ with different base networks, including Inception-ResNet-v2, Xception, ResNet-50, MobileNet-v2, and ResNet-18. The objective of this paper was to recommend the base network and its optimizer of DeepLab v3+ architecture in terms of crack segmentation of structure for structural health assessment and monitoring under limited resources. The optimizer algorithm, mini-batch size, learning rate, and squared gradient decay factor were adjusted to obtain the best model for each base network considering limited resources of graphics processing unit (GPU) for model training. The best results were analyzed in terms of mean accuracy, class accuracy, and weighted IoU whilst taking the model size into account. The recommended models ranked from the most accurate to the smallest in size are DeepLab v3+ network based on ResNet-50 with Adam optimizer, Xception with RMSProp optimizer, ResNet-18 with SGDM optimizer, and MobileNet-v2 with RMSProp optimizer, respectively. The findings assist in choosing a suitable network architecture for specific applications considering the compromise between model size and performance. The results also highlight the feasibility of the network architecture with tested conditions in terms of structural crack segmentation under limited computational resources.
Downloads
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
References
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., & Adam, H. (2018). Encoder-decoder with atrous separable convolution for semantic image segmentation. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Computer vision – ECCV 2018. Lecture note in computer science, volume 11211 (pp. 833–851). Springer. https://doi.org/10.1007/978-3-030-01234-2_49
Chollet, F. (2017). Xception: Deep learning with depthwise separable convolutions. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 1800–1807). IEEE. https://doi.org/10.1109/CVPR.2017.195
Cui, X., Wang, Q., Dai, J., Li, S., Xie, C., & Wang, J. (2022). Pixel-level intelligent recognition of concrete cracks based on DRACNN. Materials Letters, 306, Article 130867. https://doi.org/10.1016/j.matlet.2021.130867
Du, J. (2019). The frontier of SGD and its variants in machine learning. Journal of Physics: Conference Series, 1229, Article 012046. https://doi.org/10.1088/1742-6596/1229/1/012046
Fu, H., Meng, D., Li, W., & Wang, Y. (2021). Bridge crack semantic segmentation based on improved Deeplabv3+. Journal of Marine Science and Engineering, 9(6), Article 671. https://doi.org/10.3390/jmse9060671
Han, C., Ma, T., Huyan, J., Huang, X., & Zhang, Y. (2022). CrackW-Net: A novel pavement crack image segmentation convolutional neural network. IEEE Transactions on Intelligent Transportation Systems, 23(11), 22135–22144. https://doi.org/10.1109/TITS.2021.3095507
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 770–778). IEEE. https://doi.org/10.1109/CVPR.2016.90
Kheradmandi, N., & Mehranfar, V. (2022). A critical review and comparative study on image segmentation-based techniques for pavement crack detection. Construction and Building Materials, 321, Article 126162. https://doi.org/10.1016/j.conbuildmat.2021.126162
Kim, J., Shim, S., Cha, Y., & Cho, G.-C. (2021). Lightweight pixel-wise segmentation for efficient concrete crack detection using hierarchical convolutional neural network. Smart Materials and Structures, 30(4), Article 045023. https://doi.org/10.1088/1361-665X/abea1e
Lau, S. L. H., Chong, E. K. P., Yang, X., & Wang, X. (2020). Automated pavement crack segmentation using U-Net-based convolutional neural network. IEEE Access, 8, 114892–114899. https://doi.org/10.1109/ACCESS.2020.3003638
Li, H., Wang, W., Wang, M., Li, L., & Vimlund, V. (2022). A review of deep learning methods for pixel-level crack detection. Journal of Traffic and Transportation Engineering (English Edition), 9(6), 945–968. https://doi.org/10.1016/j.jtte.2022.11.003
Liu, J., Yang, X., Lau, S., Wang, X., Luo, S., Lee, V. C.-S., & Ding, L. (2020). Automated pavement crack detection and segmentation based on two-step convolutional neural network. Computer-Aided Civil and Infrastructure Engineering, 35(11), 1291–1305. https://doi.org/10.1111/mice.12622
Liu, Y., Yao, J., Lu, X., Xie, R., & Li, L. (2019). DeepCrack: A deep hierarchical feature learning architecture for crack segmentation. Neurocomputing, 338, 139–153. https://doi.org/10.1016/j.neucom.2019.01.036
Mei, Q., & Gül, M. (2020). A cost effective solution for pavement crack inspection using cameras and deep neural networks. Construction and Building Materials, 256, Article 119397. https://doi.org/10.1016/j.conbuildmat.2020.119397
Mohan, A., & Poobal, S. (2018). Crack detection using image processing: A critical review and analysis. Alexandria Engineering Journal, 57(2), 787–798. https://doi.org/10.1016/j.aej.2017.01.020
Nguyen, N. H. T., Perry, S., Bone, D., Le, H. T., & Nguyen, T. T. (2021). Two-stage convolutional neural network for road crack detection and segmentation. Expert Systems with Applications, 186, Article 115718. https://doi.org/10.1016/j.eswa.2021.115718
Nguyen, T.-G., Do, T.-L., Nguyen, T.-N., & Nguyen, N.-N. (2024). Semantic segmentation of cracks using DeepLabv3+. In J. N. Reddy, C. M. Wang, V. H. Luong, & A. T. Le (Eds.), Proceedings of the Third International Conference on Sustainable Civil Engineering and Architecture (ICSCEA 2023) (pp. 1539–1546). Springer. https://doi.org/10.1007/978-981-99-7434-4_165
Pu, R., Ren, G., Li, H., Jiang, W., Zhang, J., & Qin, H. (2022). Autonomous concrete crack semantic segmentation using deep fully convolutional encoder–decoder network in concrete structures inspection. Buildings, 12(11), Article 2019. https://doi.org/10.3390/Buildings12112019
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., & Chen, L.-C. (2018). MobileNetV2: Inverted residuals and linear bottlenecks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 4510–4520). IEEE. https://doi.org/10.1109/CVPR.2018.00474
Su, C., & Wang, W. (2020). Concrete cracks detection using convolutional neural network based on transfer learning. Mathematical Problems in Engineering, 2020(1), Article 7240129. https://doi.org/10.1155/2020/7240129
Su, H., Wang, X., Han, T., Wang, Z., Zhao, Z., & Zhang, P. (2022). Research on a U-Net bridge crack identification and feature-calculation methods based on a CBAM attention mechanism. Buildings, 12(10), Article 1561. https://doi.org/10.3390/buildings12101561
Sun, X., Xie, Y., Jiang, L., Cao, Y., & Liu, B. (2022). DMA-Net: DeepLab with multi-scale attention for pavement crack segmentation. IEEE Transactions on Intelligent Transportation Systems, 23(10), 18392–18403. https://doi.org/10.1109/TITS.2022.3158670
Szegedy, C., Ioffe, S., Vanhoucke, V., & Alemi, A. (2017). Inception-v4, Inception-ResNet and the impact of residual connections on learning. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1), 4278–4284. https://doi.org/10.1609/aaai.v31i1.11231
Talab, A. M. A., Huang, Z., Xi, F., & HaiMing, L. (2016). Detection crack in image using Otsu method and multiple filtering in image processing techniques. Optik, 127(3), 1030–1033. https://doi.org/10.1016/j.ijleo.2015.09.147
Xie, Z., Lu, Q., Guo, J., Lin, W., Ge, G., Tang, Y., Pasini, D., & Wang, W. (2024). Semantic segmentation for tooth cracks using improved DeepLabv3+ model. Heliyon, 10(4), Article e25892. https://doi.org/10.1016/j.heliyon.2024.e25892
Xu, G., Yue, Q., & Liu, X. (2023). Deep learning algorithm for real-time automatic crack detection, segmentation, qualification. Engineering Applications of Artificial Intelligence, 126(Part C), Article 107085. https://doi.org/10.1016/j.engappai.2023.107085
Yang, L., Huang, H., Kong, S., & Liu, Y. (2023). A deep segmentation network for crack detection with progressive and hierarchical context fusion. Journal of Building Engineering, 75, Article 106886. https://doi.org/10.1016/j.jobe.2023.106886
Zhou, Z., Zheng, Y., Zhang, J., & Yang, H. (2023). Fast detection algorithm for cracks on tunnel linings based on deep semantic segmentation. Frontiers of Structural and Civil Engineering, 17(5), 732–744. https://doi.org/10.1007/s11709-023-0965-y