Open Access

Workers and Safety Helmets Detection in Day and Night Scenes based on improved YOLOv5

 and   
Jul 02, 2024

Cite
Download Cover

Li Z, Liu F, Yang W, et al. A survey of convolutional neural networks: analysis, applications, and prospects[J]. IEEE transactions on neural networks and learning systems, 2021. Search in Google Scholar

Akinosho T D, Oyedele L O, Bilal M, et al. Deep learning in the construction industry: A review of present status and future innovations[J]. Journal of Building Engineering, 2020, 32: 101827. Search in Google Scholar

Paneru S, Jeelani I. Computer vision applications in construction: Current state, opportunities & challenges[J]. Automation in Construction, 2021, 132: 103940. Search in Google Scholar

Zhou F, Zhao H, Nie Z. Safety helmet detection based on YOLOv5[C]//2021 IEEE International conference on power electronics, computer applications (ICPECA). IEEE, 2021: 6–11. Search in Google Scholar

Rebholz, F. E., Al-Kaisy, A. F., Nassar, K., Liu, L., Soibelman, L., El-Rayes, K., Bradley University. Dept. of Civil Engineering and Construction, & University of Illinois at Urbana-Champaign. Dept. of Civil and Environmental Engineering. (2004). Nighttime construction: Evaluation of construction operations : final report. (ITRC FR 00/01-5). https://rosap.ntl.bts.gov/view/dot/16121 Search in Google Scholar

Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580–587. Search in Google Scholar

Uijlings J R R, Van De Sande K E A, Gevers T, et al. Selective search for object recognition[J]. International journal of computer vision, 2013, 104(2): 154–171. Search in Google Scholar

Hearst M A, Dumais S T, Osuna E, et al. Support vector machines[J]. IEEE Intelligent Systems and their applications, 1998, 13(4): 18–28. Search in Google Scholar

He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904–1916. Search in Google Scholar

Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories[C]//2006 IEEE computer society conference on computer vision and pattern recognition (CVPR’06). IEEE, 2006, 2: 2169–2178. Search in Google Scholar

Ren S, He K, Girshick R, et al. Faster r-cnn: Towards real-time object detection with region proposal networks[J]. Advances in neural information processing systems, 2015, 28. Search in Google Scholar

Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779–788. Search in Google Scholar

Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning (ICML), pp 448–456 Search in Google Scholar

Wagstaff K, Cardie C, Rogers S, Schr?dl S (2001) Constrained k-means clustering with background knowledge. In: International conference on machine learning (ICML), pp 577–584 Search in Google Scholar

He KM, Zhang XY, Ren SQ, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 770–778 Search in Google Scholar

Lin TY, Dollar P, Girshick R, He KM, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 936–944 Search in Google Scholar

Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263–7271. Search in Google Scholar

Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv:1804.02767, 2018. Search in Google Scholar

Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection[J]. arXiv preprint arXiv:2004.10934, 2020. Search in Google Scholar

Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) SSD: single shot multibox detector. In: European conference on computer vision (ECCV), pp 21–37 Search in Google Scholar

Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556, 2014. Search in Google Scholar

Xiao Y, Tian Z, Yu J, et al. A review of object detection based on deep learning[J]. Multimedia Tools and Applications, 2020, 79(33): 23729–23791. Search in Google Scholar

Son H, Choi H, Seong H, et al. Detection of construction workers under varying poses and changing background in image sequences via very deep residual networks[J]. Automation in Construction, 2019, 99: 27–38. Search in Google Scholar

Kang K S, Cho Y W, Jin K H, et al. Application of one-stage instance segmentation with weather conditions in surveillance cameras at construction sites[J]. Automation in Construction, 2022, 133: 104034. Search in Google Scholar

He K, Gkioxari G, Dollár P, et al. Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961–2969. Search in Google Scholar

Xie Z, Liu H, Li Z, et al. A convolutional neural network based approach towards real-time hard hat detection[C]//2018 IEEE International Conference on Progress in Informatics and Computing (PIC). IEEE, 2018: 430–434. Search in Google Scholar

Shen J, Xiong X, Li Y, et al. Detecting safety helmet wearing on construction sites with bounding‐box regression and deep transfer learning[J]. Computer‐Aided Civil and Infrastructure Engineering, 2021, 36(2): 180–196. Search in Google Scholar

Wang L, Xie L, Yang P, et al. Hardhat-wearing detection based on a lightweight convolutional neural network with multi-scale features and a top-down module[J]. Sensors, 2020, 20(7): 1868. Search in Google Scholar

Zhang C, Tian Z, Song J, et al. Construction worker hardhat-wearing detection based on an improved BiFPN[C]//2020 25th International Conference on Pattern Recognition (ICPR). IEEE, 2021: 8600–8607. Search in Google Scholar

Tan M, Pang R, Le Q V. Efficientdet: Scalable and efficient object detection[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 10781–10790. Search in Google Scholar

Wang W, Peng Y, Cao G, et al. Low-illumination image enhancement for night-time UAV pedestrian detection[J]. IEEE Transactions on Industrial Informatics, 2020, 17(8): 5208–5217. Search in Google Scholar

Liu S, Wu H, Rahman M A, et al. Enhancement of low illumination images based on an optimal hyperbolic tangent profile[J]. computers & electrical engineering, 2018, 70: 538–550. Search in Google Scholar

Danielyan A, Katkovnik V, Egiazarian K. BM3D frames and variational image deblurring[J]. IEEE Transactions on image processing, 2011, 21(4): 1715–1728. Search in Google Scholar

Xiao B, Lin Q, Chen Y. A vision-based method for automatic tracking of construction machines at nighttime based on deep learning illumination enhancement[J]. Automation in Construction, 2021, 127: 103721. Search in Google Scholar

Leung H K, Chen X Z, Yu C W, et al. A deep-learning-based vehicle detection approach for insufficient and nighttime illumination conditions[J]. Applied Sciences, 2019, 9(22): 4769. Search in Google Scholar

Kang K S, Cho Y W, Jin K H, et al. Application of one-stage instance segmentation with weather conditions in surveillance cameras at construction sites[J]. Automation in Construction, 2022, 133: 104034. Search in Google Scholar

Jhong S Y, Chen Y Y, Hsia C H, et al. Nighttime object detection system with lightweight deep network for internet of vehicles[J]. Journal of Real-Time Image Processing, 2021, 18(4): 1141–1155. Search in Google Scholar

Chen Y, Shin H. Pedestrian detection at night in infrared images using an attention-guided encoder-decoder convolutional neural network[J]. Applied Sciences, 2020, 10(3): 809. Search in Google Scholar

Xiao B, Kang S C. Development of an image data set of construction machines for deep learning object detection[J]. Journal of Computing in Civil Engineering, 2021, 35(2): 05020005. Search in Google Scholar

GitHub. YOLOv5-Master. 2021. Available online: https://github.com/ultralytics/yolov5. Search in Google Scholar

Zheng Z, Wang P, Liu W, et al. Distance-IoU loss: Faster and better learning for bounding box regression[C]//Proceedings of the AAAI conference on artificial intelligence. 2020, 34(07): 12993–13000. Search in Google Scholar

Zhang Y F, Ren W, Zhang Z, et al. Focal and efficient IOU loss for accurate bounding box regression[J]. Neurocomputing, 2022, 506: 146–157. Search in Google Scholar

Rosenfeld A, Thurston M. Edge and curve detection for visual scene analysis[J]. IEEE Transactions on computers, 1971, 100(5): 562–569. Search in Google Scholar

Bodla N, Singh B, Chellappa R, et al. Soft-NMS--improving object detection with one line of code[C]//Proceedings of the IEEE international conference on computer vision. 2017: 5561–5569. Search in Google Scholar

Borji A, Itti L. State-of-the-art in visual attention modeling[J]. IEEE transactions on pattern analysis and machine intelligence, 2012, 35(1): 185–207. Search in Google Scholar

Mnih V, Heess N, Graves A. Recurrent models of visual attention[J]. Advances in neural information processing systems, 2014, 27. Search in Google Scholar

Li X, Wang W, Hu X, et al. Selective kernel networks[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019: 510–519. Search in Google Scholar

Wang X, Girshick R, Gupta A, et al. Non-local neural networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7794–7803. Search in Google Scholar

Lin Z, Feng M, Santos C N, et al. A structured self-attentive sentence embedding[J]. arXiv preprint arXiv:1703.03130, 2017. Search in Google Scholar

Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7132–7141. Search in Google Scholar

MacQueen J. Classification and analysis of multivariate observations[C]//5th Berkeley Symp. Math. Statist. Probability. Los Angeles LA USA: University of California, 1967: 281–297. Search in Google Scholar

Arthur D, Vassilvitskii S. K-means++ the advantages of careful seeding[C]//Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms. 2007: 1027–1035. Search in Google Scholar

Halevy A, Norvig P, Pereira F. The unreasonable effectiveness of data[J]. IEEE intelligent systems, 2009, 24(2): 8–12. Search in Google Scholar

Joffrey L, Taghi M K, Richard B. A survey on addressing high-class imbalance in big data. J Big Data (2018) 5: 42[J]. Search in Google Scholar

Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks. 2012 Advances in Neural Information Processing Systems (NIPS)[J]. Neural Information Processing Systems Foundation, La Jolla, CA, 2012. Search in Google Scholar

Shorten C, Khoshgoftaar T M. A survey on image data augmentation for deep learning[J]. Journal of big data, 2019, 6(1): 1–48. Search in Google Scholar

Kang G, Dong X, Zheng L, et al. Patchshuffle regularization[J]. arXiv preprint arXiv:1707.07103, 2017. Search in Google Scholar

Zhong Z, Zheng L, Kang G, et al. Random erasing data augmentation[C]//Proceedings of the AAAI conference on artificial intelligence. 2020, 34(07): 13001–13008. Search in Google Scholar

DeVries T, Taylor G W. Dataset augmentation in feature space[J]. arXiv preprint arXiv:1702.05538, 2017. Search in Google Scholar

Goodfellow I J, Shlens J, Szegedy C. Explaining and harnessing adversarial examples[J]. arXiv preprint arXiv:1412.6572, 2014. Search in Google Scholar

Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11): 139–144. Search in Google Scholar

Zhang D, Han J, Cheng G, et al. Weakly supervised object localization and detection: A survey[J]. IEEE transactions on pattern analysis and machine intelligence, 2021, 44(9): 5866–5885. Search in Google Scholar

Language:
English