Open Access

Workers and Safety Helmets Detection in Day and Night Scenes based on improved YOLOv5


Cite

Li Z, Liu F, Yang W, et al. A survey of convolutional neural networks: analysis, applications, and prospects[J]. IEEE transactions on neural networks and learning systems, 2021. Search in Google Scholar

Akinosho T D, Oyedele L O, Bilal M, et al. Deep learning in the construction industry: A review of present status and future innovations[J]. Journal of Building Engineering, 2020, 32: 101827. Search in Google Scholar

Paneru S, Jeelani I. Computer vision applications in construction: Current state, opportunities & challenges[J]. Automation in Construction, 2021, 132: 103940. Search in Google Scholar

Zhou F, Zhao H, Nie Z. Safety helmet detection based on YOLOv5[C]//2021 IEEE International conference on power electronics, computer applications (ICPECA). IEEE, 2021: 6–11. Search in Google Scholar

Rebholz, F. E., Al-Kaisy, A. F., Nassar, K., Liu, L., Soibelman, L., El-Rayes, K., Bradley University. Dept. of Civil Engineering and Construction, & University of Illinois at Urbana-Champaign. Dept. of Civil and Environmental Engineering. (2004). Nighttime construction: Evaluation of construction operations : final report. (ITRC FR 00/01-5). https://rosap.ntl.bts.gov/view/dot/16121 Search in Google Scholar

Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580–587. Search in Google Scholar

Uijlings J R R, Van De Sande K E A, Gevers T, et al. Selective search for object recognition[J]. International journal of computer vision, 2013, 104(2): 154–171. Search in Google Scholar

Hearst M A, Dumais S T, Osuna E, et al. Support vector machines[J]. IEEE Intelligent Systems and their applications, 1998, 13(4): 18–28. Search in Google Scholar

He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904–1916. Search in Google Scholar

Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories[C]//2006 IEEE computer society conference on computer vision and pattern recognition (CVPR’06). IEEE, 2006, 2: 2169–2178. Search in Google Scholar

Ren S, He K, Girshick R, et al. Faster r-cnn: Towards real-time object detection with region proposal networks[J]. Advances in neural information processing systems, 2015, 28. Search in Google Scholar

Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779–788. Search in Google Scholar

Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning (ICML), pp 448–456 Search in Google Scholar

Wagstaff K, Cardie C, Rogers S, Schr?dl S (2001) Constrained k-means clustering with background knowledge. In: International conference on machine learning (ICML), pp 577–584 Search in Google Scholar

He KM, Zhang XY, Ren SQ, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 770–778 Search in Google Scholar

Lin TY, Dollar P, Girshick R, He KM, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 936–944 Search in Google Scholar

Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263–7271. Search in Google Scholar

Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv:1804.02767, 2018. Search in Google Scholar

Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection[J]. arXiv preprint arXiv:2004.10934, 2020. Search in Google Scholar

Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) SSD: single shot multibox detector. In: European conference on computer vision (ECCV), pp 21–37 Search in Google Scholar

Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556, 2014. Search in Google Scholar

Xiao Y, Tian Z, Yu J, et al. A review of object detection based on deep learning[J]. Multimedia Tools and Applications, 2020, 79(33): 23729–23791. Search in Google Scholar

Son H, Choi H, Seong H, et al. Detection of construction workers under varying poses and changing background in image sequences via very deep residual networks[J]. Automation in Construction, 2019, 99: 27–38. Search in Google Scholar

Kang K S, Cho Y W, Jin K H, et al. Application of one-stage instance segmentation with weather conditions in surveillance cameras at construction sites[J]. Automation in Construction, 2022, 133: 104034. Search in Google Scholar

He K, Gkioxari G, Dollár P, et al. Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961–2969. Search in Google Scholar

Xie Z, Liu H, Li Z, et al. A convolutional neural network based approach towards real-time hard hat detection[C]//2018 IEEE International Conference on Progress in Informatics and Computing (PIC). IEEE, 2018: 430–434. Search in Google Scholar

Shen J, Xiong X, Li Y, et al. Detecting safety helmet wearing on construction sites with bounding‐box regression and deep transfer learning[J]. Computer‐Aided Civil and Infrastructure Engineering, 2021, 36(2): 180–196. Search in Google Scholar

Wang L, Xie L, Yang P, et al. Hardhat-wearing detection based on a lightweight convolutional neural network with multi-scale features and a top-down module[J]. Sensors, 2020, 20(7): 1868. Search in Google Scholar

Zhang C, Tian Z, Song J, et al. Construction worker hardhat-wearing detection based on an improved BiFPN[C]//2020 25th International Conference on Pattern Recognition (ICPR). IEEE, 2021: 8600–8607. Search in Google Scholar

Tan M, Pang R, Le Q V. Efficientdet: Scalable and efficient object detection[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 10781–10790. Search in Google Scholar

Wang W, Peng Y, Cao G, et al. Low-illumination image enhancement for night-time UAV pedestrian detection[J]. IEEE Transactions on Industrial Informatics, 2020, 17(8): 5208–5217. Search in Google Scholar

Liu S, Wu H, Rahman M A, et al. Enhancement of low illumination images based on an optimal hyperbolic tangent profile[J]. computers & electrical engineering, 2018, 70: 538–550. Search in Google Scholar

Danielyan A, Katkovnik V, Egiazarian K. BM3D frames and variational image deblurring[J]. IEEE Transactions on image processing, 2011, 21(4): 1715–1728. Search in Google Scholar

Xiao B, Lin Q, Chen Y. A vision-based method for automatic tracking of construction machines at nighttime based on deep learning illumination enhancement[J]. Automation in Construction, 2021, 127: 103721. Search in Google Scholar

Leung H K, Chen X Z, Yu C W, et al. A deep-learning-based vehicle detection approach for insufficient and nighttime illumination conditions[J]. Applied Sciences, 2019, 9(22): 4769. Search in Google Scholar

Kang K S, Cho Y W, Jin K H, et al. Application of one-stage instance segmentation with weather conditions in surveillance cameras at construction sites[J]. Automation in Construction, 2022, 133: 104034. Search in Google Scholar

Jhong S Y, Chen Y Y, Hsia C H, et al. Nighttime object detection system with lightweight deep network for internet of vehicles[J]. Journal of Real-Time Image Processing, 2021, 18(4): 1141–1155. Search in Google Scholar

Chen Y, Shin H. Pedestrian detection at night in infrared images using an attention-guided encoder-decoder convolutional neural network[J]. Applied Sciences, 2020, 10(3): 809. Search in Google Scholar

Xiao B, Kang S C. Development of an image data set of construction machines for deep learning object detection[J]. Journal of Computing in Civil Engineering, 2021, 35(2): 05020005. Search in Google Scholar

GitHub. YOLOv5-Master. 2021. Available online: https://github.com/ultralytics/yolov5. Search in Google Scholar

Zheng Z, Wang P, Liu W, et al. Distance-IoU loss: Faster and better learning for bounding box regression[C]//Proceedings of the AAAI conference on artificial intelligence. 2020, 34(07): 12993–13000. Search in Google Scholar

Zhang Y F, Ren W, Zhang Z, et al. Focal and efficient IOU loss for accurate bounding box regression[J]. Neurocomputing, 2022, 506: 146–157. Search in Google Scholar

Rosenfeld A, Thurston M. Edge and curve detection for visual scene analysis[J]. IEEE Transactions on computers, 1971, 100(5): 562–569. Search in Google Scholar

Bodla N, Singh B, Chellappa R, et al. Soft-NMS--improving object detection with one line of code[C]//Proceedings of the IEEE international conference on computer vision. 2017: 5561–5569. Search in Google Scholar

Borji A, Itti L. State-of-the-art in visual attention modeling[J]. IEEE transactions on pattern analysis and machine intelligence, 2012, 35(1): 185–207. Search in Google Scholar

Mnih V, Heess N, Graves A. Recurrent models of visual attention[J]. Advances in neural information processing systems, 2014, 27. Search in Google Scholar

Li X, Wang W, Hu X, et al. Selective kernel networks[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019: 510–519. Search in Google Scholar

Wang X, Girshick R, Gupta A, et al. Non-local neural networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7794–7803. Search in Google Scholar

Lin Z, Feng M, Santos C N, et al. A structured self-attentive sentence embedding[J]. arXiv preprint arXiv:1703.03130, 2017. Search in Google Scholar

Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7132–7141. Search in Google Scholar

MacQueen J. Classification and analysis of multivariate observations[C]//5th Berkeley Symp. Math. Statist. Probability. Los Angeles LA USA: University of California, 1967: 281–297. Search in Google Scholar

Arthur D, Vassilvitskii S. K-means++ the advantages of careful seeding[C]//Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms. 2007: 1027–1035. Search in Google Scholar

Halevy A, Norvig P, Pereira F. The unreasonable effectiveness of data[J]. IEEE intelligent systems, 2009, 24(2): 8–12. Search in Google Scholar

Joffrey L, Taghi M K, Richard B. A survey on addressing high-class imbalance in big data. J Big Data (2018) 5: 42[J]. Search in Google Scholar

Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks. 2012 Advances in Neural Information Processing Systems (NIPS)[J]. Neural Information Processing Systems Foundation, La Jolla, CA, 2012. Search in Google Scholar

Shorten C, Khoshgoftaar T M. A survey on image data augmentation for deep learning[J]. Journal of big data, 2019, 6(1): 1–48. Search in Google Scholar

Kang G, Dong X, Zheng L, et al. Patchshuffle regularization[J]. arXiv preprint arXiv:1707.07103, 2017. Search in Google Scholar

Zhong Z, Zheng L, Kang G, et al. Random erasing data augmentation[C]//Proceedings of the AAAI conference on artificial intelligence. 2020, 34(07): 13001–13008. Search in Google Scholar

DeVries T, Taylor G W. Dataset augmentation in feature space[J]. arXiv preprint arXiv:1702.05538, 2017. Search in Google Scholar

Goodfellow I J, Shlens J, Szegedy C. Explaining and harnessing adversarial examples[J]. arXiv preprint arXiv:1412.6572, 2014. Search in Google Scholar

Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11): 139–144. Search in Google Scholar

Zhang D, Han J, Cheng G, et al. Weakly supervised object localization and detection: A survey[J]. IEEE transactions on pattern analysis and machine intelligence, 2021, 44(9): 5866–5885. Search in Google Scholar

eISSN:
2444-8656
Language:
English
Publication timeframe:
Volume Open
Journal Subjects:
Life Sciences, other, Mathematics, Applied Mathematics, General Mathematics, Physics