Effects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks

Liu, Pengkun; Chi, Hung Lin; Li, Xiao; Guo, Jingjing

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.autcon.2021.103901
Scopus: eid_2-s2.0-85114986468
WOS: WOS:000703892600002
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Civil Engineering: Journal/Magazine Articles

Article: Effects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks

Title	Effects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks
Authors	Liu, Pengkun Chi, Hung Lin Li, Xiao Guo, Jingjing
Keywords	Construction safety Convolutional neural network (CNN) Fatigue detection Long short-term memory network (LSTM) Multi-sources datasets Tower crane operator
Issue Date	2021
Citation	Automation in Construction, 2021, v. 132, article no. 103901 How to Cite? DOI: http://dx.doi.org/10.1016/j.autcon.2021.103901
Abstract	Fatigue of operators due to intensive workloads and long working time is a significant constraint that leads to inefficient crane operations and increased risk of safety issues. It can be potentially prevented through early warnings of fatigue for further appropriate work shift arrangements. Many deep neural networks have recently been developed for the fatigue detection of vehicle drivers through training and processing the facial image or video data from the public driver's datasets. However, these datasets are difficult to directly use for the fatigue detections under crane operation scenarios due to the variations of facial features and head movement patterns between crane operators and vehicle drivers. Furthermore, there is no representative and public dataset with the facial information of crane operators under construction scenarios. Therefore, this study aims to explore and analyse the features of multi-sources datasets and the corresponding data acquisition methods which are suitable for crane operators' fatigue detection, further providing collection guidelines of crane operators dataset. Variations on public datasets such as real or pretend facial expression, the segment level of human-verified labelling, camera positions, acquisition scenarios, and illumination conditions are analysed. A hybrid learning architecture is proposed by combining convolutional neural networks (CNN) and long short-term memory (LSTM) for fatigue detection. In order to establish a unified evaluation criterion, the effort of the study includes relabelling three public vehicle drivers datasets, NTHU-DDD, UTA-RLDD, and YawnDD, with human-verified labels at the frame and minute segment levels, and training the corresponding hybrid fatigue detection models accordingly. The average detection accuracies and losses are identified for the trained models of UTA-RLDD, NTHU-DDD, and YawnDD individually. The trained models are used to evaluate the fatigue status of facial videos from licensed crane operators under simulated crane operation scenarios. The results suggest the necessary considerations of different influential factors for establishing a large and public fatigue dataset for crane operators.
Persistent Identifier	http://hdl.handle.net/10722/326298
ISSN	0926-5805 2023 Impact Factor: 9.6 2023 SCImago Journal Rankings: 2.626
ISI Accession Number ID	WOS:000703892600002

DC Field	Value	Language
dc.contributor.author	Liu, Pengkun	-
dc.contributor.author	Chi, Hung Lin	-
dc.contributor.author	Li, Xiao	-
dc.contributor.author	Guo, Jingjing	-
dc.date.accessioned	2023-03-09T09:59:35Z	-
dc.date.available	2023-03-09T09:59:35Z	-
dc.date.issued	2021	-
dc.identifier.citation	Automation in Construction, 2021, v. 132, article no. 103901	-
dc.identifier.issn	0926-5805	-
dc.identifier.uri	http://hdl.handle.net/10722/326298	-
dc.description.abstract	Fatigue of operators due to intensive workloads and long working time is a significant constraint that leads to inefficient crane operations and increased risk of safety issues. It can be potentially prevented through early warnings of fatigue for further appropriate work shift arrangements. Many deep neural networks have recently been developed for the fatigue detection of vehicle drivers through training and processing the facial image or video data from the public driver's datasets. However, these datasets are difficult to directly use for the fatigue detections under crane operation scenarios due to the variations of facial features and head movement patterns between crane operators and vehicle drivers. Furthermore, there is no representative and public dataset with the facial information of crane operators under construction scenarios. Therefore, this study aims to explore and analyse the features of multi-sources datasets and the corresponding data acquisition methods which are suitable for crane operators' fatigue detection, further providing collection guidelines of crane operators dataset. Variations on public datasets such as real or pretend facial expression, the segment level of human-verified labelling, camera positions, acquisition scenarios, and illumination conditions are analysed. A hybrid learning architecture is proposed by combining convolutional neural networks (CNN) and long short-term memory (LSTM) for fatigue detection. In order to establish a unified evaluation criterion, the effort of the study includes relabelling three public vehicle drivers datasets, NTHU-DDD, UTA-RLDD, and YawnDD, with human-verified labels at the frame and minute segment levels, and training the corresponding hybrid fatigue detection models accordingly. The average detection accuracies and losses are identified for the trained models of UTA-RLDD, NTHU-DDD, and YawnDD individually. The trained models are used to evaluate the fatigue status of facial videos from licensed crane operators under simulated crane operation scenarios. The results suggest the necessary considerations of different influential factors for establishing a large and public fatigue dataset for crane operators.	-
dc.language	eng	-
dc.relation.ispartof	Automation in Construction	-
dc.subject	Construction safety	-
dc.subject	Convolutional neural network (CNN)	-
dc.subject	Fatigue detection	-
dc.subject	Long short-term memory network (LSTM)	-
dc.subject	Multi-sources datasets	-
dc.subject	Tower crane operator	-
dc.title	Effects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1016/j.autcon.2021.103901	-
dc.identifier.scopus	eid_2-s2.0-85114986468	-
dc.identifier.volume	132	-
dc.identifier.spage	article no. 103901	-
dc.identifier.epage	article no. 103901	-
dc.identifier.isi	WOS:000703892600002	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Effects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats