File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Effects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks

TitleEffects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks
Authors
KeywordsConstruction safety
Convolutional neural network (CNN)
Fatigue detection
Long short-term memory network (LSTM)
Multi-sources datasets
Tower crane operator
Issue Date2021
Citation
Automation in Construction, 2021, v. 132, article no. 103901 How to Cite?
AbstractFatigue of operators due to intensive workloads and long working time is a significant constraint that leads to inefficient crane operations and increased risk of safety issues. It can be potentially prevented through early warnings of fatigue for further appropriate work shift arrangements. Many deep neural networks have recently been developed for the fatigue detection of vehicle drivers through training and processing the facial image or video data from the public driver's datasets. However, these datasets are difficult to directly use for the fatigue detections under crane operation scenarios due to the variations of facial features and head movement patterns between crane operators and vehicle drivers. Furthermore, there is no representative and public dataset with the facial information of crane operators under construction scenarios. Therefore, this study aims to explore and analyse the features of multi-sources datasets and the corresponding data acquisition methods which are suitable for crane operators' fatigue detection, further providing collection guidelines of crane operators dataset. Variations on public datasets such as real or pretend facial expression, the segment level of human-verified labelling, camera positions, acquisition scenarios, and illumination conditions are analysed. A hybrid learning architecture is proposed by combining convolutional neural networks (CNN) and long short-term memory (LSTM) for fatigue detection. In order to establish a unified evaluation criterion, the effort of the study includes relabelling three public vehicle drivers datasets, NTHU-DDD, UTA-RLDD, and YawnDD, with human-verified labels at the frame and minute segment levels, and training the corresponding hybrid fatigue detection models accordingly. The average detection accuracies and losses are identified for the trained models of UTA-RLDD, NTHU-DDD, and YawnDD individually. The trained models are used to evaluate the fatigue status of facial videos from licensed crane operators under simulated crane operation scenarios. The results suggest the necessary considerations of different influential factors for establishing a large and public fatigue dataset for crane operators.
Persistent Identifierhttp://hdl.handle.net/10722/326298
ISSN
2023 Impact Factor: 9.6
2023 SCImago Journal Rankings: 2.626
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLiu, Pengkun-
dc.contributor.authorChi, Hung Lin-
dc.contributor.authorLi, Xiao-
dc.contributor.authorGuo, Jingjing-
dc.date.accessioned2023-03-09T09:59:35Z-
dc.date.available2023-03-09T09:59:35Z-
dc.date.issued2021-
dc.identifier.citationAutomation in Construction, 2021, v. 132, article no. 103901-
dc.identifier.issn0926-5805-
dc.identifier.urihttp://hdl.handle.net/10722/326298-
dc.description.abstractFatigue of operators due to intensive workloads and long working time is a significant constraint that leads to inefficient crane operations and increased risk of safety issues. It can be potentially prevented through early warnings of fatigue for further appropriate work shift arrangements. Many deep neural networks have recently been developed for the fatigue detection of vehicle drivers through training and processing the facial image or video data from the public driver's datasets. However, these datasets are difficult to directly use for the fatigue detections under crane operation scenarios due to the variations of facial features and head movement patterns between crane operators and vehicle drivers. Furthermore, there is no representative and public dataset with the facial information of crane operators under construction scenarios. Therefore, this study aims to explore and analyse the features of multi-sources datasets and the corresponding data acquisition methods which are suitable for crane operators' fatigue detection, further providing collection guidelines of crane operators dataset. Variations on public datasets such as real or pretend facial expression, the segment level of human-verified labelling, camera positions, acquisition scenarios, and illumination conditions are analysed. A hybrid learning architecture is proposed by combining convolutional neural networks (CNN) and long short-term memory (LSTM) for fatigue detection. In order to establish a unified evaluation criterion, the effort of the study includes relabelling three public vehicle drivers datasets, NTHU-DDD, UTA-RLDD, and YawnDD, with human-verified labels at the frame and minute segment levels, and training the corresponding hybrid fatigue detection models accordingly. The average detection accuracies and losses are identified for the trained models of UTA-RLDD, NTHU-DDD, and YawnDD individually. The trained models are used to evaluate the fatigue status of facial videos from licensed crane operators under simulated crane operation scenarios. The results suggest the necessary considerations of different influential factors for establishing a large and public fatigue dataset for crane operators.-
dc.languageeng-
dc.relation.ispartofAutomation in Construction-
dc.subjectConstruction safety-
dc.subjectConvolutional neural network (CNN)-
dc.subjectFatigue detection-
dc.subjectLong short-term memory network (LSTM)-
dc.subjectMulti-sources datasets-
dc.subjectTower crane operator-
dc.titleEffects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1016/j.autcon.2021.103901-
dc.identifier.scopuseid_2-s2.0-85114986468-
dc.identifier.volume132-
dc.identifier.spagearticle no. 103901-
dc.identifier.epagearticle no. 103901-
dc.identifier.isiWOS:000703892600002-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats