File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Temporal Memory Relation Network for Workflow Recognition from Surgical Video

TitleTemporal Memory Relation Network for Workflow Recognition from Surgical Video
Authors
Keywordslong-range memory clue
multi-scale temporal convolution
non-local operation
Surgical workflow recognition
Issue Date2021
Citation
IEEE Transactions on Medical Imaging, 2021, v. 40, n. 7, p. 1911-1923 How to Cite?
AbstractAutomatic surgical workflow recognition is a key component for developing context-aware computer-assisted systems in the operating theatre. Previous works either jointly modeled the spatial features with short fixed-range temporal information, or separately learned visual and long temporal cues. In this paper, we propose a novel end-to-end temporal memory relation network (TMRNet) for relating long-range and multi-scale temporal patterns to augment the present features. We establish a long-range memory bank to serve as a memory cell storing the rich supportive information. Through our designed temporal variation layer, the supportive cues are further enhanced by multi-scale temporal-only convolutions. To effectively incorporate the two types of cues without disturbing the joint learning of spatio-temporal features, we introduce a non-local bank operator to attentively relate the past to the present. In this regard, our TMRNet enables the current feature to view the long-range temporal dependency, as well as tolerate complex temporal extents. We have extensively validated our approach on two benchmark surgical video datasets, M2CAI challenge dataset and Cholec80 dataset. Experimental results demonstrate the outstanding performance of our method, consistently exceeding the state-of-the-art methods by a large margin (e.g., 67.0% v.s. 78.9% Jaccard on Cholec80 dataset).
Persistent Identifierhttp://hdl.handle.net/10722/349552
ISSN
2023 Impact Factor: 8.9
2023 SCImago Journal Rankings: 3.703
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorJin, Yueming-
dc.contributor.authorLong, Yonghao-
dc.contributor.authorChen, Cheng-
dc.contributor.authorZhao, Zixu-
dc.contributor.authorDou, Qi-
dc.contributor.authorHeng, Pheng Ann-
dc.date.accessioned2024-10-17T06:59:17Z-
dc.date.available2024-10-17T06:59:17Z-
dc.date.issued2021-
dc.identifier.citationIEEE Transactions on Medical Imaging, 2021, v. 40, n. 7, p. 1911-1923-
dc.identifier.issn0278-0062-
dc.identifier.urihttp://hdl.handle.net/10722/349552-
dc.description.abstractAutomatic surgical workflow recognition is a key component for developing context-aware computer-assisted systems in the operating theatre. Previous works either jointly modeled the spatial features with short fixed-range temporal information, or separately learned visual and long temporal cues. In this paper, we propose a novel end-to-end temporal memory relation network (TMRNet) for relating long-range and multi-scale temporal patterns to augment the present features. We establish a long-range memory bank to serve as a memory cell storing the rich supportive information. Through our designed temporal variation layer, the supportive cues are further enhanced by multi-scale temporal-only convolutions. To effectively incorporate the two types of cues without disturbing the joint learning of spatio-temporal features, we introduce a non-local bank operator to attentively relate the past to the present. In this regard, our TMRNet enables the current feature to view the long-range temporal dependency, as well as tolerate complex temporal extents. We have extensively validated our approach on two benchmark surgical video datasets, M2CAI challenge dataset and Cholec80 dataset. Experimental results demonstrate the outstanding performance of our method, consistently exceeding the state-of-the-art methods by a large margin (e.g., 67.0% v.s. 78.9% Jaccard on Cholec80 dataset).-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Medical Imaging-
dc.subjectlong-range memory clue-
dc.subjectmulti-scale temporal convolution-
dc.subjectnon-local operation-
dc.subjectSurgical workflow recognition-
dc.titleTemporal Memory Relation Network for Workflow Recognition from Surgical Video-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TMI.2021.3069471-
dc.identifier.pmid33780335-
dc.identifier.scopuseid_2-s2.0-85103759454-
dc.identifier.volume40-
dc.identifier.issue7-
dc.identifier.spage1911-
dc.identifier.epage1923-
dc.identifier.eissn1558-254X-
dc.identifier.isiWOS:000668842500015-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats