SV-RCNet: Workflow recognition from surgical videos using recurrent convolutional network

Jin, Yueming; Dou, Qi; Chen, Hao; Yu, Lequan; Qin, Jing; Fu, Chi Wing; Heng, Pheng Ann

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/TMI.2017.2787657
Scopus: eid_2-s2.0-85040080064
PMID: 29727275
WOS: WOS:000431544500004
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Statistics & Actuarial Science: Journal/Magazine Articles

Article: SV-RCNet: Workflow recognition from surgical videos using recurrent convolutional network

Title	SV-RCNet: Workflow recognition from surgical videos using recurrent convolutional network
Authors	Jin, Yueming Dou, Qi Chen, Hao Yu, Lequan Qin, Jing Fu, Chi Wing Heng, Pheng Ann
Keywords	long short-term memory joint learning of spatio-temporal features surgical workflow recognition very deep residual network Recurrent convolutional network
Issue Date	2018
Citation	IEEE Transactions on Medical Imaging, 2018, v. 37, n. 5, p. 1114-1126 How to Cite? DOI: http://dx.doi.org/10.1109/TMI.2017.2787657
Abstract	We propose an analysis of surgical videos that is based on a novel recurrent convolutional network (SV-RCNet), specifically for automatic workflow recognition from surgical videos online, which is a key component for developing the context-aware computer-assisted intervention systems. Different from previous methods which harness visual and temporal information separately, the proposed SV-RCNet seamlessly integrates a convolutional neural network (CNN) and a recurrent neural network (RNN) to form a novel recurrent convolutional architecture in order to take full advantages of the complementary information of visual and temporal features learned from surgical videos. We effectively train the SV-RCNet in an end-to-end manner so that the visual representations and sequential dynamics can be jointly optimized in the learning process. In order to produce more discriminative spatio-temporal features, we exploit a deep residual network (ResNet) and a long short term memory (LSTM) network, to extract visual features and temporal dependencies, respectively, and integrate them into the SV-RCNet. Moreover, based on the phase transition-sensitive predictions from the SV-RCNet, we propose a simple yet effective inference scheme, namely the prior knowledge inference (PKI), by leveraging the natural characteristic of surgical video. Such a strategy further improves the consistency of results and largely boosts the recognition performance. Extensive experiments have been conducted with the MICCAI 2016 Modeling and Monitoring of Computer Assisted Interventions Workflow Challenge dataset and Cholec80 dataset to validate SV-RCNet. Our approach not only achieves superior performance on these two datasets but also outperforms the state-of-the-art methods by a significant margin.
Persistent Identifier	http://hdl.handle.net/10722/299565
ISSN	0278-0062 2023 Impact Factor: 8.9 2023 SCImago Journal Rankings: 3.703
ISI Accession Number ID	WOS:000431544500004

DC Field	Value	Language
dc.contributor.author	Jin, Yueming	-
dc.contributor.author	Dou, Qi	-
dc.contributor.author	Chen, Hao	-
dc.contributor.author	Yu, Lequan	-
dc.contributor.author	Qin, Jing	-
dc.contributor.author	Fu, Chi Wing	-
dc.contributor.author	Heng, Pheng Ann	-
dc.date.accessioned	2021-05-21T03:34:41Z	-
dc.date.available	2021-05-21T03:34:41Z	-
dc.date.issued	2018	-
dc.identifier.citation	IEEE Transactions on Medical Imaging, 2018, v. 37, n. 5, p. 1114-1126	-
dc.identifier.issn	0278-0062	-
dc.identifier.uri	http://hdl.handle.net/10722/299565	-
dc.description.abstract	We propose an analysis of surgical videos that is based on a novel recurrent convolutional network (SV-RCNet), specifically for automatic workflow recognition from surgical videos online, which is a key component for developing the context-aware computer-assisted intervention systems. Different from previous methods which harness visual and temporal information separately, the proposed SV-RCNet seamlessly integrates a convolutional neural network (CNN) and a recurrent neural network (RNN) to form a novel recurrent convolutional architecture in order to take full advantages of the complementary information of visual and temporal features learned from surgical videos. We effectively train the SV-RCNet in an end-to-end manner so that the visual representations and sequential dynamics can be jointly optimized in the learning process. In order to produce more discriminative spatio-temporal features, we exploit a deep residual network (ResNet) and a long short term memory (LSTM) network, to extract visual features and temporal dependencies, respectively, and integrate them into the SV-RCNet. Moreover, based on the phase transition-sensitive predictions from the SV-RCNet, we propose a simple yet effective inference scheme, namely the prior knowledge inference (PKI), by leveraging the natural characteristic of surgical video. Such a strategy further improves the consistency of results and largely boosts the recognition performance. Extensive experiments have been conducted with the MICCAI 2016 Modeling and Monitoring of Computer Assisted Interventions Workflow Challenge dataset and Cholec80 dataset to validate SV-RCNet. Our approach not only achieves superior performance on these two datasets but also outperforms the state-of-the-art methods by a significant margin.	-
dc.language	eng	-
dc.relation.ispartof	IEEE Transactions on Medical Imaging	-
dc.subject	long short-term memory	-
dc.subject	joint learning of spatio-temporal features	-
dc.subject	surgical workflow recognition	-
dc.subject	very deep residual network	-
dc.subject	Recurrent convolutional network	-
dc.title	SV-RCNet: Workflow recognition from surgical videos using recurrent convolutional network	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/TMI.2017.2787657	-
dc.identifier.pmid	29727275	-
dc.identifier.scopus	eid_2-s2.0-85040080064	-
dc.identifier.volume	37	-
dc.identifier.issue	5	-
dc.identifier.spage	1114	-
dc.identifier.epage	1126	-
dc.identifier.eissn	1558-254X	-
dc.identifier.isi	WOS:000431544500004	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: SV-RCNet: Workflow recognition from surgical videos using recurrent convolutional network

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats