TY - JOUR
T1 - ARST
T2 - auto-regressive surgical transformer for phase recognition from laparoscopic videos
AU - Zou, Xiaoyang
AU - Liu, Wenyong
AU - Wang, Junchen
AU - Tao, Rong
AU - Zheng, Guoyan
N1 - Publisher Copyright:
© 2022 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2023
Y1 - 2023
N2 - Phase recognition plays an essential role for surgical workflow analysis in computer assisted intervention. Transformer, originally proposed for sequential data modelling in natural language processing, has been successfully applied to surgical phase recognition. Existing works based on transformer mainly focus on modeling attention dependency, without introducing auto-regression. In this work, an Auto-Regressive Surgical Transformer, referred as ARST, is first proposed for on-line surgical phase recognition from laparoscopic videos, modeling the inter-phase correlation implicitly by conditional probability distribution. To reduce inference bias and to enhance phase consistency, we further develop a consistency constraint inference strategy based on auto-regression. We conduct comprehensive validations on a well-known public dataset Cholec80. Experimental results show that our method outperforms the state-of-the-art methods both quantitatively and qualitatively, and achieves an inference rate of 66 frames per second (fps).
AB - Phase recognition plays an essential role for surgical workflow analysis in computer assisted intervention. Transformer, originally proposed for sequential data modelling in natural language processing, has been successfully applied to surgical phase recognition. Existing works based on transformer mainly focus on modeling attention dependency, without introducing auto-regression. In this work, an Auto-Regressive Surgical Transformer, referred as ARST, is first proposed for on-line surgical phase recognition from laparoscopic videos, modeling the inter-phase correlation implicitly by conditional probability distribution. To reduce inference bias and to enhance phase consistency, we further develop a consistency constraint inference strategy based on auto-regression. We conduct comprehensive validations on a well-known public dataset Cholec80. Experimental results show that our method outperforms the state-of-the-art methods both quantitatively and qualitatively, and achieves an inference rate of 66 frames per second (fps).
KW - Surgical workflow analysis
KW - auto-regression
KW - laparoscopic videos
KW - surgical phase recognition
KW - transformer
UR - https://www.scopus.com/pages/publications/85142190075
U2 - 10.1080/21681163.2022.2145238
DO - 10.1080/21681163.2022.2145238
M3 - 文章
AN - SCOPUS:85142190075
SN - 2168-1163
VL - 11
SP - 1012
EP - 1018
JO - Computer Methods in Biomechanics and Biomedical Engineering: Imaging and Visualization
JF - Computer Methods in Biomechanics and Biomedical Engineering: Imaging and Visualization
IS - 4
ER -