Single- and Multi-Task Architectures for Surgical Workflow Challenge at M2CAI 2016

by   Andru P. Twinanda, et al.

The surgical workflow challenge at M2CAI 2016 consists of identifying 8 surgical phases in cholecystectomy procedures. Here, we propose to use deep architectures that are based on our previous work where we presented several architectures to perform multiple recognition tasks on laparoscopic videos. In this technical report, we present the phase recognition results using two architectures: (1) a single-task architecture designed to perform solely the surgical phase recognition task and (2) a multi-task architecture designed to perform jointly phase recognition and tool presence detection. On top of these architectures we propose to use two different approaches to enforce the temporal constraints of the surgical workflow: (1) HMM-based and (2) LSTM-based pipelines. The results show that the LSTM-based approach is able to outperform the HMM-based approach and also to properly enforce the temporal constraints into the recognition process.


Single- and Multi-Task Architectures for Tool Presence Detection Challenge at M2CAI 2016

The tool presence detection challenge at M2CAI 2016 consists of identify...

DeepPhase: Surgical Phase Recognition in CATARACTS Videos

Automated surgical workflow analysis and understanding can assist surgeo...

EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos

Surgical workflow recognition has numerous potential medical application...

Surgical Workflow Recognition: from Analysis of Challenges to Architectural Study

Algorithmic surgical workflow recognition is an ongoing research field a...

The TUM LapChole dataset for the M2CAI 2016 workflow challenge

In this technical report we present our collected dataset of laparoscopi...

"Train one, Classify one, Teach one" – Cross-surgery transfer learning for surgical step recognition

Prior work demonstrated the ability of machine learning to automatically...

On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis

Batch Normalization's (BN) unique property of depending on other samples...

Please sign up or login with your details

Forgot password? Click here to reset