Latent Graph Representations for Critical View of Safety Assessment

by   Aditya Murali, et al.
Université de Strasbourg

Assessing the critical view of safety in laparoscopic cholecystectomy requires accurate identification and localization of key anatomical structures, reasoning about their geometric relationships to one another, and determining the quality of their exposure. In this work, we propose to capture each of these aspects by modeling the surgical scene with a disentangled latent scene graph representation, which we can then process using a graph neural network. Unlike previous approaches using graph representations, we explicitly encode in our graphs semantic information such as object locations and shapes, class probabilities and visual features. We also incorporate an auxiliary image reconstruction objective to help train the latent graph representations. We demonstrate the value of these components through comprehensive ablation studies and achieve state-of-the-art results for critical view of safety prediction across multiple experimental settings.


page 1

page 2

page 5

page 8


Learning Latent Scene-Graph Representations for Referring Relationships

Understanding the semantics of complex visual scenes often requires anal...

Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning

Recent approaches on visual scene understanding attempt to build a scene...

SurGNN: Explainable visual scene understanding and assessment of surgical skill using graph neural networks

This paper explores how graph neural networks (GNNs) can be used to enha...

GINet: Graph Interaction Network for Scene Parsing

Recently, context reasoning using image regions beyond local convolution...

3D-LatentMapper: View Agnostic Single-View Reconstruction of 3D Shapes

Computer graphics, 3D computer vision and robotics communities have prod...

Please sign up or login with your details

Forgot password? Click here to reset