Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

06/15/2023
by   Youquan Liu, et al.
2

Recent advancements in vision foundation models (VFMs) have opened up new possibilities for versatile and efficient visual perception. In this work, we introduce Seal, a novel framework that harnesses VFMs for segmenting diverse automotive point cloud sequences. Seal exhibits three appealing properties: i) Scalability: VFMs are directly distilled into point clouds, eliminating the need for annotations in either 2D or 3D during pretraining. ii) Consistency: Spatial and temporal relationships are enforced at both the camera-to-LiDAR and point-to-segment stages, facilitating cross-modal representation learning. iii) Generalizability: Seal enables knowledge transfer in an off-the-shelf manner to downstream tasks involving diverse point clouds, including those from real/synthetic, low/high-resolution, large/small-scale, and clean/corrupted datasets. Extensive experiments conducted on eleven different point cloud datasets showcase the effectiveness and superiority of Seal. Notably, Seal achieves a remarkable 45.0 random initialization by 36.9 Moreover, Seal demonstrates significant performance gains over existing methods across 20 different few-shot fine-tuning tasks on all eleven tested point cloud datasets.

READ FULL TEXT

page 2

page 4

page 11

page 14

page 15

page 16

page 17

page 28

research
12/08/2022

Frozen CLIP Model is An Efficient Point Cloud Backbone

The pretraining-finetuning paradigm has demonstrated great success in NL...
research
01/25/2019

Dense 3D Point Cloud Reconstruction Using a Deep Pyramid Network

Reconstructing a high-resolution 3D model of an object is a challenging ...
research
06/11/2019

Few-Shot Point Cloud Region Annotation with Human in the Loop

We propose a point cloud annotation framework that employs human-in-loop...
research
05/29/2021

RPG: Learning Recursive Point Cloud Generation

In this paper we propose a novel point cloud generator that is able to r...
research
05/19/2023

PointGPT: Auto-regressively Generative Pre-training from Point Clouds

Large language models (LLMs) based on the generative pre-training transf...
research
06/14/2023

Explore In-Context Learning for 3D Point Cloud Understanding

With the rise of large-scale models trained on broad data, in-context le...
research
01/05/2021

CLOI: An Automated Benchmark Framework For Generating Geometric Digital Twins Of Industrial Facilities

This paper devises, implements and benchmarks a novel framework, named C...

Please sign up or login with your details

Forgot password? Click here to reset