LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action

07/10/2022
by   Dhruv Shah, et al.
0

Goal-conditioned policies for robotic navigation can be trained on large, unannotated datasets, providing for good generalization to real-world settings. However, particularly in vision-based settings where specifying goals requires an image, this makes for an unnatural interface. Language provides a more convenient modality for communication with robots, but contemporary methods typically require expensive supervision, in the form of trajectories annotated with language descriptions. We present a system, LM-Nav, for robotic navigation that enjoys the benefits of training on unannotated large datasets of trajectories, while still providing a high-level interface to the user. Instead of utilizing a labeled instruction following dataset, we show that such a system can be constructed entirely out of pre-trained models for navigation (ViNG), image-language association (CLIP), and language modeling (GPT-3), without requiring any fine-tuning or language-annotated robot data. We instantiate LM-Nav on a real-world mobile robot and demonstrate long-horizon navigation through complex, outdoor environments from natural language instructions. For videos of our experiments, code release, and an interactive Colab notebook that runs in your browser, please check out our project page https://sites.google.com/view/lmnav

READ FULL TEXT

page 2

page 6

page 7

page 17

research
10/07/2022

GNM: A General Navigation Model to Drive Any Robot

Learning provides a powerful tool for vision-based navigation, but the c...
research
06/26/2023

ViNT: A Foundation Model for Visual Navigation

General-purpose pre-trained models ("foundation models") have enabled pr...
research
08/04/2022

LaTTe: Language Trajectory TransformEr

Natural language is one of the most intuitive ways to express human inte...
research
10/12/2022

Interactive Language: Talking to Robots in Real Time

We present a framework for building interactive, real-time, natural lang...
research
10/14/2022

ExAug: Robot-Conditioned Navigation Policies via Geometric Experience Augmentation

Machine learning techniques rely on large and diverse datasets for gener...
research
04/12/2021

RECON: Rapid Exploration for Open-World Navigation with Latent Goal Models

We describe a robotic learning system for autonomous navigation in diver...
research
09/19/2023

Guide Your Agent with Adaptive Multimodal Rewards

Developing an agent capable of adapting to unseen environments remains a...

Please sign up or login with your details

Forgot password? Click here to reset