跳到主要导航 跳到搜索 跳到主要内容

Adaptive Zone-aware Hierarchical Planner for Vision-Language Navigation

  • Chen Gao
  • , Xingyu Peng
  • , Mi Yan
  • , He Wang
  • , Lirong Yang
  • , Haibing Ren
  • , Hongsheng Li
  • , Si Liu*
  • *此作品的通讯作者
  • Beihang University
  • Peking University
  • Meituan
  • Chinese University of Hong Kong

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The task of Vision-Language Navigation (VLN) is for an embodied agent to reach the global goal according to the instruction. Essentially, during navigation, a series of sub-goals need to be adaptively set and achieved, which is naturally a hierarchical navigation process. However, previous methods leverage a single-step planning scheme, i.e., directly performing navigation action at each step, which is unsuitable for such a hierarchical navigation process. In this paper, we propose an Adaptive Zone-aware Hierarchical Planner (AZHP) to explicitly divides the navigation process into two heterogeneous phases, i.e., sub-goal setting via zone partition/selection (high-level action) and sub-goal executing (low-level action), for hierarchical planning. Specifically, AZHP asynchronously performs two levels of action via the designed State-Switcher Module (SSM). For high-level action, we devise a Scene-aware adaptive Zone Partition (SZP) method to adaptively divide the whole navigation area into different zones on-the-fly. Then the Goal-oriented Zone Selection (GZS) method is proposed to select a proper zone for the current sub-goal. For low-level action, the agent conducts navigation-decision multi-steps in the selected zone. Moreover, we design a Hierarchical RL (HRL) strategy and auxiliary losses with curriculum learning to train the AZHP framework, which provides effective supervision signals for each stage. Extensive experiments demonstrate the superiority of our proposed method, which achieves state-of-the-art performance on three VLN benchmarks (REVERIE, SOON, R2R).

源语言英语
主期刊名Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
出版商IEEE Computer Society
14911-14920
页数10
ISBN(电子版)9798350301298
DOI
出版状态已出版 - 2023
活动2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Vancouver, 加拿大
期限: 18 6月 202322 6月 2023

出版系列

姓名Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
2023-June
ISSN(印刷版)1063-6919

会议

会议2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
国家/地区加拿大
Vancouver
时期18/06/2322/06/23

指纹

探究 'Adaptive Zone-aware Hierarchical Planner for Vision-Language Navigation' 的科研主题。它们共同构成独一无二的指纹。

引用此