TY - JOUR
T1 - OpenIN
T2 - Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments
AU - Tang, Yujie
AU - Wang, Meiling
AU - Deng, Yinan
AU - Zheng, Zibo
AU - Deng, Jingchuan
AU - Zuo, Sibo
AU - Yue, Yufeng
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2025
Y1 - 2025
N2 - In daily domestic settings, frequently used objects like cups often have unfixed positions and multiple instances within the same category, and their carriers also frequently change. As a result, it becomes challenging for a robot to efficiently navigate to a specific instance. To tackle this challenge, the robot must capture and update scene changes and plans continuously. However, current object-navigation approaches primarily focus on the semantic level and lack the ability to dynamically update the scene representation. In contrast, this paper captures the relationships between frequently used objects and their static carriers. It constructs an open-vocabulary Carrier-Relationship Scene Graph (CRSG) and updates the carrying status during robot navigation to reflect the dynamic changes of the scene. Based on CRSG, we further propose an instance navigation strategy that models the navigation process as a Markov Decision Process. At each step, decisions are informed by the Large Language Model's commonsense knowledge and visual-language feature similarity. We designed a series of long-horizon navigation tasks for frequently used everyday items in the Habitat simulator. The results demonstrate that by updating the CRSG, the robot can navigate efficiently to moved targets. Additionally, we conducted extensive experiments on a real robot, demonstrating the effectiveness of our method and exploring its limitations.
AB - In daily domestic settings, frequently used objects like cups often have unfixed positions and multiple instances within the same category, and their carriers also frequently change. As a result, it becomes challenging for a robot to efficiently navigate to a specific instance. To tackle this challenge, the robot must capture and update scene changes and plans continuously. However, current object-navigation approaches primarily focus on the semantic level and lack the ability to dynamically update the scene representation. In contrast, this paper captures the relationships between frequently used objects and their static carriers. It constructs an open-vocabulary Carrier-Relationship Scene Graph (CRSG) and updates the carrying status during robot navigation to reflect the dynamic changes of the scene. Based on CRSG, we further propose an instance navigation strategy that models the navigation process as a Markov Decision Process. At each step, decisions are informed by the Large Language Model's commonsense knowledge and visual-language feature similarity. We designed a series of long-horizon navigation tasks for frequently used everyday items in the Habitat simulator. The results demonstrate that by updating the CRSG, the robot can navigate efficiently to moved targets. Additionally, we conducted extensive experiments on a real robot, demonstrating the effectiveness of our method and exploring its limitations.
KW - Carrier-Relationship Scene Graph
KW - Dynamic Scenes
KW - Instance Navigation
UR - http://www.scopus.com/pages/publications/105011836542
U2 - 10.1109/LRA.2025.3592071
DO - 10.1109/LRA.2025.3592071
M3 - Article
AN - SCOPUS:105011836542
SN - 2377-3766
VL - 10
SP - 9256
EP - 9263
JO - IEEE Robotics and Automation Letters
JF - IEEE Robotics and Automation Letters
IS - 9
ER -