Intention-Guided Heuristic Partially Observable Monte Carlo Planning for Off-Ramp Decision-Making of Autonomous Vehicles

Yanbo Chen; Guofu Yan; Huilong Yu; Junqiang Xi

doi:10.1109/TITS.2025.3547906

Intention-Guided Heuristic Partially Observable Monte Carlo Planning for Off-Ramp Decision-Making of Autonomous Vehicles

Yanbo Chen, Guofu Yan, Huilong Yu^*, Junqiang Xi

^*此作品的通讯作者

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

The Partially Observable Monte Carlo Planning (POMCP) leverages Monte Carlo Tree Search (MCTS) and Particle Filtering (PF) to enhance the computational efficiency in solving large-scale Partially Observable Markov Decision Processes (POMDPs), allowing for updates of the belief state and effective adaptation to evolving uncertainties, which has been widely studied in autonomous driving. However, this approach faces two limitations when applied to planning for autonomous vehicles: chaotic branch expansion in the belief tree reduces computational efficiency, and particle deprivation hinders the accurate estimation of the dynamic intentions of surrounding vehicles. To this end, an intention-guided Partially Observable Monte Carlo Planning with a Heuristic-based Double Progressive Widening (POMCP-HDPW) approach is proposed to facilitate efficient decision-making for autonomous vehicles. We propose an enhance resampling method of PF that accounts for the driving intentions of surrounding vehicles, maintaining particle diversity and thereby improving estimation accuracy. Additionally, we prune the action and observation spaces by leveraging human driving experience and collision risk assessment, enabling the expansion and exploration of high-value belief nodes and preventing chaotic expansion. Three different methods are employed to drive the motion of surrounding vehicles, validating the robustness of the proposed model: intelligent driving model control, offline driving using the exiD trajectories, and driver-in-the-loop validation. Notably, experimental results on the exiD dataset demonstrate a success rate of 96.88% in off-ramp scenarios.

源语言	英语
页（从-至）	10834-10849
页数	16
期刊	IEEE Transactions on Intelligent Transportation Systems
卷	26
期	7
DOI	http://doi.org/10.1109/TITS.2025.3547906
出版状态	已出版 - 2025
已对外发布	是

访问文件

10.1109/TITS.2025.3547906

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{7d64c46acad547d180a59cd0c0644ed9,

title = "Intention-Guided Heuristic Partially Observable Monte Carlo Planning for Off-Ramp Decision-Making of Autonomous Vehicles",

abstract = "The Partially Observable Monte Carlo Planning (POMCP) leverages Monte Carlo Tree Search (MCTS) and Particle Filtering (PF) to enhance the computational efficiency in solving large-scale Partially Observable Markov Decision Processes (POMDPs), allowing for updates of the belief state and effective adaptation to evolving uncertainties, which has been widely studied in autonomous driving. However, this approach faces two limitations when applied to planning for autonomous vehicles: chaotic branch expansion in the belief tree reduces computational efficiency, and particle deprivation hinders the accurate estimation of the dynamic intentions of surrounding vehicles. To this end, an intention-guided Partially Observable Monte Carlo Planning with a Heuristic-based Double Progressive Widening (POMCP-HDPW) approach is proposed to facilitate efficient decision-making for autonomous vehicles. We propose an enhance resampling method of PF that accounts for the driving intentions of surrounding vehicles, maintaining particle diversity and thereby improving estimation accuracy. Additionally, we prune the action and observation spaces by leveraging human driving experience and collision risk assessment, enabling the expansion and exploration of high-value belief nodes and preventing chaotic expansion. Three different methods are employed to drive the motion of surrounding vehicles, validating the robustness of the proposed model: intelligent driving model control, offline driving using the exiD trajectories, and driver-in-the-loop validation. Notably, experimental results on the exiD dataset demonstrate a success rate of 96.88\% in off-ramp scenarios.",

keywords = "Autonomous driving, Partially Observable Markov Decision Process, decision making",

author = "Yanbo Chen and Guofu Yan and Huilong Yu and Junqiang Xi",

note = "Publisher Copyright: {\textcopyright} 2000-2011 IEEE.",

year = "2025",

doi = "10.1109/TITS.2025.3547906",

language = "English",

volume = "26",

pages = "10834--10849",

journal = "IEEE Transactions on Intelligent Transportation Systems",

issn = "1524-9050",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "7",

}

TY - JOUR

T1 - Intention-Guided Heuristic Partially Observable Monte Carlo Planning for Off-Ramp Decision-Making of Autonomous Vehicles

AU - Chen, Yanbo

AU - Yan, Guofu

AU - Yu, Huilong

AU - Xi, Junqiang

PY - 2025

Y1 - 2025

N2 - The Partially Observable Monte Carlo Planning (POMCP) leverages Monte Carlo Tree Search (MCTS) and Particle Filtering (PF) to enhance the computational efficiency in solving large-scale Partially Observable Markov Decision Processes (POMDPs), allowing for updates of the belief state and effective adaptation to evolving uncertainties, which has been widely studied in autonomous driving. However, this approach faces two limitations when applied to planning for autonomous vehicles: chaotic branch expansion in the belief tree reduces computational efficiency, and particle deprivation hinders the accurate estimation of the dynamic intentions of surrounding vehicles. To this end, an intention-guided Partially Observable Monte Carlo Planning with a Heuristic-based Double Progressive Widening (POMCP-HDPW) approach is proposed to facilitate efficient decision-making for autonomous vehicles. We propose an enhance resampling method of PF that accounts for the driving intentions of surrounding vehicles, maintaining particle diversity and thereby improving estimation accuracy. Additionally, we prune the action and observation spaces by leveraging human driving experience and collision risk assessment, enabling the expansion and exploration of high-value belief nodes and preventing chaotic expansion. Three different methods are employed to drive the motion of surrounding vehicles, validating the robustness of the proposed model: intelligent driving model control, offline driving using the exiD trajectories, and driver-in-the-loop validation. Notably, experimental results on the exiD dataset demonstrate a success rate of 96.88% in off-ramp scenarios.

AB - The Partially Observable Monte Carlo Planning (POMCP) leverages Monte Carlo Tree Search (MCTS) and Particle Filtering (PF) to enhance the computational efficiency in solving large-scale Partially Observable Markov Decision Processes (POMDPs), allowing for updates of the belief state and effective adaptation to evolving uncertainties, which has been widely studied in autonomous driving. However, this approach faces two limitations when applied to planning for autonomous vehicles: chaotic branch expansion in the belief tree reduces computational efficiency, and particle deprivation hinders the accurate estimation of the dynamic intentions of surrounding vehicles. To this end, an intention-guided Partially Observable Monte Carlo Planning with a Heuristic-based Double Progressive Widening (POMCP-HDPW) approach is proposed to facilitate efficient decision-making for autonomous vehicles. We propose an enhance resampling method of PF that accounts for the driving intentions of surrounding vehicles, maintaining particle diversity and thereby improving estimation accuracy. Additionally, we prune the action and observation spaces by leveraging human driving experience and collision risk assessment, enabling the expansion and exploration of high-value belief nodes and preventing chaotic expansion. Three different methods are employed to drive the motion of surrounding vehicles, validating the robustness of the proposed model: intelligent driving model control, offline driving using the exiD trajectories, and driver-in-the-loop validation. Notably, experimental results on the exiD dataset demonstrate a success rate of 96.88% in off-ramp scenarios.

KW - Autonomous driving

KW - Partially Observable Markov Decision Process

KW - decision making

UR - http://www.scopus.com/pages/publications/105000168907

U2 - 10.1109/TITS.2025.3547906

DO - 10.1109/TITS.2025.3547906

M3 - Article

AN - SCOPUS:105000168907

SN - 1524-9050

VL - 26

SP - 10834

EP - 10849

JO - IEEE Transactions on Intelligent Transportation Systems

JF - IEEE Transactions on Intelligent Transportation Systems

IS - 7

ER -

Intention-Guided Heuristic Partially Observable Monte Carlo Planning for Off-Ramp Decision-Making of Autonomous Vehicles

摘要

访问文件

其它文件与链接

指纹

引用此