A reinforcement learning from human feedback based method for task allocation of human robot collaboration assembly considering human preference

Jingfei Wang; Yan Yan; Yaoguang Hu; Xiaonan Yang

doi:10.1016/j.aei.2025.103497

A reinforcement learning from human feedback based method for task allocation of human robot collaboration assembly considering human preference

Jingfei Wang, Yan Yan, Yaoguang Hu, Xiaonan Yang^*

^*此作品的通讯作者

机械与车辆学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

Currently, human-robot collaboration is considered as an important enabling technology in human-centered manufacturing of industry 5.0. Reasonable task allocation and sequencing of human-robot collaboration process are necessary to fully utilize the strengths of workers and robots to improve workers’ performance and experience. Although many human factors are considered in current studies of task allocation, it is difficult for workers to provide preferred choices and feedback to directly affect the decision-making due to the complexity of decision-making process, moreover, it may result in a solution that is not suitable for individual worker. To address this problem, a task allocation method based on human feedback reinforcement learning is proposed in this study. In this method, multi-agent reinforcement learning is applied to pre-train the agent models to solve the task allocation and sequencing problem with multiple optimal objectives. An analytic hierarchy process-based method is utilized to analyze human action preferences to build a heuristic reward model. Furthermore, a preference training approach using knowledge distillation is proposed, and agents are adjusted through preference rewards and pre-trained optimization experiences to learn a decision-making policy that suits worker preferences. The effectiveness of the method is verified in comparative and ablation experiments.

源语言	英语
文章编号	103497
期刊	Advanced Engineering Informatics
卷	66
DOI	http://doi.org/10.1016/j.aei.2025.103497
出版状态	已出版 - 7月 2025

访问文件

10.1016/j.aei.2025.103497

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{155aec7197ee4726bd86c26726d99225,

title = "A reinforcement learning from human feedback based method for task allocation of human robot collaboration assembly considering human preference",

abstract = "Currently, human-robot collaboration is considered as an important enabling technology in human-centered manufacturing of industry 5.0. Reasonable task allocation and sequencing of human-robot collaboration process are necessary to fully utilize the strengths of workers and robots to improve workers{\textquoteright} performance and experience. Although many human factors are considered in current studies of task allocation, it is difficult for workers to provide preferred choices and feedback to directly affect the decision-making due to the complexity of decision-making process, moreover, it may result in a solution that is not suitable for individual worker. To address this problem, a task allocation method based on human feedback reinforcement learning is proposed in this study. In this method, multi-agent reinforcement learning is applied to pre-train the agent models to solve the task allocation and sequencing problem with multiple optimal objectives. An analytic hierarchy process-based method is utilized to analyze human action preferences to build a heuristic reward model. Furthermore, a preference training approach using knowledge distillation is proposed, and agents are adjusted through preference rewards and pre-trained optimization experiences to learn a decision-making policy that suits worker preferences. The effectiveness of the method is verified in comparative and ablation experiments.",

keywords = "Human robot collaboration, Reinforcement learning, Reinforcement learning from human feedback, Task allocation and sequencing",

author = "Jingfei Wang and Yan Yan and Yaoguang Hu and Xiaonan Yang",

note = "Publisher Copyright: {\textcopyright} 2025",

year = "2025",

month = jul,

doi = "10.1016/j.aei.2025.103497",

language = "English",

volume = "66",

journal = "Advanced Engineering Informatics",

issn = "1474-0346",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - A reinforcement learning from human feedback based method for task allocation of human robot collaboration assembly considering human preference

AU - Wang, Jingfei

AU - Yan, Yan

AU - Hu, Yaoguang

AU - Yang, Xiaonan

PY - 2025/7

Y1 - 2025/7

N2 - Currently, human-robot collaboration is considered as an important enabling technology in human-centered manufacturing of industry 5.0. Reasonable task allocation and sequencing of human-robot collaboration process are necessary to fully utilize the strengths of workers and robots to improve workers’ performance and experience. Although many human factors are considered in current studies of task allocation, it is difficult for workers to provide preferred choices and feedback to directly affect the decision-making due to the complexity of decision-making process, moreover, it may result in a solution that is not suitable for individual worker. To address this problem, a task allocation method based on human feedback reinforcement learning is proposed in this study. In this method, multi-agent reinforcement learning is applied to pre-train the agent models to solve the task allocation and sequencing problem with multiple optimal objectives. An analytic hierarchy process-based method is utilized to analyze human action preferences to build a heuristic reward model. Furthermore, a preference training approach using knowledge distillation is proposed, and agents are adjusted through preference rewards and pre-trained optimization experiences to learn a decision-making policy that suits worker preferences. The effectiveness of the method is verified in comparative and ablation experiments.

AB - Currently, human-robot collaboration is considered as an important enabling technology in human-centered manufacturing of industry 5.0. Reasonable task allocation and sequencing of human-robot collaboration process are necessary to fully utilize the strengths of workers and robots to improve workers’ performance and experience. Although many human factors are considered in current studies of task allocation, it is difficult for workers to provide preferred choices and feedback to directly affect the decision-making due to the complexity of decision-making process, moreover, it may result in a solution that is not suitable for individual worker. To address this problem, a task allocation method based on human feedback reinforcement learning is proposed in this study. In this method, multi-agent reinforcement learning is applied to pre-train the agent models to solve the task allocation and sequencing problem with multiple optimal objectives. An analytic hierarchy process-based method is utilized to analyze human action preferences to build a heuristic reward model. Furthermore, a preference training approach using knowledge distillation is proposed, and agents are adjusted through preference rewards and pre-trained optimization experiences to learn a decision-making policy that suits worker preferences. The effectiveness of the method is verified in comparative and ablation experiments.

KW - Human robot collaboration

KW - Reinforcement learning

KW - Reinforcement learning from human feedback

KW - Task allocation and sequencing

UR - http://www.scopus.com/pages/publications/105005866575

U2 - 10.1016/j.aei.2025.103497

DO - 10.1016/j.aei.2025.103497

M3 - Article

AN - SCOPUS:105005866575

SN - 1474-0346

VL - 66

JO - Advanced Engineering Informatics

JF - Advanced Engineering Informatics

M1 - 103497

ER -

A reinforcement learning from human feedback based method for task allocation of human robot collaboration assembly considering human preference

摘要

访问文件

其它文件与链接

指纹

引用此