基于深度強化學(xué)習的移動(dòng)機器人動(dòng)態(tài)路徑規劃算法

首頁(yè) > 過(guò)刊瀏覽>2023年第31卷第1期 >153-159

基于深度強化學(xué)習的移動(dòng)機器人動(dòng)態(tài)路徑規劃算法
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者單位:浙江工業(yè)大學(xué)
作者簡(jiǎn)介:
通訊作者:
中圖分類(lèi)號:
基金項目:國家自然科學(xué)基金項目 （61973275）

Dynamic path planning algorithm of mobile robot based on deep reinforcement learning

Author:

Affiliation:

Fund Project:

摘要

圖/表

訪(fǎng)問(wèn)統計

參考文獻

相似文獻

引證文獻

資源附件

文章評論

摘要:

為了在復雜舞臺環(huán)境下使用移動(dòng)機器人實(shí)現物品搬運或者載人演出，提出了一種基于深度強化學(xué)習的動(dòng)態(tài)路徑規劃算法。首先通過(guò)構建全局地圖獲取移動(dòng)機器人周?chē)恼系K物信息，將演員和舞臺道具分別分類(lèi)成動(dòng)態(tài)障礙物和靜態(tài)障礙物。然后建立局部地圖，通過(guò)LSTM網(wǎng)絡(luò )編碼動(dòng)態(tài)障礙物信息，使用社會(huì )注意力機制計算每個(gè)動(dòng)態(tài)障礙物的重要性來(lái)實(shí)現更好的避障效果。通過(guò)構建新的獎勵函數來(lái)實(shí)現對動(dòng)靜態(tài)障礙物的不同躲避情況。最后通過(guò)模仿學(xué)習和優(yōu)先級經(jīng)驗回放技術(shù)來(lái)提高網(wǎng)絡(luò )的收斂速度，從而實(shí)現在舞臺復雜環(huán)境下的移動(dòng)機器人的動(dòng)態(tài)路徑規劃。實(shí)驗結果表明，該網(wǎng)絡(luò )的收斂速度明顯提高，在不同障礙物環(huán)境下都能夠表現出好的動(dòng)態(tài)避障效果。

Abstract:

A dynamic path planning algorithm based on deep reinforcement learning is proposed in order to use mobile robots to carry goods or perform manned performances in complex stage environment. Firstly, the obstacle information around the mobile robot is obtained by constructing a global map, and the actors and stage props are classified into dynamic obstacles and static obstacles respectively. Then establish a local map, encode the dynamic obstacle information through LSTM network, and calculate the importance of each dynamic obstacle through social attention mechanism to achieve better obstacle avoidance effect. By constructing a new reward function, different avoidance situations of dynamic and static obstacles are realized. Finally, simulation learning and priority experience playback technology are used to improve the convergence speed of the network, so as to realize the dynamic path planning of mobile robot in the complex stage environment. The experimental results show that the convergence speed of the network is significantly improved, and it can show good dynamic obstacle avoidance effect in different obstacle environments.

參考文獻

相似文獻

引證文獻

引用本文

張柏鑫,楊毅鑌,朱華中,劉安東,倪洪杰.基于深度強化學(xué)習的移動(dòng)機器人動(dòng)態(tài)路徑規劃算法計算機測量與控制[J].,2023,31(1):153-159.

復制

文章指標

點(diǎn)擊次數:
下載次數:
HTML閱讀次數:
引用次數:

歷史

收稿日期:2022-06-11
最后修改日期:2022-07-09
錄用日期:2022-07-11
在線(xiàn)發(fā)布日期: 2023-01-16
出版日期:

国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

引用本文

分享

文章指標

歷史

文章二維碼