基于Dueling Double DQN的交通信號控制方法

首頁(yè) > 過(guò)刊瀏覽>2024年第32卷第7期 >154-161

基于Dueling Double DQN的交通信號控制方法
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者單位:浙江理工大學(xué) 信息科學(xué)與工程學(xué)院
作者簡(jiǎn)介:
通訊作者:
中圖分類(lèi)號:TP
基金項目:國家自然科學(xué)基金青年科學(xué)基金項目（No. 61603154），浙江省自然科學(xué)基金探索項目(No. LTGS23F030002)，嘉興市應用性基礎研究項目(2023AY11034)，工業(yè)控制技術(shù)國家重點(diǎn)實(shí)驗室開(kāi)放課題(No. ICT2022B52)

Traffic Signal Control Method based on Dueling Double DQN

Author:

Affiliation:

Fund Project:

摘要

圖/表

訪(fǎng)問(wèn)統計

參考文獻

相似文獻

引證文獻

資源附件

文章評論

摘要:

為了提高交叉口通行效率緩解交通擁堵，深入挖掘交通狀態(tài)信息中所包含的深層次隱含特征信息，提出了一種基于Dueling Double DQN (D3QN) 的單交叉口交通信號控制方法。構建了一個(gè)基于深度強化學(xué)習Double DQN(DDQN)的交通信號控制模型，對動(dòng)作-價(jià)值函數的估計值和目標值迭代運算過(guò)程進(jìn)行了優(yōu)化，克服基于深度強化學(xué)習DQN的交通信號控制模型存在收斂速度慢的問(wèn)題。設計了一個(gè)新的Dueling Network解耦交通狀態(tài)和相位動(dòng)作的價(jià)值，增強Double DQN (DDQN) 提取深層次特征信息的能力。基于微觀(guān)仿真平臺SUMO搭建了一個(gè)單交叉口模擬仿真框架和環(huán)境，開(kāi)展仿真測試。仿真測試結果表明，與傳統交通信號控制方法和基于深度強化學(xué)習DQN的交通信號控制方法相比，所提方法能夠有效減少車(chē)輛平均等待時(shí)間、車(chē)輛平均排隊長(cháng)度和車(chē)輛平均停車(chē)次數，明顯提升交叉口通行效率。

Abstract:

In order to improve the efficiency of intersection traffic and alleviate traffic congestion, a single intersection traffic signal control method based on Dueling Double DQN (D3QN) is proposed by deeply exploring the deep hidden feature information contained in traffic status information. A traffic signal control model based on deep reinforcement learning Double DQN (DDQN) was constructed, and the iterative operation process of the estimated value and target value of the action value function was optimized to overcome the problem of slow convergence speed in traffic signal control models based on deep reinforcement learning DQN. A new Dueling Network has been designed to decouple the value of traffic states and phase actions, enhancing the ability of Double DQN (DDQN) to extract deep level feature information. A single intersection simulation framework and environment were built based on the micro simulation platform simulation of urban mobility（SUMO）for simulation testing. The simulation test results show that compared with traditional traffic signal control methods and traffic signal control methods based on deep reinforcement learning DQN, the proposed method can effectively reduce the average waiting time, average queue length, and average number of stops of vehicles, significantly improving the efficiency of intersection traffic.

參考文獻

相似文獻

引證文獻

引用本文

葉寶林,陳棟,劉春元,陳濱,吳維敏.基于Dueling Double DQN的交通信號控制方法計算機測量與控制[J].,2024,32(7):154-161.

復制

文章指標

點(diǎn)擊次數:
下載次數:
HTML閱讀次數:
引用次數:

歷史

收稿日期:2023-11-26
最后修改日期:2024-01-09
錄用日期:2024-01-10
在線(xiàn)發(fā)布日期: 2024-08-02
出版日期:

国产欧美精品一区二区,中文字幕专区在线亚洲,国产精品美女网站在线观看,艾秋果冻传媒2021精品,在线免费一区二区,久久久久久青草大香综合精品,日韩美aaa特级毛片,欧美成人精品午夜免费影视

引用本文

分享

文章指標

歷史

文章二維碼