强化学习在城市交通信号灯控制方法中的应用

刘义; 何均宏

doi:10.3981/j.issn.1000-7857.2019.06.011

科技导报 >

2019 , Vol. 37 >Issue 6: 84 - 90

DOI: https://doi.org/10.3981/j.issn.1000-7857.2019.06.011

专题：智能交通

强化学习在城市交通信号灯控制方法中的应用

刘义 ,
何均宏

展开

1. 深圳市公安局交通警察局, 深圳 518035;
2. 华为技术有限公司, 深圳 518080

刘义,工程师,研究方向为交通管理,电子信箱:liuyi@stc.gov.cn

收稿日期: 2019-01-14

修回日期: 2019-01-29

网络出版日期: 2019-04-09

收起

A survey of the application of reinforcement learning in urban traffic signal control methods

LIU Yi ,
HE Junhong

Expand

1. Shenzhen Traffic Police, Shenzhen 518035, China;
2. Huawei Technologies Co., Ltd., Shenzhen 518080, China

Received date: 2019-01-14

Revised date: 2019-01-29

Online published: 2019-04-09

Fold

摘要

悉尼自适应交通控制系统（SCATS）、绿信比-周期-相位差优化技术（SCOOT）及Smooth采用自适应交通信号灯控制方法，对城市道路口的交通信号灯进行了有效控制。随着深圳城市交通流量急剧增长，深圳交警在自主研发Smooth信号控制式基础上，提出实时、分布式、自适应调控要求，联合创新了人工信号控制方案TrafficGo，探索基于深度神经网络的强化学习，通过在线学习各种流量负荷，实时推理计算信控时段、相位、相序、信号周期、绿信比、相位差，进一步优化了交通信号灯的控制模式。介绍了在交通信号灯控制中运用的强化学习模型，实地测评表明，其取得了一定改进效果。

关键词： 交通信号控制; 强化学习; 人工智能; 通行效率

本文引用格式

刘义 , 何均宏 . 强化学习在城市交通信号灯控制方法中的应用[J]. 科技导报, 2019 , 37(6) : 84 -90 . DOI: 10.3981/j.issn.1000-7857.2019.06.011

Abstract

The adaptive traffic signal control method is adopted to effectively control the traffic lights at the urban road junctions, with the rapid growth of the traffic flow in Shenzhen. Shenzhen traffic police asked for a real-time, distributed and adaptive control on the basis of the self-developed smooth signal control. Joint innovation has developed the reinforcement learning based on the deep neural network. Through online learning of various traffic loads, and the real-time reasoning, the information control period, phase, phase sequence, signal cycle, split and phase difference are calculated. This paper reviews the reinforcement learning model used in the traffic signal control, and makes an evaluation on the spot.

Key words： traffic signal control; reinforcement learning; artificial intelligence; pass efficiency

参考文献

[1] 陆化普. 大数据及其在城市智能交通系统中的应用综述[J]. 交通运输系统工程与信息, 2015(10):45-51.Lu Huapu. Big data and its applications in urban intelligent transportation system[J]. Journal of Transportation Systems Engineering and Information Technology, 2015(10):45-51.
[2] 杨文臣, 张轮, Zhu Feng. 多智能体强化学习在城市交通网络信号控制方法中的应用综述[J]. 计算机应用研究, 2018, 35(6):101-114. Yang Wenchen, Zhang Lun, Zhu Feng. Multi-agent reinforcement learning based traffic signal control for integrated urban network:Survey of state of art[J]. Application Research of Computers, 2018, 35(6):101-114.
[3] Li L, Lv Y S, Wang F Y. Traffic signal timing via deep reinforcement learning[J]. Acta Automatica Sinica, 2016, 3(3):247-254.
[4] Hamilton A, Waterson B, Cherrett T, et al. The evolution of urban traffic control:Changing policy and technology[J]. Transportation Planning & Technology, 2013, 36(1):24-43.
[5] Zhang J, Wang F Y, Wang K, et al. Data-driven intelligent transportation systems:A survey[J]. IEEE Transactions on Intelligent Transportation Systems, 2011, 12(4):1624-1639.
[6] Wu X, Liu H X. Using high-resolution event-based data for traffic modeling and control:An overview[J]. Transportation Research Part C, 2014, 42(2):28-43.
[7] Yau K L A, Qadir J, Khoo H L, et al. A Survey on reinforcement learning models and algorithms for traffic signal control[J]. ACM Computing Surveys, 2017, 50(3):1-38.
[8] Azimirad E, Pariz N, Sistani M B N. A novel fuzzy model and control of single intersection at urban traffic network[J]. IEEE Systems Journal, 2010, 4(1):107-111.
[9] Balaji P G, German X, Srinivasan D. Urban traffic signal control using reinforcement learning agents[J]. IET Intelligent Transport Systems, 2010, 4(3):177-188.
[10] Sutton R S, Barto A G. Reinforcement learning:An introduction[J]. IEEE Transactions on Neural Networks, 1998, 9(5):1054.
[11] Watkins C J C H, Dayan P. Q-learning[J]. Machine Learning, 1992, 8(3/4):279-292.
[12] Lecun Y, Bengio Y, Hinton G. Deep learning[J]. Nature, 2015, 521(7553):436-444.
[13] Mnih V, Kavukcuoglu K, Silver D, et al. Human-level control through deep reinforcement learning[J]. Nature, 2015, 518(7540):529-533.
[14] Genders W, Razavi S. Using a deep reinforcement learning agent for traffic signal control[J]. arXiv preprint, 2016, arXiv:1611.01142.
[15] Tran D, Toulis P, Airoldi E M. Stochastic gradient descent methods for estimation with large data sets[J]. arXiv preprint, 2015, arXiv:1509.06459.
[16] Lillicrap T P, Hunt J J, Pritzel A, et al. Continuous control with deep reinforcement learning[J]. arXiv preprint, 2016, arXiv:1509.02971.

Options

文章导航

摘要

本文引用格式

Abstract

参考文献

联系我们

访问统计

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献

联系我们

访问统计