运筹学学报 ›› 2023, Vol. 27 ›› Issue (2): 49-62.doi: 10.15960/j.cnki.issn.1007-6093.2023.02.003

•   • 上一篇    下一篇

面向城市交通信号优化的多智能体强化学习综述

华贇1, 王祥丰1,*(), 金博1   

  1. 1. 华东师范大学计算机科学与技术学院, 上海 200062
  • 收稿日期:2022-06-12 出版日期:2023-06-15 发布日期:2023-06-13
  • 通讯作者: 王祥丰 E-mail:xfwang@cs.ecnu.edu.cn
  • 作者简介:王祥丰, E-mail: xfwang@cs.ecnu.edu.cn
  • 基金资助:
    国家重点研发计划(2021YFA1000300);国家重点研发计划(2021YFA1000302);国家自然科学基金(12071145);国家自然科学基金(11971216)

Multi-agent deep reinforcement learning-based urban traffic signal management

Yun HUA1, Xiangfeng WANG1,*(), Bo JIN1   

  1. 1. School of Computer Science and Technology, East China Normal University, Shanghai 200062, China
  • Received:2022-06-12 Online:2023-06-15 Published:2023-06-13
  • Contact: Xiangfeng WANG E-mail:xfwang@cs.ecnu.edu.cn

摘要:

随着近年来国民经济水平的快速提高, 人民的出行需求快速增长, 给当前由传统非智能信号控制主导的道路交通信号系统带来了日趋严峻的压力。交通路网复杂程度的显著提升促使交通信号控制从单点问题向系统工程问题发展, 而人工智能技术的兴起, 使得城市交通信号优化有了更多的处理手段。以多智能体强化学习为代表的群体智能方法在最近几年被广泛应用于交通信号控制与优化, 其中包括交通信号灯控制、自动驾驶、车路协同等。多智能体强化学习方法相比于传统方法, 可以赋予交通信号系统智能化的同时实现大规模交通信号系统协作, 以提升城市交通运行效率。未来智慧城市交通愿景下, 参与城市交通的各个部分互相协作是至关重要的, 多智能体强化学习在城市交通信号优化具有极大研究价值。本文将系统介绍面向城市交通信号优化的多智能体强化学习的基本理论及其应用于城市交通信号优化领域的现状, 从智能体协作的角度对已有方法进行归纳, 并分析各类方法优缺点。此外, 本文总结多智能体强化学习方法在城市交通信号优化领域所面临的挑战, 并指出该领域未来潜在研究方向, 以促进多智能体强化学习方法在智能城市交通信号优化领域的发展。

关键词: 多智能体强化学习, 智能交通, 交通信号控制, 自动驾驶

Abstract:

With the rapid improvement of the national economy in recent years, people's travel demand has increased, bringing increasingly severe pressure on the current urban traffic signal system relying on traditional non-intelligent traffic lights. The significant increase in the complexity of the traffic network has led to the development of traffic signal control from a single-point problem to a system engineering problem, and the development of artificial intelligence technology brings more methods to dealing with urban traffic signal control. Swarm intelligence methods, represented by multi-agent reinforcement learning, have been widely used in traffic signal control and optimization, including traffic light control, autonomous driving, and vehicle-road collaboration. Compared to traditional methods, multi-agent reinforcement learning can empower the intelligence of traffic signal systems while implementing large-scale traffic signal system collaboration to improve the efficiency of urban traffic operations. The various components involved in urban transportation must collaborate in the vision of intelligent urban traffic. Multi-agent reinforcement learning is of great research value in urban traffic signal control and optimization. This paper will systematically introduce the basic theory of multi-agent deep reinforcement learning and its use in urban traffic signal optimization, summarize the existing approaches and analyze the drawbacks of each method. In addition, this paper will outline the challenges of multi-agent reinforcement learning methods for urban traffic signal optimization. Then the paper points out possible future research directions to promote the development of multi-agent reinforcement learning methods in urban traffic signal optimization.

Key words: multi-agent reinforcement learning, intelligent traffic, traffic signal control, autonomous driving

中图分类号: