跳到主要导航 跳到搜索 跳到主要内容

CLlight: Enhancing representation of multi-agent reinforcement learning with contrastive learning for cooperative traffic signal control

  • Beihang University
  • Zhongguancun Laboratory
  • Ltd.
  • Key Laboratory of Urban ITS Technology Optimization and Integration Ministry of Public Security People's Republic of China
  • Key Laboratory of intelligent transportation in Anhui Province

科研成果: 期刊稿件文章同行评审

摘要

Multi-agent reinforcement learning has shown great potential for coordinating multi-intersection traffic signals due to its powerful adaptive capabilities, treating each intersection as an agent. However, in the real world, different intersections possess differentiating characteristics such as unique vehicle distributions and traffic patterns. Most existing methods directly add neighboring intersection states to local intersections and optimize the cooperative policy network based on synthesized global features. This indirect optimization approach makes it difficult to thoroughly explore the mutual interactions among different intersection agents, preventing agents from truly learning features with cooperative awareness. To resolve these challenges, we introduce contrastive learning as representation task to the multi-intersection traffic signal control approach named CLlight for two-stage policy network updating. In the first stage, we utilize policy-based or actor-critic-based reinforcement learning methods such as A2C, SAC, and PPO to train policy networks with certain representational capabilities. In the second stage, by extracting pre- and post-masked features and reconstructing the post-masked features, the agents are encouraged to learn the similarities and differences between different intersection policies, which in turn enhances the cooperative and individual representation capabilities of the policy network. To the best of our knowledge, this is the first application of contrastive learning in the field of traffic signal control. Experimental results demonstrate, compared to other state-of-the-art traffic signal control methods, superior average travel time and average waiting time performance under various scenarios, tested on synthetic and real-world datasets.

源语言英语
文章编号125578
期刊Expert Systems with Applications
262
DOI
出版状态已出版 - 1 3月 2025

指纹

探究 'CLlight: Enhancing representation of multi-agent reinforcement learning with contrastive learning for cooperative traffic signal control' 的科研主题。它们共同构成独一无二的指纹。

引用此