Welcome to the National Transport Library Catalogue

Normal view MARC view

Evaluating Green-Extension Policies with Reinforcement Learning and Markovian Traffic State Estimation Adam, Zain M ; Abbas, Montasir M ; Li, Pengfei

By: Contributor(s): Series: ; 2128Publication details: Washington DC Transportation Research Record: Journal of the Transportation Research Board, 2009Description: s. 217-225ISBN:
  • 9780309142601
Subject(s): Bibl.nr: VTI P8167:2128Location: Abstract: Several protection algorithms strive to reduce the number of vehicles trapped in the dilemma zone. These algorithms use some arbitrary policies such as terminating the green when only one vehicle is present in the dilemma zone and the dilemma zone has not cleared after a certain period of time. The research proposes a control agent that is able to develop and adapt an optimal policy by learning from the environment. The agent incorporates a Markovian traffic state estimation into its learning process. A novel approach is presented for controlling traffic signals so that the number of vehicles trapped in the dilemma zone is reduced in an optimal fashion according to changes in traffic states. A comparison between the proposed optimal policy and the emerging detection-control system two-stage policy was conducted, and it was found that the policy based on reinforcement learning reduced the number of vehicles caught in the dilemma zone by up to 32%.
Item type: Reports, conferences, monographs
Holdings
Current library Status
Statens väg- och transportforskningsinstitut Available

Several protection algorithms strive to reduce the number of vehicles trapped in the dilemma zone. These algorithms use some arbitrary policies such as terminating the green when only one vehicle is present in the dilemma zone and the dilemma zone has not cleared after a certain period of time. The research proposes a control agent that is able to develop and adapt an optimal policy by learning from the environment. The agent incorporates a Markovian traffic state estimation into its learning process. A novel approach is presented for controlling traffic signals so that the number of vehicles trapped in the dilemma zone is reduced in an optimal fashion according to changes in traffic states. A comparison between the proposed optimal policy and the emerging detection-control system two-stage policy was conducted, and it was found that the policy based on reinforcement learning reduced the number of vehicles caught in the dilemma zone by up to 32%.