WIT Press

An Adaptive Learning Algorithm For A Route Choice Problem In Uncertain Traffic Environments


Free (open access)





Page Range

43 - 52




2,989 kb

Paper DOI



WIT Press


T. Miyagi


User equilibrium in a congested network has been conventionally formulated using mathematical optimization procedures. These approaches assume drivers’ behaviours with complete information in the sense that each driver knows the other drivers’ behaviours and their payoff functions. If each driver doesn’t know the other drivers’ strategies, he cannot optimize his strategy. In such a situation, an adaptive heuristics may be a relevant approach to get a better solution. To establish behavioural rules of route choice under incomplete information, we use a smooth fictitious play and a regret matching model developed in game theory, and combine these two approaches. We also propose a new algorithm that can be applicable to a complex situation in traffic environments. Keywords: smooth fictitious play, regret-based strategy, ε-Hannan consistency, reinforcement learning. 1 Introduction Consider dynamic environment where drivers choose their routes repeatedly every day. Each driver is equipped with a route guidance system which can be used to collect the information about travel times of routes he would choose by receiving signals from the traffic control centre. Each driver knows his own payoff function, but he does not know those of the other drivers. Moreover, each driver can know in hindsight the vector of payoffs he would have obtained if he had chosen any of his possible actions. We are interested in not only such informed drivers, but also in naïve drivers, who cannot use the route guidance system. Their knowledge about travel is far restricted: a naive driver does not


smooth fictitious play, regret-based strategy, ε-Hannan consistency, reinforcement learning