Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems

12 years 8 months ago
Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems
—Large-scale agent-based systems are required to self-optimize towards multiple, potentially conflicting, policies of varying spatial and temporal scope. As a result, not all agents may be implementing all policies at all times, resulting in agent heterogeneity. As agents share their operating environment, significant dependencies can arise between agents and therefore between policy implementations. To address self-optimization in the presence of agent heterogeneity, policy dependency and the lack of global knowledge that is inherent in large-scale decentralized environments, we propose Distributed W-Learning (DWL). DWL is a reinforcement learning (RL)-based algorithm for collaborative agent-based self-optimization towards multiple policies, which relies only on local interactions and learning. We have evaluated the DWL algorithm in a simulation of a selforganizing urban traffic control (UTC) system and show that using DWL can improve the performance of multiple policies deployed...
Ivana Dusparic, Vinny Cahill
Added 21 May 2010
Updated 21 May 2010
Type Conference
Year 2009
Where SASO
Authors Ivana Dusparic, Vinny Cahill
Comments (0)