S. Cohen, O. Maimon, and E. Khmelnitsky (Israel)
Distributed reinforcement-learning, factored MDPs, representation abstraction
This paper addresses the decision-making mechanism, for controlling Markov Decision Processes (MDPs) with factored action-spaces. The paper proposes a simple decision-making approach, which decomposes a multi dimensional action-space into its dimensions and distributes the dimensions amongst multiple autonomous agents. A distinct Q-function is defined for each agent, and a Reinforcement Learning (RL) algorithm, which uses the decision-making approach, is proposed. The algorithm maintains estimations that converge to the agents' Q-functions. It is shown that by distributing the decision-making, and due to the factored action-space, new opportunities for representation abstractions emerge. The paper's contribution is illustrated, using a navigation task that involves three robots.
Important Links:
Go Back