Strategic dialogue management via deep reinforcement learning

Cuayahuitl, Heriberto, Keizer, Simon and Lemon, Oliver (2015) Strategic dialogue management via deep reinforcement learning. In: NIPS Workshop on Deep Reinforcement Learning, 11 December 2015, Montreal, Canada.

Full content URL: http://arxiv.org/abs/1611.08675

Documents
25994 1511.08099v1.pdf
[img]
[Download]
[img]
Preview
PDF
25994 1511.08099v1.pdf - Whole Document

867kB
Item Type:Conference or Workshop contribution (Paper)
Item Status:Live Archive

Abstract

Artificially intelligent agents equipped with strategic skills that can negotiate during their interactions with other natural or artificial agents are still underdeveloped. This paper describes a successful application of Deep Reinforcement Learning (DRL) for training intelligent agents with strategic conversational skills, in a situated dialogue setting. Previous studies have modelled the behaviour of strategic agents using supervised learning and traditional reinforcement learning techniques, the latter using tabular representations or learning with linear function approximation. In this study, we apply DRL with a high-dimensional state space to the strategic board game of Settlers of Catan---where players can offer resources in exchange for others and they can also reply to offers made by other players. Our experimental results report that the DRL-based learnt policies significantly outperformed several baselines including random, rule-based, and supervised-based behaviours. The DRL-based policy has a 53% win rate versus 3 automated players (`bots'), whereas a supervised player trained on a dialogue corpus in this setting achieved only 27%, versus the same 3 bots. This result supports the claim that DRL is a promising framework for training dialogue systems, and strategic agents with negotiation abilities.

Keywords:Deep Reinforcement Learning, (Spoken) Dialogue Systems
Subjects:G Mathematical and Computer Sciences > G700 Artificial Intelligence
Divisions:College of Science > School of Computer Science
Related URLs:
ID Code:25994
Deposited On:02 Feb 2017 14:51

Repository Staff Only: item control page