A Data-Efficient Deep Learning Approach for Deployable Multimodal Social Robots

Cuayahuitl, Heriberto (2020) A Data-Efficient Deep Learning Approach for Deployable Multimodal Social Robots. Neurocomputing, 396 . pp. 587-598. ISSN 0925-2312

Full content URL: https://doi.org/10.1016/j.neucom.2018.09.104

This is the latest version of this item.

Documents
A Data-Efficient Deep Learning Approach for Deployable Multimodal Social Robots
Author's Original Manuscript
[img]
[Download]
A Data-Efficient Deep Learning Approach for Deployable Multimodal Social Robots
Accepted Manuscript
[img]
[Download]
[img]
Preview
PDF
main.pdf - Whole Document

11MB
[img] PDF
main.pdf - Whole Document
Available under License Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International.

11MB
Item Type:Article
Item Status:Live Archive

Abstract

The deep supervised and reinforcement learning paradigms (among others) have the potential to endow interactive multimodal social robots with the ability of acquiring skills autonomously. But it is still not very clear yet how they can be best deployed in real world applications. As a step in this direction, we propose a deep learning-based approach for efficiently training a humanoid robot to play multimodal games---and use the game of `Noughts \& Crosses' with two variants as a case study. Its minimum requirements for learning to perceive and interact are based on a few hundred example images, a few example multimodal dialogues and physical demonstrations of robot manipulation, and automatic simulations. In addition, we propose novel algorithms for robust visual game tracking and for competitive policy learning with high winning rates, which substantially outperform DQN-based baselines. While an automatic evaluation shows evidence that the proposed approach can be easily extended to new games with competitive robot behaviours, a human evaluation with 130 humans playing with the {\it Pepper} robot confirms that highly accurate visual perception is required for successful game play.

Additional Information:The final published version of this article can be accessed online at https://www.journals.elsevier.com/neurocomputing/
Keywords:Deep Reinforcement Learning, Deep Supervised Learning, Interactive Robots, Multimodal Perception and Interaction, Board Games
Subjects:H Engineering > H671 Robotics
G Mathematical and Computer Sciences > G700 Artificial Intelligence
G Mathematical and Computer Sciences > G730 Neural Computing
Divisions:College of Science > School of Computer Science
Related URLs:
ID Code:42805
Deposited On:18 Nov 2020 09:43

Available Versions of this Item

Repository Staff Only: item control page