Contextual policy search for generalizing a parameterized biped walking controller

Abdolmaleki, A. and Lau, N. and Reis, L. P. and Peters, J. and Neumann, G. (2015) Contextual policy search for generalizing a parameterized biped walking controller. In: IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC), 8 - 10 April 2015, Vila Real, Portugal.

Documents
25698 07101605.pdf

Request a copy
[img] PDF
25698 07101605.pdf - Whole Document
Restricted to Repository staff only

372kB
Item Type:Conference or Workshop contribution (Paper)
Item Status:Live Archive

Abstract

We investigate learning of flexible Robot locomotion controller, i.e., the controllers should be applicable for multiple contexts, for example different walking speeds, various slopes of the terrain or other physical properties of the robot. In our experiments, contexts are desired walking linear speed and the direction of the gait. Current approaches for learning control parameters of biped locomotion controllers are typically only applicable for a single context. They can be used for a particular context, for example to learn a gait with highest speed, lowest energy consumption or a combination of both. The question of our research is, how can we obtain a flexible walking controller that controls the robot (near) optimally for many different contexts? We achieve the desired flexibility of the controller by applying the recently developed contextual relative entropy policy search(REPS) method. With such a contextual policy search algorithm, we can generalize the robot walking controller for different contexts, where a context is described by a real valued vector. In this paper we also extend the contextual REPS algorithm to learn a non-linear policy instead of a linear one over the contexts. In order to validate our method, we perform a simulation experiment using a simulated NAO humanoid robot. The robot now learns a policy to choose the controller parameters for a continuous set of walking speeds and directions.

Keywords:Robotics, Legged locomotion, Context, Foot, Trajectory, Springs, Joints
Subjects:H Engineering > H670 Robotics and Cybernetics
G Mathematical and Computer Sciences > G440 Human-computer Interaction
Divisions:College of Science > School of Computer Science
Related URLs:
ID Code:25698
Deposited On:19 May 2017 15:00

Repository Staff Only: item control page