Abstract
We propose a sequential learning algorithm with a focus on robot
control. It is initialised by a teacher who directs the robot
through a series of example solutions of a problem. Left alone, the
control chooses its next action by prediction based on a variable
order Markov chain model selected to minimise a MDL criterion based
on generalised code length La of the past robot-environment
interaction. The user specifies the parameter a and as a
result, the robot can be directed towards exploratory behaviour if
confidence in the teacher is low (a<0), and towards
goal-seeking exploitive behaviour if confidence in the teacher is
high (a>0). The novelty of the proposed method lies in the
use of generalised code length in the MDL model selection criterion.
control. It is initialised by a teacher who directs the robot
through a series of example solutions of a problem. Left alone, the
control chooses its next action by prediction based on a variable
order Markov chain model selected to minimise a MDL criterion based
on generalised code length La of the past robot-environment
interaction. The user specifies the parameter a and as a
result, the robot can be directed towards exploratory behaviour if
confidence in the teacher is low (a<0), and towards
goal-seeking exploitive behaviour if confidence in the teacher is
high (a>0). The novelty of the proposed method lies in the
use of generalised code length in the MDL model selection criterion.
Original language | English |
---|---|
Title of host publication | RoboMat 07 : Coimbra, Portugal, 17-19 September, 2007 |
Editors | Helder Araujo, Maria Isabel Ribeiro |
Number of pages | 6 |
Publisher | CIM (Centro Internacional de Matematica) |
Publication date | 2007 |
Pages | 51-57 |
ISBN (Print) | 9789899501133 |
Publication status | Published - 2007 |
Event | Workshop of Robotics and Mathematics (RoboMat 2007) - Coimbra, Portugal Duration: 17 Sept 2007 → 19 Sept 2007 |
Conference
Conference | Workshop of Robotics and Mathematics (RoboMat 2007) |
---|---|
Country/Territory | Portugal |
City | Coimbra |
Period | 17/09/2007 → 19/09/2007 |