Abstract
We propose a sequential learning algorithm with a focus on robot
control. It is initialised by a teacher who directs the robot
through a series of example solutions of a problem. Left alone, the
control chooses its next action by prediction based on a variable
order Markov chain model selected to minimise a MDL criterion based
on generalised code length La of the past robot-environment
interaction. The user specifies the parameter a and as a
result, the robot can be directed towards exploratory behaviour if
confidence in the teacher is low (a<0), and towards
goal-seeking exploitive behaviour if confidence in the teacher is
high (a>0). The novelty of the proposed method lies in the
use of generalised code length in the MDL model selection criterion.
control. It is initialised by a teacher who directs the robot
through a series of example solutions of a problem. Left alone, the
control chooses its next action by prediction based on a variable
order Markov chain model selected to minimise a MDL criterion based
on generalised code length La of the past robot-environment
interaction. The user specifies the parameter a and as a
result, the robot can be directed towards exploratory behaviour if
confidence in the teacher is low (a<0), and towards
goal-seeking exploitive behaviour if confidence in the teacher is
high (a>0). The novelty of the proposed method lies in the
use of generalised code length in the MDL model selection criterion.
Originalsprog | Engelsk |
---|---|
Titel | RoboMat 07 : Coimbra, Portugal, 17-19 September, 2007 |
Redaktører | Helder Araujo, Maria Isabel Ribeiro |
Antal sider | 6 |
Forlag | CIM (Centro Internacional de Matematica) |
Publikationsdato | 2007 |
Sider | 51-57 |
ISBN (Trykt) | 9789899501133 |
Status | Udgivet - 2007 |
Begivenhed | Workshop of Robotics and Mathematics (RoboMat 2007) - Coimbra, Portugal Varighed: 17 sep. 2007 → 19 sep. 2007 |
Konference
Konference | Workshop of Robotics and Mathematics (RoboMat 2007) |
---|---|
Land/Område | Portugal |
By | Coimbra |
Periode | 17/09/2007 → 19/09/2007 |