A curious robot: An explorative-exploitive inference algorithm

Kim Steenstrup Pedersen, Peter Johansen

Abstract

We propose a sequential learning algorithm with a focus on robot
control. It is initialised by a teacher who directs the robot
through a series of example solutions of a problem. Left alone, the
control chooses its next action by prediction based on a variable
order Markov chain model selected to minimise a MDL criterion based
on generalised code length La of the past robot-environment
interaction. The user specifies the parameter a and as a
result, the robot can be directed towards exploratory behaviour if
confidence in the teacher is low (a<0), and towards
goal-seeking exploitive behaviour if confidence in the teacher is
high (a>0). The novelty of the proposed method lies in the
use of generalised code length in the MDL model selection criterion.
Original languageEnglish
Title of host publicationRoboMat 07 : Coimbra, Portugal, 17-19 September, 2007
EditorsHelder Araujo, Maria Isabel Ribeiro
Number of pages6
PublisherCIM (Centro Internacional de Matematica)
Publication date2007
Pages51-57
ISBN (Print)9789899501133
Publication statusPublished - 2007
EventWorkshop of Robotics and Mathematics (RoboMat 2007) - Coimbra, Portugal
Duration: 17 Sept 200719 Sept 2007

Conference

ConferenceWorkshop of Robotics and Mathematics (RoboMat 2007)
Country/TerritoryPortugal
CityCoimbra
Period17/09/200719/09/2007

Fingerprint

Dive into the research topics of 'A curious robot: An explorative-exploitive inference algorithm'. Together they form a unique fingerprint.

Cite this