Optimal learning with Bernstein online aggregation

Olivier Wintenberger*

*Corresponding author for this work
7 Citations (Scopus)

Abstract

We introduce a new recursive aggregation procedure called Bernstein Online Aggregation (BOA). Its exponential weights include a second order refinement. The procedure is optimal for the model selection aggregation problem in the bounded iid setting for the square loss: the excess of risk of its batch version achieves the fast rate of convergence log (M) / n in deviation. The BOA procedure is the first online algorithm that satisfies this optimal fast rate. The second order refinement is required to achieve the optimality in deviation as the classical exponential weights cannot be optimal, see Audibert (Advances in neural information processing systems. MIT Press, Cambridge, MA, 2007). This refinement is settled thanks to a new stochastic conversion that estimates the cumulative predictive risk in any stochastic environment with observable second order terms. The observable second order term is shown to be sufficiently small to assert the fast rate in the iid setting when the loss is Lipschitz and strongly convex. We also introduce a multiple learning rates version of BOA. This fully adaptive BOA procedure is also optimal, up to a log log (n) factor.

Original languageEnglish
JournalMachine Learning
Volume106
Issue number1
Pages (from-to)119-141
Number of pages23
ISSN0885-6125
DOIs
Publication statusPublished - 1 Jan 2017

Keywords

  • Exponential weighted averages
  • Individual sequences
  • Learning theory

Fingerprint

Dive into the research topics of 'Optimal learning with Bernstein online aggregation'. Together they form a unique fingerprint.

Cite this