PAC-Bayesian inequalities for martingales

Yevgeny Seldin; François Laviolette; Nicolò Cesa-Bianchi; John Shawe-Taylor; Peter Auer

PAC-Bayesian inequalities for martingales

Yevgeny Seldin, François Laviolette, Nicolò Cesa-Bianchi, John Shawe-Taylor, Peter Auer

29 Citations (Scopus)

Abstract

We present a set of high-probability inequalities that control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. Our results extend the PAC-Bayesian (probably approximately correct) analysis in learning theory from the i.i.d. setting to martingales opening the way for its application to importance weighted sampling, reinforcement learning, and other interactive learning domains, as well as many other domains in probability theory and statistics, where martingales are encountered. We also present a comparison inequality that bounds the expectation of a convex function of a martingale difference sequence shifted to the [0, 1] interval by the expectation of the same function of independent Bernoulli random variables. This inequality is applied to derive a tighter analog of Hoeffding-Azuma's inequality.

Original language	English
Journal	I E E E Transactions on Information Theory
Volume	58
Issue number	12
ISSN	0018-9448
Publication status	Published - 2012
Externally published	Yes

Access to Document

http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6257492

Cite this

@article{93049c6d310649ec9a2fa691492bfce6,

title = "PAC-Bayesian inequalities for martingales",

abstract = "We present a set of high-probability inequalities that control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. Our results extend the PAC-Bayesian (probably approximately correct) analysis in learning theory from the i.i.d. setting to martingales opening the way for its application to importance weighted sampling, reinforcement learning, and other interactive learning domains, as well as many other domains in probability theory and statistics, where martingales are encountered. We also present a comparison inequality that bounds the expectation of a convex function of a martingale difference sequence shifted to the [0, 1] interval by the expectation of the same function of independent Bernoulli random variables. This inequality is applied to derive a tighter analog of Hoeffding-Azuma's inequality.",

author = "Yevgeny Seldin and Fran{\c c}ois Laviolette and Nicol{\`o} Cesa-Bianchi and John Shawe-Taylor and Peter Auer",

year = "2012",

language = "English",

volume = "58",

journal = "I E E E Transactions on Information Theory",

issn = "0018-9448",

publisher = "Institute of Electrical and Electronics Engineers",

number = "12",

}

TY - JOUR

T1 - PAC-Bayesian inequalities for martingales

AU - Seldin, Yevgeny

AU - Laviolette, François

AU - Cesa-Bianchi, Nicolò

AU - Shawe-Taylor, John

AU - Auer, Peter

PY - 2012

Y1 - 2012

N2 - We present a set of high-probability inequalities that control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. Our results extend the PAC-Bayesian (probably approximately correct) analysis in learning theory from the i.i.d. setting to martingales opening the way for its application to importance weighted sampling, reinforcement learning, and other interactive learning domains, as well as many other domains in probability theory and statistics, where martingales are encountered. We also present a comparison inequality that bounds the expectation of a convex function of a martingale difference sequence shifted to the [0, 1] interval by the expectation of the same function of independent Bernoulli random variables. This inequality is applied to derive a tighter analog of Hoeffding-Azuma's inequality.

AB - We present a set of high-probability inequalities that control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. Our results extend the PAC-Bayesian (probably approximately correct) analysis in learning theory from the i.i.d. setting to martingales opening the way for its application to importance weighted sampling, reinforcement learning, and other interactive learning domains, as well as many other domains in probability theory and statistics, where martingales are encountered. We also present a comparison inequality that bounds the expectation of a convex function of a martingale difference sequence shifted to the [0, 1] interval by the expectation of the same function of independent Bernoulli random variables. This inequality is applied to derive a tighter analog of Hoeffding-Azuma's inequality.

M3 - Journal article

SN - 0018-9448

VL - 58

JO - I E E E Transactions on Information Theory

JF - I E E E Transactions on Information Theory

IS - 12

ER -

PAC-Bayesian inequalities for martingales

Abstract

Access to Document

Fingerprint

Cite this