Bounding the bias of contrastive divergence learning

Anja Fischer; Christian Igel

doi:10.1162/NECO_a_00085

Bounding the bias of contrastive divergence learning

Anja Fischer, Christian Igel

Department of Computer Science

34 Citations (Scopus)

Abstract

Optimization based on k-step contrastive divergence (CD) has become a common way to train restricted Boltzmann machines (RBMs). The k-step CD is a biased estimator of the log-likelihood gradient relying on Gibbs sampling. We derive a new upper bound for this bias. Its magnitude depends on k, the number of variables in the RBM, and the maximum change in energy that can be produced by changing a single variable. The last reflects the dependence on the absolute values of the RBM parameters. The magnitude of the bias is also affected by the distance in variation between the modeled distribution and the starting distribution of the Gibbs chain.

Original language	English
Journal	Neural Computation
Volume	23
Issue number	3
Pages (from-to)	664-673
Number of pages	10
ISSN	0899-7667
DOIs	https://doi.org/10.1162/NECO_a_00085
Publication status	Published - Mar 2011

Access to Document

10.1162/NECO_a_00085

Cite this

@article{7adec40587054251b315a2e516ec75fb,

title = "Bounding the bias of contrastive divergence learning",

abstract = "Optimization based on k-step contrastive divergence (CD) has become a common way to train restricted Boltzmann machines (RBMs). The k-step CD is a biased estimator of the log-likelihood gradient relying on Gibbs sampling. We derive a new upper bound for this bias. Its magnitude depends on k, the number of variables in the RBM, and the maximum change in energy that can be produced by changing a single variable. The last reflects the dependence on the absolute values of the RBM parameters. The magnitude of the bias is also affected by the distance in variation between the modeled distribution and the starting distribution of the Gibbs chain. ",

author = "Anja Fischer and Christian Igel",

year = "2011",

month = mar,

doi = "10.1162/NECO_a_00085",

language = "English",

volume = "23",

pages = "664--673",

journal = "Neural Computation",

issn = "0899-7667",

publisher = "M I T Press",

number = "3",

}

TY - JOUR

T1 - Bounding the bias of contrastive divergence learning

AU - Fischer, Anja

AU - Igel, Christian

PY - 2011/3

Y1 - 2011/3

N2 - Optimization based on k-step contrastive divergence (CD) has become a common way to train restricted Boltzmann machines (RBMs). The k-step CD is a biased estimator of the log-likelihood gradient relying on Gibbs sampling. We derive a new upper bound for this bias. Its magnitude depends on k, the number of variables in the RBM, and the maximum change in energy that can be produced by changing a single variable. The last reflects the dependence on the absolute values of the RBM parameters. The magnitude of the bias is also affected by the distance in variation between the modeled distribution and the starting distribution of the Gibbs chain.

AB - Optimization based on k-step contrastive divergence (CD) has become a common way to train restricted Boltzmann machines (RBMs). The k-step CD is a biased estimator of the log-likelihood gradient relying on Gibbs sampling. We derive a new upper bound for this bias. Its magnitude depends on k, the number of variables in the RBM, and the maximum change in energy that can be produced by changing a single variable. The last reflects the dependence on the absolute values of the RBM parameters. The magnitude of the bias is also affected by the distance in variation between the modeled distribution and the starting distribution of the Gibbs chain.

U2 - 10.1162/NECO_a_00085

DO - 10.1162/NECO_a_00085

M3 - Journal article

C2 - 21162669

SN - 0899-7667

VL - 23

SP - 664

EP - 673

JO - Neural Computation

JF - Neural Computation

IS - 3

ER -

Bounding the bias of contrastive divergence learning

Abstract

Access to Document

Fingerprint

Cite this