Approximation properties of DBNs with binary hidden units and real-valued visible units

Oswin Krause; Asja Fischer; Tobias Glasmachers; Christian Igel

Approximation properties of DBNs with binary hidden units and real-valued visible units

Oswin Krause, Asja Fischer, Tobias Glasmachers, Christian Igel

Department of Computer Science

7 Citations (Scopus)

Abstract

Deep belief networks (DBNs) can approximate any distribution over fixed-length binary vectors. However, DBNs are frequently applied to model real-valued data, and so far little is known about their representational power in this case. We analyze the approximation properties of DBNs with two layers of binary hidden units and visible units with conditional distributions from the exponential family. It is shown that these DBNs can, under mild assumptions, model any additive mixture of distributions from the exponential family with independent variables. An arbitrarily good approximation in terms of Kullback-Leibler divergence of an m-dimensional mixture distribution with n components can be achieved by a DBN with m visible variables and n and n + 1 hidden variables in the first and second hidden layer, respectively. Furthermore, relevant infinite mixtures can be approximated arbitrarily well by a DBN with a finite number of neurons. This includes the important special case of an infinite mixture of Gaussian distributions with fixed variance restricted to a compact domain, which in turn can approximate any strictly positive density over this domain.

Original language	English
Title of host publication	Proceedings of the 30th International Conference on Machine Learning
Editors	Sanjoy Dasgupta, David McAllester
Number of pages	8
Publication date	2013
Pages	419-426
Publication status	Published - 2013
Event	30th International Conference on Machine Learning - Atlanta, United States Duration: 16 Jun 2013 → 21 Jun 2013 Conference number: 30

Conference

Conference	30th International Conference on Machine Learning
Number	30
Country/Territory	United States
City	Atlanta
Period	16/06/2013 → 21/06/2013

Series	JMLR: Workshop and Conference Proceedings
Volume	28

Access to Document

Approximation properties of DBNs with binary hidden units and real-valued visible unitsFinal published version, 349 KB

http://jmlr.org/proceedings/papers/v28/krause13.pdf

Cite this

Approximation properties of DBNs with binary hidden units and real-valued visible units. / Krause, Oswin; Fischer, Asja; Glasmachers, Tobias et al.
Proceedings of the 30th International Conference on Machine Learning. ed. / Sanjoy Dasgupta; David McAllester. 2013. p. 419-426 (JMLR: Workshop and Conference Proceedings, Vol. 28).

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Krause, O, Fischer, A, Glasmachers, T & Igel, C 2013, Approximation properties of DBNs with binary hidden units and real-valued visible units. in S Dasgupta & D McAllester (eds), Proceedings of the 30th International Conference on Machine Learning. JMLR: Workshop and Conference Proceedings, vol. 28, pp. 419-426, 30th International Conference on Machine Learning, Atlanta, United States, 16/06/2013. <http://jmlr.org/proceedings/papers/v28/krause13.pdf>

@inproceedings{2a4804ed808b4af1a53a640168b2ac96,

title = "Approximation properties of DBNs with binary hidden units and real-valued visible units",

abstract = "Deep belief networks (DBNs) can approximate any distribution over fixed-length binary vectors. However, DBNs are frequently applied to model real-valued data, and so far little is known about their representational power in this case. We analyze the approximation properties of DBNs with two layers of binary hidden units and visible units with conditional distributions from the exponential family. It is shown that these DBNs can, under mild assumptions, model any additive mixture of distributions from the exponential family with independent variables. An arbitrarily good approximation in terms of Kullback-Leibler divergence of an m-dimensional mixture distribution with n components can be achieved by a DBN with m visible variables and n and n + 1 hidden variables in the first and second hidden layer, respectively. Furthermore, relevant infinite mixtures can be approximated arbitrarily well by a DBN with a finite number of neurons. This includes the important special case of an infinite mixture of Gaussian distributions with fixed variance restricted to a compact domain, which in turn can approximate any strictly positive density over this domain.",

author = "Oswin Krause and Asja Fischer and Tobias Glasmachers and Christian Igel",

year = "2013",

language = "English",

series = "JMLR: Workshop and Conference Proceedings",

publisher = "Microtome Publishing",

pages = "419--426",

editor = "Sanjoy Dasgupta and David McAllester",

booktitle = "Proceedings of the 30th International Conference on Machine Learning",

note = "30th International Conference on Machine Learning ; Conference date: 16-06-2013 Through 21-06-2013",

}

TY - GEN

T1 - Approximation properties of DBNs with binary hidden units and real-valued visible units

AU - Krause, Oswin

AU - Fischer, Asja

AU - Glasmachers, Tobias

AU - Igel, Christian

N1 - Conference code: 30

PY - 2013

Y1 - 2013

N2 - Deep belief networks (DBNs) can approximate any distribution over fixed-length binary vectors. However, DBNs are frequently applied to model real-valued data, and so far little is known about their representational power in this case. We analyze the approximation properties of DBNs with two layers of binary hidden units and visible units with conditional distributions from the exponential family. It is shown that these DBNs can, under mild assumptions, model any additive mixture of distributions from the exponential family with independent variables. An arbitrarily good approximation in terms of Kullback-Leibler divergence of an m-dimensional mixture distribution with n components can be achieved by a DBN with m visible variables and n and n + 1 hidden variables in the first and second hidden layer, respectively. Furthermore, relevant infinite mixtures can be approximated arbitrarily well by a DBN with a finite number of neurons. This includes the important special case of an infinite mixture of Gaussian distributions with fixed variance restricted to a compact domain, which in turn can approximate any strictly positive density over this domain.

AB - Deep belief networks (DBNs) can approximate any distribution over fixed-length binary vectors. However, DBNs are frequently applied to model real-valued data, and so far little is known about their representational power in this case. We analyze the approximation properties of DBNs with two layers of binary hidden units and visible units with conditional distributions from the exponential family. It is shown that these DBNs can, under mild assumptions, model any additive mixture of distributions from the exponential family with independent variables. An arbitrarily good approximation in terms of Kullback-Leibler divergence of an m-dimensional mixture distribution with n components can be achieved by a DBN with m visible variables and n and n + 1 hidden variables in the first and second hidden layer, respectively. Furthermore, relevant infinite mixtures can be approximated arbitrarily well by a DBN with a finite number of neurons. This includes the important special case of an infinite mixture of Gaussian distributions with fixed variance restricted to a compact domain, which in turn can approximate any strictly positive density over this domain.

M3 - Article in proceedings

T3 - JMLR: Workshop and Conference Proceedings

SP - 419

EP - 426

BT - Proceedings of the 30th International Conference on Machine Learning

A2 - Dasgupta, Sanjoy

A2 - McAllester, David

T2 - 30th International Conference on Machine Learning

Y2 - 16 June 2013 through 21 June 2013

ER -

Approximation properties of DBNs with binary hidden units and real-valued visible units

Abstract

Conference

Access to Document

Fingerprint

Cite this