The author's site: http://web.mit.edu/~rsalakhu/www/index.html
Chapter 2 is about RBM and DBN.
Restricted Boltzmann Machine:
Two-layer architecture, visible binary units, v, and hidden binary units, h.
dimension of v is D and dimension of h is F.
The energy of state {v, h} is:
W is the symmetric weights, b is the visible bias and a is the hidden bias.
The joint distribution over the visible and hidden units is defined by:
Z(\theta) is know as the partition function for normalization.
The probability that the model assigns to the visible vector v is:
and the hidden units could be explicitly marginalized out:
The conditional probabilities:
From the energy based model theory: http://deeplearning.net/tutorial/rbm.html
Free energy is defined as:
data:image/s3,"s3://crabby-images/c0f66/c0f660a18d27833c5df554a705640b24a45daea5" alt=""
then:
data:image/s3,"s3://crabby-images/c82f3/c82f3b2ee8b901134202ae270e87326aa74d794f" alt=""
P(x) is actually P(v; \theta) above.
For RBM, the free energy is:
Fro RBMs with binary visible units and binary hidden units, we obtain:
No comments:
Post a Comment