6.4 Statistical mechanics of RBM
It is hopeless to provide answers to these questions in full generality for a given RBM with parameters fitted from real data. However, statistical physics methods and concepts allow us to study the typical energy landscape and properties of RBM drawn from appropriate random ensembles. We follow this approach hereafter, using the replica method [86]. We define the Random-RBM ensemble model for ReLU hidden units as follows, see also drawing in Fig. 28,
• N binary visible units, M ReLU hidden units, with N, M → ∞ and α = M N is finite.
• uniform visible layer fields, i.e. gi = g, ∀i.
• uniform hidden layer thresholds, i.e. θµ = θ, ∀µ.
• a random weight matrix wiµ = √ ξiµ N , where each ’pattern’ ξiµ is drawn independently, taking values +1, −1 with probabilities p 2 and 0 with probability 1 − p. The degree of sparsity p is the fraction of non-zero weights.