However, the model is very dependent on the strength of the visible units. I'm finding that even obvious patterns that humans can easily detect are not captured by the RBM learning if the visible units are scaled down by some reasonable factor. This may be because the energies for a lot of desirable patterns become too small and the model will only be able to learn a few bases that do give strong signals.
It helps to scale up lower layer weights in these cases.
No comments:
Post a Comment