Generalized softmax networks for non-linear component extraction.
In: MarquesDeSa, J and Alexandre, LA and Duch, W and Mandic, DP, (eds.)
Artificial Neural Networks - ICANN 2007, Pt 1, Proceedings.
(pp. 657 - 667).
We develop a probabilistic interpretation of non-linear component extraction in neural networks that activate their hidden units according to a softmaxlike mechanism. On the basis of a generative model that combines hidden causes using the max-function, we show how the extraction of input components in such networks can be interpreted as maximum likelihood parameter optimization. A simple and neurally plausible Hebbian A-rule is derived. For approximatelyoptimal learning, the activity of the hidden neural units is described by a generalized softmax function and the classical softmax is recovered for very sparse input. We use the bars benchmark test to numerically verify our analytical results and to show competitiveness of the derived learning algorithms.
|Title:||Generalized softmax networks for non-linear component extraction|
|Event:||17th International Conference on Artificial Neural Networks (ICANN 2007)|
|Dates:||2007-09-09 - 2007-09-13|
|UCL classification:||UCL > School of Life and Medical Sciences
UCL > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > School of Life and Medical Sciences > Faculty of Life Sciences > Gatsby Computational Neuroscience Unit
Archive Staff Only