Analyzing steady state Variance in Hebbian Learning: A Moment Closure Approach
Keywords:
Ojas rule, Hebbian learning, learning rate, neural networks, moment closure approximationAbstract
Hebbian learning, an important concept in neural networks, is the basis for various learning algorithms that model the adaptation of neural connections, also known as synapses. Among these models, Oja’s rule stands out as an important example, giving valuable insights into the dynamics of unsupervised learning algorithms. The fact that the final steady-state solution of a single-layer network that learns using Oja’s rule equals the solution of Principal component analysis is well known. However, the way in which the learning rate can affect the variance of the final parameters is less explored. In this paper, we investigate how different learning rates can influence the variance of parameters in Oja’s rule, utilizing the moment closure approximation. By focusing on the variance, we offer new perspectives on the behavior of Oja’s rule under varying conditions. We derive a closed-form equation that connects the parameter variance with the learning rate and shows that the relationship between these is linear. This gives valuable insights that may help to optimize the learning process of Hebbian models.
References
D. O. Hebb, The Organization of Behavior: A Neuropsychological Theory. Wiley, New York, 1949.
S. Nabavi, R. Fox, C. D. Proulx, J. Y. Lin, R. Y. Tsien, and R. Malinow, “Engineering a memory with LTD and LTP,” Nature, vol. 511, pp. 348–352, 2014.
E. Kuriscak, P. Marsalek, J. Stroffek, and P. G. Toth, “Biological context of Hebb learning in artificial neural networks: A review,” Neurocomputing, vol. 152, pp. 27–35, 2015.
G. Amato et al., “Hebbian learning meets deep convolutional neural networks,” in Image Analysis and Processing – ICIAP 2019, E. Ricci et al., Eds. Springer International Publishing, Cham, pp. 324–334, 2019.
J. P. Johansen et al., “Hebbian and neuromodulatory mechanisms interact to trigger associative memory formation,” Proceedings of the National Academy of Sciences, vol. 111, no. 51, pp. E5584–E5592, 2014.
T. J. Sejnowski and G. Tesauro, “The Hebb rule for synaptic plasticity: Algorithms and implementations,” in Neural Models of Plasticity, J. H. Byrne and W. O. Berry, Eds. Academic Press, pp. 94–103, 1989.
E. L. Bienenstock et al., “Theory for the development of neuron selectivity: Orientation specificity and binocular interaction in visual cortex,” Journal of Neuroscience, vol. 2, no. 1, pp. 32–48, 1982.
T. J. Sejnowski, “Storing covariance with nonlinearly interacting neurons,” Journal of Mathematical Biology, vol. 4, no. 4, pp. 303–321, 1977.
E. Oja, “A simplified neuron model as a principal component analyzer,” Journal of Mathematical Biology, vol. 15, no. 3, pp. 267–273, 1982.
L. N. Smith, “Cyclical learning rates for training neural networks,” in Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 464–472, 2017.
E. Vardanyan and D. B. Saakian, “The analytical dynamics of the finite population evolution games,” Physica A: Statistical Mechanics and its Applications, vol. 553, p. 124233, 2020.
E. Vardanyan, E. Koonin, and D. B. Saakian, "Analysis of finite population evolution models using a moment closure approximation," Journal of the Physical Society of Japan, vol. 90, no. 1, p. 014801, 2021.
V. Galstyan and D. B. Saakian, "Quantifying the stochasticity of policy parameters in reinforcement learning problems," Physical Review E, vol. 107, p. 034112, 2023.
E. Oja, "Principal components, minor components, and linear neural networks," Neural Networks, vol. 5, no. 6, pp. 927–935, 1992.
A. Hyvärinen and E. Oja, "Independent component analysis: Algorithms and applications," Neural Networks, vol. 13, no. 4–5, pp. 411–430, 2000.
W. Gerstner, W. M. Kistler, R. Naud, and L. Paninski, Neuronal Dynamics: From Single Neurons to Networks and Models of Cognition, Cambridge University Press, 2014.
L. Isserlis, "On a formula for the product-moment coefficient of any order of a normal frequency distribution in any number of variables," Biometrika, vol. 12, no. 1–2, pp. 134–139, 1918.
L. Isserlis, "On certain probable errors and correlation coefficients of multiple frequency distributions with skew regression," Biometrika, vol. 11, no. 3, pp. 185–190, 1916.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Edgar A. Vardanyan
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.