|
Intelligent systems. Theory and applications, 2022, Volume 26, Issue 3, Pages 64–73
(Mi ista481)
|
|
|
|
Part 2. Special Issues in Intellectual Systems Theory
Residual network with recurrent structures
L. Jianga, Zh. Cuib, Z. Wanga a Lomonosov Moscow State University, Faculty of Mechanics and Mathematics
b Lomonosov Moscow State University, Faculty of Computational Mathematics and Cybernetics
Abstract:
We introduce a recurrent structure (spatially) on residual networks, which can improve the performance of the network while saving parameters. We investigate the behaviour of recurrent structures in residual networks based on Riemannian manifolds, introducing curvature as a metric for neural networks. We also experimentally verify that the gain due to the recurrent structure is related to the curvature, and demonstrate the generality of the recurrent structure as a method to improve the performance of the network.
Keywords:
Neural Netrowks, Riemannian geometry, Recurrent structures, Manifold, Transformers.
Citation:
L. Jiang, Zh. Cui, Z. Wang, “Residual network with recurrent structures”, Intelligent systems. Theory and applications, 26:3 (2022), 64–73
Linking options:
https://www.mathnet.ru/eng/ista481 https://www.mathnet.ru/eng/ista/v26/i3/p64
|
Statistics & downloads: |
Abstract page: | 43 | Full-text PDF : | 47 | References: | 12 |
|