We prove a quantitative functional central limit theorem for one-hidden-layer neural networks with generic activation function. Our rates of convergence depend heavily on the smoothness of the activation function, and they range from logarithmic for nondifferentiable nonlinearities such as the ReLu to √n for highly regular activations. Our main tools are based on functional versions of the Stein–Malliavin method; in particular, we rely on a quantitative functional central limit theorem which has been recently established by Bourguin and Campese.
Cammarota, V., Marinucci, D., Salvi, M., Vigogna, S. (2024). A quantitative functional central limit theorem for shallow neural networks. MODERN STOCHASTICS: THEORY AND APPLICATIONS, 11(1), 85-108 [10.15559/23-VMSTA238].
A quantitative functional central limit theorem for shallow neural networks
Marinucci D.;Salvi M.
;Vigogna S.
2024-01-01
Abstract
We prove a quantitative functional central limit theorem for one-hidden-layer neural networks with generic activation function. Our rates of convergence depend heavily on the smoothness of the activation function, and they range from logarithmic for nondifferentiable nonlinearities such as the ReLu to √n for highly regular activations. Our main tools are based on functional versions of the Stein–Malliavin method; in particular, we rely on a quantitative functional central limit theorem which has been recently established by Bourguin and Campese.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.