The paradox of over-parameterization in learning algorithm

Authors

  • Fabrizio Morlando Independent Researcher, Italy

DOI:

https://doi.org/10.56947/amcs.v34.824

Keywords:

Algorithmic Stability, Over-parameterization, Generalization Gap

Abstract

Traditional learning theory fails to explain why over-parameterized networks generalize. We propose a physical synthesis treating learning as a dissipative process. By integrating Differential Geometry, Dynamical Systems, and Algorithmic Stability, we show generalization depends on dynamical stability rather than architecture. Utilizing the Polyak-Lojasiewicz condition and Restricted Strong Convexity, we establish that gradient flow acts as a contractive operator, preventing trajectory divergence. Finally, McDiarmid’s inequalities convert this contraction into rigorous probabilistic bounds. Our framework proves that over-parameterization facilitates geometric properties that inherently regularize the learning process, ensuring robust generalization despite high model capacity.

Downloads

Download data is not yet available.

Downloads

Published

2026-05-24

Issue

Section

Articles