High-Dimensional Scaling Limits of Online Stochastic Gradient Descent in Single-Index Models
dc.contributor.author | Rangriz, Parsa | |
dc.date.accessioned | 2025-04-25T13:40:33Z | |
dc.date.available | 2025-04-25T13:40:33Z | |
dc.date.issued | 2025-04-25 | |
dc.date.submitted | 2025-04-15 | |
dc.description.abstract | We analyze the scaling limits of stochastic gradient descent (SGD) with a constant step size in the high-dimensional regime in single-index models. Specifically, we prove limit theorems for the trajectories of finite-dimensional summary statistics of SGD as the dimension tends to infinity. These scaling limits enable the analysis of both ballistic dynamics, described by a system of ordinary differential equations (ODEs), and diffusive dynamics, captured by a system of stochastic differential equations (SDEs). Additionally, we analyze a critical step-size scaling regime where, below this threshold, the effective ballistic dynamics align with the gradient flow of the population loss. In contrast, a new diffusive correction term appears at the threshold due to fluctuations around the fixed points. Furthermore, we discuss nearly sharp thresholds for the number of samples required for consistent estimation, which depend solely on an intrinsic property of the activation function known as the information exponent. Our main contribution is demonstrating that if a single-index model has an information exponent greater than two, the deterministic scaling limit, corresponding to the ballistic phase, or so-called dynamical mean-field theory in statistical physics, fails to achieve consistent estimation in high-dimensional inference problems. This shows the necessity of diffusive correction terms to accurately describe the dynamics of online SGD in single-index models via SDEs such as an Ornstein-Uhlenbeck process. | |
dc.identifier.uri | https://hdl.handle.net/10012/21642 | |
dc.language.iso | en | |
dc.pending | false | |
dc.publisher | University of Waterloo | en |
dc.title | High-Dimensional Scaling Limits of Online Stochastic Gradient Descent in Single-Index Models | |
dc.type | Master Thesis | |
uws-etd.degree | Master of Mathematics | |
uws-etd.degree.department | Statistics and Actuarial Science | |
uws-etd.degree.discipline | Statistics | |
uws-etd.degree.grantor | University of Waterloo | en |
uws-etd.embargo.terms | 0 | |
uws.contributor.advisor | Jagannath, Aukosh | |
uws.contributor.affiliation1 | Faculty of Mathematics | |
uws.peerReviewStatus | Unreviewed | en |
uws.published.city | Waterloo | en |
uws.published.country | Canada | en |
uws.published.province | Ontario | en |
uws.scholarLevel | Graduate | en |
uws.typeOfResource | Text | en |