On Early Stopping in Gradient Descent Learning

Overview of attention for article published in Constructive Approximation, April 2007

Altmetric Badge

About this Attention Score

In the top 25% of all research outputs scored by Altmetric
High Attention Score compared to outputs of the same age (83rd percentile)

Mentioned by

twitter: 2 X users
patent: 2 patents

wikipedia: 2 Wikipedia pages

Citations

dimensions_citation: 602 Dimensions

Readers on

mendeley: 343 Mendeley
citeulike: 1 CiteULike

Summary X Patents Wikipedia Dimensions citations

So far, Altmetric has seen 2 X posts from 2 X users, with an upper bound of 6,262 followers.

@deepcohen It should behave more similar to early stopping in kernel regression. This is of course similar to ridge regularization, but can lead to better convergence rates if the target function is smoother than the kernel: https://t.co/AfGFASXIRI

29 Jan 2023

Reply Repost Favourite

@ogrisel Nice paper, but early stopping to prevent overfitting in *infinite dimensional spaces* has been analyzed also long time ago https://t.co/YzZrHpXhhv by @lrntzrsc I also did my part showing that in the same setting a special SGD does not need early

16 Jul 2018

Reply Repost Favourite