Gradient Descent as the Euler Method: From Discrete Optimization to Continuous Dynamics

Last updated