Optimization for training deep models: Cosmos — All that is, or was, or ever will be

Optimization for training deep models

cosmos 18th March 2019 at 4:36pm

Deep learning theory

Loss surface of neural networks

A Convergence Theory for Deep Learning via Over-Parameterization

On the Convergence Rate of Training Recurrent Neural Networks