Our "delta-GCLip" is the *only* known adaptive gradient algorithm that provably trains deep-nets AND is practically competitive. That's the message of our recently accepted #TMLR paper - and my 4th TMLR journal ๐
openreview.net/pdf?id=ABT1X...
#optimization #deeplearningtheory
0
1
0
2