Maksym Andriushchenko

Paper_weight_decay

October 9, 2023

2023

Our new paper Why Do We Need Weight Decay in Modern Deep Learning? is available online. Also check out our new preprint on layer-wise linear mode connectivity.