torch/core/optim/adam.py · 494774d3a545f807d483fd9e6e4563cedec6dda5 · SeetaResearch / Dragon

Summary:
This commit fuses the weight decay and mixed precision conversion
into update kernels to get lower training latency.

committed Dec 31, 2021

adam.py 4.59 KB