test_nn.py
43 KB
-
Optimize training update operators · 494774d3
Summary: This commit fuses the weight decay and mixed precision conversion into update kernels to get lower training latency.
Ting PAN committed
Summary: This commit fuses the weight decay and mixed precision conversion into update kernels to get lower training latency.