iterator.py
6.72 KB
-
Fix cuBLAS fp32 downcast issue on ampere devices · ac051717
Summary: This commit removes the default cuBLAS tensor core math mode when CUDA >= 11.0 on ampere devices to avoid the FP32 downcast math.
Ting PAN committed