test_torch.py
5.96 KB
-
Fix cuBLAS fp32 downcast issue on ampere devices · ac051717
Summary: This commit removes the default cuBLAS tensor core math mode when CUDA >= 11.0 on ampere devices to avoid the FP32 downcast math.
Ting PAN committed