activation.py
19.2 KB
-
Instantiate dispatch template by value for crucial CUDA kernels · 46feba80
Summary: This commit instantiates CUDA kernels by using constant dimensions to enable the optimization during compiler-time.
Ting PAN committed