context_cuda.h
13 KB
-
Optimize Concat && Split Operator · f76c693e
Summary: This commit uses CopyMatrix to implement concat and split generically instead of specialized kernels.
Ting PAN committed
Summary: This commit uses CopyMatrix to implement concat and split generically instead of specialized kernels.