tl_cuda 0.4.1

CUDA GPU tensor library for TL