Used to perform calculations with an OpenCL capable device.
To make new calculations invocable, a trait providing new operations should be implemented for CLDevice.
A CPU is used to perform calculations on the host CPU.
To make new operations invocable, a trait providing new functions should be implemented for CPU.
Used to perform calculations with a CUDA capable device.
To make new calculations invocable, a trait providing new operations should be implemented for CudaDevice.
Device is another representation of a compute device.
It stores the type of the device and a pointer to the device from which Device originates from.
This is used instead of another “device” generic for Buffer.
Adds a Buffer to the “cache chain”.
Following calls will return this Buffer,
if the corresponding internal count matches with the id used in the cache.
Compute operations on the CPU even though the matrix was created with an OpenCL device.
There were some optimizations implemented regarding unified memory architectures.
inclusive range
used to reset the cache count in loops as every operation increases the cache count, which would break the “cache cycle” if the cache count would not be reset.