Cyclops is a parallel (distributed-memory) numerical library for multidimensional arrays (tensors) in C++ and Python. Quick documentation links: C++ and Python. Broadly, Cyclops provides tensor ...
The given task was to implement the Jacobi method in several versions: a serial CPU function, an un-optimized CUDA kernel, and an optimized version of the CUDA kernel. The Jacobi method was ...