CUDA + MPI based framework for parallel data aggregation
docker cmake hpc mpi cuda high-performance-computing prefix-sum gpu-computing openmpi mpich parallel-programming performance-benchmarking cpp20 scan-algorithm nvidia-nsight cuda-gdb
-
Updated
May 20, 2026 - C++