Skip to content

Add CUDA version#1

Closed
ipatix wants to merge 6 commits into
HPC-Dwarfs:mainfrom
ipatix:main
Closed

Add CUDA version#1
ipatix wants to merge 6 commits into
HPC-Dwarfs:mainfrom
ipatix:main

Conversation

@ipatix
Copy link
Copy Markdown

@ipatix ipatix commented May 7, 2026

This PR adds a CUDA version for RabbitCT. The implemention is fairly naive and based on the original LolaBunny.c.

ipatix added 6 commits May 6, 2026 15:21
Although double is not really a problem on the datacenter GPUs, the
register spilling by the large values increases register pressure. Using
singles reduces the required register count to <= 32, thus gives a
better occupancy.
This one uses CUDA arrays to use texture fetching hardware.
Precision is not as good, but the performance is better.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants