Skip to content

Support size_per_head=112#660

Merged
byshiue merged 3 commits into
NVIDIA:mainfrom
dskhudia:support_112
Jun 29, 2023
Merged

Support size_per_head=112#660
byshiue merged 3 commits into
NVIDIA:mainfrom
dskhudia:support_112

add support for size_per_head=112 for gpt decoder

7a71b18
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs