Skip to content

Add TurboQuant3/4 modes to quantized_scaled_dot_product_attention#3453

Closed
dedalien wants to merge 23 commits into
ml-explore:mainfrom
dedalien:turboq/integrate-generic-quant-sdpa
Closed

Add TurboQuant3/4 modes to quantized_scaled_dot_product_attention#3453
dedalien wants to merge 23 commits into
ml-explore:mainfrom
dedalien:turboq/integrate-generic-quant-sdpa

tests: add test_quantized_sdpa_turbo for turbo3/turbo4

2a12f86
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs