Skip to content

feat: add mx.fast.turboquant_attention for compressed KV cache#3340

Closed
yzamari wants to merge 8 commits into
ml-explore:mainfrom
yzamari:feature/turboquant-attention
Closed

feat: add mx.fast.turboquant_attention for compressed KV cache#3340
yzamari wants to merge 8 commits into
ml-explore:mainfrom
yzamari:feature/turboquant-attention

feat: add 4-bit quantization support to turboquant_attention

08d7fe6
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs