perf(rms_norm): use fused reduce_l2_norm path (~48× faster)#1
Closed
sbryngelson wants to merge 2 commits into
Closed
perf(rms_norm): use fused reduce_l2_norm path (~48× faster)#1sbryngelson wants to merge 2 commits into
sbryngelson wants to merge 2 commits into
background
wait
wait-all
cancel
parallel
Loading