You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|2024.11| 🔥🔥[**SkipCache**] Accelerating Vision Diffusion Transformers with Skip Branches(@SJTU)|[[pdf]](https://arxiv.org/pdf/2411.17616)|[[Skip-DiT]](https://github.com/OpenSparseLLMs/Skip-DiT)|⭐️⭐️ |
90
90
|2024.12| 🔥🔥[**DuCa**] Accelerating Diffusion Transformers with Dual Feature Caching(@SJTU)|[[pdf]](https://arxiv.org/pdf/2412.18911)|[[DuCa]](https://github.com/Shenyi-Z/DuCa)|⭐️⭐️ |
91
+
|2025.01| 🔥🔥[**FBCache**] Fastest HunyuanVideo Inference with Context Parallelism and First Block Cache on NVIDIA L20 GPUs(@chengzeyi)|[[docs]](https://github.com/chengzeyi/ParaAttention/blob/main/doc/fastest_hunyuan_video.md)|[[ParaAttention]](https://github.com/chengzeyi/ParaAttention)|⭐️⭐️ |
91
92
92
93
## 📙Awesome Diffusion Distributed Inference with Multi-GPUs
0 commit comments