Skip to content

Commit 72e34aa

Browse files
authored
Add Quantization topic (#26)
1 parent ed823d3 commit 72e34aa

1 file changed

Lines changed: 19 additions & 15 deletions

File tree

README.md

Lines changed: 19 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,8 @@
55

66
<div align='center'>
77
<img src=https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg >
8-
<img src=https://img.shields.io/github/forks/DefTruth/Awesome-SD-Inference.svg?style=social >
9-
<img src=https://img.shields.io/github/stars/DefTruth/Awesome-SD-Inference.svg?style=social >
8+
<img src=https://img.shields.io/github/forks/xlite-dev/Awesome-Diffusion-Inference.svg?style=social >
9+
<img src=https://img.shields.io/github/stars/xlite-dev/Awesome-Diffusion-Inference.svg?style=social >
1010
<img src=https://img.shields.io/badge/Release-v0.5-brightgreen.svg >
1111
<img src=https://img.shields.io/badge/License-GPLv3.0-turquoise.svg >
1212
</div>
@@ -15,10 +15,10 @@
1515

1616
## 🤖Contents
1717

18-
- [📙Awesome Diffusion Inference with Sampling](#Sampling)
19-
- [📙Awesome Diffusion Inference with Caching](#Caching)
20-
- [📙Awesome Diffusion Inference with Multi-GPUs](#Distributed)
21-
- [📙Other Awesome Diffusion Inference Paper with codes](#Others)
18+
- [📙Sampling](#Sampling)
19+
- [📙Caching](#Caching)
20+
- [📙Multi-GPUs](#Distributed)
21+
- [📙Quantization](#Quantization)
2222

2323

2424
## ©️Citations
@@ -34,7 +34,7 @@
3434
```
3535

3636

37-
## 📙Awesome Diffusion Inference with Sampling
37+
## 📙 Sampling
3838

3939
<div id="Sampling"></div>
4040

@@ -54,7 +54,7 @@
5454
|2024.08| 🔥[**StepSaver**] StepSaver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation(@intel) | [[pdf]](https://arxiv.org/pdf/2408.02054) |⚠️|⭐️⭐️ |
5555
|2024.09| 🔥[**DC-Solver**] DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation(@Tsinghua University)| [[pdf]](https://arxiv.org/pdf/2409.03755v1) | [[DC-Solver]](https://github.com/wl-zhao/DC-Solver) ![](https://img.shields.io/github/stars/wl-zhao/DC-Solver.svg?style=social) |⭐️⭐️ |
5656

57-
## 📙Awesome Diffusion Inference with Caching
57+
## 📙 Caching
5858

5959
<div id="Caching"></div>
6060

@@ -95,7 +95,7 @@
9595
|2025.01| 🔥🔥[**Token Pruning**] Token Pruning for Caching Better: 9× Acceleration on Stable Diffusion for Free(@SJTU) | [[pdf]](https://arxiv.org/pdf/2501.00375) | [[DaTo]](https://github.com/EvelynZhang-epiclab/DaTo) ![](https://img.shields.io/github/stars/EvelynZhang-epiclab/DaTo.svg?style=social)|⭐️⭐️ |
9696
|2025.04| 🔥🔥[**AB-Cache**] AB-Cache: Training-Free Acceleration of Diffusion Models via Adams-Bashforth Cached Feature Reuse(@USTC) | [[pdf]](https://arxiv.org/pdf/2504.10540) | ⚠️|⭐️⭐️ |
9797

98-
## 📙Awesome Diffusion Distributed Inference with Multi-GPUs
98+
## 📙 Multi-GPUs
9999

100100
<div id="Distributed"></div>
101101

@@ -116,9 +116,10 @@
116116
|2024.05 | 🔥🔥[**TensorRT-LLM SDXL**] SDXL Distributed Inference with TensorRT-LLM and synchronous comm(@Zars19) | [[pdf]](https://arxiv.org/abs/2402.19481) | [[SDXL-TensorRT-LLM]](https://github.com/NVIDIA/TensorRT-LLM/pull/1514) ![](https://img.shields.io/github/stars/NVIDIA/TensorRT-LLM.svg?style=social)| ⭐️⭐️ |
117117
|2024.06| 🔥🔥[**Clip Parallelism**] Video-Infinity: Distributed Long Video Generation(@nus.edu)|[[pdf]](https://arxiv.org/pdf/2406.16260) | [[Video-Infinity]](https://github.com/Yuanshi9815/Video-Infinity) ![](https://img.shields.io/github/stars/Yuanshi9815/Video-Infinity.svg?style=social)|⭐️⭐️ |
118118
|2024.05| 🔥🔥[**FIFO-Diffusion**] FIFO-Diffusion: Generating Infinite Videos from Text without Training(@Seoul National University)|[[pdf]](https://arxiv.org/pdf/2405.11473) | [[FIFO-Diffusion]](https://github.com/jjihwan/FIFO-Diffusion_public) ![](https://img.shields.io/github/stars/jjihwan/FIFO-Diffusion_public.svg?style=social) |⭐️⭐️ |
119+
|2025.01| 🔥🔥[**ParaAttention**] Context parallel attention that accelerates DiT model inference with dynamic caching(@chengzeyi)| [[docs]](https://github.com/chengzeyi/ParaAttention) | [[ParaAttention]](https://github.com/chengzeyi/ParaAttention) ![](https://img.shields.io/github/stars/chengzeyi/ParaAttention.svg?style=social)|⭐️⭐️ |
119120

120-
## 📙Other Awesome Diffusion Inference Paper with codes
121-
<div id="Others"></div>
121+
## 📙 Quantization
122+
<div id="Quantization"></div>
122123

123124
|Date|Title|Paper|Code|Recom|
124125
|:---:|:---:|:---:|:---:|:---:|
@@ -130,6 +131,9 @@
130131
|2024.08| 🔥[**Diffusion-RWKV**] Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models(@Zhengcong Fei)|[[pdf]](https://arxiv.org/pdf/2404.04478) | [[Diffusion-RWKV]](https://github.com/feizc/Diffusion-RWKV) ![](https://img.shields.io/github/stars/feizc/Diffusion-RWKV.svg?style=social)|⭐️⭐️ |
131132
|2024.09| 🔥[**LinFusion**] LINFUSION: 1 GPU, 1 MINUTE, 16K IMAGE(@NUS)|[[pdf]](https://arxiv.org/pdf/2409.02097) | [[LinFusion]](https://github.com/Huage001/LinFusion) ![](https://img.shields.io/github/stars/Huage001/LinFusion.svg?style=social)|⭐️⭐️ |
132133
|2024.11| 🔥🔥[**SVDQuant**] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models|[[pdf]](https://arxiv.org/pdf/2411.05007) | [[nunchaku]](https://github.com/mit-han-lab/nunchaku) ![](https://img.shields.io/github/stars/mit-han-lab/nunchaku.svg?style=social)|⭐️⭐️ |
134+
|2024.10|🔥🔥[**SageAttention**] SAGEATTENTION: ACCURATE 8-BIT ATTENTION FOR PLUG-AND-PLAY INFERENCE ACCELERATION(@thu-ml)|[[pdf]](https://arxiv.org/pdf/2410.02367)|[[SageAttention]](https://github.com/thu-ml/SageAttention) ![](https://img.shields.io/github/stars/thu-ml/SageAttention) | ⭐️⭐️ |
135+
|2024.11|🔥🔥[**SageAttention-2**] SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization(@thu-ml)|[[pdf]](https://arxiv.org/pdf/2411.10958)|[[SageAttention]](https://github.com/thu-ml/SageAttention) ![](https://img.shields.io/github/stars/thu-ml/SageAttention) | ⭐️⭐️ |
136+
|2025.03|🔥🔥[**SpargeAttention**] SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference(@thu-ml)|[[pdf]](https://arxiv.org/pdf/2502.18137)|[[SpargeAttn]](https://github.com/thu-ml/SpargeAttn) ![](https://img.shields.io/github/stars/thu-ml/SpargeAttn) | ⭐️⭐️ |
133137

134138
## ©️License
135139

@@ -140,11 +144,11 @@ GNU General Public License v3.0
140144
Welcome to star & submit a PR to this repo!
141145

142146
<div align='center'>
143-
<a href="https://star-history.com/#xlite-dev/Awesome-SD-Inference&Date">
147+
<a href="https://star-history.com/#xlite-dev/Awesome-Diffusion-Inference&Date">
144148
<picture align='center'>
145-
<source media="(prefers-color-scheme: dark)" srcset="https://api.star-history.com/svg?repos=xlite-dev/Awesome-SD-Inference&type=Date&theme=dark" />
146-
<source media="(prefers-color-scheme: light)" srcset="https://api.star-history.com/svg?repos=xlite-dev/Awesome-SD-Inference&type=Date" />
147-
<img width="350" height="250" alt="Star History Chart" src="https://api.star-history.com/svg?repos=xlite-dev/Awesome-SD-Inference&type=Date" />
149+
<source media="(prefers-color-scheme: dark)" srcset="https://api.star-history.com/svg?repos=xlite-dev/Awesome-Diffusion-Inference&type=Date&theme=dark" />
150+
<source media="(prefers-color-scheme: light)" srcset="https://api.star-history.com/svg?repos=xlite-dev/Awesome-Diffusion-Inference&type=Date" />
151+
<img width="350" height="250" alt="Star History Chart" src="https://api.star-history.com/svg?repos=xlite-dev/Awesome-Diffusion-Inference&type=Date" />
148152
</picture>
149153
</a>
150154
</div>

0 commit comments

Comments
 (0)