@@ -15,17 +15,18 @@ But both the stages of training use VQVAE only. One can easily change that to va
1515For diffusion part, as of now it only implements DDPM with linear schedule.
1616
1717
18- ## Stable Diffusion Tutorial Video
19- ### Unconditional
18+ ## Stable Diffusion Tutorial Videos
2019<a href =" https://www.youtube.com/watch?v=1BkzNb3ejK4 " >
2120 <img alt="Stable Diffusion Tutorial" src="https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/7a24d114-38bd-43a8-9819-3afa112f39ab "
2221 width="400">
2322</a >
24-
25- ### Conditional
26-
23+ <a href =" https://www.youtube.com/watch?v=hEJjg7VUA8g " >
24+ <img alt="Stable Diffusion Conditioning Tutorial" src="https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/0b03cae4-4009-4bd7-8b02-328b0c5f33a3 "
25+ width="400">
26+ </a >
2727___
2828
29+
2930## Sample Output for Autoencoder on CelebHQ
3031Image - Top, Reconstructions - Below
3132
@@ -37,9 +38,22 @@ Image - Top, Reconstructions - Below
3738
3839## Sample Output for Conditional LDM
3940### Sample Output for Class Conditioned on MNIST
40- ### Sample Output for Text Conditioned on CelebHQ (not converged)
41- ### Sample Output for Mask Conditioned on CelebHQ (not converged)
42- ### Sample Output for Text and Mask Conditioned on CelebHQ (not converged)
41+ ![ 50] ( https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/46a38d36-3770-4f40-895a-95a16dc6462a )
42+ ![ 50] ( https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/1562c41d-e6ff-41cf-8d1e-6909a4240a04 )
43+ ![ 50] ( https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/0cde44a6-746b-4f05-9422-9604f9436d91 )
44+ ![ 50] ( https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/7d6b8db2-dab4-4a17-9fe6-570d938669f6 )
45+ ![ 50] ( https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/6ecc3c61-3668-4305-aa4a-0f0e3cf815a0 )
46+
47+ ### Sample Output for Text(using CLIP) and Mask Conditioned on CelebHQ (not converged)
48+ <img src =" https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/78aa92bb-655e-46f6-92e9-a0c59787d700 " width =" 100 " >
49+
50+ Text - She is a woman with blond hair
51+
52+ <img src =" https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/9fcf66fb-65d7-4c2c-9fc4-9bbe428de11f " width =" 100 " >
53+
54+ Text - She is a woman with black hair
55+
56+ <img src =" https://github.com/explainingai-code/StableDiffusion-PyTorch/assets/144267687/44ad67c8-c1e1-4ade-936f-18da92202e55 " width =" 100 " >
4357
4458___
4559
0 commit comments