sd3_vae_anime_ft / README.md
Disty0's picture
Update README.md
fdd4e69 verified
metadata
datasets:
  - Disty0/danbooru_curated-jxl_lossless_4mp
base_model:
  - stabilityai/stable-diffusion-3.5-medium
  - stabilityai/stable-diffusion-3-medium-diffusers
pipeline_tag: text-to-image
library_name: diffusers

This VAE was finetuned on PNG only anime illustrations for 512 steps with MAE Loss.
Used fp32 weights + fp16 mixed precision with learning rate 4e-6 and effective batch size of 16.

This training was to test my VAE decoder training code and 512 step model turned out to be better than i expected and fixes the color shifting issues of the original SD3 VAE pretty well.
I stopped messing with SD3 after a while but i decided to release this VAE finetune instead of deleting it.

Original Image:

orig.png

Original SD3 VAE:

vae_orig.png

Anime VAE Finetune:

vae_ft.png