Trained on PonyDiffusion-SDXL_v6 with ~200 images.
Training Config (kohya):
lr 2e-4
frozen text encoder
batch 2
GA 4
lr scheduler cosine
dim 16
alpha 8
conv_dim 8
conv_alpha 4
tag dropout 15%
network dropout 30%
res 1024 bucket max 1024
shuffle captions
flip
ip_noise_gamma 0.02
min_snr_gamma 4
Trained on PonyDiffusion-SDXL_v6 with ~200 images.
Training Config (kohya):
lr 2e-4
frozen text encoder
batch 2
GA 4
lr scheduler cosine
dim 16
alpha 8
conv_dim 8
conv_alpha 4
tag dropout 15%
network dropout 30%
res 1024 bucket max 1024
shuffle captions
flip
ip_noise_gamma 0.02
min_snr_gamma 4