So I figured out an interesting development. Trained without text encoder, and it got BETTER results. Maybe Kohya is training dual text encodes wrong? who knows.
So I figured out an interesting development. Trained without text encoder, and it got BETTER results. Maybe Kohya is training dual text encodes wrong? who knows.