The total dataset is made of 46 pictures. V2 was trained on Stable diffusion 2.1 768. I used StableTuner to do the training, using full caption on the pictures with almost no recurring word outside the main concept. 6 epochs of 40 repeats on LR 1e-6 were used, with prior preservation.
The total dataset is made of 46 pictures. V2 was trained on Stable diffusion 2.1 768. I used StableTuner to do the training, using full caption on the pictures with almost no recurring word outside the main concept. 6 epochs of 40 repeats on LR 1e-6 were used, with prior preservation.