Trained the Unet and CLIP separately.
Less focus on cinematic looking and more focus on anatomy and prompt comprehension
Trained the Unet and CLIP separately.
Less focus on cinematic looking and more focus on anatomy and prompt comprehension