One-sentence update summary: HelloWorld 7.0 is an iteratively optimized version, with the best body performance in the entire series, and further enhanced concept scope and detail richness.
Update details:
By adding negative training images, strengthening pose training, and optimizing the clip model, the accuracy of the model's limbs and hands has been improved compared to previous versions. The recommended negative prompt words are: "bad hand, bad anatomy, worst quality, ai generated images, low quality, average quality".
Extracted the fine-tuned LoRA from the official SPO model and incorporated it into HelloWorld 7.0. SPO is a further improvement of the DPO method. The SPO base model is used for better performance than the DPO XL base model and the original SDXL base model. The SPO LoRA can enhance image details & contrast and beautify images. Thanks to the technical team behind SPO.
Continued to expand the concept scope of the training set, but optimized and streamlined the training set (large training set fine-tuning is too expensive, and H800 is difficult to rent recently, can't afford the local training time). The current total training set is 20,821 images. The training set resolution distribution is as follows, and it is recommended to use several resolutions with a larger number of images for output:
Used GPT4O to re-label all datasets. This time, a structured labeling method was used, with the specific structure being: "one-sentence summary description + multiple image element tags + inspired by XXX + aesthetic quality description words", where the aesthetic quality description words are divided into five levels: worst quality, low quality, average quality, best quality, and masterpiece. A typical labeling example is as follows:
conceptual art featuring a human hand wrapped in red and beige ribbons, isolated against a plain, light background, realistic style, minimalist color scheme, smooth textures, elongated and surreal aesthetic, inspired by salvador dalí's surrealist works, masterpiece
The "High-Frequency Tagging Word List" and the "High-Frequency Art Style List" involved in the Inspired by XXX for the HelloWorld 7.0 version will only be provided to commercial licensing users. Partners who have purchased Helloworld XL series model authorization in the past, please contact me if there are any omissions to get it for free.
Players can refer to the High-Frequency Tagging Word List of HelloWorld 6.0. In addition, I have also provided 150+ high-quality HelloWorld 7.0 example images in the gallery, which can be used as a reference for everyone's output. Model making is not easy, thank you players for your understanding and tolerance!
HelloWorld 7.0 Update - June 13, 2024
One-sentence update summary: HelloWorld 7.0 is an iteratively optimized version, with the best body performance in the entire series, and further enhanced concept scope and detail richness.
Update details:
By adding negative training images, strengthening pose training, and optimizing the clip model, the accuracy of the model's limbs and hands has been improved compared to previous versions. The recommended negative prompt words are: "bad hand, bad anatomy, worst quality, ai generated images, low quality, average quality".
Extracted the fine-tuned LoRA from the official SPO model and incorporated it into HelloWorld 7.0. SPO is a further improvement of the DPO method. The SPO base model is used for better performance than the DPO XL base model and the original SDXL base model. The SPO LoRA can enhance image details & contrast and beautify images. Thanks to the technical team behind SPO.
Continued to expand the concept scope of the training set, but optimized and streamlined the training set (large training set fine-tuning is too expensive, and H800 is difficult to rent recently, can't afford the local training time). The current total training set is 20,821 images. The training set resolution distribution is as follows, and it is recommended to use several resolutions with a larger number of images for output:
Used GPT4O to re-label all datasets. This time, a structured labeling method was used, with the specific structure being: "one-sentence summary description + multiple image element tags + inspired by XXX + aesthetic quality description words", where the aesthetic quality description words are divided into five levels: worst quality, low quality, average quality, best quality, and masterpiece. A typical labeling example is as follows:
The "High-Frequency Tagging Word List" and the "High-Frequency Art Style List" involved in the Inspired by XXX for the HelloWorld 7.0 version will only be provided to commercial licensing users. Partners who have purchased Helloworld XL series model authorization in the past, please contact me if there are any omissions to get it for free.
Players can refer to the High-Frequency Tagging Word List of HelloWorld 6.0. In addition, I have also provided 150+ high-quality HelloWorld 7.0 example images in the gallery, which can be used as a reference for everyone's output. Model making is not easy, thank you players for your understanding and tolerance!