Tensor.Art
Create

playground-v2-512px-base-anime-finetune

CHECKPOINT
Reprint


Updated:

playground-v2-512px-base-anime-finetune

■This is an experimental fine tuning.

I trained using onetrainer.

Fine-tuning is performed on a 100,000-image dataset that mainly contains anime images, but also some realistic and AI images. The training resolution is 512px.

I would like to share the possibilities of playground v2 512px base with everyone.

It is the same as SDXL, so you can download it and use it immediately.

The advantage of this model is 512px, so I thought it would be ideal if you want to train SDXL architecture but have problems such as lack of VRAM.

I think this model may be a good choice for those who want to use the SDXL architecture but feel that the generated size of 1024px is too large or want to generate at 512px.

Fine tuning is done at 512px.An advantage is that there is no need to prepare a 1024px dataset. You can use the dataset that has been used in SD1.5 so far, so it is less burdensome.Training time can also be reduced.

1024px eats up training time, cache time, cache space, VRAM, hard disk, etc...

It's 4 times faster than 1024px. I'm sorry if my calculations are wrong... Learning is fast and fun since you can get the benefits of SDXL architecture even at low resolutions.

This model may have potential.

■Please be careful as sexual images are also generated.

There are cases where the look of something realistic or AI comes out strongly.

It might be a good idea to add "realistic" to the negative prompt.

"blush" This tag may be effective as it forces an anime style.

This is a very strong tag, so putting it near the beginning may be too strong.

On the other hand, it might be fun to try something other than anime.

New discoveries are made in areas that were not originally intended.

If you have created your favorite image, please share it with us!

It's okay not to expect perfection too much.This model is still immature.The broken results are more interesting!

It would be interesting to generate various tags using something that can automatically generate tags.

■The standard size for this model is 512px

A ratio like 512x768 like SD1.5 is suitable.

768px 1024px is not trained, so the result will be disastrous.

If you set it to a large size when doing i2i, it will fail.

The limit would be 1.5x magnification and denoise 0.5.

I like dpmpp_sde step:12 cfg:3-5. Euler a is also stable and good. The generation speed will also be faster.

i2i can raise the cfg as much as you want. At around cfg15, contrast and detail become more prominent.

■I am training with the danbooru tag.

A small number of tags will produce a disastrous result.The tags that are often used in danbooru and SD are the quality tags for this model.

We are only learning general tags such as 1gril, and we are not training artist or anime work tags.

I would be happy if you could give me your opinion on what datasets I would like to have if I continue training in the future.

The order of tags is important. Every tag has a unique image.

The more popular the tag, the better the quality may be, but the image will be reflected more strongly, so it is also effective to offset it with other tags or change the order to dilute it.

"Looking at viewer" can easily be of high quality.

I'm training without adding the "nsfw" tag, but I feel like it's effective for some reason...

■It's an incomplete and very difficult model, but if you're interested, please give it a try. I'm not very good with prompts, so if you can generate interesting results, please share them so I can make this model even stronger.

Your feedback will motivate us to train on a wider range of datasets.

There are still tags that have not yet been learned, so more diverse expressions will be possible.

■Great pre-trained model used for fine tuning.

https://huggingface.co/playgroundai/playground-v2-512px-base

If you have any questions, please feel free to ask!

日本語での質問も大丈夫ですのでご気軽にお声がけください~

Version Detail

Playground v2
■"Pruned Model fp16 (6.46 GB): for inference and merging"

Project Permissions

Model reprinted from : https://civitai.com/models/453991?modelVersionId=505439

Reprinted models are for communication and learning purposes only, not for commercial use. Original authors can contact us to transfer the models through our Discord channel --- #claim-models.

Comments

Related Posts

The image generated by the model, publishing a post will appear here
Describe the image you want to generate, then press Enter to send.