Tensor.Art
Create

TagGUI - captioning tool for model creators


Updated:

📥 Download | https://github.com/jhc13/taggui

Cross-platform desktop application for quickly adding and editing image tags and captions, aimed towards creators of image datasets for generative AI models like Stable Diffusion.

Features

  • Keyboard-friendly interface for fast tagging

  • Tag autocomplete based on your own most-used tags

  • Integrated Stable Diffusion token counter

  • Automatic caption and tag generation with models including CogVLM, LLaVA, WD Tagger, and many more

  • Batch tag operations for renaming, deleting, and sorting tags

  • Advanced image list filtering

Captioning parameters

Prompt: Instructions given to the captioning model. Prompt formats are handled automatically based on the selected model. You can use the following template variables to dynamically insert information about each image into the prompt:

  • {tags}: The tags of the image, separated by commas.

  • {name}: The file name of the image without the extension.

  • {directory} or {folder}: The name of the directory containing the image.

An example prompt using a template variable could be Describe the image using the following tags as context: {tags}. With this prompt, {tags} would be replaced with the existing tags of each image before the prompt is sent to the model.

Start caption with: Generated captions will start with this text.

Remove tag separators in caption: If checked, tag separators (commas by default) will be removed from the generated captions.

Discourage from caption: Words or phrases that should not be present in the generated captions. You can separate multiple words or phrases with commas (,). For example, you can put appears,seems,possibly to prevent the model from using an uncertain tone in the captions. The words may still be generated due to limitations related to tokenization.

Include in caption: Words or phrases that should be present somewhere in the generated captions. You can separate multiple words or phrases with commas (,). You can also allow the captioning model to choose from a group of words or phrases by separating them with |. For example, if you put cat,orange|white|black, the model will attempt to generate captions that contain the word cat and either orange, white, or black. It is not guaranteed that all of your specifications will be met.

Tags to exclude (WD Tagger models): Tags that should not be generated, separated by commas.

Many of the other generation parameters are described in the Hugging Face documentation.

0

Comments