diff --git a/README.MD b/README.MD index 78160c2..2f0fa86 100644 --- a/README.MD +++ b/README.MD @@ -24,6 +24,8 @@ See clip_rename.bat for an example to chain captioning and renaming together. [Training](https://github.com/victorchall/EveryDream-trainer) (separate repo) - Fine tuning with captioned training and ground truth data (needs 24GB GPU). +[Image Caption GUI and video frame extractors](./doc/CAPTION_GUI.md) and courtesy of [MStevenson](https://github.com/mstevenson/) + ## Install You can use conda or venv. This was developed on Python 3.10.5 but may work on older newer versions. diff --git a/doc/CAPTION_GUI.md b/doc/CAPTION_GUI.md new file mode 100644 index 0000000..0540b69 --- /dev/null +++ b/doc/CAPTION_GUI.md @@ -0,0 +1,16 @@ +# MSteven's tools + +## Caption GUI + + python scripts/image_caption_gui.py + +Python GUI tool to manually caption images for machine learning. + +A sidecar file is created for each image with the same name and a .txt extension. These are compatible with EveryDream Trainer. + +### Controls: +[control/command + o] to open a folder of images. +[page down] and [page up] to go to next and previous images. Hold shift to skip 10 images. +[shift + home] and [shift + end] to go to first and last images. +[shift + delete] to move the current image into a '_deleted' folder. +[escape] to exit the app. \ No newline at end of file