update gpu req readme for auto-caption

This commit is contained in:
Victor Hall 2022-10-30 22:01:38 -04:00
parent 814440c288
commit 25cc3fdb3b
1 changed files with 1 additions and 1 deletions

View File

@ -2,7 +2,7 @@
Automatic captioning uses Salesforce's BLIP to automatically create a clean sentence structure for captioning input images before training. Automatic captioning uses Salesforce's BLIP to automatically create a clean sentence structure for captioning input images before training.
This requires an Nvidia GPU with about 860MB of available VRAM. It should run fine on something like a 1050 2GB. This requires an Nvidia GPU, but is not terribly intensive work. It should run fine on something like a 1050 Ti 4GB.
Images should be **square** (1:1 H:W ratio), but they can be any size. I suggest using [Birme](https://www.birme.net/?target_width=512&target_height=512&auto_focal=false&image_format=webp&quality_jpeg=95&quality_webp=99) to crop and resize first, but there are various tools out there for this. I strongly suggest making sure to crop well for training! Images should be **square** (1:1 H:W ratio), but they can be any size. I suggest using [Birme](https://www.birme.net/?target_width=512&target_height=512&auto_focal=false&image_format=webp&quality_jpeg=95&quality_webp=99) to crop and resize first, but there are various tools out there for this. I strongly suggest making sure to crop well for training!