update gpu req readme for auto-caption
This commit is contained in:
parent
814440c288
commit
25cc3fdb3b
|
@ -2,7 +2,7 @@
|
|||
|
||||
Automatic captioning uses Salesforce's BLIP to automatically create a clean sentence structure for captioning input images before training.
|
||||
|
||||
This requires an Nvidia GPU with about 860MB of available VRAM. It should run fine on something like a 1050 2GB.
|
||||
This requires an Nvidia GPU, but is not terribly intensive work. It should run fine on something like a 1050 Ti 4GB.
|
||||
|
||||
Images should be **square** (1:1 H:W ratio), but they can be any size. I suggest using [Birme](https://www.birme.net/?target_width=512&target_height=512&auto_focal=false&image_format=webp&quality_jpeg=95&quality_webp=99) to crop and resize first, but there are various tools out there for this. I strongly suggest making sure to crop well for training!
|
||||
|
||||
|
|
Loading…
Reference in New Issue