From 87bfe652ceb64faea2e18960df7f85a26d75c226 Mon Sep 17 00:00:00 2001 From: Victor Hall Date: Sun, 24 Mar 2024 10:00:00 -0400 Subject: [PATCH] caption cog notebook2 --- CaptionCog.ipynb | 14 ++++++++------ 1 file changed, 8 insertions(+), 6 deletions(-) diff --git a/CaptionCog.ipynb b/CaptionCog.ipynb index 29c9e01..a9c4912 100644 --- a/CaptionCog.ipynb +++ b/CaptionCog.ipynb @@ -6,14 +6,16 @@ "metadata": {}, "source": [ "# Cog Captioning\n", + "\"Open\n", "This notebook is an implementation of [CogVLM](https://github.com/THUDM/CogVLM) for image captioning. \n", "\n", - "This may require HIGH RAM shape on Google Colab, but T4 16gb is enough (even if slow).\n", + "Read [Docs](doc/CAPTION_COG.md) for basic usage guide. \n", "\n", - "1. Read [Docs](doc/CAPTION_COG.md) for basic usage guide. \n", - "2. Open in [Google Colab](https://colab.research.google.com/github/victorchall/EveryDream2trainer/blob/main/CaptionCog.ipynb) **OR** Runpod/Vast using the EveryDream2trainer docker container/template and open this notebook.\n", - "3. Run the cells below to install dependencies.\n", - "4. Place your images in \"input\" folder or change the data_root to point to a Gdrive folder." + "Open in [Google Colab](https://colab.research.google.com/github/victorchall/EveryDream2trainer/blob/main/CaptionCog.ipynb) **OR** use Runpod/Vast/Whatever using the EveryDream2trainer docker container/template and open this notebook.\n", + "\n", + "### Requirements\n", + "Ampere or newer GPU with 16GB+ VRAM (ex. A100, 3090, 4060 Ti 16GB, etc). \n", + "The 4bit quantization loading requires bfloat16 datatype support, which is not supported on older Turing GPUs like the T4 16GB, which rules out using free tier Google Colab.\n" ] }, { @@ -95,7 +97,7 @@ "## Run captions.\n", "\n", "Place your images in \"input\" folder, or you can change the image_dir to point to a Gdrive folder.\n", - "\n" + "Note: Colab may complain that you are running out of disk space, but it should still work.\n" ] }, {