From d6a3ca65cef6228a5bb6b4dcf6eba9d7430aa7ff Mon Sep 17 00:00:00 2001 From: Frederik Fix Date: Wed, 14 Sep 2022 20:09:08 +0200 Subject: [PATCH] fix extract command --- docs/en/training/dataset.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/en/training/dataset.md b/docs/en/training/dataset.md index 8a525cd..373c684 100644 --- a/docs/en/training/dataset.md +++ b/docs/en/training/dataset.md @@ -91,11 +91,11 @@ rsync -r rsync://176.9.41.242:873/danbooru2021/metadata/posts000000000000.json . You should now have two folders named: 512px and metadata. ## Organizing the dataset -Although we have the dataset, the metadata that explains what the image is, is inside the JSON file. In order to extract the data into individual txt files, we are going to use the script inside `` /waifu-diffusion/scripts/danbooru21_extract.py`` +Although we have the dataset, the metadata that explains what the image is, is inside the JSON file. In order to extract the data into individual txt files, we are going to use the script inside ``danbooru_data/local/extractfromjson_danboo21.py`` Assuming you are in the same directory as metadata and 512px folder: ````bash -python /waifu-diffusion/scripts/danbooru21_extract.py +python danbooru_data/local/extractfromjson_danboo21.py -J metadata/posts000000000000.json -E dataset ```` Change "/waifu-diffusion" to the path of the cloned waifu-diffusion repository. This script will also change some tags such as "1girl" to "one girl", "2boys" to "two boys", and so on. It will also add "upoaded on Danbooru".