Training settings have approximately following this curve as the dataset as grown.
Typical dreambooth training falls on the far left of this graph, the community has many examples of people using 10-80 images and 800-2500 steps for typical face/person training. As I've scaled the Final Fantasy 7 Remake model, I've found a somewhat inverted exponential curve in steps required as I add data, but I suspect this will flatten out to linear as we zoom out.
Adding in more ground truth data will also multiple the line with respect to the Y axis of steps. I.e. 25% training data and 75% ground truth data I would suspect will increase steps/training time by 4 but better preserve the character of the base model.
My expectation the price of adding substantial ground truth data will be improved model quality retention.