Replies: 5 comments 9 replies
-
Not to confuse diff models with fp16 models. SafeTensors and 700MB models are all fp16 models. We support official models (5.3G for 1.0, 1.4G for 1.0, 1.4G for 1.1) and fp16 models (0.7G for 1.0, 0.7G for 1.1) in pth, ckpt, or safetensors format. You can still use non-diff models with 700MB and SafeTensor format even if we remove support for diff models. Diff model has nothing to do with model size and format - they are models with weight offsets obtained from your base model. The "diff" means the difference between controlnet and your base model. For example, if your base model is stable diffusion 1.5, then the diff means the difference between controlnet and stable diffusion 1.5. Then if your model is Realistic Vision, then a diff model will construct a controlnet by adding the diff to Realistic Vision. Although many people do not know, but everytime you load a diff model, you are actually merging a new model from controlnet and your base model. You may ask what is the difference and how it really works. The answer is that we do not know and this is the reason why we are going to end support of it because we cannot ensure its quality (but considering that this may conflict our principle to make sure that all previous results can be reproduced, perhaps we will never really remove it). You can find non-diff version of 700MB controlnets 1.0 or 1.1 everywhere on internet and they wont emit any logs |
Beta Was this translation helpful? Give feedback.
-
Sigh – the diff model is just a mistake we made in the past and we need to explain something super complicated every time we run into issue. Perhaps this time I write this a bit longer and all future questions can use this copy and paste |
Beta Was this translation helpful? Give feedback.
-
We will begin to consider support diff models in a long term if any user succeeded in finding a reason why we should support it. |
Beta Was this translation helpful? Give feedback.
-
Ok so is the reason for the default 1.4GiB models existing as they are is that they support older graphics cards? While the fp16 require newer (20XX?) cards? Does that mean that bf16 could be even smaller? On the diff models specifically is the implication that a fp16 diff model would be halved again (~350mb). Because that would be a selling point to me, given how many models there are and how many you sometimes end up applying in one run. IO to container services is often the biggest delay unless you are doing super long runs (video etc perhaps). Lastly, excuse my ignorance, but is there any reason not to deliver all the models as safetensors now? I thought most of the GUIs out there now support it (and those that do not probably shoild right?). |
Beta Was this translation helpful? Give feedback.
-
I got this message when I used the TemporalNet diff model. The thing about that model, which I find quite useful for my work, is that I don't have enough VRAM to use the regular model, which is 12GB. I noticed there was also something about 'stop using custom models' (I don't have the message in front of me atm). Are you going to be blocking the use of non standard models in the future? This is important, as all I use at the moment are non-standard models. |
Beta Was this translation helpful? Give feedback.
-
I saw that this message has started being emitted from a recent commit. Was there any discussion of this anywhere, or can there be here?
I have been using the diff models from when they became available. For me they have the following value:
I see lack of robustness is given as the driver, but I haven't had any issues with the diff models and will gladly accept any current level of imprecision I might be experiencing as an acceptable trade-off.
I know @lllyasviel made this change, so I am not sure if this conversation should be here or over in the main ControlNet repo, but given the message was added here I have gone with that.
Beta Was this translation helpful? Give feedback.
All reactions