-
Hello Junshen, I had a question on the computation of the losses. I saw in your code that you detach the NeSVoR/nesvor/nesvor/models.py Lines 310 to 312 in 671c023 Is there a particular reason for it? Did you observe something like a more stable training? Thanks for your help! Best, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi Thomas, Yes. Recently, we found that detaching the variables can stabilize the training in some cases. But for most of the data we evaluated, this doesn't change the performance much. |
Beta Was this translation helpful? Give feedback.
Hi Thomas,
Yes. Recently, we found that detaching the variables can stabilize the training in some cases. But for most of the data we evaluated, this doesn't change the performance much.