You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @pruksmhc. Sorry for the delayed response. The compute_advantage_and_return function treats all the text in a batch as an episode. Therefore, technically, the batch size is not one.
In this line:
instructGOOSE/instruct_goose/trainer.py
Line 49 in 5b9ac3f
What is the purpose of iterating through in the 0th dimension, given the tensor is of size '1 b' (so it will only iterate once).
The text was updated successfully, but these errors were encountered: