You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It seems that some o1-journey reflection data publiced on Hugging Face are actually correcting correct reasoning steps. Is it possible that there are still room in how the reasoning tree is traversed and how reflection is constructed.
For example, I randomly sampled instances related to the keyword "wait". Every single checked reflection (rows 19, 39, 56, 75) have reflections that appeared to be unnecessary
It seems that some o1-journey reflection data publiced on Hugging Face are actually correcting correct reasoning steps. Is it possible that there are still room in how the reasoning tree is traversed and how reflection is constructed.
For example, I randomly sampled instances related to the keyword "wait". Every single checked reflection (rows 19, 39, 56, 75) have reflections that appeared to be unnecessary
https://huggingface.co/datasets/GAIR/o1-journey/
The text was updated successfully, but these errors were encountered: