Better construction of reflection data in o1-journey? #5

YuMS · 2024-10-17T03:35:51Z

It seems that some o1-journey reflection data publiced on Hugging Face are actually correcting correct reasoning steps. Is it possible that there are still room in how the reasoning tree is traversed and how reflection is constructed.

For example, I randomly sampled instances related to the keyword "wait". Every single checked reflection (rows 19, 39, 56, 75) have reflections that appeared to be unnecessary

https://huggingface.co/datasets/GAIR/o1-journey/

YuMS · 2024-10-17T03:42:11Z

BTW, Really appreciate your work. I think the construction of o1-journey data is an important first step.

codelion · 2024-11-01T10:37:28Z

You can use cot_reflection approach to construct data for cot with reflection using optillm - https://github.com/codelion/optillm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better construction of reflection data in o1-journey? #5

Better construction of reflection data in o1-journey? #5

YuMS commented Oct 17, 2024

YuMS commented Oct 17, 2024

codelion commented Nov 1, 2024

Better construction of reflection data in o1-journey? #5

Better construction of reflection data in o1-journey? #5

Comments

YuMS commented Oct 17, 2024

YuMS commented Oct 17, 2024

codelion commented Nov 1, 2024