Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better construction of reflection data in o1-journey? #5

Open
YuMS opened this issue Oct 17, 2024 · 2 comments
Open

Better construction of reflection data in o1-journey? #5

YuMS opened this issue Oct 17, 2024 · 2 comments

Comments

@YuMS
Copy link

YuMS commented Oct 17, 2024

It seems that some o1-journey reflection data publiced on Hugging Face are actually correcting correct reasoning steps. Is it possible that there are still room in how the reasoning tree is traversed and how reflection is constructed.

For example, I randomly sampled instances related to the keyword "wait". Every single checked reflection (rows 19, 39, 56, 75) have reflections that appeared to be unnecessary

https://huggingface.co/datasets/GAIR/o1-journey/

@YuMS
Copy link
Author

YuMS commented Oct 17, 2024

BTW, Really appreciate your work. I think the construction of o1-journey data is an important first step.

@codelion
Copy link

codelion commented Nov 1, 2024

You can use cot_reflection approach to construct data for cot with reflection using optillm - https://github.com/codelion/optillm

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants