Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

An example of math calculation #6

Open
GanjinZero opened this issue Oct 27, 2023 · 1 comment
Open

An example of math calculation #6

GanjinZero opened this issue Oct 27, 2023 · 1 comment

Comments

@GanjinZero
Copy link

[RequestOutput(request_id=0, prompt='[INST] Write critiques for a submitted response on a given user's query, and grade the response:\n \n[BEGIN DATA]\n***\n[Query]: 1+1=\n***\n[Response]: 3\n***\n[END DATA]\n\nWrite critiques for this response. After that, you should give a final rating for the response on a scale of 1 to 10 by strictly following this format: "[[rating]]", for example: "Rating: [[5]]". [/INST]', prompt_token_ids=[1, 518, 25580, 29962, 14350, 3994, 3783, 363, 263, 18397, 2933, 373, 263, 2183, 1404, 29915, 29879, 2346, 29892, 322, 19468, 278, 2933, 29901, 13, 259, 13, 29961, 29933, 17958, 360, 8254, 29962, 13, 17435, 13, 29961, 3010, 5387, 29871, 29896, 29974, 29896, 29922, 13, 17435, 13, 29961, 5103, 5387, 29871, 29941, 13, 17435, 13, 29961, 11794, 360, 8254, 29962, 13, 13, 6113, 3994, 3783, 363, 445, 2933, 29889, 2860, 393, 29892, 366, 881, 2367, 263, 2186, 21700, 363, 278, 2933, 373, 263, 6287, 310, 29871, 29896, 304, 29871, 29896, 29900, 491, 18719, 1494, 445, 3402, 29901, 376, 8999, 29741, 5262, 613, 363, 1342, 29901, 376, 29934, 1218, 29901, 5519, 29945, 5262, 1642, 518, 29914, 25580, 29962], prompt_logprobs=None, outputs=[CompletionOutput(index=0, text=' The response to the user's query, which simply asked for the sum of 1+1, is correct in its answer, stating that the result is 3. However, the response could be improved in terms of engagement and user interaction. The response could have been more informative and interactive by asking the user if they needed further assistance or if they had any other questions. This would have made the interaction more engaging and personal. Additionally, the response could have been more explicit in stating the mathematical operation being performed, such as saying "The sum of 1+1 is 3." This would have made the response more informative and clearer for the user. \n\nRating: [[7]] ', token_ids=[450, 2933, 304, 278, 1404, 29915, 29879, 2346, 29892, 607, 3763, 4433, 363, 278, 2533, 310, 29871, 29896, 29974, 29896, 29892, 338, 1959, 297, 967, 1234, 29892, 23659, 393, 278, 1121, 338, 29871, 29941, 29889, 2398, 29892, 278, 2933, 1033, 367, 16710, 297, 4958, 310, 3033, 5049, 322, 1404, 14881, 29889, 450, 2933, 1033, 505, 1063, 901, 1871, 1230, 322, 28923, 491, 6721, 278, 1404, 565, 896, 4312, 4340, 18872, 470, 565, 896, 750, 738, 916, 5155, 29889, 910, 723, 505, 1754, 278, 14881, 901, 3033, 6751, 322, 7333, 29889, 19814, 29892, 278, 2933, 1033, 505, 1063, 901, 6261, 297, 23659, 278, 19475, 5858, 1641, 8560, 29892, 1316, 408, 5934, 376, 1576, 2533, 310, 29871, 29896, 29974, 29896, 338, 29871, 29941, 1213, 910, 723, 505, 1754, 278, 2933, 901, 1871, 1230, 322, 26135, 363, 278, 1404, 29889, 29871, 13, 13, 29934, 1218, 29901, 5519, 29955, 5262, 29871, 2], cumulative_logprob=-22.662128049569425, logprobs=None, finish_reason=stop)], finished=True)]

@lockon-n
Copy link
Collaborator

Thanks for your valuable feedback!

We will keep on improving Auto-J's ability on math reasoning.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants