Replies: 1 comment 6 replies
-
Thanks for starting the discussion @vaibhavad. I believe a simpler approach than implementing the Otherwise, I believe the rest of the proposal sounds reasonable. |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am interested in adding Conversational Retrieval tasks to MTEB. To get started, the two datasets that I am interested in adding are
Conversational retrieval involves different challenges compared to other tasks in MTEB. In particular, the query is a list of utterances from conversation history, instead of a single sentence. Here is an example fro QReCC:
Furthermore, how to convert the conversation history to a single input string for the model is in itself an interesting and relevant research direction. The most basic way is to just concatenate all the turns. Other approaches have also explored using a separate query rewriting model to convert conversation history to a single decontextualized question.
Ideally, the submitters should implement the logic of converting conversation history to input string, however, this will introduce breaking changes. So, here is what I propose:
conv_history_to_text
.conv_history_to_text
function, the MTEB package uses that implementation to convert conversation history to text before callingencode
, otherwise it uses the default implementation.Does this sound reasonable? If so, I can get started on Conversation Retrieval task category.
Beta Was this translation helpful? Give feedback.
All reactions