Not supporting long passage exceeds 512 tokens. #1

KuRoti · 2023-01-19T02:12:18Z

Currently, we found out that bert-based models report the same probabilities for all options if the total length of the passage + question exceeds 512 tokens since it's their maximum token length. We found the issue, but after discussion, we concluded that the truncation for the 512 tokens is not a fundamental solution to this problem since the key information for question-solving may be mentioned in the cut-offed part of the passage. We will soon prepare a solution for longer passages.
We are now considering splitting the passage into smaller sentences, and returning the maximum KDA value among calculations over all pairs of split passage - questions.

KuRoti self-assigned this Jan 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not supporting long passage exceeds 512 tokens. #1

Not supporting long passage exceeds 512 tokens. #1

KuRoti commented Jan 19, 2023

Not supporting long passage exceeds 512 tokens. #1

Not supporting long passage exceeds 512 tokens. #1

Comments

KuRoti commented Jan 19, 2023