How did the length extrapolation perform in Figure 5 ? #415

SCZwangxiao · 2024-10-17T10:16:18Z

In Figure 5 of the technical report (which illustrates the length extrapolation capability of Qwen2-VL-72B on Video-MME Medium Video), the inference sequence length is scaled into 80K. However, the max_position_embeddings in the config is only 32768.

Could the authors share with us how did the length extrapolation perform?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How did the length extrapolation perform in Figure 5 ? #415

How did the length extrapolation perform in Figure 5 ? #415

SCZwangxiao commented Oct 17, 2024

How did the length extrapolation perform in Figure 5 ? #415

How did the length extrapolation perform in Figure 5 ? #415

Comments

SCZwangxiao commented Oct 17, 2024