Why sort instances by sequence length in descending order step is needed? #7

mrgloom · 2019-08-05T15:12:42Z

Why sort instances by sequence length in descending order step is needed?

rayryeng · 2020-11-24T20:25:32Z

This was required in earlier versions of PyTorch and that was so that the examples in a batch can be interleaved properly when performing forward + backprop. Knowing the size of each example before you pad with zeroes and sorting the sequences was crucial for the interleave to work. However, this is no longer needed as of PyTorch 1.1.0 as you can specify enforce_sorted=False so it can do the sorting internally. Also with newer versions of PyTorch, you can just use pack_sequence instead of pack_padded_sequence where you no longer need to pad the sequences. You would just provide a list of tensors, with each tensor being a sequence in the batch.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why sort instances by sequence length in descending order step is needed? #7

Why sort instances by sequence length in descending order step is needed? #7

mrgloom commented Aug 5, 2019

rayryeng commented Nov 24, 2020 •

edited

Loading

Why sort instances by sequence length in descending order step is needed? #7

Why sort instances by sequence length in descending order step is needed? #7

Comments

mrgloom commented Aug 5, 2019

rayryeng commented Nov 24, 2020 • edited Loading

rayryeng commented Nov 24, 2020 •

edited

Loading