You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
During our early experiments we noticed several issues with our preprocessing pipeline which we
fixed for the later experiments. In particular, we noticed that some sacremoses command line flags were broken, and the out-of-the-box inference tool from FairSeq did not fully replicate the preprocessing pipeline used for training (punctuation normalization and vocabulary-aware subword segmentation). The original pipeline (called v1) was used for our baseline models. The later experiments used the fixed implementations of sacremoses and FairSeq (denoted by v2).
The text was updated successfully, but these errors were encountered:
The CLI flags and chaining though pipeline should be tested with a little more robustness than just the examples in README.md
Not sure if it is still the case, but I found this from https://aclanthology.org/2020.wmt-1.88.pdf
The text was updated successfully, but these errors were encountered: