Training data volume #4

abhishmitra · 2018-08-13T08:59:23Z

Hi, Great project btw!
So I'm looking to get slightly better variety in the volume of paraphrased sentences.
The 50m dataset has 30 million high quality paraphrasing pairs.

How many have you used to train the model? Also do you reckon it'll increase performance by a lot?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training data volume #4

Training data volume #4

abhishmitra commented Aug 13, 2018

Training data volume #4

Training data volume #4

Comments

abhishmitra commented Aug 13, 2018