You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, when I got into the link of the RCV dataset, I found "404 not found", could you provide another link of the RCV dataset? If possible could you provide other datasets in your paper. It's a little hard for me to understand the code without the dataset. Thank you very much!
The text was updated successfully, but these errors were encountered:
You can know the format of the data by looking at the load_data method.
In the line, you see the data is pickle files containing four attributes (the last two are never used and can thus ignore).
[train, test, vocab, catgy] = pickle.load(fin)
Then looking at the load_data_and_labels method, you see the train/test data are a list of document dicts with key 'text' for the plain text document and 'catgy' for the label.
There's another closed issue providing a link to some other datasets used in the paper.
Hi, when I got into the link of the RCV dataset, I found "404 not found", could you provide another link of the RCV dataset? If possible could you provide other datasets in your paper. It's a little hard for me to understand the code without the dataset. Thank you very much!
The text was updated successfully, but these errors were encountered: