Here is a refactoring of the implementation of the RST discourse parser described in A Two-stage Parsing Method for Text-level Discourse Analysis. Due to the licence of RST data corpus, the training data is not included in our project folder. To reproduce the result in the paper, download it from the LDC, preprocess the data as stated below.
-
Preprocess the data:
python3 preprocess.py RST_DATA_DIR RST_DEST_DIR
-
Train model:
python3 main.py --train --train_dir TRAIN_DIR
-
Evaluate model:
python3 main.py --eval --eval_dir EVAL_DIR
Currently runs under Python 3.7. The models are rewritten in sklearn. See requirements.txt for more details.