Skip to content
forked from yizhongw/StageDP

A two-stage RST discourse parser

License

Notifications You must be signed in to change notification settings

rknaebel/StageDP

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Two-stage Discourse Parser

Here is a refactoring of the implementation of the RST discourse parser described in A Two-stage Parsing Method for Text-level Discourse Analysis. Due to the licence of RST data corpus, the training data is not included in our project folder. To reproduce the result in the paper, download it from the LDC, preprocess the data as stated below.

Usage:

  1. Preprocess the data:

    python3 preprocess.py RST_DATA_DIR RST_DEST_DIR
    
  2. Train model:

    python3 main.py --train --train_dir TRAIN_DIR
    
  3. Evaluate model:

    python3 main.py --eval --eval_dir EVAL_DIR
    

Requirements:

Currently runs under Python 3.7. The models are rewritten in sklearn. See requirements.txt for more details.

About

A two-stage RST discourse parser

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.7%
  • Dockerfile 0.3%