DSTM

Introduction

This is the pytorch implementation of the DSTM model. We propose a DSTM model (a Transformer model guided by dependency syntax information in alternative ways) for document-level relation extraction tasks. This model effectively integrates dependency syntax information into Transformer to improve its ability to process long texts by enhancing attention between two tokens with dependencies in a sentence. Specially, a dependency syntax tree is built at the document level, and three different alternative (Direct Transformation, Biaffine Transformation, Decomposed Linear Transformation) fusion methods are designed to fuse the dependency syntax information into the pretraining model.

This implementation is adapted based on huggingface transformers.

Requirements

python3.7.0, transformers==2.7.0
This implementation is tested on a single 10G NVIDIA GeForce RTX 3080 GPU with CUDA version=11.0.

Prepare Model and Dataset

Download pretrained models into ./pretrained_lm. For example, if you want to reproduce the results based on RoBERTa Base, you can download and keep the model files as:

    pretrained_lm
    └─── roberta_base
         ├── pytorch_model.bin
         ├── vocab.json
         ├── config.json
         └── merges.txt

Note that these files should correspond to huggingface transformers of version 2.7.0. Or the code will automatically download from s3 into your --cache_dir.

Download DocRED dataset into ./data, including train_annotated.json, dev.json and test.json. Moreover, the adjacency matrix corresponding to the document-level dependency syntax tree for each dataset, i.e. the data in data.rar, can be unpacked into ./data.

Train

Choose your model and config the script:
Choose --model_type from [roberta, bert].
Then run training script:

sh train.sh

checkpoints will be saved into ./checkpoints, and the best threshold for relation prediction will be searched on dev set and printed when evaluation.

Predict

Set --checkpoint and --predict_thresh then run script:

sh predict.sh

The result will be saved as ${checkpoint}/result.json.
You can compress and upload it to the official competition leaderboard at CodaLab.

zip result.zip result.json

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
model		model
pretrained_lm		pretrained_lm
DSPM.png		DSPM.png
README.md		README.md
cdr_dataset.py		cdr_dataset.py
dataset.py		dataset.py
loader.py		loader.py
predict.sh		predict.sh
reader.py		reader.py
run_cdr.py		run_cdr.py
run_docred.py		run_docred.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DSTM

Introduction

Requirements

Prepare Model and Dataset

Train

Predict

About

Releases

Packages

Languages

MingYangi/DSTM

Folders and files

Latest commit

History

Repository files navigation

DSTM

Introduction

Requirements

Prepare Model and Dataset

Train

Predict

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages