Skip to content

Commit

Permalink
still WIP
Browse files Browse the repository at this point in the history
  • Loading branch information
Arne Neumann committed Feb 15, 2019
1 parent 5b10a7f commit adc5a88
Showing 1 changed file with 10 additions and 1 deletion.
11 changes: 10 additions & 1 deletion Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,16 @@ RUN patch -p0 < ../../../patches/rst-ptb.train.patch
WORKDIR /opt/josydipa/dataset/rst/test
RUN patch -p0 < ../../../patches/rst-ptb.test.patch

# TODO: write/get a converter from .mrg to .cleangold files, cf. https://github.com/kaayy/josydipa/issues/2
WORKDIR /opt/josydipa/dataset/ptb
# RUN patch -p0 < ../../patches/ptb-rst.patch
RUN patch -p0 < ../../patches/ptb-rst.patch

WORKDIR /opt/josydipa
RUN python src/tokenize_rst.py --rst_path dataset/rst/train && \
python src/tokenize_rst.py --rst_path dataset/rst/test

RUN mkdir -p dataset/joint && \
python src/aligner.py --rst_path dataset/rst/train --const_path dataset/ptb > dataset/joint/train.txt && \
python src/aligner.py --rst_path dataset/rst/test --const_path dataset/ptb > dataset/joint/test.txt


0 comments on commit adc5a88

Please sign in to comment.