This is the implementation for the paper Dependency Induction Through the Lens of Visual Perception. We implemented based on the lpcfg Lexicalized-PCFG.
To generate the alignment, concatinate all caption and label pairs for all splits, and use the make_input
argument in align.py
under data/proc_data
directory
We provided the alignment input and output in the data/proc_data
folder. Periods are stripped.
To get the dice alignment scores:
python dice_alignment.py {alignment_file} > {file_prefix.out}
The dependencies (.conllx
files) for MSCOCO are generated using this repo
Dependencies folders should be placed in the lpcfg/data
with the folder name dep
to use during preprocess.py
Sample training scripts are provided in lpcfg/scripts
folder. They can be executed sh scripts/{script_you_want_to_run}.sh
in the lpcfg
directory. Evaluation requires setting the argument --mode test
when calling train.py
python preprocess.py --vocabsize 100000 --replace_num 1 --dep --outputfile {OUTPUT_PATH}
requires the input file for dice alignment, and the output file contains the alignment pairs with scores
python preprocess.py --vocabsize 100000 --replace_num 1 --dep --outputfile {OUTPUT_PATH} --align_input {PATH_TO_ALIGN_INPUT_FILE} --align_output {PATH_TO_ALIGN_OUTPUT_FILE}
requires the original file for each English word with it's corresponding concreteness scores
python preprocess.py --vocabsize 100000 --replace_num 1 --dep --outputfile {OUTPUT_PATH} --concrete_file {PATH_TO_CONCRETE_FILE}