The Impact of Reasoning Step Length on Large Language Models

Created by Mingyu Jin*, Qinkai Yu*, Dong Shu, Haiyan Zhao, Wenyue Hua, Yanda Meng†, Yongfeng Zhang†, Mengnan Du† This repository contains PyTorch implementation of "The Impact of Reasoning Step Length on Large Language Models"

The paper has been accepted by ACL 2024 findings.

[arXiv] https://arxiv.org/abs/2401.04925

[ACL] https://aclanthology.org/2024.findings-acl.108/

Installation

Make sure you have Python>=3.8 installed on your machine.

pip install torch==1.8.2+cu111 torchtext==0.9.2 -f https://download.pytorch.org/whl/lts/1.8/torch_lts.html
pip install -r requirements.txt

Datasets

Download the datasets from the following:

https://github.com/kojima-takeshi188/zero_shot_cot/tree/main/dataset
https://github.com/kojima-takeshi188/zero_shot_cot/tree/main/log

Run inference

python run_inference.py --dataset multiarith --demo_path demos/multiarith --output_dir experiment/multiarith

Use demo

Relationship Between Steps and Accuracy

last_letter_1 represents adding one step to the demo

last_letter_2 represents adding two-step to the demo

.......and so on

python run_inference.py --dataset last_letter --demo_path demos/last_letter_1 --output_dir experiment/last_letters_1 #1 represent the number of add step

Effect of Prompt with Wrong Answer

python run_inference.py --dataset last_letter --demo_path demos/last_letters_false --output_dir experiment/last_letters_false

Create demo

Citation

If you find this codebase helpful, please consider to cite:

@article{jin2024impact,
  title={The Impact of Reasoning Step Length on Large Language Models},
  author={Jin, Mingyu and Yu, Qinkai and Zhao, Haiyan and Hua, Wenyue and Meng, Yanda and Zhang, Yongfeng and Du, Mengnan and others},
  journal={arXiv preprint arXiv:2401.04925},
  year={2024}
}

@inproceedings{jin-etal-2024-impact,
    title = "The Impact of Reasoning Step Length on Large Language Models",
    author = "Jin, Mingyu  and
      Yu, Qinkai  and
      Shu, Dong  and
      Zhao, Haiyan  and
      Hua, Wenyue  and
      Meng, Yanda  and
      Zhang, Yongfeng  and
      Du, Mengnan",
    booktitle = "Findings of the Association for Computational Linguistics ACL 2024",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand and virtual meeting",
    url = "https://aclanthology.org/2024.findings-acl.108",
    pages = "1830--1842",
}

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
dataset		dataset
demo		demo
experiment		experiment
result		result
README.md		README.md
api.py		api.py
intro.pdf		intro.pdf
intro.png		intro.png
output.txt		output.txt
run_inference.py		run_inference.py
try_cot.ipynb		try_cot.ipynb
try_cot_colab.ipynb		try_cot_colab.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Impact of Reasoning Step Length on Large Language Models

Installation

Datasets

Run inference

Use demo

Relationship Between Steps and Accuracy

Effect of Prompt with Wrong Answer

Create demo

Citation

About

Releases

Packages

Contributors 2

Languages

MingyuJ666/The-Impact-of-Reasoning-Step-Length-on-Large-Language-Models

Folders and files

Latest commit

History

Repository files navigation

The Impact of Reasoning Step Length on Large Language Models

Installation

Datasets

Run inference

Use demo

Relationship Between Steps and Accuracy

Effect of Prompt with Wrong Answer

Create demo

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages