Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
data.py		data.py
deepspeed_config.json		deepspeed_config.json
tune.py		tune.py
tune_proofstep.sh		tune_proofstep.sh

README.md

Fine-tuning

Fine-tuning your own model is optional: by default, llmstep uses a model available on Huggingface that was fine-tuned with these scripts:

wellecks/llmstep-mathlib4-pythia2.8b

First download and format the data:

python data.py

Fine-tuning is then done using tune.py. See tune_proofstep.sh for an example command.
The command uses 8 GPUs with Deepspeed (tested on NVIDIA RTX A6000).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train

train

README.md

Fine-tuning

Files

train

Directory actions

More options

Directory actions

More options

Latest commit

History

train

Folders and files

parent directory

README.md

Fine-tuning