Proper hyperparameter handling #53

Flova · 2025-01-28T18:42:26Z

Proposed changes

Until now, all parameters related to model training and inference were hardcoded in the respective scripts. Now we have yaml config files for the training. The parameters are subsequently stored in the model files itself and are automatically loaded later during inference to configure the model and dataset.

Additionally we also store the optimizer and lr_sheduler state which enables effortless restarting.

Old models were converted using a small script.

Related issues

Close #21

Checklist

Write documentation
Create issues for future work
This PR is on our DDLitLab project board

Proper hyperparameter handling, restarting, config files

c0985fe

Flova requested review from texhnolyze and jaagut January 28, 2025 18:42

Update formatting

3f047d4

jaagut approved these changes Jan 30, 2025

View reviewed changes

texhnolyze approved these changes Jan 30, 2025

View reviewed changes

Flova merged commit d4fe368 into main Jan 30, 2025
5 checks passed

Flova deleted the feature/proper_param_handling branch January 30, 2025 10:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proper hyperparameter handling #53

Proper hyperparameter handling #53

Flova commented Jan 28, 2025 •

edited

Loading

Proper hyperparameter handling #53

Proper hyperparameter handling #53

Conversation

Flova commented Jan 28, 2025 • edited Loading

Proposed changes

Related issues

Checklist

Flova commented Jan 28, 2025 •

edited

Loading