[Liger] liger DPO support #2568

kashif · 2025-01-14T13:14:28Z

What does this PR do?

Add support for Liger-kernel losses for the DPO Kernel

HuggingFaceDocBuilderDev · 2025-01-15T11:15:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2025-01-17T16:23:36Z

tests/test_dpo_trainer.py

+        3. Loss values are reasonable and finite
+        4. Training works with both default and custom beta values
+        """
+        beta_values = [0.1, 0.5]  # Test multiple beta values


Can you use @parameterized.expand instead?

trl/trainer/dpo_trainer.py

qgallouedec · 2025-01-17T16:50:25Z

liger loss isn't compatible with ref precomputing right? If so we could add a warning or an error.

qgallouedec · 2025-01-17T16:57:26Z

docs/source/reducing_memory_usage.md

+
+## Liger for reducing peak memory usage
+
+[To complete]
+
+<hfoptions id="liger">
+<hfoption id="DPO">
+
+To use Liger for reducing peak memory usage, use the following code snippet:
+
+```python
+from trl import DPOConfig
+
+training_args = DPOConfig(..., use_liger_loss=True)
+```
+
+</hfoption>
+</hfoptions>


@kashif I've added this section in the new guide for reducing memory usage, if you've words to fill it

initial liger support

f50e74d

kashif mentioned this pull request Dec 22, 2024

[Tracking issue] Integrate native liger-kernel losses #2495

Open

6 tasks

kashif added 3 commits January 15, 2025 12:05

fix outputs

e3eebd3

fix config merge conflict

2d82b39

Merge branch 'main' into liger-dpo

50d341e

kashif added 2 commits January 15, 2025 12:19

fix comment

8ae06b1

fix peft training

cc2b7b9

qgallouedec reviewed Jan 17, 2025

View reviewed changes

qgallouedec added 3 commits January 17, 2025 16:31

use parametrized

03fd005

raise error as soon as dep is not met

5f4110f

move param to the right section

b22eb24

qgallouedec reviewed Jan 17, 2025

View reviewed changes

trl/trainer/dpo_trainer.py Outdated Show resolved Hide resolved

reducing memory doc

b8e6f8c

qgallouedec reviewed Jan 17, 2025

View reviewed changes

kashif added 4 commits January 21, 2025 14:57

use liger specifc method

6310dbd

Merge branch 'main' into liger-dpo

bdca4f1

Merge branch 'main' into liger-dpo

5efe4d0

update return signature

dbece54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Liger] liger DPO support #2568

[Liger] liger DPO support #2568

kashif commented Jan 14, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 15, 2025

qgallouedec Jan 17, 2025

qgallouedec commented Jan 17, 2025

qgallouedec Jan 17, 2025

[Liger] liger DPO support #2568

Are you sure you want to change the base?

[Liger] liger DPO support #2568

Conversation

kashif commented Jan 14, 2025 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jan 15, 2025

qgallouedec Jan 17, 2025

Choose a reason for hiding this comment

qgallouedec commented Jan 17, 2025

qgallouedec Jan 17, 2025

Choose a reason for hiding this comment

kashif commented Jan 14, 2025 •

edited

Loading