Removed functions iterating over tensors from torch compilation process #224

jczaja · 2024-09-25T12:48:09Z

Problem:
Recently from dependencies of tgi-gaudi project some torch compile graph breaks were event out and it made some torch compiled graphs much bigger and more memory consuming which in some models could led to Device out-of-memory.

Solution:
Torch compiled graphs that wer causing Device OOM behaviour where related to having loops inside of them that where processing lots of tensors. Those functions with loops were excluded from torch compilation process.

bsochack · 2024-09-25T17:01:33Z

server/text_generation_server/models/causal_lm.py

@@ -92,7 +92,6 @@ def biggest_single_chunk(offset):
        return 0


-@torch_compile_for_eager


why don't use torch._dynamo.graph_break() in place of mark_step instead of removing compilation of graph?
same question applies to all below cases

does the change cause the ops within the function to be executed eagerly?

does the change cause the ops within the function to be executed eagerly?

I assume(I have not run logging of fallback to eager events) that functions excluded from torch compile regions (as done in this PR) are now running eager e.g. pytorch ops from code that got torch compile decorator discarded are running eager .

In some of internal testing it was revealed that excluding those functions(as in this PR) from torch compile region did not have an impact on performance or accuracy.

schoi-habana

lgtm

Removed functions iterating over tensors from torch compilation process

756704e

bsochack reviewed Sep 25, 2024

View reviewed changes

jczaja requested a review from bsochack September 27, 2024 14:12

mandy-li requested review from schoi-habana and removed request for bsochack October 1, 2024 17:59

schoi-habana approved these changes Oct 1, 2024

View reviewed changes

tthakkal approved these changes Oct 15, 2024

View reviewed changes

tthakkal requested a review from mandy-li October 15, 2024 21:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removed functions iterating over tensors from torch compilation process #224

Removed functions iterating over tensors from torch compilation process #224

jczaja commented Sep 25, 2024

bsochack Sep 25, 2024

bsochack Sep 25, 2024

jczaja Sep 26, 2024 •

edited

Loading

jczaja Sep 27, 2024

schoi-habana left a comment

		@@ -92,7 +92,6 @@ def biggest_single_chunk(offset):
		return 0


		@torch_compile_for_eager

Removed functions iterating over tensors from torch compilation process #224

Are you sure you want to change the base?

Removed functions iterating over tensors from torch compilation process #224

Conversation

jczaja commented Sep 25, 2024

bsochack Sep 25, 2024

Choose a reason for hiding this comment

bsochack Sep 25, 2024

Choose a reason for hiding this comment

jczaja Sep 26, 2024 • edited Loading

Choose a reason for hiding this comment

jczaja Sep 27, 2024

Choose a reason for hiding this comment

schoi-habana left a comment

Choose a reason for hiding this comment

jczaja Sep 26, 2024 •

edited

Loading