clustering_qr.kmeans_plusplus explicit tensors deletion #774

RobertoDF · 2024-09-03T15:40:52Z

Expand the reach of clear_cache to clustering_qr.kmeans_plusplus.
Somehow I still get OOM at

Line 202 in b2f5ded

vexp = 2 * Xg @ Xc.T - (Xc**2).sum(1)

in one particular session. With this explicit cache cleaning I manage to process the session.

The error happens with a specific Xg matrix that takes 5GB of GPU memory. I have 12GB of memory on my RTX 4070.

The explicit deletion of vexp and dexp will slow down the loop therefore happens only if the tensor is bigger than 4GB.

dexp and vexp are reassigned within each loop but somehow the GPU does not delete immediately the previous tensor. It is enough to delete the variables, it is not necessary to use directly torch.cuda.empty_cache(), that would further slow down the loop.

Related to #746, possibly #771

RobertoDF · 2024-09-04T08:42:06Z

oh! I thought drafts were only visible to me sorry. I still have the problem but I found a way better solution. Ill do another pull request soon.

RobertoDF added 2 commits September 3, 2024 17:00

explicit tensors deletion in clustering_qr.kmeans_plusplus

e6d58eb

Add condition dependent on tensor size

d30e519

RobertoDF closed this Sep 3, 2024

RobertoDF deleted the kmeans_plusplus_explicit_tensors_deletion branch September 4, 2024 08:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clustering_qr.kmeans_plusplus explicit tensors deletion #774

clustering_qr.kmeans_plusplus explicit tensors deletion #774

RobertoDF commented Sep 3, 2024 •

edited

Loading

RobertoDF commented Sep 4, 2024

clustering_qr.kmeans_plusplus explicit tensors deletion #774

clustering_qr.kmeans_plusplus explicit tensors deletion #774

Conversation

RobertoDF commented Sep 3, 2024 • edited Loading

RobertoDF commented Sep 4, 2024

RobertoDF commented Sep 3, 2024 •

edited

Loading