clustering_qr.kmeans_plusplus explicit tensors deletion #773

RobertoDF · 2024-09-03T15:34:00Z

Expand the reach of clear_cache to clustering_qr.kmeans_plusplus.
Somehow I still get OOM at

Line 202 in b2f5ded

vexp = 2 * Xg @ Xc.T - (Xc**2).sum(1)

in one particular session. With this explicit cache cleaning I manage to process the session.

The error happens with a specific Xg matrix that takes 5GB of GPU memory. I have 12GB of memory on my RTX 4070.

The explicit deletion of vexp and dexp will slow down the loop therefore happens only if the tensor is bigger than 4GB.

dexp and vexp are reassigned within each loop but somehow the GPU does not delete immediately the previous tensor. It is enough to delete the variables, it is not necessary to use directly torch.cuda.empty_cache(), that would further slow doen the loop.

Related to #746, possibly #771

RobertoDF added 2 commits September 3, 2024 17:00

explicit tensors deletion in clustering_qr.kmeans_plusplus

e6d58eb

Add condition dependent on tensor size

d30e519

RobertoDF changed the title ~~Roberto df kmeans plusplus explicit tensors deletion~~ clustering_qr.kmeans_plusplus explicit tensors deletion Sep 3, 2024

RobertoDF closed this Sep 3, 2024

RobertoDF deleted the RobertoDF-kmeans_plusplus_explicit_tensors_deletion branch September 3, 2024 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clustering_qr.kmeans_plusplus explicit tensors deletion #773

clustering_qr.kmeans_plusplus explicit tensors deletion #773

RobertoDF commented Sep 3, 2024

clustering_qr.kmeans_plusplus explicit tensors deletion #773

clustering_qr.kmeans_plusplus explicit tensors deletion #773

Conversation

RobertoDF commented Sep 3, 2024