Unable to allocate memory even if enough VRAM is available #378

jonasmock · 2023-11-04T01:05:11Z

jonasmock
Nov 4, 2023

My VM has 32GB RAM and 2 x Nvidia Tesla V100 32GB.

I'm not able to load a safetensors file larger than my 32GB system RAM even if I have 64GB VRAM available. It seems like the memory is allocated in system RAM and afterwards the model is loaded on my GPUs if I use a smaler model.

Is there a way to directly allocate the memory on the GPUs? I want to load safetensors >32Gb and < 64GB

jonasmock · 2023-11-04T01:13:12Z

jonasmock
Nov 4, 2023
Author

0 replies

Narsil · 2023-11-27T16:03:28Z

Narsil
Nov 27, 2023
Maintainer

I'm trying to replicate on a small example but I'm failing to reproduce.

Can you share more about the environment you're working in ? Python version, all libs versions, OS, etc,, ?
Also GPTQ model is only 18GB of RAM...

Just a a quick side you could try;

fallocate -l 18G example.txt
python -c "import torch; torch.UntypedStorage.from_file('example.txt')"

This is what should happen under the hood, and there zero issue even if the memory mapping exceeds (by far) the available RAM.

0 replies

Narsil · 2023-11-27T16:53:33Z

Narsil
Nov 27, 2023
Maintainer

Wow, rechecked again because I had 2 reports, it seems it's

torch.UntypedStorage.from_file('example.txt', size=18000000)  # Whateve the size of the file is

This forces torch to actually allocate the entire file.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to allocate memory even if enough VRAM is available #378

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Unable to allocate memory even if enough VRAM is available #378

jonasmock Nov 4, 2023

Replies: 3 comments

jonasmock Nov 4, 2023 Author

Narsil Nov 27, 2023 Maintainer

Narsil Nov 27, 2023 Maintainer

jonasmock
Nov 4, 2023

jonasmock
Nov 4, 2023
Author

Narsil
Nov 27, 2023
Maintainer

Narsil
Nov 27, 2023
Maintainer