A ImportError when I run the program "FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners.ipynb" #165

YRookieBoy · 2024-03-14T07:51:09Z

Hi，
When I try to run "FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners.ipynb" in google colab, I came aross a problem.
The code is

model_name = "THUDM/chatglm2-6b"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModel.from_pretrained(
model_name,
quantization_config=q_config,
trust_remote_code=True,
device='cuda'
)

and, the error is

ImportError: Using load_in_8bit=True requires Accelerate: pip install accelerate and the latest version of bitsandbytes pip install -i https://test.pypi.org/simple/ bitsandbytes or pip install bitsandbytes`
model = prepare_model_for_int8_training(model, use_gradient_checkpointing=True)

Last, I program the code in Gcolab pro and I am sure both packages is installed.
Please help me solve the problem, thank you so much!

The text was updated successfully, but these errors were encountered:

llk010502 · 2024-03-17T01:27:45Z

Hi, based on my experience, you can try to reinstall these two packages when this error shows, then restart your kernel to run your code. Hope this works.

YRookieBoy · 2024-03-22T00:03:18Z

Thank you very much! I have already run the code successfully.

Siddharth-Latthe-07 · 2024-07-16T13:02:01Z

The error indicates that the necessary packages for 8-bit training, specifically accelerate and bitsandbytes, are either not installed correctly or not recognized by the environment. Here's how you can troubleshoot and resolve the issue:

Ensure correct installation
Restart the runtime: Sometimes, after installing new packages, you need to restart the runtime for the changes to take effect.
Check for correct version and Load the packages before model definition
sample snippet

# Install the necessary packages
!pip install accelerate
!pip install -i https://test.pypi.org/simple/ bitsandbytes

# Restart runtime after installing the packages (manual step in the Colab interface)

# Import the required libraries
from transformers import AutoTokenizer, AutoModel
from accelerate import Accelerator

# Ensure the runtime is using GPU
import torch
device = 'cuda' if torch.cuda.is_available() else 'cpu'

# Load the model with the necessary configuration
model_name = "THUDM/chatglm2-6b"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModel.from_pretrained(
    model_name,
    quantization_config=q_config,
    trust_remote_code=True,
    device=device
)

# Prepare model for 8-bit training
from transformers import prepare_model_for_int8_training
model = prepare_model_for_int8_training(model, use_gradient_checkpointing=True)

and also check the gpu settings

hope this will help, let me know the further updates
Thanks

tducharme-brex · 2025-01-29T01:36:13Z

Hello @Siddharth-Latthe-07 , I tried the above code and got the same error

`low_cpu_mem_usage` was None, now set to True since model is quantized.
Loading checkpoint shards: 100%
 7/7 [00:08<00:00,  1.16s/it]
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
[<ipython-input-21-3503f94a9a20>](https://localhost:8080/#) in <cell line: 0>()
      3 model_name = "THUDM/chatglm2-6b"
      4 tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
----> 5 model = AutoModel.from_pretrained(
      6         model_name,
      7         quantization_config=q_config,

3 frames
[/usr/local/lib/python3.11/dist-packages/transformers/modeling_utils.py](https://localhost:8080/#) in to(self, *args, **kwargs)
   2772         # Checks if the model has been loaded in 8-bit
   2773         if getattr(self, "quantization_method", None) == QuantizationMethod.BITS_AND_BYTES:
-> 2774             raise ValueError(
   2775                 "`.to` is not supported for `4-bit` or `8-bit` bitsandbytes models. Please use the model as it is, since the"
   2776                 " model has already been set to the correct devices and casted to the correct `dtype`."

ValueError: `.to` is not supported for `4-bit` or `8-bit` bitsandbytes models. Please use the model as it is, since the model has already been set to the correct devices and casted to the correct `dtype`.

any idea how to help?
I restarted my runtime and its still not working

Thanks!

Siddharth-Latthe-07 · 2025-01-29T15:22:03Z

@tducharme-brex Ok,
I guess the issue is that .to() is not supported for 4-bit or 8-bit bitsandbytes models. This happens when you explicitly try to move the model to a device (device='cuda') after loading it with quantization.

Try to remove the explicit device=device argument in AutoModel.from_pretrained(), beacuse BitsAndBytesConfig automatically assigns the correct device.

try running this script and let me know:-

# Install necessary packages
!pip install --upgrade bitsandbytes accelerate transformers

# Restart runtime after installation

# Import required libraries
from transformers import AutoTokenizer, AutoModel, BitsAndBytesConfig
import torch

# Ensure the runtime is using GPU
device = 'cuda' if torch.cuda.is_available() else 'cpu'
print(f"Using device: {device}")

# Define quantization config for int8
q_config = BitsAndBytesConfig(load_in_8bit=True)

# Load tokenizer
model_name = "THUDM/chatglm2-6b"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)

# Load model (DO NOT specify `device='cuda'` explicitly)
model = AutoModel.from_pretrained(
    model_name,
    quantization_config=q_config,  # This will automatically handle device placement
    trust_remote_code=True,
    device_map="auto"  # Use "auto" to let Hugging Face handle device allocation
)

# DO NOT manually move model to CUDA
# model.to(device)  <-- REMOVE THIS

# If you need to prepare for int8 training, use this
from transformers import prepare_model_for_int8_training
model = prepare_model_for_int8_training(model, use_gradient_checkpointing=True)

print("Model loaded successfully!")

tducharme-brex · 2025-01-30T17:38:15Z

@Siddharth-Latthe-07 I had to change the import, but I got this block of code to work

from transformers import prepare_model_for_int8_training was raising an import error
changing it to from peft import prepare_model_for_int8_training still threw an error.
finally this got it to work:

from peft import prepare_model_for_kbit_training
model = prepare_model_for_kbit_training(model, use_gradient_checkpointing=True)

ultimately I could only get it to work when I changed the model version from THUDM/chatglm2-6b to the newer THUDM/chatglm3-6b. Then everything ran smoothly

Thanks!

YRookieBoy closed this as completed Mar 14, 2024

YRookieBoy changed the title ~~Hi,~~ A ImportError when I run the program "FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners.ipynb" Mar 14, 2024

YRookieBoy reopened this Mar 14, 2024

RGIIST mentioned this issue Apr 14, 2024

import error in FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginers.ipynb resolved #174

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A ImportError when I run the program "FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners.ipynb" #165

A ImportError when I run the program "FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners.ipynb" #165

YRookieBoy commented Mar 14, 2024 •

edited

Loading

llk010502 commented Mar 17, 2024

YRookieBoy commented Mar 22, 2024

Siddharth-Latthe-07 commented Jul 16, 2024

tducharme-brex commented Jan 29, 2025

Siddharth-Latthe-07 commented Jan 29, 2025

tducharme-brex commented Jan 30, 2025

A ImportError when I run the program "FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners.ipynb" #165

A ImportError when I run the program "FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners.ipynb" #165

Comments

YRookieBoy commented Mar 14, 2024 • edited Loading

llk010502 commented Mar 17, 2024

YRookieBoy commented Mar 22, 2024

Siddharth-Latthe-07 commented Jul 16, 2024

tducharme-brex commented Jan 29, 2025

Siddharth-Latthe-07 commented Jan 29, 2025

tducharme-brex commented Jan 30, 2025

YRookieBoy commented Mar 14, 2024 •

edited

Loading