Skip to content

Commit

Permalink
Update quant_llama2_hqq_demo.py - remove old quantize()
Browse files Browse the repository at this point in the history
  • Loading branch information
mobicham authored Nov 21, 2023
1 parent 76619ae commit 7d36c42
Showing 1 changed file with 0 additions and 1 deletion.
1 change: 0 additions & 1 deletion code/llama2_benchmark/quant_llama2_hqq_demo.py
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,6 @@
#quant_config = hqq_base_quant_config(nbits=2, group_size=16)
#quant_config = hqq_base_quant_config(nbits=2, group_size=16, quant_scale=True) #scale is quantized to 8-bit/g=128

#quantize_model(model, quant_config=quant_config)
LlamaHQQ.quantize_model(model, quant_config=quant_config)

# #Evaluate the quantized model
Expand Down

0 comments on commit 7d36c42

Please sign in to comment.