Skip to content

Commit

Permalink
docs: update granite3 model support (#441)
Browse files Browse the repository at this point in the history
Signed-off-by: Anh Uong <[email protected]>
  • Loading branch information
anhuong authored Jan 16, 2025
1 parent 28c3d38 commit d03072b
Showing 1 changed file with 5 additions and 1 deletion.
6 changes: 5 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -200,6 +200,10 @@ For advanced data preprocessing support including mixing and custom preprocessin
Model Name & Size | Model Architecture | Full Finetuning | Low Rank Adaptation (i.e. LoRA) | qLoRA(quantized LoRA) |
-------------------- | ---------------- | --------------- | ------------------------------- | --------------------- |
Granite PowerLM 3B | GraniteForCausalLM | ✅* | ✅* | ✅* |
Granite 3.1 1B | GraniteForCausalLM | ✔️* | ✔️* | ✔️* |
Granite 3.1 2B | GraniteForCausalLM | ✔️* | ✔️* | ✔️* |
Granite 3.1 3B | GraniteForCausalLM | ✔️* | ✔️* | ✔️* |
Granite 3.1 8B | GraniteForCausalLM | ✔️* | ✔️* | ✔️* |
Granite 3.0 2B | GraniteForCausalLM | ✔️* | ✔️* | ✔️* |
Granite 3.0 8B | GraniteForCausalLM | ✅* | ✅* | ✔️ |
GraniteMoE 1B | GraniteMoeForCausalLM | ✅ | ✅** | ? |
Expand All @@ -219,7 +223,7 @@ Mixtral 8x7B | Mixtral | ✅ | ✅ | ✅ |
Mistral-7b | Mistral | ✅ | ✅ | ✅ |  
Mistral large | Mistral | 🚫 | 🚫 | 🚫 |

(*) - Supported with `fms-hf-tuning` v2.0.1 or later
(*) - Supported with `fms-hf-tuning` v2.4.0 or later.

(**) - Supported for q,k,v,o layers . `all-linear` target modules does not infer on vLLM yet.

Expand Down

0 comments on commit d03072b

Please sign in to comment.