trtllm-bench missing support of moe_ep_size / moe_tp_size. #2577

juewAtAmazon · 2024-12-16T01:59:37Z

It appears trtllm-bench is missing support of moe_ep_size / moe_tp_size.

To evaluate MoE model's expert parallelism, e.g., in ref [1], can we get a roadmap update of trtllm-bench's support of MoE parallelisms?

Or please clarify how to get the tooling / process / scripts used in [1] below.

Thanks

trtllm-bench --model mistralai/Mixtral-8x22B-v0.1 build --moe_ep_size 4 --pp_size 2 --quantization FP8 --dataset /home/ubuntu/mistral-8x22b.data

trtllm-bench support MoE parallelisms.

[TensorRT-LLM] TensorRT-LLM version: 0.15.0
Usage: trtllm-bench build [OPTIONS]
Try 'trtllm-bench build --help' for help.

Error: No such option: --moe_ep_size (Possible options: --max_batch_size, --pp_size, --tp_size)

n/a

The text was updated successfully, but these errors were encountered:

juewAtAmazon added the bug Something isn't working label Dec 16, 2024

Provide feedback