Skip to content

Actions: NVIDIA/TensorRT-LLM

auto-assign

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
140 workflow runs
140 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Are multimodal models supported by trtllm-serve?
auto-assign #143: Issue #2714 labeled by nv-guomingz
January 23, 2025 15:19 2s
January 23, 2025 15:19 2s
Are multimodal models supported by trtllm-serve?
auto-assign #142: Issue #2714 labeled by nv-guomingz
January 23, 2025 14:57 51s
January 23, 2025 14:57 51s
January 22, 2025 14:24 47s
January 22, 2025 14:21 2s
January 22, 2025 14:21 48s
how to compile deepseekv3 ?
auto-assign #138: Issue #2711 labeled by nv-guomingz
January 22, 2025 10:25 41s
January 22, 2025 10:25 41s
how to compile deepseekv3 ?
auto-assign #137: Issue #2711 labeled by nv-guomingz
January 22, 2025 10:25 2s
January 22, 2025 10:25 2s
Support for Blackwell and Thor
auto-assign #136: Issue #2710 labeled by nv-guomingz
January 22, 2025 02:31 2s
January 22, 2025 02:31 2s
NVILA support (Qwen2)
auto-assign #135: Issue #2707 labeled by danigarciaoca
January 21, 2025 13:13 2s
January 21, 2025 13:13 2s
What is execution context memory?
auto-assign #134: Issue #2698 labeled by nv-guomingz
January 21, 2025 03:05 2s
January 21, 2025 03:05 2s
convert NVILA with 0.16.0
auto-assign #133: Issue #2706 labeled by nv-guomingz
January 21, 2025 02:57 42s
January 21, 2025 02:57 42s
Wrong outputs with FP8 kv_cache reuse
auto-assign #132: Issue #2699 labeled by nv-guomingz
January 21, 2025 02:44 47s
January 21, 2025 02:44 47s
convert NVILA with 0.16.0
auto-assign #131: Issue #2706 labeled by dzy130120
January 20, 2025 12:25 3s
January 20, 2025 12:25 3s
quantized model using AWQ and lora weights
auto-assign #129: Issue #2703 labeled by nv-guomingz
January 20, 2025 05:27 43s
January 20, 2025 05:27 43s
Support for int2/int3 quantization
auto-assign #128: Issue #2704 labeled by nv-guomingz
January 20, 2025 05:26 43s
January 20, 2025 05:26 43s
SmoothQuant doesn't work with lora
auto-assign #127: Issue #2604 labeled by byshiue
January 20, 2025 03:42 2s
January 20, 2025 03:42 2s
Wrong outputs with FP8 kv_cache reuse
auto-assign #126: Issue #2699 labeled by lishicheng1996
January 16, 2025 11:58 3s
January 16, 2025 11:58 3s
Custom allreduce performance improvement
auto-assign #125: Issue #2696 labeled by nv-guomingz
January 16, 2025 04:05 42s
January 16, 2025 04:05 42s
Failed TensorRT-LLM Benchmark
auto-assign #124: Issue #2694 labeled by maulikmadhavi
January 15, 2025 10:41 3s
January 15, 2025 10:41 3s
0.16.0 Qwen2-72B-Struct SQ error
auto-assign #123: Issue #2693 labeled by gy0514020329
January 15, 2025 07:00 2s
January 15, 2025 07:00 2s
NotImplementedError: Cannot copy out of meta tensor; no data!
auto-assign #122: Issue #2692 labeled by chilljudaoren
January 15, 2025 03:05 3s
January 15, 2025 03:05 3s
January 14, 2025 14:35 1m 7s
internvl-2.5
auto-assign #120: Issue #2686 labeled by nv-guomingz
January 14, 2025 14:31 3s
January 14, 2025 14:31 3s
January 14, 2025 14:29 52s