Skip to content

Actions: NVIDIA/TensorRT-LLM

auto-assign

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
140 workflow runs
140 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

int8 slower than bf16 on A100
auto-assign #18: Issue #2553 labeled by nv-guomingz
December 10, 2024 06:21 55s
December 10, 2024 06:21 55s
lora doesn't work when kv_cache is disabled
auto-assign #16: Issue #2543 labeled by nv-guomingz
December 10, 2024 06:01 41s
December 10, 2024 06:01 41s
[bug] Medusa example fails with vicuna 33B
auto-assign #15: Issue #2478 labeled by nv-guomingz
December 10, 2024 05:52 42s
December 10, 2024 05:52 42s
[QST] How to get the prefill latency and TPOT resepectly when using C++ runtime
auto-assign #14: Issue #2500 labeled by hello-11
December 10, 2024 05:51 1s
December 10, 2024 05:51 1s
December 10, 2024 03:21 3s
int4 not faster than fp16 and fp8
auto-assign #12: Issue #2487 labeled by hello-11
December 10, 2024 03:20 3s
December 10, 2024 03:20 3s
int8 slower than bf16 on A100
auto-assign #11: Issue #2553 labeled by ShuaiShao93
December 9, 2024 23:01 3s
December 9, 2024 23:01 3s
Device CUDA update, caused model to stop running.
auto-assign #9: Issue #2455 labeled by hello-11
December 9, 2024 09:15 3s
December 9, 2024 09:15 3s
Performance issue with long context
auto-assign #8: Issue #2548 labeled by ShuaiShao93
December 6, 2024 21:36 2s
December 6, 2024 21:36 2s
trtllm-bench faild
auto-assign #6: Issue #2545 labeled by dingjingzhen
December 6, 2024 06:24 3s
December 6, 2024 06:24 3s
Encoding error in stream response from Triton server
auto-assign #5: Issue #2544 labeled by Wonder-donbury
December 6, 2024 05:11 3s
December 6, 2024 05:11 3s
lora doesn't work when kv_cache is disabled
auto-assign #4: Issue #2543 labeled by ShuaiShao93
December 5, 2024 20:14 3s
December 5, 2024 20:14 3s
trust_remote_code argument ignored in load_calib_dataset()
auto-assign #3: Issue #2537 labeled by hiroshi-matsuda-rit
December 5, 2024 04:19 2s
December 5, 2024 04:19 2s
Failed to run gptManagerBenchmark on tag v0.15.0
auto-assign #2: Issue #2536 labeled by zhangts20
December 5, 2024 02:20 3s
December 5, 2024 02:20 3s
Issue with converting custom encoder model
auto-assign #1: Issue #2535 labeled by AvivSham
December 4, 2024 21:03 4s
December 4, 2024 21:03 4s