auto-assign

Actions

auto-assign

Actions

Loading...
Loading

auto-assign.yml

140 workflow runs

Are multimodal models supported by trtllm-serve? auto-assign #143: Issue #2714 labeled by nv-guomingz

January 23, 2025 15:19

Are multimodal models supported by trtllm-serve? auto-assign #142: Issue #2714 labeled by nv-guomingz

January 23, 2025 14:57

51s

January 23, 2025 14:57

51s

[bug] Encountered an error in forwardAsync function: Assertion failed: mNextBlocks.empty() auto-assign #141: Issue #2708 labeled by nv-guomingz

January 22, 2025 14:24

47s

January 22, 2025 14:24

47s

Speculative Decoding - Draft Target model approach - Having issue with Triton inference Server auto-assign #140: Issue #2709 labeled by nv-guomingz

January 22, 2025 14:21

Speculative Decoding - Draft Target model approach - Having issue with Triton inference Server auto-assign #139: Issue #2709 labeled by nv-guomingz

January 22, 2025 14:21

48s

January 22, 2025 14:21

48s

how to compile deepseekv3 ? auto-assign #138: Issue #2711 labeled by nv-guomingz

January 22, 2025 10:25

41s

January 22, 2025 10:25

41s

how to compile deepseekv3 ? auto-assign #137: Issue #2711 labeled by nv-guomingz

January 22, 2025 10:25

Support for Blackwell and Thor auto-assign #136: Issue #2710 labeled by nv-guomingz

January 22, 2025 02:31

NVILA support (Qwen2) auto-assign #135: Issue #2707 labeled by danigarciaoca

January 21, 2025 13:13

What is execution context memory? auto-assign #134: Issue #2698 labeled by nv-guomingz

January 21, 2025 03:05

convert NVILA with 0.16.0 auto-assign #133: Issue #2706 labeled by nv-guomingz

January 21, 2025 02:57

42s

January 21, 2025 02:57

42s

Wrong outputs with FP8 kv_cache reuse auto-assign #132: Issue #2699 labeled by nv-guomingz

January 21, 2025 02:44

47s

January 21, 2025 02:44

47s

convert NVILA with 0.16.0 auto-assign #131: Issue #2706 labeled by dzy130120

January 20, 2025 12:25

trt-llm相比hf跑qwen的forward仅context phrase有加速效果，generation没有加速效果 auto-assign #130: Issue #2705 labeled by nickole2018

January 20, 2025 09:25

quantized model using AWQ and lora weights auto-assign #129: Issue #2703 labeled by nv-guomingz

January 20, 2025 05:27

43s

January 20, 2025 05:27

43s

Support for int2/int3 quantization auto-assign #128: Issue #2704 labeled by nv-guomingz

January 20, 2025 05:26

43s

January 20, 2025 05:26

43s

SmoothQuant doesn't work with lora auto-assign #127: Issue #2604 labeled by byshiue

January 20, 2025 03:42

Wrong outputs with FP8 kv_cache reuse auto-assign #126: Issue #2699 labeled by lishicheng1996

January 16, 2025 11:58

Custom allreduce performance improvement auto-assign #125: Issue #2696 labeled by nv-guomingz

January 16, 2025 04:05

42s

January 16, 2025 04:05

42s

Failed TensorRT-LLM Benchmark auto-assign #124: Issue #2694 labeled by maulikmadhavi

January 15, 2025 10:41

0.16.0 Qwen2-72B-Struct SQ error auto-assign #123: Issue #2693 labeled by gy0514020329

January 15, 2025 07:00

NotImplementedError: Cannot copy out of meta tensor; no data! auto-assign #122: Issue #2692 labeled by chilljudaoren

January 15, 2025 03:05

Deepseek-v3 int4 weight only inference outputs garbage words with TP 8 on nvidia H20 GPU auto-assign #121: Issue #2683 labeled by nv-guomingz

January 14, 2025 14:35

1m 7s

January 14, 2025 14:35

1m 7s

internvl-2.5 auto-assign #120: Issue #2686 labeled by nv-guomingz

January 14, 2025 14:31

Multi-LoRA cpp inference error: Assertion failed: lora_weights has to few values for attn_k auto-assign #119: Issue #2687 labeled by nv-guomingz

January 14, 2025 14:29

52s

January 14, 2025 14:29

52s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

auto-assign

Actions

Loading...
Loading

Create status badge

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: NVIDIA/TensorRT-LLM

Actions

auto-assign auto-assign Actions Loading... Loading Sorry, something went wrong.

auto-assign

auto-assign

Actions

Loading...
Loading