Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

trtllm-serve without any output Qwne2.5-7b #2667

Open
1 of 4 tasks
Justin-12138 opened this issue Jan 8, 2025 · 1 comment
Open
1 of 4 tasks

trtllm-serve without any output Qwne2.5-7b #2667

Justin-12138 opened this issue Jan 8, 2025 · 1 comment
Labels
bug Something isn't working OpenAI API

Comments

@Justin-12138
Copy link

Justin-12138 commented Jan 8, 2025

System Info

-CPU: Intel Xeon Platinum 8352V (144) @ 3.500GHz X86
-Memory: 1031689MiB
-GPU:RTX-4090*8
-Librarys
tensorrt 10.7.0
tensorrt_cu12 10.7.0
tensorrt-cu12-bindings 10.7.0
tensorrt-cu12-libs 10.7.0
tensorrt-llm 0.16.0
nvidia driver version
Driver Version: 550.135 CUDA Version: 12.4
OS Ubuntu 22.04.5 LTS x86_64

Who can help?

No response

Information

  • The official example scripts
  • My own modified scripts

Tasks

  • An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • My own task or dataset (give details below)

Reproduction

I ran the trtllm-serve comman like:
trtllm-serve /home/lz/tensorrt/build/Qwen2.5-7B-Instructtrt_engines/weight_only/1-gpu
--tokenizer /home/lz/tensorrt/models/Qwen2.5-7B-Instruct
--max_batch_size 128 --max_num_tokens 4096 --max_seq_len 4096
--kv_cache_free_gpu_memory_fraction 0.95

But there is no output except the:
[TensorRT-LLM] TensorRT-LLM version: 0.16.0

No errors,no warnings no Port occupation
Image

But it ran well with the test:
python3 /home/lz/TensorRT-LLM/examples/run.py --input_text "你好,请问你叫什么?"
--max_output_len=50
--tokenizer_dir /home/lz/tensorrt/models/Qwen2.5-7B-Instruct
--engine_dir=/home/lz/tensorrt/build/Qwen2.5-7B-Instructtrt_engines/weight_only/1-gpu
Image

What Can I do to run an OpenAI API compatible server

Expected behavior

Does it should output somemore info?

actual behavior

Nnothing but version

additional notes

Is that a problem with Qwen2.5-7b
I 'd appreciate if you guys could give me some help

@Justin-12138 Justin-12138 added the bug Something isn't working label Jan 8, 2025
@nv-guomingz
Copy link
Collaborator

Hi @LinPoly would u please take a look this issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working OpenAI API
Projects
None yet
Development

No branches or pull requests

2 participants