Verification check for model - VLLM #10

anandhu-eng · 2024-07-17T09:28:39Z

For testing, vllm server was up with the command:

cm run script --tags=run,vllm-server --model=NousResearch/Hermes-2-Theta-Llama-3-8B --api_key=""

The client script:

cm run script --tags=run-mlperf,inference,_full --model=llama2-70b-99.9 --implementation=reference --device=cpu --quiet --api_server=http://0.0.0.0:8000 --adr.mlperf-implementation.tags=_repo.https://github.com/gateoverflow/inference --rerun

anandhu-eng added 2 commits July 17, 2024 14:52

Added model name verification with server

802374b

clean temp files

44ae1d9

arjunsuresh approved these changes Jul 17, 2024

View reviewed changes

arjunsuresh merged commit 93b5d64 into GATEOverflow:master Jul 17, 2024
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verification check for model - VLLM #10

Verification check for model - VLLM #10

anandhu-eng commented Jul 17, 2024 •

edited

Loading

Verification check for model - VLLM #10

Verification check for model - VLLM #10

Conversation

anandhu-eng commented Jul 17, 2024 • edited Loading

anandhu-eng commented Jul 17, 2024 •

edited

Loading