Comparison benchmarks? #138

tripathiarpan20 · 2023-07-19T10:34:33Z

Hi,
Thanks for open-sourcing the code.

I was wondering how it compares in terms of throughput with existing inference frameworks like https://github.com/huggingface/text-generation-inference and https://github.com/vllm-project/vllm , do we have any benchmarks?

rkaplan · 2023-07-19T21:21:07Z

Thanks for the request — we will be sure to add some benchmarks. cc @yixu34

Under the hood, the inference serving component is handled by HF Text Generation Inference, so the inference throughput should be similar or equivalent to that library.

rkaplan assigned yixu34 Jul 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparison benchmarks? #138

Comparison benchmarks? #138

tripathiarpan20 commented Jul 19, 2023

rkaplan commented Jul 19, 2023

Comparison benchmarks? #138

Comparison benchmarks? #138

Comments

tripathiarpan20 commented Jul 19, 2023

rkaplan commented Jul 19, 2023