-
Notifications
You must be signed in to change notification settings - Fork 10
Commits on Jul 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 54348ea - Browse repository at this point
Copy the full SHA 54348eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ae4430 - Browse repository at this point
Copy the full SHA 5ae4430View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1601d82 - Browse repository at this point
Copy the full SHA 1601d82View commit details -
Configuration menu - View commit details
-
Copy full SHA for 20faeb6 - Browse repository at this point
Copy the full SHA 20faeb6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 62ecd68 - Browse repository at this point
Copy the full SHA 62ecd68View commit details -
Configuration menu - View commit details
-
Copy full SHA for ca8bc83 - Browse repository at this point
Copy the full SHA ca8bc83View commit details -
[ci] Remove aws template (vllm-project#5757)
Signed-off-by: kevin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 91b2d1d - Browse repository at this point
Copy the full SHA 91b2d1dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 21450bc - Browse repository at this point
Copy the full SHA 21450bcView commit details -
[Speculative Decoding] Support draft model on different tensor-parall…
…el size than target model (vllm-project#5414)
Configuration menu - View commit details
-
Copy full SHA for 1d55e23 - Browse repository at this point
Copy the full SHA 1d55e23View commit details -
Configuration menu - View commit details
-
Copy full SHA for 980c10b - Browse repository at this point
Copy the full SHA 980c10bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b261da - Browse repository at this point
Copy the full SHA 3b261daView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8d6c12f - Browse repository at this point
Copy the full SHA 8d6c12fView commit details -
Configuration menu - View commit details
-
Copy full SHA for c3bc8c6 - Browse repository at this point
Copy the full SHA c3bc8c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for a9e34b9 - Browse repository at this point
Copy the full SHA a9e34b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 21f69d1 - Browse repository at this point
Copy the full SHA 21f69d1View commit details -
Configuration menu - View commit details
-
Copy full SHA for ece7c7f - Browse repository at this point
Copy the full SHA ece7c7fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e6935bd - Browse repository at this point
Copy the full SHA e6935bdView commit details -
[CI/Build] Add E2E tests for MLPSpeculator (vllm-project#5791)
Signed-off-by: Thomas Parnell <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 81a21d2 - Browse repository at this point
Copy the full SHA 81a21d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for f9775e9 - Browse repository at this point
Copy the full SHA f9775e9View commit details -
[Core] Refactor Worker and ModelRunner to consolidate control plane c…
…ommunication (vllm-project#5408) Signed-off-by: Stephanie Wang <[email protected]> Signed-off-by: Stephanie <[email protected]> Co-authored-by: Stephanie <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fb41934 - Browse repository at this point
Copy the full SHA fb41934View commit details -
Configuration menu - View commit details
-
Copy full SHA for ce9da79 - Browse repository at this point
Copy the full SHA ce9da79View commit details -
Configuration menu - View commit details
-
Copy full SHA for cb364ef - Browse repository at this point
Copy the full SHA cb364efView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9744700 - Browse repository at this point
Copy the full SHA 9744700View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f7eba7 - Browse repository at this point
Copy the full SHA 2f7eba7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 74952fd - Browse repository at this point
Copy the full SHA 74952fdView commit details -
[Kernel] Adding bias epilogue support for
cutlass_scaled_mm
(vllm-p……roject#5560) Co-authored-by: Chih-Chieh-Yang <[email protected]> Co-authored-by: Lucas Wilkinson <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1d1929b - Browse repository at this point
Copy the full SHA 1d1929bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5095252 - Browse repository at this point
Copy the full SHA 5095252View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1653293 - Browse repository at this point
Copy the full SHA 1653293View commit details -
Configuration menu - View commit details
-
Copy full SHA for e423b2c - Browse repository at this point
Copy the full SHA e423b2cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 698f968 - Browse repository at this point
Copy the full SHA 698f968View commit details -
Configuration menu - View commit details
-
Copy full SHA for 182cdaa - Browse repository at this point
Copy the full SHA 182cdaaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 750539c - Browse repository at this point
Copy the full SHA 750539cView commit details -
[BugFix] Fix cuda graph for MLPSpeculator (vllm-project#5875)
Co-authored-by: Abhinav Goyal <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7823612 - Browse repository at this point
Copy the full SHA 7823612View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0844ba8 - Browse repository at this point
Copy the full SHA 0844ba8View commit details -
[VLM][Bugfix] Make sure that
multi_modal_kwargs
is broadcasted prop……erly (vllm-project#5880) Signed-off-by: Xiaowei Jiang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5855a8e - Browse repository at this point
Copy the full SHA 5855a8eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2102a46 - Browse repository at this point
Copy the full SHA 2102a46View commit details -
Configuration menu - View commit details
-
Copy full SHA for f483510 - Browse repository at this point
Copy the full SHA f483510View commit details -
Configuration menu - View commit details
-
Copy full SHA for 684c441 - Browse repository at this point
Copy the full SHA 684c441View commit details -
Configuration menu - View commit details
-
Copy full SHA for dcb8246 - Browse repository at this point
Copy the full SHA dcb8246View commit details -
Configuration menu - View commit details
-
Copy full SHA for db62aa3 - Browse repository at this point
Copy the full SHA db62aa3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c7ef70 - Browse repository at this point
Copy the full SHA 0c7ef70View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e594ee - Browse repository at this point
Copy the full SHA 6e594eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 81ddde3 - Browse repository at this point
Copy the full SHA 81ddde3View commit details -
Configuration menu - View commit details
-
Copy full SHA for c1d4964 - Browse repository at this point
Copy the full SHA c1d4964View commit details -
Configuration menu - View commit details
-
Copy full SHA for 209a147 - Browse repository at this point
Copy the full SHA 209a147View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d5e0b9 - Browse repository at this point
Copy the full SHA 4d5e0b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5f1316e - Browse repository at this point
Copy the full SHA 5f1316eView commit details -
[VLM][BugFix] Make sure that
multi_modal_kwargs
can broadcast prope……rly with ring buffer. (vllm-project#5905) Signed-off-by: Xiaowei Jiang <[email protected]> Co-authored-by: Roger Wang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 74bf88f - Browse repository at this point
Copy the full SHA 74bf88fView commit details -
Configuration menu - View commit details
-
Copy full SHA for f177c04 - Browse repository at this point
Copy the full SHA f177c04View commit details -
[Core] Registry for processing model inputs (vllm-project#5214)
Co-authored-by: ywang96 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 70af85d - Browse repository at this point
Copy the full SHA 70af85dView commit details -
Configuration menu - View commit details
-
Copy full SHA for fd59ff4 - Browse repository at this point
Copy the full SHA fd59ff4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2e67191 - Browse repository at this point
Copy the full SHA 2e67191View commit details -
[Bugfix] Better error message for MLPSpeculator when `num_speculative…
…_tokens` is set too high (vllm-project#5894) Signed-off-by: Thomas Parnell <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0d4c0c6 - Browse repository at this point
Copy the full SHA 0d4c0c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ce7d18 - Browse repository at this point
Copy the full SHA 1ce7d18View commit details -
[Distributed] Make it clear that % should not be in tensor dict keys. (…
…vllm-project#5927) Signed-off-by: Xiaowei Jiang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4b9894c - Browse repository at this point
Copy the full SHA 4b9894cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6664f2a - Browse repository at this point
Copy the full SHA 6664f2aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 42cdb40 - Browse repository at this point
Copy the full SHA 42cdb40View commit details -
[ Misc ] Remove
fp8_shard_indexer
from Col/Row Parallel Linear (Sim……plify Weight Loading) (vllm-project#5928) Co-authored-by: Robert Shaw <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for 7c1515e - Browse repository at this point
Copy the full SHA 7c1515eView commit details -
[ Bugfix ] Enabling Loading Models With Fused QKV/MLP on Disk with FP8 (
vllm-project#5921) Co-authored-by: Robert Shaw <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for 9598197 - Browse repository at this point
Copy the full SHA 9598197View commit details -
Support Deepseek-V2 (vllm-project#4650)
Co-authored-by: Philipp Moritz <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3441c30 - Browse repository at this point
Copy the full SHA 3441c30View commit details -
Configuration menu - View commit details
-
Copy full SHA for a5ef790 - Browse repository at this point
Copy the full SHA a5ef790View commit details -
Configuration menu - View commit details
-
Copy full SHA for ccd94db - Browse repository at this point
Copy the full SHA ccd94dbView commit details -
[Bugfix] Fix Engine Failing After Invalid Request - AsyncEngineDeadEr…
…ror (vllm-project#5963) Co-authored-by: Robert Shaw <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for f49047a - Browse repository at this point
Copy the full SHA f49047aView commit details -
[Kernel] Flashinfer for prefill & decode, with Cudagraph support for …
…decode (vllm-project#4628) Co-authored-by: LiuXiaoxuanPKU <[email protected]>, bong-furiosa <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for eeb9d99 - Browse repository at this point
Copy the full SHA eeb9d99View commit details -
Configuration menu - View commit details
-
Copy full SHA for 026b28e - Browse repository at this point
Copy the full SHA 026b28eView commit details -
Configuration menu - View commit details
-
Copy full SHA for f281c2e - Browse repository at this point
Copy the full SHA f281c2eView commit details -
Configuration menu - View commit details
-
Copy full SHA for b89416e - Browse repository at this point
Copy the full SHA b89416eView commit details -
[Misc] Extend vLLM Metrics logging API (vllm-project#5925)
Co-authored-by: Antoni Baum <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for acf1f76 - Browse repository at this point
Copy the full SHA acf1f76View commit details -
[Kernel] Add punica dimensions for Granite 3b and 8b (vllm-project#5930)
Signed-off-by: Joe Runde <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b9acdae - Browse repository at this point
Copy the full SHA b9acdaeView commit details -
Configuration menu - View commit details
-
Copy full SHA for aa72bdc - Browse repository at this point
Copy the full SHA aa72bdcView commit details -
[Misc] Update Phi-3-Vision Example (vllm-project#5981)
Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 33fecd4 - Browse repository at this point
Copy the full SHA 33fecd4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 00f60d2 - Browse repository at this point
Copy the full SHA 00f60d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 270105d - Browse repository at this point
Copy the full SHA 270105dView commit details -
[Kernel] Raise an exception in MoE kernel if the batch size is larger…
… then 65k (vllm-project#5939)
Configuration menu - View commit details
-
Copy full SHA for b22f1be - Browse repository at this point
Copy the full SHA b22f1beView commit details -
[ CI/Build ] Added E2E Test For Compressed Tensors (vllm-project#5839)
Co-authored-by: Michael Goin <[email protected]> Co-authored-by: Robert Shaw <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for aa49ffe - Browse repository at this point
Copy the full SHA aa49ffeView commit details -
Configuration menu - View commit details
-
Copy full SHA for b481fe3 - Browse repository at this point
Copy the full SHA b481fe3View commit details -
[ CI/Build ] LM Eval Harness Based CI Testing (vllm-project#5838)
Co-authored-by: Robert Shaw <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for 47407b7 - Browse repository at this point
Copy the full SHA 47407b7View commit details -
[Bugfix][CI/Build][Hardware][AMD] Install matching torchvision to fix…
… AMD tests (vllm-project#5949)
Configuration menu - View commit details
-
Copy full SHA for 3d215cc - Browse repository at this point
Copy the full SHA 3d215ccView commit details -
Configuration menu - View commit details
-
Copy full SHA for d0b7111 - Browse repository at this point
Copy the full SHA d0b7111View commit details -
Configuration menu - View commit details
-
Copy full SHA for 445b0d3 - Browse repository at this point
Copy the full SHA 445b0d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for cea9f6b - Browse repository at this point
Copy the full SHA cea9f6bView commit details -
[ci][distributed] fix device count call
[ci][distributed] fix some cuda init that makes it necessary to use spawn (vllm-project#5991)
Configuration menu - View commit details
-
Copy full SHA for 4f7381a - Browse repository at this point
Copy the full SHA 4f7381aView commit details -
[Frontend]: Support base64 embedding (vllm-project#5935)
Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3ceed36 - Browse repository at this point
Copy the full SHA 3ceed36View commit details -
[Lora] Use safetensor keys instead of adapter_config.json to find une…
…xpected modules. (vllm-project#5909) Co-authored-by: sang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 51f3e3f - Browse repository at this point
Copy the full SHA 51f3e3fView commit details -
[ CI ] Temporarily Disable Large LM-Eval Tests (vllm-project#6005)
Co-authored-by: [email protected] <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for 9c74b00 - Browse repository at this point
Copy the full SHA 9c74b00View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4153e58 - Browse repository at this point
Copy the full SHA 4153e58View commit details -
[ Misc ] Refactor w8a8 to use
process_weights_after_load
(Simplify ……Weight Loading) (vllm-project#5940) Co-authored-by: Robert Shaw <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for 27a711a - Browse repository at this point
Copy the full SHA 27a711aView commit details -
2
Configuration menu - View commit details
-
Copy full SHA for 53655b2 - Browse repository at this point
Copy the full SHA 53655b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 07abe05 - Browse repository at this point
Copy the full SHA 07abe05View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f0fec4 - Browse repository at this point
Copy the full SHA 0f0fec4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1cc7c46 - Browse repository at this point
Copy the full SHA 1cc7c46View commit details -
Configuration menu - View commit details
-
Copy full SHA for a699814 - Browse repository at this point
Copy the full SHA a699814View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9a4be7f - Browse repository at this point
Copy the full SHA 9a4be7fView commit details -
1
Configuration menu - View commit details
-
Copy full SHA for b4eec34 - Browse repository at this point
Copy the full SHA b4eec34View commit details -
Configuration menu - View commit details
-
Copy full SHA for 08dedd5 - Browse repository at this point
Copy the full SHA 08dedd5View commit details -
[Speculative Decoding 2/2 ] Integrate typical acceptance sampler into…
… Spec Decode Worker (vllm-project#5348)
Configuration menu - View commit details
-
Copy full SHA for dac4bb3 - Browse repository at this point
Copy the full SHA dac4bb3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 87a4288 - Browse repository at this point
Copy the full SHA 87a4288View commit details -
Configuration menu - View commit details
-
Copy full SHA for 81e1c3e - Browse repository at this point
Copy the full SHA 81e1c3eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2c3c43b - Browse repository at this point
Copy the full SHA 2c3c43bView commit details -
[doc][misc] further lower visibility of simple api server (vllm-proje…
…ct#6041) Co-authored-by: Simon Mo <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for cf4e758 - Browse repository at this point
Copy the full SHA cf4e758View commit details -
[Bugfix] Use RayActorError for older versions of Ray in RayTokenizerG…
…roupPool (vllm-project#6039)
Configuration menu - View commit details
-
Copy full SHA for 9c7608c - Browse repository at this point
Copy the full SHA 9c7608cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1b7245f - Browse repository at this point
Copy the full SHA 1b7245fView commit details -
Configuration menu - View commit details
-
Copy full SHA for fa05042 - Browse repository at this point
Copy the full SHA fa05042View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for 484a2e3 - Browse repository at this point
Copy the full SHA 484a2e3View commit details
Commits on Jul 2, 2024
-
1
Configuration menu - View commit details
-
Copy full SHA for 99f1474 - Browse repository at this point
Copy the full SHA 99f1474View commit details -
2
Configuration menu - View commit details
-
Copy full SHA for afb93b9 - Browse repository at this point
Copy the full SHA afb93b9View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for fcb4dd3 - Browse repository at this point
Copy the full SHA fcb4dd3View commit details -
Configuration menu - View commit details
-
Copy full SHA for ceaf019 - Browse repository at this point
Copy the full SHA ceaf019View commit details -
Configuration menu - View commit details
-
Copy full SHA for 655389d - Browse repository at this point
Copy the full SHA 655389dView commit details -
2
Configuration menu - View commit details
-
Copy full SHA for 206af82 - Browse repository at this point
Copy the full SHA 206af82View commit details
Commits on Jul 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for cd2aa72 - Browse repository at this point
Copy the full SHA cd2aa72View commit details -
Configuration menu - View commit details
-
Copy full SHA for f43cb06 - Browse repository at this point
Copy the full SHA f43cb06View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for 7a45bfa - Browse repository at this point
Copy the full SHA 7a45bfaView commit details