This repository has been archived by the owner on Oct 11, 2024. It is now read-only.
forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 10
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Upstream sync 2024 06 08 (#288) - ties to v0.4.3 of vllm-upstream SUMMARY: * Merge commits from vllm-project@f68470e to vllm-project@1197e02 * Our GCP test instances do not have `gcc` or `clang` installed. All of the triton kernels rely on the `gcc` and `clang` to generate JITs. I disabled these for now, but we need to get these installed (cc @andy-neuma). All are marked with: ```python @pytest.mark.skip("C compiler not installed in NM automation. " "This codepath follows a triton pathway, which " "JITs using clang or gcc. Since neither are installed " "in our test instances, we need to skip this for now.") ``` * Cherry-picked in the changes associated with Fp8 weight format from @mgoin Note that vllm-project@f68470e is NOT included in this merge. COMPARE vs UPSTREAM: * https://github.com/neuralmagic/nm-vllm/compare/upstream-sync-2024-06-08..vllm-project:vllm:v0.4.3 --------- Signed-off-by: kerthcet <[email protected]> Signed-off-by: Muralidhar Andoorveedu <[email protected]> Signed-off-by: pandyamarut <[email protected]> Co-authored-by: Alexander Matveev <[email protected]> Co-authored-by: Woosuk Kwon <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Co-authored-by: Wenwei Zhang <[email protected]> Co-authored-by: Alexei-V-Ivanov-AMD <[email protected]> Co-authored-by: Alexey Kondratiev <[email protected]> Co-authored-by: Mor Zusman <[email protected]> Co-authored-by: Mor Zusman <[email protected]> Co-authored-by: Aurick Qiao <[email protected]> Co-authored-by: Kuntai Du <[email protected]> Co-authored-by: Antoni Baum <[email protected]> Co-authored-by: HUANG Fei <[email protected]> Co-authored-by: Isotr0py <[email protected]> Co-authored-by: Simon Mo <[email protected]> Co-authored-by: Michael Goin <[email protected]> Co-authored-by: Kante Yin <[email protected]> Co-authored-by: sasha0552 <[email protected]> Co-authored-by: SangBin Cho <[email protected]> Co-authored-by: Tyler Michael Smith <[email protected]> Co-authored-by: Cody Yu <[email protected]> Co-authored-by: raywanb <[email protected]> Co-authored-by: Nick Hill <[email protected]> Co-authored-by: Philipp Moritz <[email protected]> Co-authored-by: Letian Li <[email protected]> Co-authored-by: Murali Andoorveedu <[email protected]> Co-authored-by: Dipika Sikka <[email protected]> Co-authored-by: Varun Sundar Rabindranath <[email protected]> Co-authored-by: Varun Sundar Rabindranath <[email protected]> Co-authored-by: Elisei Smirnov <[email protected]> Co-authored-by: Elisei Smirnov <[email protected]> Co-authored-by: youkaichao <[email protected]> Co-authored-by: leiwen83 <[email protected]> Co-authored-by: Lei Wen <[email protected]> Co-authored-by: Eric Xihui Lin <[email protected]> Co-authored-by: beagleski <[email protected]> Co-authored-by: bapatra <[email protected]> Co-authored-by: Barun Patra <[email protected]> Co-authored-by: Lily Liu <[email protected]> Co-authored-by: Roger Wang <[email protected]> Co-authored-by: Zhuohan Li <[email protected]> Co-authored-by: Isotr0py <[email protected]> Co-authored-by: Michał Moskal <[email protected]> Co-authored-by: Ruth Evans <[email protected]> Co-authored-by: Divakar Verma <[email protected]> Co-authored-by: Roger Wang <[email protected]> Co-authored-by: Junichi Sato <[email protected]> Co-authored-by: Marut Pandya <[email protected]> Co-authored-by: afeldman-nm <[email protected]> Co-authored-by: Ronen Schaffer <[email protected]> Co-authored-by: Itay Etelis <[email protected]> Co-authored-by: omkar kakarparthi <[email protected]> Co-authored-by: Alexei V. Ivanov <[email protected]> Co-authored-by: Breno Faria <[email protected]> Co-authored-by: Breno Faria <[email protected]> Co-authored-by: Hyunsung Lee <[email protected]> Co-authored-by: Chansung Park <[email protected]> Co-authored-by: SnowDist <[email protected]> Co-authored-by: functionxu123 <[email protected]> Co-authored-by: xuhao <[email protected]>
- Loading branch information
1 parent
a7a385b
commit db9ed90
Showing
290 changed files
with
17,573 additions
and
9,003 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,7 +1,7 @@ | ||
import os | ||
import zipfile | ||
|
||
MAX_SIZE_MB = 150 | ||
MAX_SIZE_MB = 200 | ||
|
||
|
||
def print_top_10_largest_files(zip_file): | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
BasedOnStyle: Google | ||
UseTab: Never | ||
IndentWidth: 2 | ||
ColumnLimit: 80 | ||
|
||
# Force pointers to the type for C++. | ||
DerivePointerAlignment: false | ||
PointerAlignment: Left | ||
|
||
# Reordering #include statements can (and currently will) introduce errors | ||
SortIncludes: false | ||
|
||
# Style choices | ||
AlignConsecutiveAssignments: false | ||
AlignConsecutiveDeclarations: false | ||
IndentPPDirectives: BeforeHash | ||
|
||
IncludeCategories: | ||
- Regex: '^<' | ||
Priority: 4 | ||
- Regex: '^"(llvm|llvm-c|clang|clang-c|mlir|mlir-c)/' | ||
Priority: 3 | ||
- Regex: '^"(qoda|\.\.)/' | ||
Priority: 2 | ||
- Regex: '.*' | ||
Priority: 1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,42 @@ | ||
name: clang-format | ||
|
||
on: | ||
# Trigger the workflow on push or pull request, | ||
# but only for the main branch | ||
push: | ||
branches: | ||
- main | ||
pull_request: | ||
branches: | ||
- main | ||
|
||
jobs: | ||
clang-format: | ||
runs-on: ubuntu-latest | ||
strategy: | ||
matrix: | ||
python-version: ["3.11"] | ||
steps: | ||
- uses: actions/checkout@v2 | ||
- name: Set up Python ${{ matrix.python-version }} | ||
uses: actions/setup-python@v2 | ||
with: | ||
python-version: ${{ matrix.python-version }} | ||
- name: Install dependencies | ||
run: | | ||
python -m pip install --upgrade pip | ||
pip install clang-format==18.1.5 | ||
- name: Running clang-format | ||
run: | | ||
EXCLUDES=( | ||
'csrc/moe/topk_softmax_kernels.cu' | ||
'csrc/punica/bgmv/bgmv_bf16_bf16_bf16.cu' | ||
'csrc/punica/bgmv/bgmv_config.h' | ||
'csrc/punica/bgmv/bgmv_impl.cuh' | ||
'csrc/punica/bgmv/vec_dtypes.cuh' | ||
'csrc/punica/punica_ops.cu' | ||
'csrc/punica/type_convert.h' | ||
) | ||
find csrc/ \( -name '*.h' -o -name '*.cpp' -o -name '*.cu' -o -name '*.cuh' \) -print \ | ||
| grep -vFf <(printf "%s\n" "${EXCLUDES[@]}") \ | ||
| xargs clang-format --dry-run --Werror |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.