-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Code sync for upstream v0.14.0 #421
Code sync for upstream v0.14.0 #421
Commits on Jun 21, 2024
-
Propagate
trust_remote_code
flag throughout vLLM startup (kserve#3729)* propagate trc bool across vllm init Signed-off-by: Calvin Woo <[email protected]> Signed-off-by: calvin d. woo <[email protected]> * use args directly to avoid undefined var Signed-off-by: Calvin Woo <[email protected]> Signed-off-by: calvin d. woo <[email protected]> * Remove trailing space Signed-off-by: Dan Sun <[email protected]> Signed-off-by: calvin d. woo <[email protected]> * move params to newline Signed-off-by: calvin d. woo <[email protected]> --------- Signed-off-by: Calvin Woo <[email protected]> Signed-off-by: calvin d. woo <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 56a4e7d - Browse repository at this point
Copy the full SHA 56a4e7dView commit details
Commits on Jun 24, 2024
-
Fix dead links on PyPI (kserve#3754)
The KServe Python SDK README.md uses relative URLs that work well on GitHub but return a 404 error when visited on PyPI. This change updates the README.md to use absolute URLs that work well on both GitHub and PyPI. Signed-off-by: kevinbazira <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 352e030 - Browse repository at this point
Copy the full SHA 352e030View commit details -
Fix model is ready even if there is no model (kserve#3275)
check empty model final. Signed-off-by: HAO <[email protected]> Co-authored-by: koshino17 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d19e310 - Browse repository at this point
Copy the full SHA d19e310View commit details
Commits on Jun 27, 2024
-
Fix No model ready error in multi model serving (kserve#3758)
* Fix No model ready error in multi model serving - Fixes the regression introduced by kserve#3275 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Mark transformer model ready in init method Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fbd8778 - Browse repository at this point
Copy the full SHA fbd8778View commit details
Commits on Jun 29, 2024
-
Initial implementation of Inference client (kserve#3401)
* Initial implementation of inference client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Use Inference client for e2e tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Upgrade pytest-asyncio to 0.23.4 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix mutable object initialization in default parameters Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix graph e2e tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix pmml test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add explain, support dict response, use inference client for internal requests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix inference graph test and grpc headers Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Remove v1 datamodels Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Introduce protocol in client config Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Support inference graph Signed-off-by: Sivanantham Chinnaiyan <[email protected]> remove logging configs Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Update default timeout to 60 seconds Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add retry config for grpc client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix infer model_name parameter Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add tests for graph endpoint Signed-off-by: Sivanantham Chinnaiyan <[email protected]> debug Signed-off-by: Sivanantham Chinnaiyan <[email protected]> fix http client param mismatch Signed-off-by: Sivanantham Chinnaiyan <[email protected]> skip graph test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> fix timeout in grpc client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix url construction Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix explain Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * configure logger for e2e tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix grpc retry config Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Increase request timeout Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * configure logger for e2e tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix grpc retry config Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Increase request timeout Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Use fixtures for rest client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 74c9ff4 - Browse repository at this point
Copy the full SHA 74c9ff4View commit details -
Fix logprobs for vLLM (kserve#3738)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d4349ea - Browse repository at this point
Copy the full SHA d4349eaView commit details -
Fix model name not properly parsed by inference graph (kserve#3746)
* Fix model name not properly parsed by inference graph Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Handle single string arg with excess whitespace Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Handle duplicate arguments Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 68b04ef - Browse repository at this point
Copy the full SHA 68b04efView commit details -
pillow - Buffer Overflow (kserve#3598)
empty commit Signed-off-by: Spolti <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ca691f7 - Browse repository at this point
Copy the full SHA ca691f7View commit details
Commits on Jul 11, 2024
-
Use add_generation_prompt while creating chat template (kserve#3775)
Use add_generation_rompt for chat template Signed-off-by: Dattu Sharma <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 164ea83 - Browse repository at this point
Copy the full SHA 164ea83View commit details
Commits on Jul 12, 2024
-
Deduplicate the names for the additional domain names (kserve#3773)
* Deduplicate the names for the additional domain names Signed-off-by: Vincent Hou <[email protected]> * Refactoring the functions Signed-off-by: Vincent Hou <[email protected]> --------- Signed-off-by: Vincent Hou <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1a70e11 - Browse repository at this point
Copy the full SHA 1a70e11View commit details
Commits on Jul 15, 2024
-
Make Virtual Service case-insensitive (kserve#3779)
virtual service case insensitive Signed-off-by: Andrews Arokiam <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 29731be - Browse repository at this point
Copy the full SHA 29731beView commit details
Commits on Jul 18, 2024
-
Install packages needed for vllm model load (kserve#3802)
* Install packages needed for model load Signed-off-by: Gavrish Prabhu <[email protected]> * make all apt get into a single line Signed-off-by: Gavrish Prabhu <[email protected]> --------- Signed-off-by: Gavrish Prabhu <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 580a0f3 - Browse repository at this point
Copy the full SHA 580a0f3View commit details
Commits on Jul 20, 2024
-
Make gRPC max message length configurable (kserve#3741)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 31d2395 - Browse repository at this point
Copy the full SHA 31d2395View commit details -
Add readiness probe for MLServer and Increase memory for pmml in CI (k…
…serve#3789) * Add readiness probe for mlserver in CI Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Increase memory limit for pmml test to prevent OOMKilled and read timeout error Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5736731 - Browse repository at this point
Copy the full SHA 5736731View commit details
Commits on Jul 21, 2024
-
Several bug fixes for vLLM completion endpoint (kserve#3788)
* Fix logprobs Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix a scenario where stream completion fails if echo is true and logprobs is nil Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix a scenario where completion fails if the prompt is token_ids and echo is set to true Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Respect tokenizer revision Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add workaround for adding None to token_logprobs and top_logprobs Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for bb9641f - Browse repository at this point
Copy the full SHA bb9641fView commit details
Commits on Jul 25, 2024
-
Increase timeout to make unit test stable (kserve#3808)
agent watcher unit test is always flaky so increase timeout to make it stable Signed-off-by: jooho lee <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for de96bd3 - Browse repository at this point
Copy the full SHA de96bd3View commit details
Commits on Jul 27, 2024
-
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 32d3fc0 - Browse repository at this point
Copy the full SHA 32d3fc0View commit details -
Add tests for vLLM (kserve#3771)
* Add tests for vLLM Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * resolve comments Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Uncomment tests for fixed bugs Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3862175 - Browse repository at this point
Copy the full SHA 3862175View commit details -
Bump python to 3.11 for serving runtime images and Bump poetry to 1.8…
….3 (kserve#3812) * Upgrade serving runtime python version to 3.11 and debian to bookworm Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Upgrade poetry to 1.8.3 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Upgrade openjdk to 17 for pmml Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix 'AS' casing warning Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix pmml server Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f9e7d5a - Browse repository at this point
Copy the full SHA f9e7d5aView commit details
Commits on Aug 2, 2024
-
Bump vLLM to 0.5.3.post1 (kserve#3828)
* Bump vLLM to 0.5.3.post1 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update makefile Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * approx probability comparison Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Set multiprocessing method to spawn Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 96fb00e - Browse repository at this point
Copy the full SHA 96fb00eView commit details
Commits on Aug 6, 2024
-
Refactor the ModelServer to let uvicorn handle multiple workers and u…
…se 'spawn' for mutiprocessing (kserve#3757) * Refactor model server to let uvicorn handle multiple workers - Refactored the ModelServer to let uvicorn handle multiple workers. This will remove the bottleneck of using 'fork' for multiprocessing - Make FastAPI app instance easily accessible across the project so that users can easily add middlewares and custom exception handlers for custom models. - Use uvloop eventpolicy Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add middleware example Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add e2e test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove nest_asyncio in art explainer Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove uvloop Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix python tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * revert art explainer Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove monkeypatch Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove redundant future exception logging Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c080da5 - Browse repository at this point
Copy the full SHA c080da5View commit details
Commits on Aug 10, 2024
-
Update golang for docs/Dockerfile to 1.21 (kserve#3761)
Signed-off-by: Spolti <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d73f1ca - Browse repository at this point
Copy the full SHA d73f1caView commit details
Commits on Aug 11, 2024
-
Make ray an optional dependency (kserve#3834)
* Make ray serve an optional dependency Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Unify the log configuration using kserve logger (kserve#3577) * Configure logging for serving runtimes Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add pyyaml dependency Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * black format Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * fix pyproject.toml Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * cleanup logger for e2e Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Modify logger format to include func name Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Log model download time. Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Allow disabling logger configuration and deprecate logger related arg in model server Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Resolve comments Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * pyyaml=^6.0.0 to fix build failure Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove logger related parameters from model server Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * import model_server Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix lint Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix linting Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase, minor fixes and add e2e test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Co-authored-by: Curtis Maddalozzo <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9632a52 - Browse repository at this point
Copy the full SHA 9632a52View commit details -
Update aif example (kserve#3765)
* Update aif example chore: Update aif explainer example. - Bump KServer to 0.13.0, it will bring some library updates, plus, it fixes a few security alerts in this example. - update the scikit-learn package name Signed-off-by: Spolti <[email protected]> * move the local instructions to the README Signed-off-by: Spolti <[email protected]> * empty commit Signed-off-by: Spolti <[email protected]> --------- Signed-off-by: Spolti <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a6a061e - Browse repository at this point
Copy the full SHA a6a061eView commit details
Commits on Aug 12, 2024
-
Use helm for quick installation (kserve#3813)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f9c73c2 - Browse repository at this point
Copy the full SHA f9c73c2View commit details
Commits on Aug 17, 2024
-
Allow KServe to have its own local gateways for Serverless mode (kser…
…ve#3737) These changes introduce the possibility to configure KServe with its own Istio local gateway, to partially decouple KServe from the Knative local gateway. Typically, it is OK to re-use the already configured Knative local gateway for KServe uses (as long as configs do not conflict). However, there are cases where having a dedicated local gateway for KServe is beneficial. Just to give some examples: * To have the ability to use strict mTLS in Istio * To reduce some pressure on the Knative local gateway by having a dedicated gateway deployment (it still would hit Knative gateway, but only once, rather than twice) * To be able to configure TLS on cluster-local hostnames (Knative support is still experimental) To have a dedicated Gateway in KServe, similar configurations to Knative are need to be done. At the very least, and if not having a dedicated gateway deployment, a v1/Service and an Istio Gateway resource need to be created for KServe. Such resources would need to be configured in _localGateway_ and _localGatewayService_. KServe still needs to rely on Knative routing for the KSVCs it creates. Thus, after handling an incoming request and resolving its target, it needs to be forwarded to be handled by Knative. This is the reason for introducing a new `knativeLocalGatewayService` in the ConfigMap. The removed `ingressService` seems to be unused. Apparently, it became unused when the v1alpa1 API of the InferenceServices was deprecated and removed. Signed-off-by: Edgar Hernández <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2fae6e2 - Browse repository at this point
Copy the full SHA 2fae6e2View commit details -
Add support for Azure DNS zone endpoints (kserve#3819)
* Add support for Azure DNS zone endpoints Signed-off-by: tjandy98 <[email protected]> * Add test cases for Azure Blob and File Share URI pattern matching Signed-off-by: tjandy98 <[email protected]> * flake8 Signed-off-by: tjandy98 <[email protected]> * black Signed-off-by: tjandy98 <[email protected]> --------- Signed-off-by: tjandy98 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 61b5e7e - Browse repository at this point
Copy the full SHA 61b5e7eView commit details
Commits on Aug 18, 2024
-
Fix failed build for knativeLocalGatewayService (kserve#3866)
Signed-off-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a16f2d7 - Browse repository at this point
Copy the full SHA a16f2d7View commit details
Commits on Aug 19, 2024
-
Add logging request feature for vLLM backend (kserve#3849)
* Add logging request feature for vLLM Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add log request feature for huggingface Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9292c88 - Browse repository at this point
Copy the full SHA 9292c88View commit details
Commits on Aug 21, 2024
-
Bump vLLM to 0.5.4 (kserve#3874)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 656f65e - Browse repository at this point
Copy the full SHA 656f65eView commit details -
Fix: Add workaround for snyk image scan failure (kserve#3880)
Fix snyk image scan failure Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 99fa478 - Browse repository at this point
Copy the full SHA 99fa478View commit details -
Fix trust_remote_code not working with huggingface backend (kserve#3879)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 87cf2cd - Browse repository at this point
Copy the full SHA 87cf2cdView commit details -
Update KServe 2024-2025 Roadmap (kserve#3810)
* Update ROADMAP.md Signed-off-by: Dan Sun <[email protected]> * Add llm gateway Signed-off-by: Dan Sun <[email protected]> * Update ROADMAP.md Signed-off-by: Dan Sun <[email protected]> * Update ROADMAP.md Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e82beb6 - Browse repository at this point
Copy the full SHA e82beb6View commit details
Commits on Aug 24, 2024
-
Configurable image pull secrets in Helm charts (kserve#3838)
* Image pull secrets in helm charts Signed-off-by: [email protected] <[email protected]> * fix README Signed-off-by: [email protected] <[email protected]> --------- Signed-off-by: [email protected] <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c10f548 - Browse repository at this point
Copy the full SHA c10f548View commit details -
Fix issue with rolling update behavior (kserve#3786)
* fix for the rolling update strategy Signed-off-by: Andrews Arokiam <[email protected]> * codegen fix Signed-off-by: Andrews Arokiam <[email protected]> * codegen fix Signed-off-by: Andrews Arokiam <[email protected]> * codegen fix Signed-off-by: Andrews Arokiam <[email protected]> * go mod fix Signed-off-by: Andrews Arokiam <[email protected]> * fix for deployment patch Signed-off-by: Andrews Arokiam <[email protected]> * test for rolling update strategy Signed-off-by: Andrews Arokiam <[email protected]> --------- Signed-off-by: Andrews Arokiam <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7dd4ae2 - Browse repository at this point
Copy the full SHA 7dd4ae2View commit details -
Fix the 'tokens exceeding model limit' error response in vllm server (k…
…serve#3886) fix tokens exceeding model limit error type in vllm server Signed-off-by: [email protected] <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b89c8c2 - Browse repository at this point
Copy the full SHA b89c8c2View commit details -
Add support for binary data extension protocol and FP16 datatype (kse…
…rve#3685) * Add support for binary data extension protocol and FP16 datatype support Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add FP16 validation Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Refactor, support request outputs and add tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add back numpy as data support Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add binary data extension support for inference client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Increase timeout for gprc client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Apply suggestions from code review Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 69cdca5 - Browse repository at this point
Copy the full SHA 69cdca5View commit details -
Protobuf version upgrade 4.25.4 (kserve#3881)
* protobuf version upgrade 4.25.4 Signed-off-by: Andrews Arokiam <[email protected]> * resolved merge conflicts on poetry Signed-off-by: Andrews Arokiam <[email protected]> --------- Signed-off-by: Andrews Arokiam <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0a6302b - Browse repository at this point
Copy the full SHA 0a6302bView commit details -
Adds optional labels and annotations to the controller (kserve#3366)
* Adds optional labels to the controller Signed-off-by: Souhaib Guitouni <[email protected]> * Add annotation and podAnnotation Signed-off-by: Dan Sun <[email protected]> * Update helm doc Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Souhaib Guitouni <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Souhaib Guitouni <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a9e967f - Browse repository at this point
Copy the full SHA a9e967fView commit details -
Enable Server-Side Apply for Kustomize Overlays in Test Environment (k…
…serve#3877) Signed-off-by: jooho lee <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f17d068 - Browse repository at this point
Copy the full SHA f17d068View commit details -
bufix: update image_transformer.py to handle changes in input structu…
…re (kserve#3830) "data": { "EventName": "s3:ObjectCreated:Put", "Key": "mnist/0.png", "Records": [ { "eventVersion": "2.0", "eventSource": "minio:s3", "awsRegion": "", "eventTime": "2024-07-31T06:20:44.731Z", "eventName": "s3:ObjectCreated:Put", "userIdentity": { "principalId": "minio" }, "s3": { "s3SchemaVersion": "1.0", "configurationId": "Config", "bucket": { "name": "mnist", "ownerIdentity": { "principalId": "minio" }, "arn": "arn:aws:s3:::mnist" }, "object": { "key": "0.png", "size": 324, "eTag": "ebed21f6f77b0a64673a3c96b0c623ba", "contentType": "image/png", " ... Co-authored-by: jongs <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6978ab9 - Browse repository at this point
Copy the full SHA 6978ab9View commit details -
support text embedding task in hugging face server (kserve#3743)
* support text embedding task in huggingfaceserver Signed-off-by: Kevin Mingtarja <[email protected]> * fix lint errors Signed-off-by: Kevin Mingtarja <[email protected]> * format code Signed-off-by: Kevin Mingtarja <[email protected]> * minor refactor Signed-off-by: Kevin Mingtarja <[email protected]> * add e2e test Signed-off-by: Kevin Mingtarja <[email protected]> * specify task explicitly in e2e test Signed-off-by: Kevin Mingtarja <[email protected]> * use approx() for comparing text embedding output Signed-off-by: Kevin Mingtarja <[email protected]> * fix output Signed-off-by: Kevin Mingtarja <[email protected]> * specify relative tolerance to approx Signed-off-by: Kevin Mingtarja <[email protected]> * fix e2e test output Signed-off-by: Kevin Mingtarja <[email protected]> * Update to use predict_isvc in test Signed-off-by: Dan Sun <[email protected]> * Fix output data assert Signed-off-by: Dan Sun <[email protected]> * Add abs toleration Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Kevin Mingtarja <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b7431e2 - Browse repository at this point
Copy the full SHA b7431e2View commit details
Commits on Aug 25, 2024
-
Rename max_length parameter to max_model_len to be in sync with vLLM (k…
…serve#3827) * Rename max_length parameter to max_model_len to be in sync with vLLM Signed-off-by: Dattu Sharma <[email protected]> * Add max_length back which also writes to max_model_len Signed-off-by: Dattu Sharma <[email protected]> * Fixup CLI helper message Signed-off-by: Dattu Sharma <[email protected]> * Also support max_model_len Signed-off-by: datta0 <[email protected]> * Cleanup and refactor to max_model_len Signed-off-by: datta0 <[email protected]> --------- Signed-off-by: Dattu Sharma <[email protected]> Signed-off-by: datta0 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1bd82fb - Browse repository at this point
Copy the full SHA 1bd82fbView commit details -
[Upstream] - Update-istio version based on go version 1.21 (kserve#3825)
Updated istio versions based on go version 1.21 Signed-off-by: mholder6 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 74591b4 - Browse repository at this point
Copy the full SHA 74591b4View commit details
Commits on Aug 26, 2024
-
Enrich isvc NotReady events for failed conditions (kserve#3303)
* Enrich isvc NotReady events for failed conditions
Configuration menu - View commit details
-
Copy full SHA for 9bca4ba - Browse repository at this point
Copy the full SHA 9bca4baView commit details -
adding metadata on requests (kserve#3635)
* adding metadata on requests Signed-off-by: Gabe Cemaj <[email protected]> * fix tests Signed-off-by: Gabe Cemaj <[email protected]> * switch to allow list system instead of prefix Signed-off-by: Gabe Cemaj <[email protected]> * encode meta as json and fix tests Signed-off-by: Gabe Cemaj <[email protected]> * expose options top level Signed-off-by: Gabe Cemaj <[email protected]> * expose options top level Signed-off-by: Gabe Cemaj <[email protected]> * Fix cloud event error handling Signed-off-by: Dan Sun <[email protected]> * import missing packages Signed-off-by: Dan Sun <[email protected]> * Rename to metadata headers Signed-off-by: Dan Sun <[email protected]> * Fix golint Signed-off-by: Dan Sun <[email protected]> * Generate for metadata header field Signed-off-by: Dan Sun <[email protected]> * fix conflicting go.sun Signed-off-by: Gabe Cemaj <[email protected]> --------- Signed-off-by: Gabe Cemaj <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Gabe Cemaj <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c05d760 - Browse repository at this point
Copy the full SHA c05d760View commit details
Commits on Aug 27, 2024
-
Publish 0.14.0-rc0 release (kserve#3867)
* Update 0.14.0-rc0 release Signed-off-by: Dan Sun <[email protected]> * Add security context Signed-off-by: Dan Sun <[email protected]> * Update helm doc Signed-off-by: Dan Sun <[email protected]> * Update crd Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ab251d6 - Browse repository at this point
Copy the full SHA ab251d6View commit details
Commits on Aug 28, 2024
-
Use API token for publishing package to PyPI (kserve#3896)
Use API tokens for publishing package PyPI Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ddc56ca - Browse repository at this point
Copy the full SHA ddc56caView commit details -
Fix sdlc broken when kserve installed using helm (kserve#3890)
* Fix sdlc Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add option for only installing deps in quick install Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Increase cpu & memory request for controller Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix 0.14.0-rc0 release Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d2f90dc - Browse repository at this point
Copy the full SHA d2f90dcView commit details -
Add Security Context and Resources to RBAC Proxy (kserve#3898)
* add security context and resources to rbac proxy container Signed-off-by: Gavin Li <[email protected]> * feedback Signed-off-by: Gavin Li <[email protected]> --------- Signed-off-by: Gavin Li <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3a347c5 - Browse repository at this point
Copy the full SHA 3a347c5View commit details
Commits on Aug 30, 2024
-
Remove unwanted cluster scope secret permissions (kserve#3893)
Remove unwanted secret permissions Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e879a39 - Browse repository at this point
Copy the full SHA e879a39View commit details
Commits on Sep 3, 2024
-
bump to vllm 0.5.5 (kserve#3911)
* bump to vllm 0.5.5 Signed-off-by: Lize Cai <[email protected]> * fix parse_and_batch_prompt import Signed-off-by: Lize Cai <[email protected]> --------- Signed-off-by: Lize Cai <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 67373ad - Browse repository at this point
Copy the full SHA 67373adView commit details
Commits on Sep 5, 2024
-
pin gosec to 2.20.0 (kserve#3921)
Signed-off-by: Jin Dong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c80dae9 - Browse repository at this point
Copy the full SHA c80dae9View commit details
Commits on Sep 7, 2024
-
add a new doc 'common issues and solutions' (kserve#3878)
Signed-off-by: jooho lee <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c7e99d8 - Browse repository at this point
Copy the full SHA c7e99d8View commit details -
Implement health endpoint for vLLM backend (kserve#3850)
* Implement health endpoint for vLLM backend Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add openai health endpoint Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 305d393 - Browse repository at this point
Copy the full SHA 305d393View commit details -
Add security best practices for inferenceservice, inferencegraph, ser…
…vingruntimes (kserve#3917) * Add security context for runtimes Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add security context for runtimes helm Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add security best practices for ig Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Disable service account secret auto mount Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * fmt Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update test cases Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add user id for tensorflow, triton and torchserve Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f54e27a - Browse repository at this point
Copy the full SHA f54e27aView commit details
Commits on Sep 11, 2024
-
* Bump go 1.22 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update generate scripts Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix breaking changes Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix deepcopy, client generation and bump controller-gen to 0.16.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * generate files Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump go version for CI and dockerfiles Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump go version for qpext Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Use slices.Contains Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove clientv1alpha1 dir Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix lint Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * go mod tidy Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * set min go version to 1.22.7 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e7353d1 - Browse repository at this point
Copy the full SHA e7353d1View commit details
Commits on Sep 13, 2024
-
bump to vllm 0.6.0 (kserve#3934)
* bump to vllm 0.6.0 Signed-off-by: yxia216 <[email protected]> * lock Signed-off-by: yxia216 <[email protected]> --------- Signed-off-by: yxia216 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d779344 - Browse repository at this point
Copy the full SHA d779344View commit details
Commits on Sep 15, 2024
-
Set the volume mount's readonly annotation based on the ISVC annotati…
…on (kserve#3885) * Set the volume mount's readonly annotation based on the ISVC annotation Signed-off-by: Hannah DeFazio <[email protected]> * Add test case where readonly is unset, check values Signed-off-by: Hannah DeFazio <[email protected]> * Use StorageInitializerVolumeName constant Signed-off-by: Hannah DeFazio <[email protected]> * Set the readonly value for the storage-initializer Signed-off-by: Hannah DeFazio <[email protected]> * Add tests for direct pvc volume mount use case Signed-off-by: Hannah DeFazio <[email protected]> --------- Signed-off-by: Hannah DeFazio <[email protected]> Co-authored-by: Spolti <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fef6c7a - Browse repository at this point
Copy the full SHA fef6c7aView commit details
Commits on Sep 16, 2024
-
mount /dev/shm volume to huggingfaceserver (kserve#3910)
* add /dev/shm volume to hfserver. Signed-off-by: Lize Cai <[email protected]> * update helm chart docs Signed-off-by: Lize Cai <[email protected]> * add flag to enable devshm. Signed-off-by: Lize Cai <[email protected]> --------- Signed-off-by: Lize Cai <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 391c8c6 - Browse repository at this point
Copy the full SHA 391c8c6View commit details -
Fix permission error in snyk scan (kserve#3889)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9ec6842 - Browse repository at this point
Copy the full SHA 9ec6842View commit details
Commits on Sep 17, 2024
-
Cluster Local Model CR (kserve#3839)
* new model cache cr Signed-off-by: Jin Dong <[email protected]> * update crd Signed-off-by: Jin Dong <[email protected]> * Fix genereted python tests Signed-off-by: Jin Dong <[email protected]> * Fix test failure Signed-off-by: Jin Dong <[email protected]> * Make nodegroup a list field in model cache cr Signed-off-by: Jin Dong <[email protected]> * fix lint Signed-off-by: Jin Dong <[email protected]> * minor updates to model cache cr Signed-off-by: Jin Dong <[email protected]> * Add usecase field to cluster storage container Signed-off-by: Jin Dong <[email protected]> * Fix test failures Signed-off-by: Jin Dong <[email protected]> * Change variable name Signed-off-by: Jin Dong <[email protected]> * Fix lint Signed-off-by: Jin Dong <[email protected]> * Fix default storage container cr Signed-off-by: Jin Dong <[email protected]> * fix defualt.yaml Signed-off-by: Jin Dong <[email protected]> * Remove storagelimit field from node group Signed-off-by: Jin Dong <[email protected]> * Fix python code Signed-off-by: Jin Dong <[email protected]> * Change some fields Signed-off-by: Jin Dong <[email protected]> * Rename crd Signed-off-by: Jin Dong <[email protected]> * Fix lint error in python test files Signed-off-by: Jin Dong <[email protected]> * Rename CR Signed-off-by: Jin Dong <[email protected]> * Add status to local model node group Signed-off-by: Jin Dong <[email protected]> * Add missing node status Signed-off-by: Jin Dong <[email protected]> * Remove files related to ClusterLocalNodeGroup Signed-off-by: Jin Dong <[email protected]> * Add default value for workload type Signed-off-by: Jin Dong <[email protected]> * Fix StorageContainerSpec WorkloadType default value Signed-off-by: Jin Dong <[email protected]> * nodegroups -> nodegroup Signed-off-by: Jin Dong <[email protected]> * Add comments Signed-off-by: Jin Dong <[email protected]> * Add back storageLimit Signed-off-by: Jin Dong <[email protected]> * Update charts/kserve-crd/templates/serving.kserve.io_clusterstoragecontainers.yaml Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for acfc887 - Browse repository at this point
Copy the full SHA acfc887View commit details -
added http headers to inbound request (kserve#3895)
Signed-off-by: Andrews Arokiam <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 90a5e12 - Browse repository at this point
Copy the full SHA 90a5e12View commit details
Commits on Sep 19, 2024
-
Add prow-github-action (kserve#3888)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a2f612c - Browse repository at this point
Copy the full SHA a2f612cView commit details -
Add TLS support for Inference Loggers (kserve#3863)
* Initial TLS bundle support Signed-off-by: Rui Vieira <[email protected]> * Move CA bundle volume and mount path to constants Signed-off-by: Rui Vieira <[email protected]> * Rename loggerConfigTls to loggerTLSConfig Signed-off-by: Rui Vieira <[email protected]> * Rename TlsCertName to CertName Signed-off-by: Rui Vieira <[email protected]> * Add Logger option skip TLS verification Also: - Fixed incorrect cert name argument name (`--log-tls-cert` is now `--logger-ca-cert-file`) Signed-off-by: Rui Vieira <[email protected]> * Correct case Change CABundle and CACertfile to caBundle and caCertFile. Signed-off-by: Rui Vieira <[email protected]> * Fix linting errors - Restore newline at the end of charts/kserve-resources/README.md - Remove import of github.com/kserve/kserve/pkg/constants from `worker.go` and replace with local constant for the CA mount path - `InsecureSkipVerify: logReq.TlsSkipVerify` was triggering gosec's G402 with "potential 'true' for `logReq.TlsSkipVerify`". Since this value is allowed to be true, this specific line was excluded from the checks and an explanatory comment added - Remove import of `k8s.io/utils/ptr` and replace with a pointer `&optionalVolume` Signed-off-by: Rui Vieira <[email protected]> * Fix import sort order on `worker.go` Signed-off-by: Rui Vieira <[email protected]> --------- Signed-off-by: Rui Vieira <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 02293ac - Browse repository at this point
Copy the full SHA 02293acView commit details -
Fix explainer endpoint not working with path based routing (kserve#3257)
* Fix explainer not working with path based routing Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add explainer e2e test for path based routing Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 536fc9b - Browse repository at this point
Copy the full SHA 536fc9bView commit details -
Fix ingress configuration for path based routing and update go mod (k…
…serve#3944) Fix broken ingress test and update go mod Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d97baa6 - Browse repository at this point
Copy the full SHA d97baa6View commit details
Commits on Sep 23, 2024
-
Add HostIPC field to ServingRuntimePodSpec (kserve#3943)
* Add HostIPC to ServingRuntimePodSpec Signed-off-by: Jin Dong <[email protected]> * Add hostIPC: true to huggingface server yaml Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a0a4d00 - Browse repository at this point
Copy the full SHA a0a4d00View commit details -
remove conversion wehbook part from self-signed-ca.sh (kserve#3941)
Signed-off-by: jooho lee <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for bf6fae8 - Browse repository at this point
Copy the full SHA bf6fae8View commit details -
update fluid kserve sample to use huggingface servingruntime (kserve#…
…3907) * update fluid kserve demo to use huggingface servingruntime and other model. Signed-off-by: Lize Cai <[email protected]> * fix lint Signed-off-by: Lize Cai <[email protected]> * explicitly set custom servingruntime, update devshm. Signed-off-by: Lize Cai <[email protected]> * update the args in custom kserve hfserver Signed-off-by: Lize Cai <[email protected]> * address comments Signed-off-by: Lize Cai <[email protected]> * add return of line Signed-off-by: Lize Cai <[email protected]> --------- Signed-off-by: Lize Cai <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2376eeb - Browse repository at this point
Copy the full SHA 2376eebView commit details
Commits on Sep 24, 2024
-
bump to vLLM0.6.1post2 (kserve#3948)
* bump to vLLM0.6.1post2 Signed-off-by: yxia216 <[email protected]> * update kserve/poetry Signed-off-by: yxia216 <[email protected]> --------- Signed-off-by: yxia216 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 80982c2 - Browse repository at this point
Copy the full SHA 80982c2View commit details
Commits on Sep 26, 2024
-
Add NodeDownloadPending status to ClusterLocalModel (kserve#3955)
Add NodeDownloadPending status Signed-off-by: Jin Dong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 12c4039 - Browse repository at this point
Copy the full SHA 12c4039View commit details
Commits on Sep 27, 2024
-
add tags to rest server timing logs to differentiate cpu and wall time (
kserve#3954) Signed-off-by: Gregory Keith <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9380e99 - Browse repository at this point
Copy the full SHA 9380e99View commit details
Commits on Sep 29, 2024
-
Implement Huggingface model download in storage initializer (kserve#3584
) * initial commit for hugging face model download and load Signed-off-by: Andrews Arokiam <[email protected]> * bug fix on storage initializer Signed-off-by: Andrews Arokiam <[email protected]> * added hf_token and unittests Signed-off-by: Andrews Arokiam <[email protected]> * separate hf-storage-initializer image to reduce image size Signed-off-by: Andrews Arokiam <[email protected]> * review comment changes Signed-off-by: Andrews Arokiam <[email protected]> * snapshot download Signed-off-by: Andrews Arokiam <[email protected]> * use existing image for storage initializer Signed-off-by: Andrews Arokiam <[email protected]> * resolved merge conflicts Signed-off-by: Andrews Arokiam <[email protected]> * added hf storage uri validation Signed-off-by: Andrews Arokiam <[email protected]> * resolved merge conflicts Signed-off-by: Andrews Arokiam <[email protected]> --------- Signed-off-by: Andrews Arokiam <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1e28d4f - Browse repository at this point
Copy the full SHA 1e28d4fView commit details
Commits on Sep 30, 2024
-
Update OWNERS file (kserve#3966)
Signed-off-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b344a0a - Browse repository at this point
Copy the full SHA b344a0aView commit details
Commits on Oct 2, 2024
-
Cluster local model controller (kserve#3860)
* Consolidate into one commit Signed-off-by: Jin Dong <[email protected]> * Fix configmap format Signed-off-by: Jin Dong <[email protected]> * Fix configmap Signed-off-by: Jin Dong <[email protected]> * Log configmap read error Signed-off-by: Jin Dong <[email protected]> * fix naming Signed-off-by: Dan Sun <[email protected]> * Update comments Signed-off-by: Jin Dong <[email protected]> * Add enabled flag to configmap and avoid cluster resource check in isvc defaulter Signed-off-by: Jin Dong <[email protected]> * move client into the local model block Signed-off-by: Dan Sun <[email protected]> * Fix lint Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 20b6e29 - Browse repository at this point
Copy the full SHA 20b6e29View commit details
Commits on Oct 3, 2024
-
Prepare for 0.14.0-rc1release and automate sync process (kserve#3970)
* Sync helm chart with kustomize Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update manifest generation script to sync helm charts Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Make kserve-addressable-resolver role optional Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Prepare for 0.14.0-rc1 release Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update release process Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Comment out crd sync script in make Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix helm template syntax Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a50fdc9 - Browse repository at this point
Copy the full SHA a50fdc9View commit details -
add a new API for multi-node/multi-gpu (kserve#3871)
* add a new API for multi-node/multi-gpu Signed-off-by: jooho lee <[email protected]> * fix gitaction Signed-off-by: jooho lee <[email protected]> * fix merging conflict Signed-off-by: jooho lee <[email protected]> * fix gitaction fail Signed-off-by: jooho lee <[email protected]> * regenerate codegen/manifests Signed-off-by: jooho lee <[email protected]> * Apply suggestions from code review Co-authored-by: Dan Sun <[email protected]> Signed-off-by: Jooho Lee <[email protected]> * remove unnecessary comment Signed-off-by: jooho lee <[email protected]> * change the type of workerSpec in isvc to PodSpec Signed-off-by: jooho lee <[email protected]> * update controller-gen version Signed-off-by: jooho lee <[email protected]> * remove replicas from workerSpec Signed-off-by: jooho lee <[email protected]> * fix conflict merging Signed-off-by: jooho lee <[email protected]> * added size(replicas) for workerSpec again Signed-off-by: jooho lee <[email protected]> * add WorkerSpec to inferenceService Signed-off-by: jooho lee <[email protected]> * fix go linter Signed-off-by: jooho lee <[email protected]> --------- Signed-off-by: jooho lee <[email protected]> Signed-off-by: Jooho Lee <[email protected]> Signed-off-by: Jooho Lee <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d5ed018 - Browse repository at this point
Copy the full SHA d5ed018View commit details
Commits on Oct 5, 2024
-
Fix update-openapigen.sh that can be executed from kserve dir (kserve…
…#3924) * fix openapigen.sh that can be executed from kserve dir Signed-off-by: jooho lee <[email protected]> * regenerate codegen/manifests Signed-off-by: jooho lee <[email protected]> * Update go.sum Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: jooho lee <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ecc3729 - Browse repository at this point
Copy the full SHA ecc3729View commit details
Commits on Oct 6, 2024
-
Add python 3.12 support and remove python 3.8 support (kserve#3645)
* Support python 3.12 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update dependencies Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update deps to support 3.12 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove python 3.8 support Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove skip for infer client test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix port forward Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix sklearn pandas dep Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * skip pydantic v1 test for py 3.12 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add setuptools dep for pmml Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix lgb Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Include setuptools for paddle Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Include setuptools for huggingface Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 24d882b - Browse repository at this point
Copy the full SHA 24d882bView commit details
Commits on Oct 9, 2024
-
Fix openssl vulnerability CWE-1395 (kserve#3975)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 17725be - Browse repository at this point
Copy the full SHA 17725beView commit details -
Fix Kubernetes Doc Links (kserve#3670)
* Bump version to 0.13.0-rc0 (kserve#3665) Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: jordanyono <[email protected]> * fixing docs Signed-off-by: jordanyono <[email protected]> * fix spelling mistake Signed-off-by: jordanyono <[email protected]> --------- Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: jordanyono <[email protected]> Co-authored-by: Curtis Maddalozzo <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d7d79e1 - Browse repository at this point
Copy the full SHA d7d79e1View commit details -
Fix kserve local testing env (kserve#3981)
* Fix local testing Signed-off-by: Dan Sun <[email protected]> * Fix codegen Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 70e12ff - Browse repository at this point
Copy the full SHA 70e12ffView commit details
Commits on Oct 10, 2024
-
Fix streaming response not working properly with logger (kserve#3847)
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for dec8c14 - Browse repository at this point
Copy the full SHA dec8c14View commit details -
Add a flag for automount serviceaccount token (kserve#3979)
* Add a flag for automount serviceaccount Signed-off-by: Jin Dong <[email protected]> * Set default to false Signed-off-by: Jin Dong <[email protected]> * Default to true Signed-off-by: Jin Dong <[email protected]> * Fix test error Signed-off-by: Jin Dong <[email protected]> * Update openapi generated.go Signed-off-by: Jin Dong <[email protected]> * Fix python lint Signed-off-by: Jin Dong <[email protected]> * Fix config loading Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6ff2a2f - Browse repository at this point
Copy the full SHA 6ff2a2fView commit details
Commits on Oct 11, 2024
-
Do not set security context on the storage initializer from user cont…
…ainer (kserve#3985) * Do not set security context on the storage initializer from user container Signed-off-by: Jin Dong <[email protected]> * Add securityContext to the default storage container in the helm chart Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d2695a5 - Browse repository at this point
Copy the full SHA d2695a5View commit details -
Modelcar race condition mitigation with an init container (kserve#3932)
This adds the model container as an init-container to mitigate a race condition that would happen if the model container is not present on the cluster-node. The race condition happens if the cluster is able to fetch and start the runtime container before the modelcar is fetched. This would lead to the runtime to terminate with error. By configuring the model container as an init-container the runtime won't start until the modelcar is fetched. Although there is still the risk of a race condition when the cluster schedules the runtime container first, the pod should stabilize after a few restarts of the runtime container and should either prevent a CrashLoopBackOff event on the pod, or the crash event would finish quickly. This improves compatibility with the runtimes which can now stay agnostic to the modelcar implementation, until better techniques (like native sidecars, and oci volume mounts) become mature. Signed-off-by: Edgar Hernández <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7796c44 - Browse repository at this point
Copy the full SHA 7796c44View commit details
Commits on Oct 12, 2024
-
Fix: Headers passing for v1/v2 endpoints (kserve#3669)
* Initial commit for headers passing issue Signed-off-by: Andrews Arokiam <[email protected]> * modifying the e2e test for rebase conflict Signed-off-by: Andrews Arokiam <[email protected]> * bug fix on unittest Signed-off-by: Andrews Arokiam <[email protected]> * review changes Signed-off-by: Andrews Arokiam <[email protected]> * fix for test failure Signed-off-by: Andrews Arokiam <[email protected]> * bug fix on e2e test Signed-off-by: Andrews Arokiam <[email protected]> * overridding the entrypoint of custom model images Signed-off-by: Andrews Arokiam <[email protected]> * custom response header Signed-off-by: Andrews Arokiam <[email protected]> * fix for unittest failure Signed-off-by: Andrews Arokiam <[email protected]> * added custom response headers in post process Signed-off-by: Andrews Arokiam <[email protected]> * added predict time latency in example response header Signed-off-by: Andrews Arokiam <[email protected]> * fix OOM --------- Signed-off-by: Andrews Arokiam <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fc72a2b - Browse repository at this point
Copy the full SHA fc72a2bView commit details
Commits on Oct 14, 2024
-
Torchserve security update (kserve#3774)
* security update Signed-off-by: udai <[email protected]> * adding sign off Signed-off-by: udai <[email protected]> --------- Signed-off-by: udai <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for dbceef3 - Browse repository at this point
Copy the full SHA dbceef3View commit details -
Pin ubuntu 22.04 for minikube setup action (kserve#3994)
Signed-off-by: Jin Dong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 82234c0 - Browse repository at this point
Copy the full SHA 82234c0View commit details -
KServe 0.14 Release (kserve#3988)
* temp commit Signed-off-by: Jin Dong <[email protected]> * python-release.sh Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7e43642 - Browse repository at this point
Copy the full SHA 7e43642View commit details
Commits on Oct 25, 2024
-
Merge remote-tracking branch 'kserve/release-0.14' into j9436-kserve0…
…14-upgrade Code sync with upstream, up to v0.14. Signed-off-by: Edgar Hernández <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for befa4d6 - Browse repository at this point
Copy the full SHA befa4d6View commit details -
Fix poetry lockfiles after v0.14.0 code sync
Signed-off-by: Edgar Hernández <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 53de246 - Browse repository at this point
Copy the full SHA 53de246View commit details -
Restore compatibility with go v1.21
Signed-off-by: Edgar Hernández <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b7a868f - Browse repository at this point
Copy the full SHA b7a868fView commit details
Commits on Oct 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c9453bf - Browse repository at this point
Copy the full SHA c9453bfView commit details