[DEV TRACKER] Model Serving Requirements for Q4 #92

heyselbi · 2023-09-27T19:05:59Z

From Req Document

Req 1: Model Storage

Users must be able to to deploy a model stored in (d.) AWS

feature request: support custom CAs and ca-bundle from openshift Cluster Proxy Config in AWS S3 connectivity e.g. model serving puller modelmesh-serving#113

Req 2: Model Formats - Estimate: RHODS 1.36

Users must be able to serve models based on a variety of framework (a.) OOTB support for TensorFlow, PyTorch, scikit-learn models and (d.) Users must be able to serve models from Hugging Face without having to do any additional conversions or configurations

OOTB support for HF models #86

Req 7: Deployment Rollouts

a. Ability to deploy new model versions & deploy % of traffic to new version (canary rollout)
b. Ability to do A/B testing on different model versions
c. Ability to test deployed endpoint directly in the product UI

[STORY] Deployment Rollouts Req #98

Req 10: OOTB Deployed model performance metrics

Users must be able to access performance metrics for all deployed models (e.) CPU/GPU/memory utilization

[SPIKE] Model Performance Metrics: CPU/GPU/memory utilization #99

Req 14: Model Serving Runtimes

b. OOTB support for Caikit/TGIS

c. OOTB support for NVIDIA Triton Inference Server

[P0] [SPIKE] - OOTB support for NVIDIA Triton Inference Server modelmesh-serving#184

Req 15: Remote Deployment

eg. locations other than the cluster where model deployment is initiated (a.) Support models being deployed to remote (location other than where model deployment is initiated)

Req 17: Support options for KServe and/or ModelMesh - Estimate: RHODS 1.36

Support KServe - 1 model per pod or modelmesh - multiple models per pod (a.) RHODS admins should be able to configure whether they want to use KServe (single model serving + additional functionality), ModelMesh, or both

Other planned features

[STORY] ModelMesh manifests transition readiness for ODH v2 modelmesh-serving#207 - RHODS 1.35
[STORY] Enable Caikit Python API library caikit#15 - RHODS 1.36
[SPIKE] storage controller need to support ca cert for self sign signed certificate odh-model-controller#61 - RHODS 1.36

Other planned enhancements

Update OVMS to v2023.1 modelmesh-serving#137 - RHODS 1.35
Upgrade MM to v0.11.0 in RHODS + Metrics hotfix modelmesh-serving#228 - RHODS 1.35
Address High vulnerabilities in OVMS SNYK scan modelmesh-serving#192 - RHODS 1.35

Other planned bug fixes

Readiness probe failed event observed after the rhods upgrade #77 - RHODS 1.35
Error on reconciliation of odh-model-controller odh-model-controller#86 - RHODS 1.35

Resources

Model Serving Phase 2 Requirements doc
Model Serving Phase 2 Requirement Mapping spreadsheet

israel-hdez · 2024-02-13T19:18:33Z

Closing, as we are now tracking work on Jira.

israel-hdez · 2024-02-13T19:18:42Z

/close

openshift-ci · 2024-02-13T19:18:46Z

@israel-hdez: Closing this issue.

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

heyselbi transferred this issue from opendatahub-io/modelmesh-serving Sep 27, 2023

heyselbi added the tracker Non-completable ticket; used for tracking work at a high level label Sep 27, 2023

heyselbi self-assigned this Sep 27, 2023

heyselbi changed the title ~~[TRACKER] Model Serving Requirements for Q4~~ [DEV TRACKER] Model Serving Requirements for Q4 Sep 27, 2023

This was referenced Sep 28, 2023

Support KServe + Caikit + TGIS for FM Serving opendatahub-io/opendatahub-community#46

Open

[Tracker]: Model Serving v2 - KServe Support opendatahub-io/odh-dashboard#1810

Closed

heyselbi added this to ODH Feature Tracking and Internal tracking Oct 4, 2023

heyselbi removed their assignment Oct 5, 2023

heyselbi changed the title ~~[DEV TRACKER] Model Serving Requirements for Q4~~ [DEV TRACKER] Model Serving Deliverables for Q4 Oct 16, 2023

heyselbi changed the title ~~[DEV TRACKER] Model Serving Deliverables for Q4~~ [DEV TRACKER] Model Serving Requirements for Q4 Oct 16, 2023

heyselbi self-assigned this Oct 31, 2023

openshift-ci bot closed this as completed Feb 13, 2024

github-project-automation bot moved this from In Progress to Done in ODH Model Serving Planning Feb 13, 2024

github-project-automation bot moved this to Done in Internal tracking Feb 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DEV TRACKER] Model Serving Requirements for Q4 #92

[DEV TRACKER] Model Serving Requirements for Q4 #92

heyselbi commented Sep 27, 2023 •

edited

Loading

israel-hdez commented Feb 13, 2024

israel-hdez commented Feb 13, 2024

openshift-ci bot commented Feb 13, 2024

[DEV TRACKER] Model Serving Requirements for Q4 #92

[DEV TRACKER] Model Serving Requirements for Q4 #92

Comments

heyselbi commented Sep 27, 2023 • edited Loading

From Req Document

Req 1: Model Storage

Req 2: Model Formats - Estimate: RHODS 1.36

Req 7: Deployment Rollouts

Req 10: OOTB Deployed model performance metrics

Req 14: Model Serving Runtimes

Req 15: Remote Deployment

Req 17: Support options for KServe and/or ModelMesh - Estimate: RHODS 1.36

Other planned features

Other planned enhancements

Other planned bug fixes

Resources

israel-hdez commented Feb 13, 2024

israel-hdez commented Feb 13, 2024

openshift-ci bot commented Feb 13, 2024

heyselbi commented Sep 27, 2023 •

edited

Loading