Changelog

0.32.0-alpha (2024-10-23)

Features

run: add requester id in list model run response (#701) (8c22eec)

0.31.0-alpha (2024-10-22)

Features

run: run logging data list by requester API (#695) (b118026)
service: standardize the tag naming convention (#700) (e08be0d)

Bug Fixes

ray: fix upscale deployment failure (#698) (96563c9)
run: fix run logging update function (#699) (a753fe4)
run: not return minio error in list model run (#696) (cbeea05)

0.30.2-alpha (2024-10-08)

Bug Fixes

minio: fix nil logger causing panic (ef9645e)
test: switch to minimock and add tests (#687) (141da9d)

0.30.1-alpha (2024-09-13)

Bug Fixes

run: fix Runner missing issue (#689) (22f2aa0)

0.30.0-alpha (2024-09-10)

Features

task: support new AI Tasks spec (#655) (e34d4e1)

0.29.0-alpha (2024-08-29)

Features

minio: added minio connection (#651) (d17770f)
minio: added minio upload file bytes function (#656) (1f7b403)
minio: upgraded to minio v7 and return url and file size (#654) (4c34bf4)
model: support model statistics in model message (#660) (b460778)
model: support num_of_runs in list filter (#661) (b161512)
ray: add separate state for scaling (#659) (f519c82)
ray: support triggering upscale (#665) (441af13)
redis: replace I/O data retrieval from redis with minio (#667) (fc46ef8)
run: fix run logging owner view permission (#668) (5c07c9a)

Bug Fixes

ray: fix upscale request timeout (#675) (d24eea5)
redis: fix input value overwrite (#658) (a383380)
workflow: fix duplicate key when activity retry (#669) (97d9cd2)
workflow: fix state check (#673) (0514cae)

0.28.0-alpha (2024-07-31)

Features

mgmt: use explicit user_id and organization_id in mgmt request (#638) (4af37e6)
model: implement namespace endpoints (#639) (8c9acac)
pkg: implement fuzze search for namespace and model ID (#642) (30b11a8)
tag: allow custom tags for model (#643) (9c49db6)
version: embed version names in model struct (#645) (88f107b)

Bug Fixes

repository: fix missing tag query (#644) (f1f3229)
transpiler: fix expression (#649) (4680c92)
worker: added tests for trigger model activity (#630) (abe562e)

0.27.0-alpha (2024-07-16)

Features

credit: INS-5058 added check and collect for model trigger (#619) (5d55b5b)
credit: refactored check and collect for model trigger (#629) (0a14f9e)
ray: sync deployment configs when starting up (#631) (4a2fc8a)
service: check trigger permissions when impersonating an org (#635) (427fc45)

Bug Fixes

worker: unit test for TriggerModelActivity (#624) (dec7a60)

0.26.0-alpha (2024-07-02)

Features

model: support model version deletion (#616) (2dca40b)
repository: support case-insensitive search models (#621) (26c76b2)

Bug Fixes

redis: fix misconfigured ttl (f5da795)
worker: fix mishandled workflow not found (0189dd8)

0.25.1-alpha (2024-06-20)

Bug Fixes

schema: use camelCase for schema fields (5629b6a)

0.25.0-alpha (2024-06-18)

Features

endpoints: use camelCase for filter query string (#603) (23955e9)
handler: use camelCase for HTTP body (#599) (70f6d9a)
model: support model tag (#600) (ef87bc9)

0.24.0-alpha (2024-06-06)

⚠ BREAKING CHANGES

model: adopt containerized model serving (#542)

Features

handler: implement get latest operation (#589) (33d2395)
handler: support listing available regions for model deployment (#561) (52c2172)
handler: support model profile image (#566) (0c8dbba)
model: add permission field in model object (#576) (2d36a58)
model: add task schema in model struct (#578) (647069d)
model: adopt containerized model serving (#542) (3c80f39)
model: embed sample input/output in model proto message (#558) (5fba538)
model: support latest model version trigger (#580) (47cb36c)
model: support resource spec in model definition (#557) (fee6e4b)
model: support search/filter with list endpoints (#559) (7b17393)
model: support watch latest model and order_by for list endpoints (#586) (1a5e48c)
prediction: implement sync/async prediction records (#555) (8d58eda)
ray: support containerized model deployment (#529) (4dcab05)
ray: support custom accelerator type (#547) (f0cc0d7)

Bug Fixes

acl: fix wrong type name (#560) (89d09a5)
dockerfile: update deploy config yaml path (#590) (ee369e0)
model: fix missing package in test models (#552) (a28a21b)
ray: check CDI availability for model container (#538) (28bad42)
server: add missing message size option (#597) (d0a0aac)
service: fix list model version pagination (#569) (d8fb04a)
service: fix list model version return list size (#556) (9b69f9c)

0.23.0-alpha (2024-03-09)

Features

pkg: use mgmtPB.Owner to embed the owner information (#523) (37d5708)

Bug Fixes

handler,ray: fix reconciliation model status and namespace (#525) (62a30b6)
redis: delete redis key when errored (#526) (bb4e18d)

0.22.0-alpha (2024-02-20)

⚠ BREAKING CHANGES

triton: deprecate triton inference server (#512)

Features

acl,org: adopt ACL and add organization endpoints (#504) (13a1650)

Bug Fixes

cmd,pkg: refactor codebase to align with golanci-linter checks (#506) (b213812)
handler: fix multipart request (352a4ae)
pkg: fix isError and set maxBatchSize to 0 (2adfe5b)
pkg: fix org model namespace (#510) (f4be09c)
service: fix workflow retry when deleting (adcbde5)
service: remove org subscription check (76cd66f)
usage: add missing org usage collection (239d3f4)
worker: fix temporal cloud namespace init (#513) (17c5d68)

Code Refactoring

triton: deprecate triton inference server (#512) (f8a277d)

0.21.3-alpha (2024-01-30)

Bug Fixes

model: fix indexing error in text2img and img2img postprocessing (#501) (0ba505b)
model: fix missing field in ray while serving img2img task (#496) (f572f18)
payload: fix wrong form data key (#503) (4d69e5e)

0.21.2-alpha (2024-01-25)

Bug Fixes

main: fix misused return statement (5cbfc3d)

0.21.1-alpha (2024-01-02)

Features

service: support basic github pat to avoid rate-limit (#477) (45931ca)

Miscellaneous Chores

release: release v0.21.1-alpha (bd320b0)

0.21.0-alpha (2023-12-14)

Features

model: refactoring AI Tasks for Consistency Across Text and Image Generation (#461) (e827130)
redis: use redis for model state caching (#472) (3b6b977)

Bug Fixes

model: fix grpc message size limit issue (#474) (1ec7ae1)

0.20.0-alpha (2023-11-30)

Features

model: Enhancements for Image Inpu in Text 2 Image Task (#457) (eb604a1)
ray: use shared python executable (#455) (db9658b)

Bug Fixes

model: fix deployment reconciliation (#459) (bac1961)
ray: fix model file extension (#453) (424d632)

0.19.0-alpha (2023-11-11)

Features

model: Support New Fields for Multi-Modal Model In Text Generation Task and Refactor Existing Ones (#448) (49bdf5b)
ray: add ray serve as model serving backend (#445) (a9b4005)

Bug Fixes

predeploy: fix predeploy model missing triton models reference (3f296cd)
ray: fix model healthcheck causing scaling loop (#450) (4d8cdbf)
ray: fix unziping ray model (ca79411)
service: fix fail model deletion in state error (#449) (91125c0)

0.18.0-alpha (2023-10-26)

Features

model: Enhancements for Llava Model Support and Model Hub File Movement (#434) (58cb97c)
model: Support for LLM-like models in TRITON Inference Server (#432) (590eb0b)

Bug Fixes

Dockerfile: fix Python 3.11 using Debian base image (#438) (2ace6eb)
payload: fix incorrect conversion between integer types (#440) (32bffea)

0.17.2-alpha (2023-10-13)

Bug Fixes

model: fix init model namespace (77a35b3)

0.17.1-alpha (2023-09-30)

Bug Fixes

main: fix namespace error when deploying model (#423) (dd5badf)

0.17.0-alpha (2023-09-13)

Miscellaneous Chores

release: release v0.17.0-alpha (70172a2)

0.16.11-alpha (2023-08-19)

Miscellaneous Chores

release: release v0.16.11-alpha (5aba1ce)

0.16.10-alpha (2023-08-03)

Miscellaneous Chores

release: release v0.16.10-alpha (1cd7990)

0.16.9-alpha (2023-07-20)

Miscellaneous Chores

release: release v0.16.9-alpha (485a9fd)

0.16.8-alpha (2023-07-09)

Miscellaneous Chores

release: release v0.16.8-alpha (8251037)

0.16.7-alpha (2023-06-20)

Miscellaneous Chores

release: release 0.16.7-alpha (c8ef5c4)

0.16.6-alpha (2023-06-11)

Miscellaneous Chores

release: release v0.16.6-alpha (c1f57a9)

0.16.5-alpha (2023-06-02)

Miscellaneous Chores

release: release v0.16.5-alpha (b8ba368)

0.16.4-alpha (2023-05-11)

Miscellaneous Chores

release: release v0.16.4-alpha (ab8cf12)

0.16.3-alpha (2023-05-06)

Bug Fixes

create single triton client (#357) (8dedf5d)

0.16.2-alpha (2023-04-25)

Miscellaneous Chores

release: release v0.16.2-alpha (b735b17)

0.16.1-alpha (2023-04-24)

Bug Fixes

pass the context between package layers (#345) (e6e7f2f)

Miscellaneous Chores

release 0.16.1-alpha (1d497e0)

0.16.0-alpha (2023-04-15)

Features

add model initialization module (#332) (aa753a5)

0.15.0-alpha (2023-04-07)

Features

controller: add model state monitoring with controller (#323) (4397826)
remove model instance (#320) (15e1b62)
support model caching (#317) (d15ffba)

0.14.0-alpha (2023-03-26)

Features

add private endpoint and gRPC test cases (#306) (bb3c193)

Bug Fixes

config: use private port for mgmt-backend (#307) (3264e2b)
list models and model instances pagination (#304) (1f19ed4)

0.13.1-alpha (2023-02-26)

Bug Fixes

create a subfolder in model-repository if needed (#290) (7f8d78b)
fix creating subfolder (105a11a)
fix subfolder creation (#292) (0b6ec3f)
fix variable name (#293) (a7995dd)

0.13.0-alpha (2023-02-23)

Features

add support for text generation tasks (#252) (767ec45)

Bug Fixes

keep format for empty inference output (#258) (e2a2e48)

0.12.1-alpha (2023-02-12)

Bug Fixes

fix keypoint model payload parser (#249) (461d54a)

0.12.0-alpha (2023-02-10)

Features

add text to image task (#239) (421eb1a)

Bug Fixes

fix usage client nil issue when mgmt-backend not ready (#241) (4290159)

0.11.1-alpha (2023-01-20)

Bug Fixes

fix list long-run operation error (#220) (472696d)

0.11.0-alpha (2023-01-14)

Miscellaneous Chores

release 0.11.0-alpha (d592acb)

0.10.0-alpha (2022-12-23)

Features

support async deploy and undeploy model instance (#192) (ed36dc7)
support semantic segmentation (#203) (f22262c)

Bug Fixes

model instance state update to unspecified state (#206) (14c87d5)
panic error with nil object (#208) (a342113)

0.9.1-alpha (2022-11-28)

Bug Fixes

HuggingFace batching bug in preprocess model (b1582e8)

0.9.0-alpha (2022-10-19)

Features

support instance segmentation task (#183) (d28cfdc)

Bug Fixes

allow updating emtpy description for a model (#177) (100ec84)

0.8.1-alpha (2022-09-19)

Bug Fixes

update description for GitHub model from user input (#173) (821dab3)

0.8.0-alpha (2022-09-14)

Features

add confidence score for ocr output (#167) (e915452)

0.7.3-alpha (2022-09-07)

Features

handle oom (#163) (4db1c45)

Miscellaneous Chores

release 0.7.3-alpha (9033c50)

0.7.2-alpha (2022-08-22)

Miscellaneous Chores

release 0.7.2-alpha (17529d6)

0.7.1-alpha (2022-08-21)

Bug Fixes

post process ocr task (e387154)

0.7.0-alpha (2022-08-17)

Features

add release stage for model definition (#153) (4e13ba5)
support ocr task (#150) (7766c6f)

0.6.3-alpha (2022-07-19)

Bug Fixes

fix client stream server recv wrong file length interval (#143) (0e06f7c)
post process for unspecified task output (ad88068)
trigger image with 4 channel (#141) (7445f5f)

0.6.2-alpha (2022-07-12)

Miscellaneous Chores

release v0.6.2-alpha (4365f32)

0.6.1-alpha (2022-07-11)

Miscellaneous Chores

release v0.6.1-alpha (f18dc30)

0.6.0-alpha (2022-07-06)

Features

support creating a HuggingFace model (#113) (1577d87)

Bug Fixes

model definition in list model and missing zero in output (#121) (a90072d)

0.5.1-alpha (2022-06-27)

Miscellaneous Chores

release v0.5.1-alpha (895056d)

0.5.0-alpha (2022-06-26)

Features

add credential definition (#109) (92d3391)
support artivc (#102) (b8e21a4)

Bug Fixes

bug usage storage (#103) (975fdc1)
fix duration configuration bug (ee4a310)
init config before logger (9d3fb4a)
status code when deploy model error (#111) (31d3f11)
update model definitions and tasks in usage collection (#100) (c593087)
wrong logic when checking user account and service account (7058db6)

0.4.2-alpha (2022-05-31)

Bug Fixes

fix config path (a8cf2c0)
regexp zap logger with new protobuf package (8b9c463)

Miscellaneous Chores

release 0.4.2-alpha (fc5a14a)

0.4.1-alpha (2022-05-19)

Bug Fixes

add writeonly to description (f59d98f)
clone repository and make folder (ac79386)
model configuration response in integration test (0225c1e)
refactor JSON schema (f24db48)

0.4.0-alpha (2022-05-13)

Features

create model from GitHub (#61) (cf763cb)

Bug Fixes

refactor model definition and model JSON schema (#73) (0cce154)

0.3.2-alpha (2022-03-22)

Miscellaneous Chores

release 0.3.2-alpha (9f8cd91)

0.3.1-alpha (2022-03-21)

Bug Fixes

fix unload model issue causing Triton server OOM (#42) (fb4d1d1)
update version order when get model version list (#38) (83c054a)

0.3.0-alpha (2022-02-24)

Features

support url/base64 content prediction (#34) (a88ddfd)

Bug Fixes

correct version when making inference (#31) (c918e77)
update docker compose file for building dev image (#29) (83cba09)

0.2.0-alpha (2022-02-19)

Features

add gRPC Gateway and GetModel API (#7) (bff6fc9)
support model name when creating model (#25) (7d799b7)

Bug Fixes

fix build and go version (#9) (f8d4346)

0.1.0-alpha (2022-02-12)

Features

add codebase for model grpc service (4defa3e)

Bug Fixes

add link for guideline create Conda environment file (7ee8e06)
logic when essemble or not (ab8e7c1)
postgres host (a322165)
return list of models in list method (b88ebd7)
update db schema, protobuf generated files and create model, version in upload api (7573e54)
update predict for essemble model (016f11c)

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Changelog

0.32.0-alpha (2024-10-23)

Features

0.31.0-alpha (2024-10-22)

Features

Bug Fixes

0.30.2-alpha (2024-10-08)

Bug Fixes

0.30.1-alpha (2024-09-13)

Bug Fixes

0.30.0-alpha (2024-09-10)

Features

0.29.0-alpha (2024-08-29)

Features

Bug Fixes

0.28.0-alpha (2024-07-31)

Features

Bug Fixes

0.27.0-alpha (2024-07-16)

Features

Bug Fixes

0.26.0-alpha (2024-07-02)

Features

Bug Fixes

0.25.1-alpha (2024-06-20)

Bug Fixes

0.25.0-alpha (2024-06-18)

Features

0.24.0-alpha (2024-06-06)

⚠ BREAKING CHANGES

Features

Bug Fixes

0.23.0-alpha (2024-03-09)

Features

Bug Fixes

0.22.0-alpha (2024-02-20)

⚠ BREAKING CHANGES

Features

Bug Fixes

Code Refactoring

0.21.3-alpha (2024-01-30)

Bug Fixes

0.21.2-alpha (2024-01-25)

Bug Fixes

0.21.1-alpha (2024-01-02)

Features

Miscellaneous Chores

0.21.0-alpha (2023-12-14)

Features

Bug Fixes

0.20.0-alpha (2023-11-30)

Features

Bug Fixes

0.19.0-alpha (2023-11-11)

Features

Bug Fixes

0.18.0-alpha (2023-10-26)

Features

Bug Fixes

0.17.2-alpha (2023-10-13)

Bug Fixes

0.17.1-alpha (2023-09-30)

Bug Fixes

0.17.0-alpha (2023-09-13)

Miscellaneous Chores

0.16.11-alpha (2023-08-19)

Miscellaneous Chores

0.16.10-alpha (2023-08-03)

Miscellaneous Chores

0.16.9-alpha (2023-07-20)

Miscellaneous Chores

0.16.8-alpha (2023-07-09)

Miscellaneous Chores

0.16.7-alpha (2023-06-20)