[pull] main from caikit:main #19

pull · 2024-02-29T05:24:15Z

See Commits and Changes for more details.

Can you help keep this open source service alive? 💖 Please sponsor : )

Need to make sentence-transformers part of the default caikit-nlp runtime image. Previously this was an optional import requiring a special runtime image. * Make sentence-transformers one of the dependencies * Remove the just-for-testing no-deps support that was a work-around Signed-off-by: markstur <[email protected]>

Signed-off-by: markstur <[email protected]>

…runcation Wrapped model with custom encode params. Moves truncation into encode (tokenize) processing). Adds option to use dtype=bloat16 with autocast (adding this for IPEX testing). Use env var BFLOAT16 to enable the dtype for ipex and the autocast in encode. Signed-off-by: markstur <[email protected]>

Signed-off-by: waleedqk <[email protected]>

Signed-off-by: Shonda-Adena-Witherspoon <[email protected]>

…task in decorator Signed-off-by: Shonda-Adena-Witherspoon <[email protected]>

…e.py Signed-off-by: Shonda-Adena-Witherspoon <[email protected]>

* add tests and verify that we get the expected truncation errors * re-tokenize to recreate the token count on trucation errors * cleanup unused stuff Signed-off-by: markstur <[email protected]>

Signed-off-by: Shonda-Adena-Witherspoon <[email protected]>

Add sentence-transformers to dependencies for real

added tokenization tasks to serve tokenization requests

Signed-off-by: waleedqk <[email protected]>

git push origin $BRANCH e remote-tracking branch 'origin/main' into ResponseOptions

openshift-ci · 2024-02-29T05:24:21Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: pull[bot]

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci · 2024-02-29T05:24:26Z

Hi @pull[bot]. Thanks for your PR.

I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Signed-off-by: markstur <[email protected]>

Signed-off-by: waleedqk <[email protected]>

* config introduces values that env vars wouldn't * make those checks more robust * make it more testable Signed-off-by: markstur <[email protected]>

* For doc/defaults Signed-off-by: markstur <[email protected]>

Signed-off-by: markstur <[email protected]>

Wrapped sentence transformer

Signed-off-by: Mynhardt Burger <[email protected]>

…in-results Include input token count in results

🥅 Allow int thresholds

* Adding missing params * Don't return unexpected tuple (with token count) unless asked * Adding check to not use our params if given an unwrapped model * Fixing some param position things Signed-off-by: Mark Sturdevant <[email protected]>

…tion behavior. * First draft could KeyError when deleting kwargs that don't exist. Tests added. * Adding a config option so the desired default behavior can be either: - Throw an error if truncation is happening implicitly, or - Nah. Just let it go. First was requested, so trunction does not happen quietly. This is common behavoir for some models. The second is more aligned with SentenceTransformers and is probably necessary for standard tests to run without errors. Signed-off-by: Mark Sturdevant <[email protected]>

Signed-off-by: Mark Sturdevant <[email protected]>

Make encode() in wrapped model compatible with super encode()

Signed-off-by: waleedqk <[email protected]>

Signed-off-by: Evaline Ju <[email protected]>

⚡ Tee stream instead of checking length

Signed-off-by: waleedqk <[email protected]>

Adding tgis params to caikit api

…h truncation * Attempting truncate_input_tokens=2 (or 1) was creating a strange error (or misbehaving) because it takes at least 3 tokens for [CLS] TOK [SEP] for meaningful results. * Now that truncate value generally means number of tokens not including begin/end. * On the max end the 2 special tokens will be allowed to consume 2 from the limit. * Batch embedding processing was returning odd/misordered results when combined with truncation. Added a re tokenize() call to avoid sending the overflow tokens as features to be processed. Signed-off-by: Mark Sturdevant <[email protected]>

Embeddings fix for truncation without room for begin/end and for batch truncation

* Test for 2 identical strings to return identical vectors * Also tests that batch vs single/loop are approx same * This test would have found a bug that was recently fixed Signed-off-by: Mark Sturdevant <[email protected]>

Embedding add a test that would have helped

Signed-off-by: Evaline Ju <[email protected]>

✨ Add tokenization task to generation modules

markstur and others added 16 commits February 7, 2024 13:26

Wrap SentenceTransformers to allow args

1907e07

Signed-off-by: markstur <[email protected]>

First Draft: Adding tgis params to caikit api

c4a8eb3

Signed-off-by: waleedqk <[email protected]>

Make similar changes for prompt tuned model to address failing tests

64b1e79

Signed-off-by: waleedqk <[email protected]>

add additional parameters to the caikit API from tgis

1b45f43

Signed-off-by: waleedqk <[email protected]>

add additional parameters to the caikit API from tgis

f670dbb

Signed-off-by: waleedqk <[email protected]>

added tokenization tasks to serve tokenization requests

dc8cbd3

Signed-off-by: Shonda-Adena-Witherspoon <[email protected]>

added unit test to modules, bumped caikit version, and declared multi…

6773656

…task in decorator Signed-off-by: Shonda-Adena-Witherspoon <[email protected]>

updated doc string, comments, and added unit test for peft_tgis_remot…

4542cde

…e.py Signed-off-by: Shonda-Adena-Witherspoon <[email protected]>

Fix for truncation error message.

ab21d84

* add tests and verify that we get the expected truncation errors * re-tokenize to recreate the token count on trucation errors * cleanup unused stuff Signed-off-by: markstur <[email protected]>

reformatted doc string for consistency

14e0c25

Signed-off-by: Shonda-Adena-Witherspoon <[email protected]>

Merge pull request #319 from markstur/sentence-transformers-dep

9c479e7

Add sentence-transformers to dependencies for real

Merge pull request #325 from swith004/input_token_count_136

2ab8abd

added tokenization tasks to serve tokenization requests

add tokens and input_tokens to response

057fa7a

Signed-off-by: waleedqk <[email protected]>

Mergt commit -m "Merge changes from main"

e0e58af

git push origin $BRANCH e remote-tracking branch 'origin/main' into ResponseOptions

openshift-ci bot added the needs-ok-to-test label Feb 29, 2024

pull bot added ⤵️ pull and removed needs-ok-to-test labels Feb 29, 2024

markstur and others added 9 commits February 29, 2024 00:48

Embeddings should use config to add scope to env setting.

6466a83

Signed-off-by: markstur <[email protected]>

remove comments

b4d44af

Signed-off-by: waleedqk <[email protected]>

add check conditions

765d682

Signed-off-by: waleedqk <[email protected]>

add check to stream case

e661fc4

Signed-off-by: waleedqk <[email protected]>

Fix using caikit config for deployment config settings

1c25484

* config introduces values that env vars wouldn't * make those checks more robust * make it more testable Signed-off-by: markstur <[email protected]>

Add embedding config defaults to caikit_nlp/config/config.yml

cc83ab0

* For doc/defaults Signed-off-by: markstur <[email protected]>

Doc strings

1a9ad04

Signed-off-by: markstur <[email protected]>

Merge pull request #328 from markstur/wrapped_sentence_transformer

8ca93c3

Wrapped sentence transformer

Add input_token_count to results

3c1f4b9

Signed-off-by: Mynhardt Burger <[email protected]>

mynhardtburger and others added 29 commits March 11, 2024 21:41

Remove #type: ignore

bdda232

Signed-off-by: Mynhardt Burger <[email protected]>

Merge pull request #334 from mynhardtburger/inlude-input_token_count-…

01526aa

…in-results Include input token count in results

Merge branch 'main' into int-threshold

f2f8380

Merge pull request #336 from evaline-ju/int-threshold

f55b082

🥅 Allow int thresholds

fmt fix

fc7d81e

Signed-off-by: Mark Sturdevant <[email protected]>

Merge pull request #337 from markstur/compatible_encode

ce34b1c

Make encode() in wrapped model compatible with super encode()

random change to restart build

7a747ec

Signed-off-by: waleedqk <[email protected]>

revert un-needed change

183c49c

Signed-off-by: waleedqk <[email protected]>

update validate test for stream and unary to check for new fields

5e5c5c4

Signed-off-by: waleedqk <[email protected]>

⚡ Tee stream instead of checking length

d97d00a

Signed-off-by: Evaline Ju <[email protected]>

🔧 Add tokenization import

c6e0986

Signed-off-by: Evaline Ju <[email protected]>

Merge pull request #341 from evaline-ju/no-stream-len

3bf2a13

⚡ Tee stream instead of checking length

add rank data to generated result

aca4538

Signed-off-by: waleedqk <[email protected]>

kick of build process

9b18700

Signed-off-by: waleedqk <[email protected]>

explicit caikit version to ensure the data model field is available

9b2d516

Signed-off-by: waleedqk <[email protected]>

Merge changes from main

8c16dc4

Merge pull request #324 from waleedqk/ResponseOptions

42c3075

Adding tgis params to caikit api

Merge pull request #343 from markstur/embed_trunc_fix

d34987a

Embeddings fix for truncation without room for begin/end and for batch truncation

Embedding add a test that would have helped

55b07f8

* Test for 2 identical strings to return identical vectors * Also tests that batch vs single/loop are approx same * This test would have found a bug that was recently fixed Signed-off-by: Mark Sturdevant <[email protected]>

Merge pull request #344 from markstur/embed_same_test

c12cb82

Embedding add a test that would have helped

✨ Add tokenization task to generation modules

6f231bf

Signed-off-by: Evaline Ju <[email protected]>

✅ Add unimplemented function tests

4cc6b88

Signed-off-by: Evaline Ju <[email protected]>

📌 Pin breaking import changes for torch 2.3.0

cb9a51a

Signed-off-by: Evaline Ju <[email protected]>

🐛 Change Std import to avoid torch pin

b608e75

Signed-off-by: Evaline Ju <[email protected]>

📌 Pin torch due to changed LaunchConfig args

79f20d8

Signed-off-by: Evaline Ju <[email protected]>

Merge pull request #351 from evaline-ju/llm-tok

d81f11e

✨ Add tokenization task to generation modules

dtrifiro merged commit 777cb2c into opendatahub-io:main May 13, 2024
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] main from caikit:main #19

[pull] main from caikit:main #19

pull bot commented Feb 29, 2024 •

edited

Loading

openshift-ci bot commented Feb 29, 2024

openshift-ci bot commented Feb 29, 2024

[pull] main from caikit:main #19

[pull] main from caikit:main #19

Conversation

pull bot commented Feb 29, 2024 • edited Loading

openshift-ci bot commented Feb 29, 2024

openshift-ci bot commented Feb 29, 2024

pull bot commented Feb 29, 2024 •

edited

Loading