Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] main from caikit:main #19

Merged
merged 71 commits into from
May 13, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
71 commits
Select commit Hold shift + click to select a range
007e32c
Add sentence-transformers to dependencies for real
markstur Feb 7, 2024
1907e07
Wrap SentenceTransformers to allow args
markstur Jan 13, 2024
d5f89b9
Wrap SentenceTransformer to allow encode with bfloat16/autocast and t…
markstur Feb 14, 2024
c4a8eb3
First Draft: Adding tgis params to caikit api
Feb 15, 2024
64b1e79
Make similar changes for prompt tuned model to address failing tests
Feb 15, 2024
1b45f43
add additional parameters to the caikit API from tgis
Feb 20, 2024
f670dbb
add additional parameters to the caikit API from tgis
Feb 20, 2024
dc8cbd3
added tokenization tasks to serve tokenization requests
Feb 21, 2024
6773656
added unit test to modules, bumped caikit version, and declared multi…
Feb 23, 2024
4542cde
updated doc string, comments, and added unit test for peft_tgis_remot…
Feb 26, 2024
ab21d84
Fix for truncation error message.
markstur Feb 27, 2024
14e0c25
reformatted doc string for consistency
Feb 27, 2024
9c479e7
Merge pull request #319 from markstur/sentence-transformers-dep
evaline-ju Feb 27, 2024
2ab8abd
Merge pull request #325 from swith004/input_token_count_136
evaline-ju Feb 27, 2024
057fa7a
add tokens and input_tokens to response
Feb 28, 2024
e0e58af
Mergt commit -m "Merge changes from main"
Feb 28, 2024
6466a83
Embeddings should use config to add scope to env setting.
markstur Feb 29, 2024
b4d44af
remove comments
Feb 29, 2024
765d682
add check conditions
Feb 29, 2024
e661fc4
add check to stream case
Feb 29, 2024
1c25484
Fix using caikit config for deployment config settings
markstur Feb 29, 2024
cc83ab0
Add embedding config defaults to caikit_nlp/config/config.yml
markstur Mar 1, 2024
1a9ad04
Doc strings
markstur Mar 1, 2024
8ca93c3
Merge pull request #328 from markstur/wrapped_sentence_transformer
evaline-ju Mar 1, 2024
3c1f4b9
Add input_token_count to results
mynhardtburger Mar 5, 2024
ef72725
Bug fixes
mynhardtburger Mar 5, 2024
14b298d
Add tests and bug fixes
mynhardtburger Mar 5, 2024
700a5d5
Review comments
mynhardtburger Mar 6, 2024
6547083
bump caikit dependency for datamodel updates
mynhardtburger Mar 7, 2024
827637f
:goal_net: Allow int thresholds
evaline-ju Mar 7, 2024
4c7fd20
add test for sort order
mynhardtburger Mar 8, 2024
df52040
Fix warnings
mynhardtburger Mar 8, 2024
0465796
refactor _truncate_input_tokens and sum_token_count
mynhardtburger Mar 8, 2024
09e66fc
Linting
mynhardtburger Mar 8, 2024
9cf0491
Add token count asserts for all endpoint tests
mynhardtburger Mar 8, 2024
092cb34
update Mock to see item is list for testing
Mar 8, 2024
4ef9dc1
Update tests/modules/text_embedding/test_embedding.py
mynhardtburger Mar 8, 2024
c8b2a8f
Fix docstring
mynhardtburger Mar 8, 2024
11d6135
Fix get_sample_start_indexes
mynhardtburger Mar 9, 2024
e89a84d
Add comments about token counts
mynhardtburger Mar 9, 2024
942826f
Readability updates for get_sample_start_indexes
mynhardtburger Mar 9, 2024
a2a7f02
update comments
Mar 11, 2024
bdda232
Remove #type: ignore
mynhardtburger Mar 12, 2024
01526aa
Merge pull request #334 from mynhardtburger/inlude-input_token_count-…
evaline-ju Mar 13, 2024
f2f8380
Merge branch 'main' into int-threshold
evaline-ju Mar 13, 2024
f55b082
Merge pull request #336 from evaline-ju/int-threshold
evaline-ju Mar 13, 2024
3c608b3
Make encode() in wrapped model compatible with super encode()
Mar 14, 2024
c6ed5b2
Catch KeyErrors before deleting keys. Add configurable default trunca…
markstur Mar 16, 2024
fc7d81e
fmt fix
markstur Mar 16, 2024
ce34b1c
Merge pull request #337 from markstur/compatible_encode
evaline-ju Mar 19, 2024
7a747ec
random change to restart build
Mar 26, 2024
183c49c
revert un-needed change
Mar 26, 2024
5e5c5c4
update validate test for stream and unary to check for new fields
Apr 1, 2024
d97d00a
:zap: Tee stream instead of checking length
evaline-ju Apr 1, 2024
c6e0986
:wrench: Add tokenization import
evaline-ju Apr 1, 2024
3bf2a13
Merge pull request #341 from evaline-ju/no-stream-len
evaline-ju Apr 2, 2024
aca4538
add rank data to generated result
Apr 2, 2024
9b18700
kick of build process
Apr 2, 2024
9b2d516
explicit caikit version to ensure the data model field is available
Apr 2, 2024
8c16dc4
Merge changes from main
Apr 2, 2024
42c3075
Merge pull request #324 from waleedqk/ResponseOptions
evaline-ju Apr 2, 2024
780ddfa
Embeddings fix for truncation without room for begin/end and for batc…
markstur Apr 3, 2024
d34987a
Merge pull request #343 from markstur/embed_trunc_fix
evaline-ju Apr 3, 2024
55b07f8
Embedding add a test that would have helped
markstur Apr 3, 2024
c12cb82
Merge pull request #344 from markstur/embed_same_test
evaline-ju Apr 4, 2024
6f231bf
:sparkles: Add tokenization task to generation modules
evaline-ju Apr 24, 2024
4cc6b88
:white_check_mark: Add unimplemented function tests
evaline-ju Apr 24, 2024
cb9a51a
:pushpin: Pin breaking import changes for torch 2.3.0
evaline-ju Apr 24, 2024
b608e75
:bug: Change Std import to avoid torch pin
evaline-ju Apr 24, 2024
79f20d8
:pushpin: Pin torch due to changed LaunchConfig args
evaline-ju Apr 24, 2024
d81f11e
Merge pull request #351 from evaline-ju/llm-tok
gkumbhat Apr 25, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
18 changes: 18 additions & 0 deletions caikit_nlp/config/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,5 +36,23 @@ training_data_limit:
6655831b-960a-4dc5-8df4-867026e2cd41:
add_model_name_here: 10000

# Config used only in EmbeddingModule. Set here or use env vars like EMBEDDING_RETRIES=32
embedding:
# Number of times to retry on error. Most deployments should use 0 retries.
retries: 0
# Batch size for encode() if <= 0 or invalid, the sentence-transformers default is used
batch_size: 0
# Should implicit truncation (with truncate_input_tokens=0) throw error for truncation (default) or disable this
implicit_truncation_errors: true
# Attempt to optimize with PyTorch compile()
pt2_compile: false
# Use IPEX optimize. Works best when used with autocast (bfloat16) below.
ipex: false
# Use autocast in encode with its default dtype (bfloat16)
autocast: false
# For testing, set device to "mps" on MacOS or "xpu" for IPEX GPU.
# Otherwise, the default does automatic checks for cuda GPU (else cpu).
device: ""

runtime:
library: caikit_nlp
8 changes: 7 additions & 1 deletion caikit_nlp/modules/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -12,4 +12,10 @@
# See the License for the specific language governing permissions and
# limitations under the License.
# Local
from . import text_classification, text_embedding, text_generation, token_classification
from . import (
text_classification,
text_embedding,
text_generation,
token_classification,
tokenization,
)
Loading
Loading