GitHub - stair-lab/embedder

Embedding funciton

This python package converts sentences into tokens and passes tokens through a model to get the sentence embedding. Designed to take dataloader format as input.

How to use:

First, within your environment, install the package.

pip install git+https://github.com/stair-lab/embedder.git

In your script, include the module:

from embed_text_package.embed_text import Embedder

Then you can initialize an embedder, load the model and call it:

NOTE: the load() function will load both, the model and embedder.

model_name = "<HF_repo>/<HF_model>"
embdr = Embedder()
embdr.load(model_name)
emb = embdr.get_embeddings(dataloader, MODEL_NAME, cols_to_be_embded)

Where dataloader is type Dataloader, model_name is type str. cols_to_be_embded is type list and should contain the names of the columns of the dataloader dataset which shall be embedded.

How to test:

First, within your environment, install the package pytest.

pip install pytest

Then, cd to main folder of the package ("embedder") and type:

pytest

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows		.github/workflows
configs		configs
src/embed_text_package		src/embed_text_package
tests		tests
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Embedding funciton

How to use:

How to test:

About

Releases

Packages

Contributors 3

Languages

stair-lab/embedder

Folders and files

Latest commit

History

Repository files navigation

Embedding funciton

How to use:

How to test:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages