[Question]: Does Flair support ModernBert 8k tokens context size? #3589

heukirne · 2024-12-22T13:50:06Z

Question

Recently ModernBert was launched with 8k context lenght: https://huggingface.co/blog/modernbert

Does Flair support fine tuning and inference of models with context lenght higher than 512?

Thanks!

heukirne · 2025-01-05T12:28:56Z

I was able to run it, just need to set the mex_length parameter:

from flair.embeddings import TransformerWordEmbeddings

embeddings = TransformerWordEmbeddings(model='answerdotai/ModernBERT-base',
                                       layers="all",
                                       subtoken_pooling="first",
                                       fine_tune=True,
                                       use_context=True, 
                                       transformers_tokenizer_kwargs={'model_max_length': 8192}
                                       )

Remember to install the main transformer, flash-attn and triton

pip install git+https://github.com/huggingface/transformers.git
pip install flash-attn --no-build-isolation
pip install triton

ModernBERT will be included in v4.48.0 of transformers. Until then, it requires installing transformers from main.

stefan-it · 2025-01-05T12:33:25Z

Hey @heukirne ,

if you plan to use ModernBERT for Token Classification tasks, please have a look at my NER repo:

https://github.com/stefan-it/modern-bert-ner

ModernBERT has currently a tokenizer issue, which is (probably) not seen for text classification tasks...

heukirne added the question Further information is requested label Dec 22, 2024

heukirne closed this as completed Jan 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: Does Flair support ModernBert 8k tokens context size? #3589

[Question]: Does Flair support ModernBert 8k tokens context size? #3589

heukirne commented Dec 22, 2024

heukirne commented Jan 5, 2025

stefan-it commented Jan 5, 2025

[Question]: Does Flair support ModernBert 8k tokens context size? #3589

[Question]: Does Flair support ModernBert 8k tokens context size? #3589

Comments

heukirne commented Dec 22, 2024

Question

heukirne commented Jan 5, 2025

stefan-it commented Jan 5, 2025