🎯
Focusing
PhD student @ University of Warsaw, interested in large language models.
-
University of Warsaw
- Warsaw, Poland
- https://syzymon.github.io
- @s_tworkowski
- https://scholar.google.com/citations?user=1V8AeXYAAAAJ&hl=en
Pinned Loading
-
CStanKonrad/long_llama
CStanKonrad/long_llama PublicLongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
-
-
young-geng/EasyLM
young-geng/EasyLM PublicLarge language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
-
-
tensorflow/tensor2tensor
tensorflow/tensor2tensor Public archiveLibrary of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
-
paperswithcode/ai-deadlines
paperswithcode/ai-deadlines Public⏰ AI conference deadline countdowns
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.