swiss-ai
Popular repositories Loading
-
nanotron
nanotron PublicForked from huggingface/nanotron
Minimalistic large language model 3D-parallelism training
-
lighteval-multilingual
lighteval-multilingual PublicForked from huggingface/lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
Python 2
-
video2dataset
video2dataset PublicForked from iejMac/video2dataset
Easily create large video dataset from video urls
Python 1
-
Megatron-LLM
Megatron-LLM PublicForked from epfLLM/Megatron-LLM
distributed trainer for LLMs
Python
-
data-PDF-pipeline
data-PDF-pipeline PublicPDF pipeline for creating training corpora (mainly for llm, multimodal and alignment horizontals)
Python
Repositories
- nanotron-multilingual Public Forked from swiss-ai/nanotron
A copy of nanotron for multilingual training
swiss-ai/nanotron-multilingual’s past year of commit activity - lighteval-multilingual Public Forked from huggingface/lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
swiss-ai/lighteval-multilingual’s past year of commit activity - video2dataset Public Forked from iejMac/video2dataset
Easily create large video dataset from video urls
swiss-ai/video2dataset’s past year of commit activity - ml-4m Public Forked from apple/ml-4m
4M: Massively Multimodal Masked Modeling (NeurIPS 2023 Spotlight)
swiss-ai/ml-4m’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
swiss-ai/vllm’s past year of commit activity - data-tooling Public Forked from huggingface/datatrove
Tool set for data preparation and selection in the context of Swiss-AI (forked from DataTrove)
swiss-ai/data-tooling’s past year of commit activity