Skip to content

Actions: TJ-Solergibert/nanotron

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
52 workflow runs
52 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Bringing liger kernels back
Secret Leaks #49: Commit df3ef9d pushed by TJ-Solergibert
September 26, 2024 07:28 18s document_xattention
September 26, 2024 07:28 18s
No more NaN losses
Secret Leaks #48: Commit 3969aa2 pushed by TJ-Solergibert
September 26, 2024 07:18 17s document_xattention
September 26, 2024 07:18 17s
read datasets locally
Secret Leaks #46: Commit f3bf21d pushed by TJ-Solergibert
September 17, 2024 08:36 18s document_xattention
September 17, 2024 08:36 18s
Fixed metadata issue
Secret Leaks #45: Commit 81fdb3a pushed by TJ-Solergibert
September 16, 2024 15:47 22s document_xattention
September 16, 2024 15:47 22s
Only load model parameters on SFT
Secret Leaks #44: Commit cd81111 pushed by TJ-Solergibert
September 16, 2024 15:30 17s document_xattention
September 16, 2024 15:30 17s
Merge branch 'main' into document_xattention
Secret Leaks #43: Commit ed51183 pushed by TJ-Solergibert
September 16, 2024 15:15 20s document_xattention
September 16, 2024 15:15 20s
September 16, 2024 12:38 21s
Compatibility with llama.py checkpoints
Secret Leaks #41: Commit a7051d1 pushed by TJ-Solergibert
September 16, 2024 09:22 20s document_xattention
September 16, 2024 09:22 20s
Added EP==0
Secret Leaks #40: Commit ef835e8 pushed by TJ-Solergibert
September 6, 2024 12:06 18s fix_resume_pp
September 6, 2024 12:06 18s
Fix pp naming
Secret Leaks #39: Commit 4d61489 pushed by TJ-Solergibert
September 6, 2024 10:27 20s fix_resume_pp
September 6, 2024 10:27 20s
Fix eval check
Secret Leaks #38: Commit 1969526 pushed by TJ-Solergibert
September 4, 2024 16:56 24s validation
September 4, 2024 16:56 24s
Optional validation
Secret Leaks #37: Commit 8e6f8ab pushed by TJ-Solergibert
August 27, 2024 17:33 17s validation
August 27, 2024 17:33 17s
Adding liger kernels and modifyng conversion scripts
Secret Leaks #36: Commit a185c50 pushed by TJ-Solergibert
August 26, 2024 14:57 23s document_xattention
August 26, 2024 14:57 23s
Lets move to todi
Secret Leaks #35: Commit 5157392 pushed by TJ-Solergibert
August 26, 2024 07:58 16s document_xattention
August 26, 2024 07:58 16s
Little hack to fix first length
Secret Leaks #34: Commit efd168f pushed by TJ-Solergibert
August 22, 2024 20:02 17s document_xattention
August 22, 2024 20:02 17s
first commit
Secret Leaks #33: Commit 71122d3 pushed by TJ-Solergibert
August 22, 2024 16:56 22s document_xattention
August 22, 2024 16:56 22s
Last fixes
Secret Leaks #32: Commit ce068fd pushed by TJ-Solergibert
August 7, 2024 19:44 16s validation
August 7, 2024 19:44 16s
Dont predict EOText token
Secret Leaks #31: Commit d5228bb pushed by TJ-Solergibert
August 2, 2024 09:01 19s sft
sft
August 2, 2024 09:01 19s
sft
August 2, 2024 07:35 15s
Added todi scripts
Secret Leaks #29: Commit c026422 pushed by TJ-Solergibert
July 30, 2024 20:42 18s sft
sft
July 30, 2024 20:42 18s
RCP Working
Secret Leaks #28: Commit a8f979d pushed by TJ-Solergibert
July 30, 2024 16:55 16s sft
sft
July 30, 2024 16:55 16s
Getting ready
Secret Leaks #27: Commit 06af8cf pushed by TJ-Solergibert
July 30, 2024 09:31 19s sft
sft
July 30, 2024 09:31 19s
Added masked LOSS check
Secret Leaks #26: Commit a66b0c6 pushed by TJ-Solergibert
July 29, 2024 14:30 18s sft
sft
July 29, 2024 14:30 18s
Added SFT generations check script
Secret Leaks #25: Commit c57533d pushed by TJ-Solergibert
July 29, 2024 09:43 18s sft
sft
July 29, 2024 09:43 18s