-
Notifications
You must be signed in to change notification settings - Fork 33
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
add esm2 checkpoint to pbss, refactor tests for xfail due to bug
Signed-off-by: Peter St. John <[email protected]>
- Loading branch information
Showing
2 changed files
with
58 additions
and
31 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -7,14 +7,14 @@ | |
description: > | ||
A pretrained 650M parameter ESM2 model. See https://ngc.nvidia.com/catalog/models/nvidia:clara:esm2nv650m. | ||
- tag: nv_3b:2.1 | ||
ngc: "nvidia/clara/esm2nv3b:2.1" | ||
- tag: nv_8m:2.0 | ||
ngc: "nvidia/clara/esm2nv8m:2.0" | ||
ngc_registry: model | ||
pbss: "s3://general-purpose/esm2/checkpoints/3b/esm2_3b_checkpoint.tar.gz" | ||
sha256: a79327a4054bf8d1d7075e1b3c961dbc503da02d72ed15f707d9cbbd49d181b6 # pragma: allowlist secret | ||
pbss: "s3://general-purpose/esm2/checkpoints/8m/esm2_8m_checkpoint.tar.gz" | ||
sha256: b4ea4d52eea8a25d2c2838617ff678f0da22d384cee195b0c192686816078dcd # pragma: allowlist secret | ||
owner: Peter St John <[email protected]> | ||
description: > | ||
An ESM-2 3B model pre-trained on NVIDIA's train/test data split. | ||
An ESM-2 8M model pre-trained on NVIDIA's train/test data split. | ||
- tag: nv_650m:2.1 | ||
ngc: "nvidia/clara/esm2nv650m:2.1" | ||
|
@@ -25,14 +25,23 @@ | |
description: > | ||
An ESM-2 650M model pre-trained on NVIDIA's train/test data split. | ||
- tag: nv_8m:2.0 | ||
ngc: "nvidia/clara/esm2nv8m:2.0" | ||
- tag: nv_3b:2.1 | ||
ngc: "nvidia/clara/esm2nv3b:2.1" | ||
ngc_registry: model | ||
pbss: "s3://general-purpose/esm2/checkpoints/8m/esm2_8m_checkpoint.tar.gz" | ||
sha256: b4ea4d52eea8a25d2c2838617ff678f0da22d384cee195b0c192686816078dcd # pragma: allowlist secret | ||
pbss: "s3://general-purpose/esm2/checkpoints/3b/esm2_3b_checkpoint.tar.gz" | ||
sha256: a79327a4054bf8d1d7075e1b3c961dbc503da02d72ed15f707d9cbbd49d181b6 # pragma: allowlist secret | ||
owner: Peter St John <[email protected]> | ||
description: > | ||
An ESM-2 8M model pre-trained on NVIDIA's train/test data split. | ||
An ESM-2 3B model pre-trained on NVIDIA's train/test data split. | ||
- tag: 8m:2.0 | ||
ngc: null | ||
ngc_registry: model | ||
pbss: s3://general-purpose/esm2/checkpoints/converted/8m/esm2_hf_converted_8m_checkpoint.tar.gz | ||
sha256: 2957b2c36d5978d0f595d6f1b72104b312621cf0329209086537b613c1c96d16 # pragma: allowlist secret | ||
owner: Peter St John <[email protected]> | ||
description: > | ||
A NeMo2 compatible checkpoint converted from the huggingface facebook/esm2_t6_8M_UR50D model. | ||
- tag: 650m:2.0 | ||
ngc: nvidia/clara/esm2nv650m:2.0 | ||
|
@@ -41,7 +50,7 @@ | |
sha256: 0798767e843e3d54315aef91934d28ae7d8e93c2849d5fcfbdf5fac242013997 # pragma: allowlist secret | ||
owner: Farhad Ramezanghorbani <[email protected]> | ||
description: > | ||
A pretrained 650M parameter ESM2 model. See https://ngc.nvidia.com/catalog/models/nvidia:clara:esm2nv650m. | ||
A NeMo2 compatible checkpoint converted from the huggingface facebook/esm2_t33_650M_UR50D model. | ||
- tag: 3b:2.0 | ||
ngc: nvidia/clara/esm2nv3b:2.0 | ||
|
@@ -50,28 +59,28 @@ | |
sha256: a2248cfed1ef39f83bd32a0e08b84c0a8f39325d383e2d92767022ff7f5260ed # pragma: allowlist secret | ||
owner: Farhad Ramezanghorbani <[email protected]> | ||
description: > | ||
A pretrained 3B parameter ESM2 model. See https://ngc.nvidia.com/catalog/models/nvidia:clara:esm2nv3b. | ||
A NeMo2 compatible checkpoint converted from the huggingface facebook/esm2_t36_3B_UR50D model. | ||
- tag: fulldata_esm2_pretrain:2.0 | ||
ngc: nvidia/clara/esm2_pretrain_nemo2_data:1.0 | ||
ngc_registry: resource | ||
pbss: "s3://general-purpose/esm2/pretrain/2024_03.tar.gz" | ||
sha256: 404d0ad8de58fa8aae96f8d9f54263a088bc7e4f7d668215afbe04c28416151b # pragma: allowlist secret | ||
sha256: 404d0ad8de58fa8aae96f8d9f54263a088bc7e4f7d668215afbe04c28416151b # pragma: allowlist secret | ||
owner: Peter St John <[email protected]> | ||
description: Full data for ESM2 pretraining. | ||
|
||
- tag: testdata_esm2_pretrain:2.0 | ||
ngc: nvidia/clara/esm2_pretrain_nemo2_testdata:1.0 | ||
ngc_registry: resource | ||
pbss: "s3://general-purpose/esm2/pretrain/2024_03_sanity.tar.gz" | ||
sha256: 006911f92bbc0ded7ea302bbdbfab4c694b409e699c32fd49de1c527a99dba3e # pragma: allowlist secret | ||
sha256: 006911f92bbc0ded7ea302bbdbfab4c694b409e699c32fd49de1c527a99dba3e # pragma: allowlist secret | ||
owner: Peter St John <[email protected]> | ||
description: Test data for ESM2 pretraining. | ||
|
||
- tag: esm2_inference_testdata:2.0 | ||
ngc: nvidia/clara/esm2_inference_testdata:2.0 # TODO: upload to NGC | ||
ngc: nvidia/clara/esm2_inference_testdata:2.0 # TODO: upload to NGC | ||
ngc_registry: resource | ||
pbss: "s3://bionemo-ci/test_data/esm2/artificial_protein_sequences.csv" | ||
sha256: 14ae3acfbf82218bc9e3e53d21a5b0594ba7c0369e169c9f1034e3fe4378d175 # pragma: allowlist secret | ||
sha256: 14ae3acfbf82218bc9e3e53d21a5b0594ba7c0369e169c9f1034e3fe4378d175 # pragma: allowlist secret | ||
owner: Farhad Ramezanghorbani <[email protected]> | ||
description: Test data for ESM2 inference. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters