Connection to a SambaNova node is a two-step process. The first step is to ssh
to the login node
. The second step is to log in to a SambaNova node from the login node
.
Login to the Sambanova login node from your local machine. This uses the MobilePASS+ token generated every time you log in to the system.
In the examples below, replace ALCFUserID with your ALCF user id.
ssh [email protected]
Password: < MobilePASS+ code >
Note: Use the ssh "-v" option in order to debug any ssh problems.
Once you are on the login node, ssh to one of the sambanova compute node.
ssh sn30-r1-h1
It is also recommended to ssh to other compute nodes namely, sn30-r1-h1
, sn30-r1-h2
, sn30-r2-h1
, sn30-r2-h2
, sn30-r3-h1
, sn30-r3-h2
, sn30-r4-h1
, sn30-r4-h2
. Note: This avoids all your jobs being queued up on the same node.
Sambanova software stack and associated environmental variables are automatically setup at login for a SN30 node.
Each of the samples or application examples provided by SambaNova has its own pre-built virtual environment which can be readily used. They are present in the /opt/sambaflow/apps/
directory tree within each of the applications. This directory contains all the different models currently supported with the Sambanova software stack.
For BERT example, understand flags used in the script. Change values for flag --ntasks
and measure its effect on performance.
- ALCF Sambanova Documentation
- Sambanova Documentation
- Sambanova applications path:
/opt/sambaflow/apps/
- Sambanova model scripts:
/data/ANL/scripts/
- Important datasets:
/software/sambanova/dataset/