GCNN

Dependencies

To install required packages with only basic capability:

pip install -r requirements.txt
pip install -r requirements-torchdep.txt

In order to run Bayesian hyperparameter optimization additionally install:

pip install -r requirements-hyperopt.txt

If you plan to modify the code, include packages for formatting (black) and testing (pytest) the code:

pip install -r requirements-dev.txt

For serialization and storing the processed data for later sessions (pickle) and visualizing structure connections (igraph):

pip install -r requirements-other.txt

Detailed dependency installation

For installing a specific PyTorch version on a machine that has only cpu:

pip3 install torch==1.6.0+cpu -f https://download.pytorch.org/whl/torch_stable.html
pip3 install torch-scatter==latest+cpu -f https://pytorch-geometric.com/whl/torch-1.6.0.html
pip3 install torch-sparse==latest+cpu -f https://pytorch-geometric.com/whl/torch-1.6.0.html
pip3 install torch-cluster==latest+cpu -f https://pytorch-geometric.com/whl/torch-1.6.0.html
pip3 install torch-spline-conv==latest+cpu -f https://pytorch-geometric.com/whl/torch-1.6.0.html
pip3 install torch-geometric

For installing on a machine that has GPUs and PyTorch already installed:

pip install --no-index torch-scatter -f https://pytorch-geometric.com/whl/torch-${TORCH}+${CUDA}.html
pip install --no-index torch-sparse -f https://pytorch-geometric.com/whl/torch-${TORCH}+${CUDA}.html
pip install --no-index torch-cluster -f https://pytorch-geometric.com/whl/torch-${TORCH}+${CUDA}.html
pip install --no-index torch-spline-conv -f https://pytorch-geometric.com/whl/torch-${TORCH}+${CUDA}.html
pip install torch-geometric

where ${CUDA} and ${TORCH} should be replaced by your specific CUDA version (cpu, cu92, cu101, cu102, cu110) and PyTorch version (1.4.0, 1.5.0, 1.6.0, 1.7.0)

Code formatting

Before committing and pushing changes, run command black . from the main directory of the project(GCNN).

Running the code

There are two options for running the code:

Training the model(main.py)
Reproducing the results using existing models(test_trained_models.py)

Training the model(main.py)

There are 3 options:

Run it with hyperparameter optimization
Run it normally with inputting the parameters of data and model from terminal
Run it normally but load the whole parameters config from json file whose location is GCNN/utilities/configuration.json

Loading parameters from configuration.json file

This is an example for configuration.json file:

{
    "atom_features": [
        0,
        1,
        2
    ],
    "batch_size": 64,
    "hidden_dim": 15,
    "num_conv_layers": 16,
    "learning_rate": 0.02,
    "radius": 10,
    "max_num_node_neighbours": 10,
    "num_epoch": 200,
    "perc_train": 0.7,
    "predicted_value_option": 1,
    "dataset_option": "FePt",
    "model_type": "PNN"
}

Possible values:

atom_features(0, 1, 2) where 0 = proton number, 1 = charge density, 2 = magnetic moment
batch size(int, >0) usually power of 2
hidden_dim(int, >0) size of a hidden layer
num_conv_layers(int, >0) number of convolutional layers
radius(int, >5) radius from which neighbours of an node are chosen, represents distance in 3D space
max_num_node_neighbours(int, >0) maximum number of node neighbours
num_epoch(int, >0) number of epochs to train the model
perc_train(float, >0, <1) percentage of dataset used for training, the rest(1-perc_train) is split equally for validation and testing
predicted_value_option(1, 2, 3, 4, 5, 6) where 1 = free energy, 2 = charge_density, 3 = magnetic moment, 4 = free energy+charge density 5 = free energy+magnetic moment, 6 = free energy+charge density+magnetic moment
dataset_option(str) possible values: FePt, CuAu, FeSi, CuAu_FePt_SHUFFLE, CuAu_TRAIN_FePt_TEST, FePt_TRAIN_CuAu_TEST
model_type(str) possible values PNN, MFC, GIN and GAT
subsample_percentage(float, <1) represents percentage of the original dataset, when used the serialized dataset loader will take the subset of the original dataset using stratified sampling

Remark: keep in mind that predicted_value_option is used to compute output dimension. Besides free energy which has dimension of 1, other combinations depend on the number of atoms in the dataset(example: free energy+charge density=1 + number of atoms) and this was configured specifically for structures that have number of atoms equal to 32. If you want to add datasets with different structures in terms of number of atoms, you need to change parts of the code in main.py where this is computed from a dictionary.

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
.github/workflows		.github/workflows
.vscode		.vscode
data_utils		data_utils
dataset		dataset
examples		examples
models		models
reproducing_results		reproducing_results
serialized_dataset		serialized_dataset
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements-hyperopt.txt		requirements-hyperopt.txt
requirements-other.txt		requirements-other.txt
requirements-torchdep.txt		requirements-torchdep.txt
requirements.txt		requirements.txt
run_config_input.py		run_config_input.py
run_hyperparam_opt.py		run_hyperparam_opt.py
test_trained_models.py		test_trained_models.py
test_training.py		test_training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GCNN

Dependencies

Detailed dependency installation

Code formatting

Running the code

Training the model(main.py)

Loading parameters from configuration.json file

About

Releases

Packages

Languages

seheracer/GCNN

Folders and files

Latest commit

History

Repository files navigation

GCNN

Dependencies

Detailed dependency installation

Code formatting

Running the code

Training the model(main.py)

Loading parameters from configuration.json file

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages