-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] _dpgen autotest run error_<work_path> #1139
Comments
"model": "ZrO2-compressed.pb", This means the model is at "./ZrO2-compressed.pb", in the same directory as folder confs. Is error still coming out by putting the files in the above location? |
Here is the json file for dft calculation: { The structure of the confs directory is: Even when POTCAR and INCAR is in the ./confs/ZrO2-m/relaxation, the error still happens unless POTCAR and INCAR is in the same directory as folder confs. |
I went through the relevant code in common_equi.py and didn't find anything wrong yet. Could you please try setting "api_version": "0.9" and a different version of dpdispatcher may be used. |
Maybe it is related to the submission system part. It happens to both md and vasp calculation. Also I do not encounter the similar results using Bohrium. |
It looks that |
Summary
When I ran "dpgen autotest run relax.json machine.json", the error below occurs, unless INCAR or POTCAR is in the same directory as folder confs. Similar error also occurs when dp potential or in.lammps is not in the same directory as folder confs. This error seems to be related to the job submission system (slurm) and the working directory. The json files are attached.
_/home/jzhang/ZrO2/dpgen/autotest/dft --> Runing...
2023-02-12 20:01:01,643 - INFO : info:check_all_finished: False
2023-02-12 20:01:01,643 - INFO : remote path: /home/jzhang/ZrO2/dpgen/autotest/dft/2ec5f7ec0bed6a5e5297c6d0a924c779337ba5a2
Traceback (most recent call last):
File "/home/jzhang/anaconda3/envs/deepmd/bin/dpgen", line 8, in
sys.exit(main())
File "/home/jzhang/anaconda3/envs/deepmd/lib/python3.10/site-packages/dpgen/main.py", line 185, in main
args.func(args)
File "/home/jzhang/anaconda3/envs/deepmd/lib/python3.10/site-packages/dpgen/auto_test/run.py", line 57, in gen_test
run_task(args.TASK, args.PARAM, args.MACHINE)
File "/home/jzhang/anaconda3/envs/deepmd/lib/python3.10/site-packages/dpgen/auto_test/run.py", line 34, in run_task
run_equi(confs, inter_parameter, mdata)
File "/home/jzhang/anaconda3/envs/deepmd/lib/python3.10/site-packages/dpgen/auto_test/common_equi.py", line 209, in run_equi
submission.run_submission()
File "/home/jzhang/anaconda3/envs/deepmd/lib/python3.10/site-packages/dpdispatcher/submission.py", line 185, in run_submission
self.upload_jobs()
File "/home/jzhang/anaconda3/envs/deepmd/lib/python3.10/site-packages/dpdispatcher/submission.py", line 348, in upload_jobs
self.machine.context.upload(self)
File "/home/jzhang/anaconda3/envs/deepmd/lib/python3.10/site-packages/dpdispatcher/ssh_context.py", line 505, in upload
self._walk_directory(submission.forward_common_files, self.local_root, file_list, directory_list)
File "/home/jzhang/anaconda3/envs/deepmd/lib/python3.10/site-packages/dpdispatcher/ssh_context.py", line 474, in walk_directory
raise RuntimeError(f'cannot find upload file {work_path} {jj}')
RuntimeError: cannot find upload file /home/jzhang/ZrO2/dpgen/autotest/dft INCAR
Part of the directory structure is:
├── confs
│ ├── ZrO2-m
│ │ └── relaxation
│ │ ├── INCAR
│ │ ├── POTCAR
│ │ └── relax_task
│ │ ├── CONTCAR
│ │ ├── INCAR -> ../INCAR
│ │ ├── inter.json
│ │ ├── KPOINTS
│ │ ├── OSZICAR
│ │ ├── OUTCAR
│ │ ├── outlog
│ │ ├── POSCAR -> ../../POSCAR
│ │ ├── POTCAR -> ../POTCAR
│ │ ├── result.json
│ │ ├── task.json
│ │ └── XDATCAR
├── infiles
│ ├── INCAR_md
│ ├── INCAR_rlx
│ ├── INCAR_static
│ ├── POTCAR_Ce
│ ├── POTCAR_O
│ ├── POTCAR_Zr
│ ├── ZrO2_a_96.POSCAR
├── machine_raptor.json
├── nohup.out
├── property.json
├── relax_dft.json
└── submit.sh
DPGEN Version and Platform
DPGEN 0.11.0
LAMMPS 23 Jun 2022 - Update 1, Deepmd-kit v2.1.5
Job submission and computing cluster configuration
slurm-wlm 20.11.4
Linux version 5.10.0-11-amd64 ([email protected]) (gcc-10 (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2) #1 SMP Debian 5.10.92-1 (2022-01-18)
Expected Behavior
Actual Behavior
Steps to Reproduce
Further Information, Files, and Links
setups.zip
The text was updated successfully, but these errors were encountered: