RuntimeError: status command squeue fails to execute.job_id:24743764 error message:slurm_load_jobs error: Unexpected message received #1763
Unanswered
SMZeng1997
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
When I run the dpgen task, I often encounter the following error:
**Traceback (most recent call last):
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/bin/dpgen", line 8, in
sys.exit(main())
^^^^^^
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpgen/main.py", line 255, in main
args.func(args)
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpgen/generator/run.py", line 5394, in gen_run
run_iter(args.PARAM, args.MACHINE)
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpgen/generator/run.py", line 4746, in run_iter
run_fp(ii, jdata, mdata)
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpgen/generator/run.py", line 4036, in run_fp
run_fp_inner(
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpgen/generator/run.py", line 3945, in run_fp_inner
submission.run_submission()
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpdispatcher/submission.py", line 260, in run_submission
self.update_submission_state()
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpdispatcher/submission.py", line 345, in update_submission_state
job.get_job_state()
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpdispatcher/submission.py", line 831, in get_job_state
job_state = self.machine.check_status(self)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpdispatcher/utils/utils.py", line 183, in wrapper
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "/home/dnlu/dat01/anaconda3/envs/deepmd-kit/lib/python3.11/site-packages/dpdispatcher/machines/slurm.py", line 153, in check_status
raise RuntimeError(
RuntimeError: status command squeue fails to execute.job_id:24743764
error message:slurm_load_jobs error: Unexpected message received
return code 1**
This error always occurs when performing fp calculations. I am performing fp calculations on another supercomputer account, connected remotely via SSH protocol.
Want to know how to solve this problem? Thank you very much.
Beta Was this translation helpful? Give feedback.
All reactions