Runing MG5 in cluster with error

Asked by Li on 2020-01-30

Hi,
    I am running MG5 in a cluster with setting like:

 cluster_type = slurm
 cluster_queue = sixhour

    The problem I met is, for exactly same process, sometimes it runs successfully sometimes it has error, like:

INFO: All jobs finished
INFO: Idle: 0, Running: 0, Completed: 1210 [ 1h 2m ]
INFO: Combining runs
Error when reading /panfs/pfs.local/scratch/physastro/l040h428/mg5_data/14TeV/2leps/pp2tt/500_600/SubProcesses/P4_gg_ttxg_t_wpb_wp_lvl_tx_wmbx_wm_lvl/G6.013d0/results.dat
Command "import command scr_run.ms" interrupted in sub-command:
"launch" with error:
IOError : [Errno 2] No such file or directory: '/panfs/pfs.local/scratch/physastro/l040h428/mg5_data/14TeV/2leps/pp2tt/500_600/SubProcesses/P4_gg_ttxg_t_wpb_wp_lvl_tx_wmbx_wm_lvl/G6.013d0/results.dat'
Please report this bug on https://bugs.launchpad.net/mg5amcnlo
More information is found in '/panfs/pfs.local/scratch/physastro/l040h428/mg5_data/14TeV/2leps/pp2tt/500_600/run_01_tag_1_debug.log'.
Please attach this file to your report.
INFO:

    After the madevent is almost done, error appears.

    Since the error occurs very randomly, I am totally confused and don't know how to solve it. May I ask if you know anything about these?

    Thank you!

Question information

Language:
English Edit question
Status:
Answered
For:
MadGraph5_aMC@NLO Edit question
Assignee:
No assignee Edit question
Last query:
2020-01-30
Last reply:
2020-01-30

Hi,

Please open a bug report and attach the debug file (you can not attach file for question).
If you are not using the latest version of MG5aMC, please also attach to your report the content of the following directory:;
/panfs/pfs.local/scratch/physastro/l040h428/mg5_data/14TeV/2leps/pp2tt/500_600/SubProcesses/P4_gg_ttxg_t_wpb_wp_lvl_tx_wmbx_wm_lvl/G6.013d0/

Thanks,

Olivier

Li (huangli-itp) said : #2

Hi Olivier,
    Good! Thank you!

Can you help with this problem?

Provide an answer of your own, or ask Li for more information if necessary.

To post a message you must log in.