Hello,
I am currently trying to run MG on a cluster. Everything works fine for a low number of events (nevents=500) but as I am going higher (nevents=50k) MG crashes. It also gives the warning ‘cluster.get_job_identifier runs unexpectedly’ just before it crashes. I’ll attach the log file below. My cluster settings in the mg5_configuration.txt are:
run_mode=1
cluster_type = slurm
cluster_queue = normal
cluster_size = 200
and I use the following process:
generate p p > t t~
launch
set pdlabel lhapdf
set lhaid 27000
set ebeam1 6800
set ebeam2 6800
set nevents 500 (or 50000)
I tried to run the exact same process without the cluster settings, so I commented out the mg5_configuration.txt lines given above and then everything runs fine. Do you have any idea why I run into trouble and how this might be solved?
Thanks and best,
Alex
generate_events 50000 Traceback (most recent call last): File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/interface/extended_cmd.py", line 1544, in onecmd return self.onecmd_orig(line, opt) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/interface/extended_cmd.py", line 1493, in onecmd_orig return func(arg, opt) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/interface/madevent_interface.py", line 2404, in do_generate_events self.run_generate_events(switch_mode, args) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/interface/common_run_interface.py", line 7630, in new_fct original_fct(obj, args, *opts) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/interface/madevent_interface.py", line 2643, in run_generate_events postcmd=False, printcmd=False) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/interface/extended_cmd.py", line 1573, in exec_cmd stop = Cmd.onecmd_orig(current_interface, line, opt) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/interface/extended_cmd.py", line 1493, in onecmd_orig return func(arg, opt) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/interface/common_run_interface.py", line 1924, in do_systematics stdout='/dev/null' File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/various/cluster.py", line 212, in cluster_submit output_files, required_output, nb_submit) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/various/cluster.py", line 75, in deco_f_store id = f(self, **args) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/various/cluster.py", line 150, in submit2 required_output=required_output, nb_submit=nb_submit) File "/home/a/a_feik02/MG5_aMC_v3_4_1/madgraph/various/misc.py", line 436, in deco_f_retry raise error.class('[Fail %i times] \n %s ' % (i+1, error)) UnboundLocalError: local variable 'error' referenced before assignment