Madevent problem at CERN lsf batch

Asked by Andre Sznajder on 2017-12-06

I am trying to run Madevent at CERN lxplus using the lsf batch system.
Everything seems to be running fine until it gets to the following point:

Error when reading /afs/cern.ch/work/s/sznajder/MG5_aMC_v2_6_0/VBS_WW_SS_QED4_QCD0/SubProcesses/P1_qq_wpwpqq/G2a0/results.dat
Command "generate_events " interrupted with error:
ValueError : empty string for float()

If I look into the contents of this file results.dat I get the following:

[sznajder@lxplus009 VBS_WW_SS_QED4_QCD0]$ more /afs/cern.ch/work/s/sznajder/MG5_aMC_v2_6_0/VBS_WW_SS_QED4_QCD0/SubProcesses/P1_qq_wpwpqq/G2a0/results.dat
end-code not correct 152

Any ideas on what is going on ?
It seems the jobs are being completed properly because I am getting the LSF emails for the job completions and I see no error messages.

Question information

Language:
English Edit question
Status:
Answered
For:
MadGraph5_aMC@NLO Edit question
Assignee:
No assignee Edit question
Last query:
2017-12-06
Last reply:
2017-12-06

Hi,

I have checked the code, and I do not find any place in the code where we would stop the code with return code 152.
So my guess is that this is a system related kill.

If I understand correctly this page:http://tldp.org/LDP/abs/html/exitcodes.html
Such message can be created if a kill -22 is trigger.
According to http://curiousthing.org/sigttin-sigttou-deep-dive-linux
This correspond to sending the signal “SIGTTOU” which suspend the writing on the standard output.

According to the above page, the code should not be suspended if the signal is ignored. So I guess that this not the real problem here but I have no clue.

Cheers,

Olivier

PS: My only suggestion would be to rerun that specific job:
cd /afs/cern.ch/work/s/sznajder/MG5_aMC_v2_6_0/VBS_WW_SS_QED4_QCD0/SubProcesses/P1_qq_wpwpqq/G2a0/
../madevent < input_app.txt # maybe input_sg.txt

If it goes trough then this is sure that this is a cluster related problem,

> On Dec 6, 2017, at 15:27, Andre Sznajder <email address hidden> wrote:
>
> New question #661447 on MadGraph5_aMC@NLO:
> https://answers.launchpad.net/mg5amcnlo/+question/661447
>
> I am trying to run Madevent at CERN lxplus using the lsf batch system.
> Everything seems to be running fine until it gets to the following point:
>
> Error when reading /afs/cern.ch/work/s/sznajder/MG5_aMC_v2_6_0/VBS_WW_SS_QED4_QCD0/SubProcesses/P1_qq_wpwpqq/G2a0/results.dat
> Command "generate_events " interrupted with error:
> ValueError : empty string for float()
>
> If I look into the contents of this file results.dat I get the following:
>
> [sznajder@lxplus009 VBS_WW_SS_QED4_QCD0]$ more /afs/cern.ch/work/s/sznajder/MG5_aMC_v2_6_0/VBS_WW_SS_QED4_QCD0/SubProcesses/P1_qq_wpwpqq/G2a0/results.dat
> end-code not correct 152
>
> Any ideas on what is going on ?
> It seems the jobs are being completed properly because I am getting the LSF emails for the job completions and I see no error messages.
>
>
> --
> You received this question notification because you are an answer
> contact for MadGraph5_aMC@NLO.

Can you help with this problem?

Provide an answer of your own, or ask Andre Sznajder for more information if necessary.

To post a message you must log in.