Trying to run Madgraph on condor on CMS Connect

Asked by Brandon Soubasis on 2018-08-10

I am trying to run madgraph on condor using CMS connect. I wanted to first try to generate something simple to see how running on condor will workout so i do the following:

./bin/mg5_aMC
tutorial
generate p p > t t~
add process p p > W+ j, W+ > l+ vl @2
output MY_FIRST_MG5_RUN
I DO NOT launch and instead type the command exit and go back to the main MadGraph directory. I then cd into MY_FIRST_MG5_RUN.
cd into this directory and then cd into the Cards directory. open the me5_configuration.txt file in the Cards directory

Uncomment run_mode
run_mode=1
Uncomment cluster_type and cluster_queue
cluster_type = condor
cluster_queue = None

Doing this will make MadGraph submit the job to Condor,

./bin/generate_events

 I am getting the following error messages:
INFO: Update the dependent parameter of the param_card.dat
Generating 10000 events with run name run_01
survey run_01
INFO: compile directory
Not able to open file /home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/crossx.html since no program configured.Please set one in ./input/mg5_configuration.txt
compile Source Directory
Using random number seed offset = 21
INFO: Running Survey
Creating Jobs
Working on SubProcesses
INFO: P1_gg_ttx
INFO: P1_qq_ttx
INFO: P2_gq_wpq_wp_lvl
INFO: P2_qq_wpg_wp_lvl
INFO: Idle: 4, Running: 0, Completed: 0 [ 0.14s ]
INFO: Idle: 4, Running: 0, Completed: 0 [ 0.27s ]
INFO: Idle: 0, Running: 4, Completed: 0 [ 30.4s ]
INFO: All jobs finished
INFO: Idle: 0, Running: 0, Completed: 4 [ 1m 0s ]
INFO: End survey
refine 10000
Creating Jobs
INFO: Refine results to 10000
/home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/bin/internal/gen_ximprove.py:806: DeprecationWarning: object.__new__() takes no parameters
  return super(gen_ximprove, cls).__new__(gen_ximprove_v4, cmd, opt)
/home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/bin/internal/gen_ximprove.py:812: DeprecationWarning: object.__init__() takes no parameters
  super(gen_ximprove, self).__init__(cmd, opt)
INFO: Generating 10000.0 unweigthed events.
INFO: Effective Luminosity 2.33720491327 pb^-1
INFO: need to improve 5 channels
Current estimate of cross-section: 5134.338 +- 75.8652574554
    P1_gg_ttx
    P1_qq_ttx
    P2_gq_wpq_wp_lvl
    P2_qq_wpg_wp_lvl
INFO: Idle: 14, Running: 0, Completed: 0 [ 0.12s ]
INFO: Idle: 14, Running: 0, Completed: 0 [ 0.24s ]
INFO: Idle: 14, Running: 0, Completed: 0 [ 30.9s ]
INFO: Idle: 14, Running: 0, Completed: 0 [ 1m 1s ]
INFO: Idle: 14, Running: 0, Completed: 0 [ 1m 31s ]
INFO: Idle: 0, Running: 1, Completed: 13 [ 2m 1s ]
INFO: All jobs finished
INFO: Idle: 0, Running: 0, Completed: 14 [ 2m 31s ]
INFO: Combining runs
Error when reading /home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/SubProcesses/P2_gq_wpq_wp_lvl/G1a0/results.dat
Command "generate_events " interrupted with error:
ValueError : empty string for float()
Please report this bug on https://bugs.launchpad.net/mg5amcnlo
More information is found in '/home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/run_01_tag_1_debug.log'.
Please attach this file to your report.
quit
INFO:

I looked into '/home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/SubProcesses/P2_gq_wpq_wp_lvl/G1a0/results.dat'
it read only one line: "end-code not correct 127"

after looking it uo on http://tldp.org/LDP/abs/html/exitcodes.html
"127 "command not found" illegal_command Possible problem with $PATH or a typo"

Please suggest how to get rid of this type of error ?

Regards,
Brandon Soubasis

Question information

Language:
English Edit question
Status:
Answered
For:
MadGraph5_aMC@NLO Edit question
Assignee:
No assignee Edit question
Last query:
2018-08-10
Last reply:
2018-08-10

Hi,

Well this sounds a node issue rather than a MG5aMC issue.
Out of your 18 job, 17 seems to went trough (at least the first four did for sure)
So it sounds unlikely that a wrong call to the executable occurs.

Especially since the code is secure against missing executable due to slow filesystem (or similar issue).

But can you check
> /home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/SubProcesses/P2_gq_wpq_wp_lvl/G1b0/results.d

and also check the content of the log in the following directory:
> /home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/SubProcesses/P2_gq_wpq_wp_lvl/G1a0/

Cheers,

Olivier

> On 10 Aug 2018, at 17:47, Brandon Soubasis <email address hidden> wrote:
>
> New question #671492 on MadGraph5_aMC@NLO:
> https://answers.launchpad.net/mg5amcnlo/+question/671492
>
> I am trying to run madgraph on condor using CMS connect. I wanted to first try to generate something simple to see how running on condor will workout so i do the following:
>
> ./bin/mg5_aMC
> tutorial
> generate p p > t t~
> add process p p > W+ j, W+ > l+ vl @2
> output MY_FIRST_MG5_RUN
> I DO NOT launch and instead type the command exit and go back to the main MadGraph directory. I then cd into MY_FIRST_MG5_RUN.
> cd into this directory and then cd into the Cards directory. open the me5_configuration.txt file in the Cards directory
>
> Uncomment run_mode
> run_mode=1
> Uncomment cluster_type and cluster_queue
> cluster_type = condor
> cluster_queue = None
>
> Doing this will make MadGraph submit the job to Condor,
>
> ./bin/generate_events
>
> I am getting the following error messages:
> INFO: Update the dependent parameter of the param_card.dat
> Generating 10000 events with run name run_01
> survey run_01
> INFO: compile directory
> Not able to open file /home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/crossx.html since no program configured.Please set one in ./input/mg5_configuration.txt
> compile Source Directory
> Using random number seed offset = 21
> INFO: Running Survey
> Creating Jobs
> Working on SubProcesses
> INFO: P1_gg_ttx
> INFO: P1_qq_ttx
> INFO: P2_gq_wpq_wp_lvl
> INFO: P2_qq_wpg_wp_lvl
> INFO: Idle: 4, Running: 0, Completed: 0 [ 0.14s ]
> INFO: Idle: 4, Running: 0, Completed: 0 [ 0.27s ]
> INFO: Idle: 0, Running: 4, Completed: 0 [ 30.4s ]
> INFO: All jobs finished
> INFO: Idle: 0, Running: 0, Completed: 4 [ 1m 0s ]
> INFO: End survey
> refine 10000
> Creating Jobs
> INFO: Refine results to 10000
> /home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/bin/internal/gen_ximprove.py:806: DeprecationWarning: object.__new__() takes no parameters
> return super(gen_ximprove, cls).__new__(gen_ximprove_v4, cmd, opt)
> /home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/bin/internal/gen_ximprove.py:812: DeprecationWarning: object.__init__() takes no parameters
> super(gen_ximprove, self).__init__(cmd, opt)
> INFO: Generating 10000.0 unweigthed events.
> INFO: Effective Luminosity 2.33720491327 pb^-1
> INFO: need to improve 5 channels
> Current estimate of cross-section: 5134.338 +- 75.8652574554
> P1_gg_ttx
> P1_qq_ttx
> P2_gq_wpq_wp_lvl
> P2_qq_wpg_wp_lvl
> INFO: Idle: 14, Running: 0, Completed: 0 [ 0.12s ]
> INFO: Idle: 14, Running: 0, Completed: 0 [ 0.24s ]
> INFO: Idle: 14, Running: 0, Completed: 0 [ 30.9s ]
> INFO: Idle: 14, Running: 0, Completed: 0 [ 1m 1s ]
> INFO: Idle: 14, Running: 0, Completed: 0 [ 1m 31s ]
> INFO: Idle: 0, Running: 1, Completed: 13 [ 2m 1s ]
> INFO: All jobs finished
> INFO: Idle: 0, Running: 0, Completed: 14 [ 2m 31s ]
> INFO: Combining runs
> Error when reading /home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/SubProcesses/P2_gq_wpq_wp_lvl/G1a0/results.dat
> Command "generate_events " interrupted with error:
> ValueError : empty string for float()
> Please report this bug on https://bugs.launchpad.net/mg5amcnlo
> More information is found in '/home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/run_01_tag_1_debug.log'.
> Please attach this file to your report.
> quit
> INFO:
>
>
> I looked into '/home/soubasis/MG5_aMC_v2_6_3_2/MG5_TEST_RUN/SubProcesses/P2_gq_wpq_wp_lvl/G1a0/results.dat'
> it read only one line: "end-code not correct 127"
>
> after looking it uo on http://tldp.org/LDP/abs/html/exitcodes.html
> "127 "command not found" illegal_command Possible problem with $PATH or a typo"
>
> Please suggest how to get rid of this type of error ?
>
> Regards,
> Brandon Soubasis
>
>
>
> --
> You received this question notification because you are an answer
> contact for MadGraph5_aMC@NLO.

Can you help with this problem?

Provide an answer of your own, or ask Brandon Soubasis for more information if necessary.

To post a message you must log in.