Problem running on PBS cluster

Asked by Andres Osorio

Hello,

This time, this is a real question ;-) Alright , I have manage to configure MG5 (v 1.4.8) to submit to a PBS cluster and one of its queue. However, my jobs don't successfully run (using qstat I saw a few jobs running but quickly ended).

In return I get the following message:

Generating 10000 events with run name run_01
survey run_01
compile directory
Using random number seed offset = 33
Running Survey
Creating Jobs
Working on SubProcesses
    P0_gg_t1t1x_t1_tn1_t_bwp_wp_qq_t1x_txn1_tx_bxwm_wm_lvl
 Idle: 3 Running: 0 Finish: 0
INFO: All jobs finished

"generate_events" with error:
IOError : [Errno 2] No such file or directory: '/fisica/aosorio/madgraphWD/MadGraph5_v1_4_8_4/GenStops02/SubProcesses/P0_gg_t1t1x_t1_tn1_t_bwp_wp_qq_t1x_txn1_tx_bxwm_wm_lvl/G1/results.dat'
Please report this bug on https://bugs.launchpad.net/madgraph5
More information is found in '/fisica/aosorio/madgraphWD/MadGraph5_v1_4_8_4/GenStops02/run_01_stops02_debug.log'.
Please attach this file to your report.

The log file basically shows the same error:

IOError: [Errno 2] No such file or directory: '/fisica/aosorio/madgraphWD/MadGraph5_v1_4_8_4/GenStops02/SubProcesses/P0_gg_t1t1x_t1_tn1_t_bwp_wp_qq_t1x_txn1_tx_bxwm_wm_lvl/G1/results.dat'

Value of current Options:
              web_browser : None
              text_editor : None
          pythia-pgs_path :
                  td_path :
             delphes_path :
             cluster_type : pbs
         madanalysis_path :
            cluster_queue : cms
       group_subprocesses : Auto
         fortran_compiler : None
                  nb_core : 4
      exrootanalysis_path :
               eps_viewer : None
                  timeout : 20
   automatic_html_opening : False
             cluster_mode : 1
             pythia8_path :
ignore_six_quark_processes : False
                 run_mode : 1

And certainly I don't see those SubProcesses/P0_XXX/Gx/ directories (as they do when running in local in multicore mode).

Do you have any suggestion on how can I find a solution to this problem.

Many thanks,

Andres

Question information

Language:
English Edit question
Status:
Solved
For:
MadGraph5_aMC@NLO Edit question
Assignee:
No assignee Edit question
Solved by:
Andres Osorio
Solved:
Last query:
Last reply:
Revision history for this message
Johan Alwall (johan-alwall) said :
#1

Hello Andres,

It sounds like your ajobN files are not run at all, if you don't even get any GN directories (the first thing the ajobN file does is a mkdir GN). Perhaps try submitting an ajobN file by hand using qsub -o log.ajob1 -e err.ajob1 -q cms -V ajob1, and check the log and err files?

Cheers,
Johan

Revision history for this message
Andres Osorio (osorio-af) said :
#2

Hi Johan,

There was a problem with the pbs server configuration. Thank you for your suggestion, that was key in debugging this issue.
Many thanks,

Andres