Hi,
I am running MG5_aMC_v2_1_1 and trying to generate 1,000,000 events for a simple process on my local cluster. I'm finding a very strange error. I submit 20 separate jobs (instances of MG which run this particular process) to our local PBS cluster. Each job is identical to the other, I simply submit the same 50,000 event generation run 20 separate times to eventually get the desired 1,000,000 events in the end. Some number of these jobs will run (at least they are continuing to run so far) but others are quickly stopped in error. I've attached the 1) output.log, 2) error.log, and 3) debug.log files below. Do you happen to have any idea what could be causing such errors in only some of the runs?
Peter
** output.log file **
seed: 6
HOSTNAME=titan34
PBS_VERSION=TORQUE-2.3.7
SHELL=/bin/bash
HISTSIZE=1000
PBS_JOBNAME=job6
PBS_ENVIRONMENT=PBS_BATCH
QTDIR=/usr/lib64/qt-3.3
QTINC=/usr/lib64/qt-3.3/include
PBS_O_WORKDIR=/scratch/pwinslow/MGjobs/jjWWBG/job6
USER=pwinslow
PBS_TASKNUM=1
PBS_O_HOME=/home/pwinslow
PBS_MOMPORT=15003
VIRTUAL_ENV=/scratch/pwinslow/CosmoTransitions/User
PBS_O_QUEUE=feed
PATH=/scratch/pwinslow/CosmoTransitions/User/bin:/home/pwinslow/.local/bin:/home/pwinslow/ActivePython-2.7/bin:/home/pwinslow/bin:/usr/lib64/qt-3.3/bin:/bin:/usr/bin:/usr/local/bin:/usr/local/sbin:/usr/sbin:/sbin:/home/pwinslow/bin
PBS_O_LOGNAME=pwinslow
MAIL=/var/spool/mail/pwinslow
PBS_O_LANG=en_US.UTF-8
PBS_JOBCOOKIE=DA569D966BD5993EEFB5F6ACFE571D52
PWD=/home/pwinslow
LANG=en_US.UTF-8
PBS_NODENUM=0
MODULEPATH=/usr/share/Modules/modulefiles:/etc/modulefiles
LOADEDMODULES=
PBS_O_SHELL=/bin/bash
PBS_SERVER=titan.physics.umass.edu
PBS_JOBID=920893.titan.physics.umass.edu
ENVIRONMENT=BATCH
HISTCONTROL=ignoredups
SSH_ASKPASS=/usr/libexec/openssh/gnome-ssh-askpass
HOME=/home/pwinslow
SHLVL=2
PBS_O_HOST=titan.physics.umass.edu
PBS_VNODENUM=0
LOGNAME=pwinslow
CVS_RSH=ssh
QTLIB=/usr/lib64/qt-3.3/lib
PBS_QUEUE=express
MODULESHOME=/usr/share/Modules
PBS_O_MAIL=/var/spool/mail/pwinslow
LESSOPEN=|/usr/bin/lesspipe.sh %s
PBS_NODEFILE=/var/spool/torque/aux//920893.titan.physics.umass.edu
G_BROKEN_FILENAMES=1
PBS_O_PATH=/scratch/pwinslow/CosmoTransitions/User/bin:/home/pwinslow/.local/bin:/home/pwinslow/ActivePython-2.7/bin:/home/pwinslow/bin:/usr/kerberos/bin:/usr/local/bin:/bin:/usr/bin:/home/pwinslow/bin
module=() { eval `/usr/bin/modulecmd bash $*`
}
_=/usr/bin/printenv
************************************************************
* *
* W E L C O M E to *
* M A D G R A P H 5 _ a M C @ N L O *
* M A D E V E N T *
* *
* * * *
* * * * * *
* * * * * 5 * * * * *
* * * * * *
* * * *
* *
* VERSION 5.2.1.2 *
* *
* The MadGraph5_aMC@NLO Development Team - Find us at *
* https://server06.fynu.ucl.ac.be/projects/madgraph *
* *
* Type 'help' for in-line help. *
* *
************************************************************
INFO: load configuration from /nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/Cards/me5_configuration.txt
INFO: load configuration from /nfs/scratch/pwinslow/MG5_aMC_v2_1_1/input/mg5_configuration.txt
INFO: load configuration from /nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/Cards/me5_configuration.txt
Using default text editor "vi". Set another one in ./input/mg5_configuration.txt
Using default eps viewer "evince". Set another one in ./input/mg5_configuration.txt
Using default web browser "firefox". Set another one in ./input/mg5_configuration.txt
No valid Delphes path found
generate_events -f run6 --nb_core=1
Generating 50000 events with run name run6
survey run6
INFO: compile directory
[1;34mSince icckw>0, We change the value of 'drjl' to 0[0m
quit
INFO:
INFO:
launch in debug mode
** error.log file **
[1;31mError detected in "generate_events -f run6 --nb_core=1"
write debug file /nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/run6_tag_1_debug.log
If you need help with this issue please contact us on https://answers.launchpad.net/madgraph5
MadGraph5Error : A compilation Error occurs when trying to compile /nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/Source.
The compilation fails with the following output message:
gfortran -O -w -fbounds-check -ffixed-line-length-132 -c -o gen_ximprove.o gen_ximprove.f
gfortran -O -w -fbounds-check -ffixed-line-length-132 -o ../bin/internal/gen_ximprove gen_ximprove.o ranmar.o rw_routines.o open_file.o
ranmar.o: In function `rmarin_':
ranmar.f:(.text+0xb1): undefined reference to `_gfortran_copy_string'
ranmar.f:(.text+0x136): undefined reference to `_gfortran_copy_string'
ranmar.o: In function `get_base_':
ranmar.f:(.text+0x9bf): undefined reference to `_gfortran_copy_string'
ranmar.f:(.text+0xa8b): undefined reference to `_gfortran_copy_string'
ranmar.f:(.text+0xac9): undefined reference to `_gfortran_copy_string'
ranmar.o:ranmar.f:(.text+0xb8d): more undefined references to `_gfortran_copy_string' follow
rw_routines.o: In function `load_para_':
rw_routines.f:(.text+0xe22): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0xe6f): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0xe77): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0xe9a): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0xeca): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1050): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1078): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x10b2): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x10cc): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x1114): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x111e): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x1146): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1180): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1423): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x144b): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1485): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x149c): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x14e6): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x14f0): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x1518): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1552): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x16de): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1706): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1740): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x175b): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x17a2): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x17ab): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x17d3): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x180d): undefined reference to `_gfortran_copy_string'
rw_routines.o: In function `load_gridpack_para_':
rw_routines.f:(.text+0x1899): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x18b0): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1b18): undefined reference to `_gfortran_copy_string'
rw_routines.o:rw_routines.f:(.text+0x1b3c): more undefined references to `_gfortran_copy_string' follow
rw_routines.o: In function `load_gridpack_para_':
rw_routines.f:(.text+0x1b8f): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x1bd7): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1be1): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x1c05): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1c3b): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1dbf): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1de3): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1e19): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1e36): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x1e7d): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1e87): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x1eab): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1ee1): undefined reference to `_gfortran_copy_string'
open_file.o: In function `open_file_':
open_file.f:(.text+0x4b): undefined reference to `_gfortran_copy_string'
open_file.f:(.text+0x193): undefined reference to `_gfortran_internal_malloc'
open_file.f:(.text+0x1cc): undefined reference to `_gfortran_copy_string'
open_file.f:(.text+0x1d4): undefined reference to `_gfortran_internal_free'
open_file.f:(.text+0x282): undefined reference to `_gfortran_copy_string'
collect2: ld returned 1 exit status
make: *** [../bin/internal/gen_ximprove] Error 1
Please try to fix this compilations issue and retry.
Help might be found at https://answers.launchpad.net/madgraph5.
If you think that this is a bug, you can report this at https://bugs.launchpad.net/madgraph5[0m
** debug.log file **
#************************************************************
#* MadGraph5_aMC@NLO/MadEvent *
#* *
#* * * *
#* * * * * *
#* * * * * 5 * * * * *
#* * * * * *
#* * * *
#* *
#* *
#* VERSION 5.2.1.2 *
#* *
#* The MadGraph5_aMC@NLO Development Team - Find us at *
#* https://server06.fynu.ucl.ac.be/projects/madgraph *
#* *
#************************************************************
#* *
#* Command File for MadEvent *
#* *
#* run as ./bin/madevent.py filename *
#* *
#************************************************************
generate_events -f run6 --nb_core=1
Traceback (most recent call last):
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/extended_cmd.py", line 879, in onecmd
return self.onecmd_orig(line, **opt)
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/extended_cmd.py", line 872, in onecmd_orig
return func(arg, **opt)
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/madevent_interface.py", line 2107, in do_generate_events
postcmd=False)
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/extended_cmd.py", line 919, in exec_cmd
stop = Cmd.onecmd_orig(current_interface, line, **opt)
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/extended_cmd.py", line 872, in onecmd_orig
return func(arg, **opt)
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/madevent_interface.py", line 2476, in do_survey
self.configure_directory()
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/madevent_interface.py", line 3570, in configure_directory
self.compile(arg=[name], cwd=os.path.join(self.me_dir, 'Source'))
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/extended_cmd.py", line 987, in compile
return misc.compile(nb_core=self.options['nb_core'], *args, **opts)
File "/nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/bin/internal/misc.py", line 293, in compile
raise MadGraph5Error, error_text
MadGraph5Error: A compilation Error occurs when trying to compile /nfs/scratch/pwinslow/MGjobs/jjWWBG/job6/MG5_aMC_v2_1_1/jjWWClusterBG/Source.
The compilation fails with the following output message:
gfortran -O -w -fbounds-check -ffixed-line-length-132 -c -o gen_ximprove.o gen_ximprove.f
gfortran -O -w -fbounds-check -ffixed-line-length-132 -o ../bin/internal/gen_ximprove gen_ximprove.o ranmar.o rw_routines.o open_file.o
ranmar.o: In function `rmarin_':
ranmar.f:(.text+0xb1): undefined reference to `_gfortran_copy_string'
ranmar.f:(.text+0x136): undefined reference to `_gfortran_copy_string'
ranmar.o: In function `get_base_':
ranmar.f:(.text+0x9bf): undefined reference to `_gfortran_copy_string'
ranmar.f:(.text+0xa8b): undefined reference to `_gfortran_copy_string'
ranmar.f:(.text+0xac9): undefined reference to `_gfortran_copy_string'
ranmar.o:ranmar.f:(.text+0xb8d): more undefined references to `_gfortran_copy_string' follow
rw_routines.o: In function `load_para_':
rw_routines.f:(.text+0xe22): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0xe6f): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0xe77): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0xe9a): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0xeca): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1050): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1078): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x10b2): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x10cc): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x1114): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x111e): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x1146): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1180): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1423): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x144b): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1485): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x149c): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x14e6): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x14f0): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x1518): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1552): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x16de): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1706): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1740): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x175b): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x17a2): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x17ab): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x17d3): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x180d): undefined reference to `_gfortran_copy_string'
rw_routines.o: In function `load_gridpack_para_':
rw_routines.f:(.text+0x1899): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x18b0): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1b18): undefined reference to `_gfortran_copy_string'
rw_routines.o:rw_routines.f:(.text+0x1b3c): more undefined references to `_gfortran_copy_string' follow
rw_routines.o: In function `load_gridpack_para_':
rw_routines.f:(.text+0x1b8f): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x1bd7): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1be1): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x1c05): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1c3b): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1dbf): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1de3): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1e19): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1e36): undefined reference to `_gfortran_internal_malloc'
rw_routines.f:(.text+0x1e7d): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1e87): undefined reference to `_gfortran_internal_free'
rw_routines.f:(.text+0x1eab): undefined reference to `_gfortran_copy_string'
rw_routines.f:(.text+0x1ee1): undefined reference to `_gfortran_copy_string'
open_file.o: In function `open_file_':
open_file.f:(.text+0x4b): undefined reference to `_gfortran_copy_string'
open_file.f:(.text+0x193): undefined reference to `_gfortran_internal_malloc'
open_file.f:(.text+0x1cc): undefined reference to `_gfortran_copy_string'
open_file.f:(.text+0x1d4): undefined reference to `_gfortran_internal_free'
open_file.f:(.text+0x282): undefined reference to `_gfortran_copy_string'
collect2: ld returned 1 exit status
make: *** [../bin/internal/gen_ximprove] Error 1
Please try to fix this compilations issue and retry.
Help might be found at https://answers.launchpad.net/madgraph5.
If you think that this is a bug, you can report this at https://bugs.launchpad.net/madgraph5
Run Options
-----------
stdout_level : None
MadEvent Options
----------------
automatic_html_opening : False (user set)
cluster_temp_path : None
cluster_queue : madgraph
cluster_memory : None
cluster_time : None
nb_core : 1 (user set)
run_mode : 2
Configuration Options
---------------------
text_editor : None
cluster_status_update : (600, 30)
pythia8_path : None (user set)
hwpp_path : None (user set)
pythia-pgs_path : /nfs/scratch/pwinslow/MG5_aMC_v2_1_1/pythia-pgs (user set)
td_path : None (user set)
delphes_path : None (user set)
thepeg_path : None (user set)
cluster_type : condor
exrootanalysis_path : /nfs/scratch/pwinslow/MG5_aMC_v2_1_1/ExRootAnalysis (user set)
eps_viewer : None
web_browser : None
syscalc_path : None (user set)
madanalysis_path : None (user set)
lhapdf : lhapdf-config
hepmc_path : None (user set)
cluster_retry_wait : 300
fortran_compiler : None
auto_update : 7 (user set)
cluster_nb_retry : 1
timeout : 60
cpp_compiler : None