Generate events failing during re-weighting step

Asked by Andrew Wightman on 2018-02-01

Hi MG Experts,

I am running MG5_aMC v2.6.1 and it is failing during the re-weighting step. I'm able to run just fine on previous versions of MG (v2.3.3) or with fewer events (<20k). The code seems to run for awhile and even finishes some events, but then raises an error.:

reweight -from_cards
INFO: split the event file in bunch of 2500 events
INFO: Idle: 1, Running: 19, Completed: 0 [ current time: 09h29 ]
INFO: Idle: 0, Running: 19, Completed: 1 [ 1m 15s ]
INFO: Idle: 0, Running: 18, Completed: 2 [ 2m 53s ]
WARNING: program /cvmfs/cms.cern.ch/slc6_amd64_gcc530/cms/cmssw/CMSSW_8_1_0/external/slc6_amd64_gcc530/bin/python /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py reweight /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/Events/run_13/unweighted_events.lhe.gz_2.lhe -from_cards --multicore=wait launch ends with non zero status: 1. Stop all computation
Command "generate_events -f" interrupted with error:
Exception : program /cvmfs/cms.cern.ch/slc6_amd64_gcc530/cms/cmssw/CMSSW_8_1_0/external/slc6_amd64_gcc530/bin/python /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py reweight /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/Events/run_13/unweighted_events.lhe.gz_2.lhe -from_cards --multicore=wait launch ends with non zero status: 1. Stop all computation
Please report this bug on https://bugs.launchpad.net/mg5amcnlo
More information is found in '/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/run_13_tag_1_debug.log'.

Below is the traceback from the corresponding log file:

generate_events -f
Traceback (most recent call last):
  File "/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/extended_cmd.py", line 1467, in onecmd
    return self.onecmd_orig(line, **opt)
  File "/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/extended_cmd.py", line 1421, in onecmd_orig
    return func(arg, **opt)
  File "/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py", line 2550, in do_generate_events
    self.exec_cmd('reweight -from_cards', postcmd=False)
  File "/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/extended_cmd.py", line 1494, in exec_cmd
    stop = Cmd.onecmd_orig(current_interface, line, **opt)
  File "/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/extended_cmd.py", line 1421, in onecmd_orig
    return func(arg, **opt)
  File "/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/common_run_interface.py", line 1985, in do_reweight
    mycluster.wait(self.me_dir,update_status)
  File "/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/cluster.py", line 822, in wait
    raise Exception, self.fail_msg
Exception: program /cvmfs/cms.cern.ch/slc6_amd64_gcc530/cms/cmssw/CMSSW_8_1_0/external/slc6_amd64_gcc530/bin/python /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py reweight /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/Events/run_09/unweighted_events.lhe.gz_12.lhe -from_cards --multicore=wait launch ends with non zero status: 1. Stop all computation

Cheers,
Andrew

Question information

Language:
English Edit question
Status:
Answered
For:
MadGraph5_aMC@NLO Edit question
Assignee:
No assignee Edit question
Last query:
2018-02-01
Last reply:
2018-02-02

Can you run manually:

/cvmfs/cms.cern.ch/slc6_amd64_gcc530/cms/cmssw/CMSSW_8_1_0/external/slc6_amd64_gcc530/bin/python<http://cms.cern.ch/slc6_amd64_gcc530/cms/cmssw/CMSSW_8_1_0/external/slc6_amd64_gcc530/bin/python>/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py<http://crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py> reweight /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/Events/run_09/unweighted_events.lhe.gz_12.lhe<http://crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/Events/run_09/unweighted_events.lhe.gz_12.lhe> -from_cards --multicore=wait

to see what is the associate crash?

Cheers,

Olivier

On 1 Feb 2018, at 15:52, Andrew Wightman <<email address hidden><mailto:<email address hidden>>> wrote:

/cvmfs/cms.cern.ch/slc6_amd64_gcc530/cms/cmssw/CMSSW_8_1_0/external/slc6_amd64_gcc530/bin/python<http://cms.cern.ch/slc6_amd64_gcc530/cms/cmssw/CMSSW_8_1_0/external/slc6_amd64_gcc530/bin/python>/afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py<http://crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py> reweight /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/Events/run_09/unweighted_events.lhe.gz_12.lhe<http://crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/Events/run_09/unweighted_events.lhe.gz_12.lhe> -from_cards --multicore=wait

Hi,

looks like launchpad tried to format the command line:
so let me retry:

/cvmfs/cms.cern.ch/slc6_amd64_gcc530/cms/cmssw/CMSSW_8_1_0/external/slc6_amd64_gcc530/bin/python /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/bin/internal/madevent_interface.py reweight /afs/crc.nd.edu/user/a/awightma/CMSSW_Releases/CMSSW_8_1_0/src/NPFitProduction/NPFitProduction/test/reweight_MGv261/MG5_aMC_v2_6_1/processtmp/Events/run_09/unweighted_events.lhe.gz_12.lhe -from_cards --multicore=wait

Can you help with this problem?

Provide an answer of your own, or ask Andrew Wightman for more information if necessary.

To post a message you must log in.