results.dat not found

Bug #1774664 reported by Jack Y. Araz
268
This bug affects 4 people
Affects Status Importance Assigned to Milestone
MadGraph5_aMC@NLO
Invalid
Undecided
Unassigned

Bug Description

Hi I'm having the exact same problem mentioned in ~/mg5amcnlo/+question/219819

the only difference is I'm using my workstation, multicore not cluster and I've been getting this error for multiple times, for different processes.
"
Error when reading /home/jack/packages/MG5_aMC_v2_6_2/Generic_zttjj/SubProcesses/P1_qq_zttxqq/G1594.08/results.dat
Command "generate_events run_01" interrupted with error:
IOError : [Errno 2] No such file or directory: '/home/jack/packages/MG5_aMC_v2_6_2/Generic_zttjj/SubProcesses/P1_qq_zttxqq/G1594.08/results.dat'
Please report this bug on https://bugs.launchpad.net/mg5amcnlo
More information is found in '/home/jack/packages/MG5_aMC_v2_6_2/Generic_zttjj/run_01_tag_1_debug.log'.
Please attach this file to your report.
"
As mentioned in the link, the problem is related to writing HTML file according to the debug file which I'm pasting below. Could you please help me to solve the issue, thanks

Jack

---------------

generate_events run_01
Traceback (most recent call last):
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/interface/extended_cmd.py", line 1501, in onecmd
    return self.onecmd_orig(line, **opt)
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/interface/extended_cmd.py", line 1450, in onecmd_orig
    return func(arg, **opt)
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/interface/madevent_interface.py", line 2467, in do_generate_events
    self.run_generate_events(switch_mode, args)
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/interface/common_run_interface.py", line 6685, in new_fct
    original_fct(obj, *args, **opts)
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/interface/madevent_interface.py", line 2509, in run_generate_events
    self.exec_cmd('refine %s' % nb_event, postcmd=False)
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/interface/extended_cmd.py", line 1528, in exec_cmd
    stop = Cmd.onecmd_orig(current_interface, line, **opt)
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/interface/extended_cmd.py", line 1450, in onecmd_orig
    return func(arg, **opt)
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/interface/madevent_interface.py", line 3419, in do_refine
    x_improve.launch() # create the ajob for the refinment.
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/madevent/gen_ximprove.py", line 861, in launch
    main_dir=pjoin(self.cmd.me_dir,'SubProcesses')) #main_dir is for gridpack readonly mode
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/madevent/sum_html.py", line 728, in collect_result
    P_comb.add_results(os.path.basename(G), path, mfactors[G])
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/madevent/sum_html.py", line 412, in add_results
    oneresult.read_results(filepath)
  File "/home/jack/packages/MG5_aMC_v2_6_2/madgraph/madevent/sum_html.py", line 279, in read_results
    finput = open(filepath)
IOError: [Errno 2] No such file or directory: '/home/jack/packages/MG5_aMC_v2_6_2/Generic_zttjj/SubProcesses/P1_qq_zttxqq/G1594.08/results.dat'
Related File: /home/jack/packages/MG5_aMC_v2_6_2/Generic_zttjj/SubProcesses/P1_qq_zttxqq/G1594.08/results.dat
                              Run Options
                              -----------
               stdout_level : 20 (user set)

                         MadEvent Options
                         ----------------
     automatic_html_opening : False (user set)
        notification_center : True
          cluster_temp_path : None
             cluster_memory : None (user set)
               cluster_size : 100
              cluster_queue : None (user set)
                    nb_core : 8 (user set)
               cluster_time : 8 (user set)
                   run_mode : 2

                      Configuration Options
                      ---------------------
                text_editor : emacs -nw (user set)
         cluster_local_path : None
      cluster_status_update : (600, 30)
               pythia8_path : None (user set)
                  hwpp_path : None (user set)
            pythia-pgs_path : None (user set)
                    td_path : None (user set)
               delphes_path : None (user set)
                thepeg_path : None (user set)
               cluster_type : condor
          madanalysis5_path : None (user set)
           cluster_nb_retry : 1
                 eps_viewer : None
                web_browser : None
               syscalc_path : None (user set)
           madanalysis_path : None (user set)
                     lhapdf : lhapdf-config
              f2py_compiler : None
                 hepmc_path : None (user set)
         cluster_retry_wait : 300
           fortran_compiler : None
                auto_update : 7 (user set)
        exrootanalysis_path : None (user set)
                    timeout : 60
               cpp_compiler : None

Revision history for this message
Jack Y. Araz (jackaraz) wrote :
Revision history for this message
Jack Y. Araz (jackaraz) wrote :
Revision history for this message
Jack Y. Araz (jackaraz) wrote :
Revision history for this message
Jack Y. Araz (jackaraz) wrote :
Revision history for this message
Olivier Mattelaer (olivier-mattelaer) wrote :

Thanks,

I have added this on my todo list for my next working day. This would be on Wednesday (I'm not working for the physics department/community on Monday/Tuesday).

Cheers,

Olivier

Revision history for this message
Jack Y. Araz (jackaraz) wrote :

Dear Oliver

Additional info:

I got the very same error in a similar process. Here is how I produced it (please note that I'm getting the errors at the very end and it takes a while to generate it.)

define w = w- w+
generate p p > w t t~ j j QED<=5

COM Energy 14 TeV, all cuts are the default.

Thank you very much

Best regards

Revision history for this message
Olivier Mattelaer (olivier-mattelaer) wrote : Re: [Bug 1774664] results.dat not found
Download full text (8.0 KiB)

Hi,

Sorry I'm one day late on this thread.
The problem is linked to the dynamical scale choice.
So one easy way to fix the problem is to not use the default dynamical scale choice
but a simpler one (like sum HT/2/...)

Another solution is to use the following patch.
This is not yet fully tested (such change can have huge impact on MLM merging and i have to be carefull about that.) but it solve the problem of the directory that you point out.

Thanks a lot for such report,

Olivier

=== modified file 'Template/LO/SubProcesses/reweight.f'
--- Template/LO/SubProcesses/reweight.f 2018-03-11 12:29:51 +0000
+++ Template/LO/SubProcesses/reweight.f 2018-06-07 19:04:43 +0000
@@ -545,6 +545,7 @@
       logical chclusold,fail,increasecode
       save chclusold
       integer tmpindex
+ integer pdgm, pdgid1, pdgid2

       logical isqcd,isjet,isparton,cluster,isjetvx,is_octet
       integer ifsno
@@ -746,8 +747,17 @@
 c The ishft gives the FS particle corresponding to imocl
               if(.not.is_octet(ipdgcl(ishft(1,ipart(1,imocl(n))-1),igraphs(1),iproc)))then
                  ! split case for q a > q and for g > g h (with the gluon splitting into quark)
- if (ipart(2,imocl(n)).eq.0) then ! q a > q case
- iqjets(ipart(1,imocl(n)))=0
+c ! also check for case of three scalar interaction (then do nothing)
+ pdgm = ipdgcl(imocl(n),igraphs(1),iproc)
+ pdgid1 = ipdgcl(idacl(n,1),igraphs(1),iproc)
+ pdgid2 = ipdgcl(idacl(n,2),igraphs(1),iproc)
+
+ if (.not.isqcd(pdgm).and..not.isqcd(pdgid1).and..not.isqcd(pdgid2)) then
+ ! this is to avoid to do weird stuff for w+ w- z (or h h h)
+ ! this fix an issue for qq_zttxqq G1594.08
+ continue
+ elseif (ipart(2,imocl(n)).eq.0) then ! q a > q case
+ iqjets(ipart(1,imocl(n)))=0
                  else ! octet. want to be sure that both are tagged as jet before removing one
                     ! this prevent that both are removed in case of g > g h , g > q1 q2, q1 > a q1.
                     ! at least one of the two should be kept as jet

> On 4 Jun 2018, at 19:11, jackaraz <email address hidden> wrote:
>
> Dear Oliver
>
> Additional info:
>
> I got the very same error in a similar process. Here is how I produced
> it (please note that I'm getting the errors at the very end and it takes
> a while to generate it.)
>
> define w = w- w+
> generate p p > w t t~ j j QED<=5
>
> COM Energy 14 TeV, all cuts are the default.
>
> Thank you very much
>
> Best regards
>
> --
> You received this bug notification because you are subscribed to
> MadGraph5_aMC@NLO.
> https://bugs.launchpad.net/bugs/1774664
>
> Title:
> results.dat not found
>
> Status in MadGraph5_aMC@NLO:
> New
>
> Bug description:
> Hi I'm having the exact same problem mentioned in
> ~/mg5amcnlo/+question/219819
>
> the only difference is I'm using my workstation, multicore not cluster and I've been getting this error for multiple times, for different processes.
> "
> Error when reading /home/jack/packages/MG5_aMC_v2_6_...

Read more...

Changed in mg5amcnlo:
status: New → In Progress
Changed in mg5amcnlo:
status: In Progress → Fix Released
Revision history for this message
Jack Y. Araz (jackaraz) wrote :

Hi Oliver
I'm still having the same problem with the updated version. Attaching the run, proc and debug files. Thanks

Changed in mg5amcnlo:
status: Fix Released → In Progress
Revision history for this message
Olivier Mattelaer (olivier-mattelaer) wrote :

Hi,

I actually do not reproduce the problem with your card/setup.
If you want me to look more at this issue, I would need that you tar me your full process directory and attach it here, such that I can take a look at the details of all the logs/result file and find what happened.

Cheers,

Olivier

Changed in mg5amcnlo:
status: In Progress → Incomplete
Revision history for this message
Jack Y. Araz (jackaraz) wrote :

Hi Oliver

Please find it in the attachment. Hope it helps.

Thanks

Revision history for this message
Olivier Mattelaer (olivier-mattelaer) wrote :

Hi,

I actually do not reproduce the problem if I run locally (just changing the compiler flag to match my machine).

If I look at the reason of the crash this is reported as "code stop with end-code 2"
I'm not aware of any place of the code where we use that return code (and did not find any line in the code that would stop the code with that number) so it is likely that those processes were stop by your machine.

The default meaning of that end-code is"Misuse of shell builtins (according to Bash documentation)"
The weird stuff is that more than 80% of your computation finish normally.
So I have no idea here what I can do to help you more.

Sorry,

Olivier

Checking for the meaning of that error message online, does not help that much (

Changed in mg5amcnlo:
status: Incomplete → Invalid
Ellen (ellenriefel)
information type: Public → Public Security
Revision history for this message
Olivier Mattelaer (olivier-mattelaer) wrote :

Please create your dedicated bug report since this is likely something different (and this will polute this thread)

Please add the same information as the one requested for this bug
(i.e. the various cards, the way you generated your matrix-element (since EVA requires dedicted option at generation time) and the log file of the crashing job.
Without them, I will not be able to help.

Cheers,

Olivier

To post a comment you must log in.
This report contains Public Security information  
Everyone can see this security related information.

Other bug subscribers

Related questions

Remote bug watches

Bug watches keep track of this bug in other bug trackers.