Error: Too many open files on running MG5 v 2.5.4

Asked by ARUNPRASATH V

Dear MG5 team,
I was trying to generate events for p p > n1 x1+ j j , ( n2 > z n1 , (z > l+ l- )), ( x1+ > w+ n1 , (w+ > l+ vl )) in multicore mode. The process had abbout 12 000 diagrams. During the running of madevent, the event generation stopped abruptly with the following message:

sh: error while loading shared libraries. libtinfo.so.5: cannot open shared object file: Error 24
Error : too many open files.

The library libtinfo.so.5 was available in the /usr/lib64 directory.

The operating System: CentOS 6.8 . The linux kernel version was: 2.6.32. The command ulimit -Hn gives 4096.

The madgraph version: 2.5.4 , mode = multicore , max. no of cores=24 (user_set)
I would appreciate any help in resolving this issue.

I have not set ulimit to a high value as I do not know if that is a wise decision. There was no problem with processes where the no. of diagrams was low.

Thank you.

Question information

Language:
English Edit question
Status:
Expired
For:
MadGraph5_aMC@NLO Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
Olivier Mattelaer (olivier-mattelaer) said :
#1

Hi,

Did you run scan? Or something similar?
Such type of problem are typically linked to a file open which is not closed properly. To be a problem, you typically need to do that very often (which is why I suspect scan)

Can you show the full log, such that I can at least see a which stage of the computation such problem occur?

Cheers,

Olivier

> On 27 Jul 2017, at 17:11, ARUNPRASATH V <email address hidden> wrote:
>
> New question #653782 on MadGraph5_aMC@NLO:
> https://answers.launchpad.net/mg5amcnlo/+question/653782
>
> Dear MG5 team,
> I was trying to generate events for p p > n1 x1+ j j , ( n2 > z n1 , (z > l+ l- )), ( x1+ > w+ n1 , (w+ > l+ vl )) in multicore mode. The process had abbout 12 000 diagrams. During the running of madevent, the event generation stopped abruptly with the following message:
>
> sh: error while loading shared libraries. libtinfo.so.5: cannot open shared object file: Error 24
> Error : too many open files.
>
> The library libtinfo.so.5 was available in the /usr/lib64 directory.
>
> The operating System: CentOS 6.8 . The linux kernel version was: 2.6.32. The command ulimit -Hn gives 4096.
>
> The madgraph version: 2.5.4 , mode = multicore , max. no of cores=24 (user_set)
> I would appreciate any help in resolving this issue.
>
> I have not set ulimit to a high value as I do not know if that is a wise decision. There was no problem with processes where the no. of diagrams was low.
>
> Thank you.
>
>
>
>
> --
> You received this question notification because you are an answer
> contact for MadGraph5_aMC@NLO.

Revision history for this message
ARUNPRASATH V (arvhep) said :
#2

Dear Olivier,
Thank you for your prompt reply. I am pasting below the debug log file:
--------------------------------------------
launch
Traceback (most recent call last):
  File "/home/x/y/ppn2x1phwww/bin/internal/extended_cmd.py", line 1415, in onecmd
  File "/home/x/y/ppn2x1phwww/bin/internal/extended_cmd.py", line 1369, in onecmd_orig
  File "/home/x/y/ppn2x1phwww/bin/internal/madevent_interface.py", line 2201, in do_launch
  File "/home/x/y/ppn2x1phwww/bin/internal/madevent_interface.py", line 2068, in do_generate_events
  File "/home/x/y/ppn2x1phwww/bin/internal/extended_cmd.py", line 1442, in exec_cmd
  File "/home/x/y/ppn2x1phwww/bin/internal/extended_cmd.py", line 1369, in onecmd_orig
  File "/home/x/y/ppn2x1phwww/bin/internal/madevent_interface.py", line 3015, in do_refine
  File "/home/x/y/ppn2x1phwww/bin/internal/extended_cmd.py", line 1506, in compile
  File "/home/x/y/ppn2x1phwww/bin/internal/misc.py", line 439, in compile
MadGraph5Error: Impossible to compile /home/x/y/ppn2x1phwww/SubProcesses/P3_gsx_n2x1pgux_n2_hn1_h_wpwm_wp_mupvm_wm_mumvmx_x1p_wpn1_wp_mupvm directory
Trying to launch make command returns:
    [Errno 24] Too many open files
In general this means that your computer is not able to compile.
                              Run Options
                              -----------
               stdout_level : None

                         MadEvent Options
                         ----------------
     automatic_html_opening : False (user set)
        notification_center : True
          cluster_temp_path : None
             cluster_memory : None
               cluster_size : 100
              cluster_queue : None
                    nb_core : 24 (user set)
               cluster_time : None
                   run_mode : 2

                      Configuration Options
                      ---------------------
                text_editor : nano (user set)
         cluster_local_path : None
      cluster_status_update : (600, 30)
               pythia8_path : /home/x/y (user set)
                  hwpp_path : None (user set)
            pythia-pgs_path : None (user set)
                    td_path : None (user set)
               delphes_path : None (user set)
                thepeg_path : None (user set)
               cluster_type : condor
          madanalysis5_path : None (user set)
           cluster_nb_retry : 1
                 eps_viewer : None
                web_browser : None
               syscalc_path : None (user set)
           madanalysis_path : None (user set)
                     lhapdf : lhapdf-config
              f2py_compiler : None
                 hepmc_path : /home/x/y/lib (user set)
         cluster_retry_wait : 300
           fortran_compiler : None
                auto_update : 7 (user set)
        exrootanalysis_path : None (user set)
                    timeout : 60
               cpp_compiler : None
--------------------------------------------------------------
Thank you.

Revision history for this message
Launchpad Janitor (janitor) said :
#3

This question was expired because it remained in the 'Open' state without activity for the last 15 days.