Crash when changing the number of SubDomains

Asked by Gonzalo Tancredi

I have a weird situation.
I am running a simulation of a falling box filled with 344752 particles.
If I use the following number of SubDomains
numSubdomainsX = 3
numSubdomainsY = 1
numSubdomainsZ = 3
the simulations runs smoohtly for several days.

But when I try to increment the number of SubDomains, the simulation starts and after a while (tipycally more than an hour) it stops, and the only errror message I get is

mpirun noticed that process rank 0 with PID 14310 on node master exited on signal 1 (Hangup).

The node name changes with different runs.

Any clues on what could be the problem?
Is there anyway to obtain more information about the causes of the crash.

Gonzalo

Question information

Language:
English Edit question
Status:
Answered
For:
ESyS-Particle Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
Dion Weatherley (d-weatherley) said :
#1

Hi Gonzalo,

I've experienced this previously on some hardware. Can you please send me your scripts and I'll try them myself. This may be a hardware issue though...

Cheers,

Dion

Revision history for this message
Gonzalo Tancredi (gonzalo) said :
#2

Dion

Thank you very much for your answer.

Attached to this mail yu will find the files necesary to run one of the test
that fail with 4x1x4 SubDomains.

The only ones that works where with 3x1x3.

Regards

Gonzalo

    Gonzalo Tancredi

Director Observatorio Astronomico Los Molinos - DICYT - MEC
Profesor Titular
Dpto. Astronomia Tel : (598-2) 2525 86 24/25/26 int. 319
Facultad Ciencias Fax : (598 2) 2525 05 80
Igua 4225 Email : <email address hidden>
11400 Montevideo - URUGUAY http://www.fisica.edu.uy/~gonzalo

---------- Original Message -----------
From: Dion Weatherley <email address hidden>
To: <email address hidden>
Sent: Thu, 09 Feb 2012 11:15:46 -0000
Subject: Re: [Question #186924]: Crash when changing the number of SubDomains

> Your question #186924 on ESyS-Particle changed:
> https://answers.launchpad.net/esys-particle/+question/186924
>
> Status: Open => Answered
>
> Dion Weatherley proposed the following answer:
> Hi Gonzalo,
>
> I've experienced this previously on some hardware. Can you please send
> me your scripts and I'll try them myself. This may be a hardware issue
> though...
>
> Cheers,
>
> Dion
>
> --
> If this answers your question, please go to the following page to let us
> know that it is solved:
>
https://answers.launchpad.net/esys-particle/+question/186924/+confirm?answer_id=0
>
> If you still need help, you can reply to this email or go to the
> following page to enter your feedback:
> https://answers.launchpad.net/esys-particle/+question/186924
>
> You received this question notification because you asked the question.
>
> !DSPAM:4f33aafa281318167612345!
------- End of Original Message -------

Revision history for this message
Dion Weatherley (d-weatherley) said :
#3

Hi Gonzalo,

You will need to send me your scripts directly via email. Launchpad does not allow attachments to be uploaded.

Cheers,

Dion

Can you help with this problem?

Provide an answer of your own, or ask Gonzalo Tancredi for more information if necessary.

To post a message you must log in.