Conversation
I've added two env vars to the configure script: one for the install directory for the source and one for the install directory of the objects.
A variable is changed without proper mutex locking.
I've added two env vars to the configure script: one for the install directory for the source and one for the install directory of the objects.
Hang should be fixed now.
…add mpi PSI goalf easyconfig
…into psi * 'psi' of https://github.com/boegel/easybuild-easyconfigs: remove respecifying default max_fail_ratio of 0.5 in NWChem easyconfig really fix typo in patch file fix typo in patch file complete mpi patch file for PSI-4.0b5 align PSI easyconfigs, correct filename of -mt PSI goalf easyconfig, add mpi PSI goalf easyconfig
|
All merged, ready to go! (At least!) |
|
I retested all of this on gengar, with PSI being installed on gengar scratch, which worked fine. I don't think the problem is in the build however, so I won't hold this PR any longer. Just to clarify things: when you tested on raichu, on which filesystem was the target installation dir for PSI? |
|
@boegel, I've tested on raichu, both on /tmp and on the scratch. In both case everything worked fine. If you can reproduce those hangs, I'm still interested in the backtraces. |
|
I'm seeing the hang again, this time with PSI 4.0b4 on delcatty using delcatty's scratch (so not related to using NFS or not, indeed). This might be because the It's hanging on: Here's the backtrace: |
|
Yeah, this is with 99% certainty the same bug. Apply I'm puzzled that you can trigger this bug so easily. The threads have to follow a certain pattern for it to trigger: the only way I could reproduce it as on a NFS share but I don't understand the correlation between NFS and the scheduler. Once loaded into memory, NFS should not make any difference. |
This is the one to rule them all.
This PR replaces #443, #434, #439, #433 and #439.
Test on raichu and everything works.
Depends on easybuilders/easybuild-easyblocks#270