DAOS-18236 test: overhaul daos_racer usage#17980
DAOS-18236 test: overhaul daos_racer usage#17980daltonbohning wants to merge 1 commit intomasterfrom
Conversation
|
Ticket title is 'daos_racer/parallel.py:DaosRacerParallelTest.test_daos_racer_parallel - Failed to initialize step=2, rc=-1025' |
617a0e6 to
c467c98
Compare
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17980/3/execution/node/735/log |
c467c98 to
dd01ab8
Compare
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17980/4/execution/node/735/log |
dd01ab8 to
5ff7666
Compare
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17980/5/execution/node/735/log |
ce16e52 to
e92f061
Compare
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17980/8/execution/node/721/log |
e92f061 to
1d185e1
Compare
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17980/9/execution/node/680/log |
1d185e1 to
6f20294
Compare
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17980/10/execution/node/681/log |
6f20294 to
76d421b
Compare
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17980/11/execution/node/681/log |
- Change clush_timeout to daos_racer_timeout to be clear - Removing debug loggin from daos_racer/parallel.py - Use ppn in daos_racer/parallel.py - Remove hardcoded envs and openmpi load from daos_racer_utils.py because this is done by the job manager - Adjust Orterun.assign_processes to accept ppn Test-tag: daos_racer OSAOnlineExtend OSAOnlineParallelTest OSAOnlineReintegration SoakSmoke test_daos_management Skip-unit-tests: true Skip-fault-injection-test: true Signed-off-by: Dalton Bohning <[email protected]>
76d421b to
b6fa1ab
Compare
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17980/12/execution/node/764/log |
|
We can see this is actually using MPI now, where it was not before. Checking for |
because this is done by the job manager
Test-tag: daos_racer
Skip-unit-tests: true
Skip-fault-injection-test: true
Steps for the author:
After all prior steps are complete: