Skip to content

DAOS-18395 test: recovery/pool_list_consolidation.py test_lost_majori…#17965

Draft
shimizukko wants to merge 21 commits intomasterfrom
makito/DAOS-18395
Draft

DAOS-18395 test: recovery/pool_list_consolidation.py test_lost_majori…#17965
shimizukko wants to merge 21 commits intomasterfrom
makito/DAOS-18395

Conversation

@shimizukko
Copy link
Copy Markdown
Contributor

…ty_ps_replicas MD-on-SSD support

In MD-on-SSD, we need to load the pool dir to a new tmpfs because the pool dir will disappear when we
stop the servers, so load with prov_mem.

We need to use ddb prov_mem to remove rdb-pool, so add support in ddb_utils.py and the logic to determine the parameters for ddb prov_mem.

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

…ty_ps_replicas MD-on-SSD support

In MD-on-SSD, we need to load the pool dir to a new
tmpfs because the pool dir will disappear when we
stop the servers, so load with prov_mem.

We need to use ddb prov_mem to remove rdb-pool, so
add support in ddb_utils.py and the logic to determine
the parameters for ddb prov_mem.

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
@github-actions
Copy link
Copy Markdown

Ticket title is 'CR Test Update - recovery/pool_list_consolidation.py test_lost_majority_ps_replicas MD-on-SSD Support'
Status is 'In Progress'
Labels: 'catastrophic_recovery'
https://daosio.atlassian.net/browse/DAOS-18395

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
@daosbuild3
Copy link
Copy Markdown
Collaborator

@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17965/11/execution/node/813/log

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17965/12/execution/node/769/log

…ng of --db_path

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Signed-off-by: Makito Kano <[email protected]>
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-17965/15/testReport/

@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17965/15/execution/node/705/log

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17965/16/execution/node/705/log

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
@daosbuild3
Copy link
Copy Markdown
Collaborator

Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-medium: false
Test-tag: test_lost_majority_ps_replicas DdbTest
@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17965/18/execution/node/750/log

@shimizukko
Copy link
Copy Markdown
Contributor Author

Engines aren't starting after dmg storage format for ddb.py tests. Possibly due to https://daosio.atlassian.net/browse/DAOS-18860
For some reason, pool_list_consolidation.py test is working. ddb.py fails in either PMEM or MD-on-SSD cluster. Same results after rerunning the tests in CI for a few times.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

2 participants