Skip to content

DAOS-16575 rebuild: set rgt DRS_COMPLETED for unretryable failure#18204

Open
liuxuezhao wants to merge 1 commit into
masterfrom
lxz/rb_test_refine
Open

DAOS-16575 rebuild: set rgt DRS_COMPLETED for unretryable failure#18204
liuxuezhao wants to merge 1 commit into
masterfrom
lxz/rb_test_refine

Conversation

@liuxuezhao
Copy link
Copy Markdown
Contributor

If rebuild failed with unretryable failure, set it as DRS_COMPLETED to avoid following pool query always report rebuild busy such as - Rebuild failing (state=busy, status=-2001)
after this patch, that kind of failed rebuild will report - Rebuild failed (state=done, status=-2001)
This can fix test_rebuild_31/REBUILD31 problem.

For test_rebuild_32/REBUILD32 add DAOS_POOL_TGT_UPDATE_SKIP_RF_CHECK to bypass DAOS_POOL_RF checking when changing pool map.

Fixes: DAOS-16575 DAOS-16576
Test-tag: pr test_rebuild_31 test_rebuild_32

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

@liuxuezhao liuxuezhao requested review from a team as code owners May 8, 2026 07:42
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 8, 2026

Ticket title is 'daos_test/rebuild.py:DaosCoreTestRebuild.test_rebuild_31 and test_rebuild_32 failures'
Status is 'In Review'
Labels: 'ci_master_daily,daily_test'
https://daosio.atlassian.net/browse/DAOS-16575

@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-18204/1/execution/node/1343/log

If rebuild failed with unretryable failure, set it as DRS_COMPLETED
to avoid following pool query always report rebuild busy such as -
Rebuild failing (state=busy, status=-2001)
after this patch, that kind of failed rebuild will report -
Rebuild failed (state=done, status=-2001)
This can fix test_rebuild_31/REBUILD31 problem.

For test_rebuild_32/REBUILD32 add DAOS_POOL_TGT_UPDATE_SKIP_RF_CHECK to
bypass DAOS_POOL_RF checking when changing pool map.

Fixes: DAOS-16575 DAOS-16576
Test-tag: pr test_rebuild_31 test_rebuild_32

Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
@liuxuezhao liuxuezhao force-pushed the lxz/rb_test_refine branch from 495d1f5 to f2dc2f6 Compare May 11, 2026 02:39
@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-18204/2/execution/node/1302/log

Copy link
Copy Markdown
Contributor

@daltonbohning daltonbohning left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ftest changes LGTM

@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Large MD on SSD completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-18204/2/testReport/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

3 participants