Skip to content

Conversation

@Nasf-Fan
Copy link
Contributor

@Nasf-Fan Nasf-Fan commented Dec 30, 2025

Include the followings:

  1. When create CHK IV namespace, make the secondary group to be same as
    the primary group. Otherwise, CHK logic may hit DER_NONEXIST trouble
    when communicate via IV.

  2. Integrate CHK IV namespace create and destroy API, cleanup related
    logic, redefine the version.

  3. Get ranks list and IV namespace version from CHK leader when rejoin.
    Adjust CHK_REJOIN RPC for related changes.

  4. Remove unsupported functionality for checking the specified 'phase'.

  5. Add new test for case of lost some engine(s) before start checker.

Test-tag: recovery

Signed-off-by: Fan Yong fan.yong@hpe.com

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

@github-actions
Copy link

Ticket title is 'DAOS checker cannot completed on Aurora after some engines excluded'
Status is 'In Progress'
Labels: 'scrubbed_2.6.5'
https://daosio.atlassian.net/browse/DAOS-17535

@daosbuild3
Copy link
Collaborator

@Nasf-Fan Nasf-Fan changed the title DAOS-17535 chk: secondary group should be same as primary group for C… DAOS-17535 chk: misc improvements for CR logic Dec 31, 2025
@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-17535_7 branch from 8e4ad6a to 639a8ec Compare December 31, 2025 03:16
@daosbuild3
Copy link
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17329/1/execution/node/1388/log

@daosbuild3
Copy link
Collaborator

@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-17535_7 branch from 639a8ec to 78579dd Compare December 31, 2025 07:33
@daosbuild3
Copy link
Collaborator

@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-17535_7 branch 2 times, most recently from aa39da7 to 476d0f9 Compare January 1, 2026 03:09
@daosbuild3
Copy link
Collaborator

@daosbuild3
Copy link
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17329/5/execution/node/1324/log

Include the followings:

1. When create CHK IV namespace, make the secondary group to be same as
   the primary group. Otherwise, CHK logic may hit DER_NONEXIST trouble
   when communicate via IV.

2. Integrate CHK IV namespace create and destroy API, cleanup related
   logic, redefine the version.

3. Get ranks list and IV namespace version from CHK leader when rejoin.
   Adjust CHK_REJOIN RPC for related changes.

4. Remove unsupported functionality for checking the specified 'phase'.

5. Add new test for case of lost some engine(s) before start checker.

Test-tag: recovery

Signed-off-by: Fan Yong <fan.yong@hpe.com>
@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-17535_7 branch from 476d0f9 to 09aaf91 Compare January 4, 2026 02:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

3 participants