Skip to content

Conversation

@junngo
Copy link
Contributor

@junngo junngo commented Jan 8, 2026

If the IrrelevantDataRemoval strategy encounters an empty repository during the process, the next repository is currently skipped (https://bugzilla.mozilla.org/show_bug.cgi?id=2008333).
In this PR, I updated the strategy so that it iterates through all candidate repositories without skipping any.

However, before landing this PR, we should first complete the work to improve the data cycling performance. Once this issue is fixed, the previously skipped repositories will start to be cleaned up. There are about 130 such repositories, and if each repository takes around 2~3 minutes, the total runtime could increase significantly.

Now, the data does not accumulate permanently. Any skipped data will eventually be removed by MainRemovalStrategy, and the target data size for this strategy is relatively small compared to the others.
It seems reasonable to delay landing this PR until the data cycling improvements are completed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant