tools: add pg backfill_toofull fix #51

JoshuaGabriel · 2025-11-04T06:59:24Z

reads the failure domain of a pool then upmaps the backfill_toofull pg into another OSD based on %utilization

Signed-off-by: Joshua Blanch <joshua.blanch@clyso.com>

yzhan298 · 2025-11-04T20:12:06Z

docs/backfill-toofull.md

+```
+
+Problem:
+Usually when a node goes down or when draining capacity, there are some OSDs that become nearfull and eventually can lead to PGs being backfill_toofull warning pops up.


Usually when a node goes down or when draining capacity seems broken. Should it be Usually when a node goes down or when draining with limited capacity?

sam0044

this is a nice script to have. I would rephrase the problem statement to be a bit more clear. Outside that, the logic looks pretty solid.

JoshuaGabriel · 2025-11-06T19:58:42Z

actually I don't think this will take into account device class for the crush rule, only tried this on all nvme cluster. If there were mixed hdd/ssd it could create an upmap to one outside its device class

JoshuaGabriel requested a review from sam0044 November 4, 2025 06:59

tools: add pg backfill_toofull fix

9ebe69c

Signed-off-by: Joshua Blanch <joshua.blanch@clyso.com>

JoshuaGabriel force-pushed the toolkit/pg_toofull branch from 189310e to 9ebe69c Compare November 4, 2025 20:10

yzhan298 reviewed Nov 4, 2025

View reviewed changes

sam0044 requested changes Nov 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tools: add pg backfill_toofull fix #51

tools: add pg backfill_toofull fix #51

Uh oh!

JoshuaGabriel commented Nov 4, 2025

Uh oh!

yzhan298 Nov 4, 2025

Uh oh!

sam0044 left a comment

Uh oh!

JoshuaGabriel commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tools: add pg backfill_toofull fix #51

Are you sure you want to change the base?

tools: add pg backfill_toofull fix #51

Uh oh!

Conversation

JoshuaGabriel commented Nov 4, 2025

Uh oh!

yzhan298 Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

sam0044 left a comment

Choose a reason for hiding this comment

Uh oh!

JoshuaGabriel commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants