Proposal of introducing a rebalance mechanism to actively trigger rescheduling of resource #4698

chaosi-zju · 2024-03-12T05:49:21Z

What type of PR is this?

/kind design
/kind documentation

What this PR does / why we need it:

Proposal of introducing a rebalance mechanism to actively trigger rescheduling of resource.

Assuming the user has propagated the workloads to member clusters, in some scenarios the current replicas distribution
is not the most expected, such as:

replicas migrated due to cluster failover, while now cluster recovered.
replicas migrated due to application-level failover, while now each cluster has sufficient resources to run the replicas.
as for Aggregated schedule strategy, replicas were initially distributed across multiple clusters due to resource
constraints, but now one cluster is enough to accommodate all replicas.

Therefore, the user desires for an approach to trigger rescheduling so that the replicas distribution can do a rebalance.

Which issue(s) this PR fixes:

Fixes part of #4840

Special notes for your reviewer:

Does this PR introduce a user-facing change?:

codecov-commenter · 2024-03-12T06:01:37Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.33%. Comparing base (5bc8c54) to head (0e1922c).
Report is 113 commits behind head on master.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #4698      +/-   ##
==========================================
+ Coverage   53.12%   53.33%   +0.20%     
==========================================
  Files         251      252       +1     
  Lines       20417    20482      +65     
==========================================
+ Hits        10847    10924      +77     
+ Misses       8856     8836      -20     
- Partials      714      722       +8

Flag	Coverage Δ
unittests	`53.33% <ø> (+0.20%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

wu0407 · 2024-03-12T11:14:34Z

This Pr mixes fault self-healing and rescheduling. I think fault self-healing includes rescheduling, similar to when a node crashes, the workload corresponding to the pod on the node will regenerate the pod. This is completed by multiple controllers working together, including a scheduler. If the goal is self-healing, then multiple components need to be considered for coordination. If it is only rescheduling, then only the target of eviction and the conditions for stopping eviction need to be considered. Can we consider the design concept of the Descheduler project in the community

RainbowMango

/assign

docs/proposals/scheduling/reschedule-task/reschedule-task.md

chaosi-zju · 2024-05-09T12:21:41Z

I did a hard job to made a thorough improvement of this proposal, now everyone can go through it all over again, looking forward to your suggestions~

chaosi-zju · 2024-05-09T12:25:18Z

This Pr mixes fault self-healing and rescheduling.

@wu0407 Hello, I have updated this proposal. Actually, this proposal is about an entirely rescheduling, as for cluster failover is only a user story of it. For more imformation you can see in latest proposal, thank you for your comments~

docs/proposals/scheduling/workload-rebalancer/workload-rebalancer.md

…cheduling of resource. Signed-off-by: chaosi-zju <[email protected]>

RainbowMango

/lgtm
/approve

karmada-bot · 2024-05-24T01:09:09Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: RainbowMango

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [RainbowMango]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

karmada-bot added the kind/design Categorizes issue or PR as related to design. label Mar 12, 2024

karmada-bot requested review from Poor12 and Tingtal March 12, 2024 05:49

karmada-bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Mar 12, 2024

chaosi-zju force-pushed the reschedule branch 2 times, most recently from c57f463 to edb362e Compare April 17, 2024 02:03

chaosi-zju mentioned this pull request Apr 17, 2024

Introduce a mechanism to actively trigger rescheduling #4840

Closed

chaosi-zju force-pushed the reschedule branch from edb362e to afd344f Compare April 17, 2024 10:54

RainbowMango reviewed Apr 17, 2024

View reviewed changes

karmada-bot assigned RainbowMango Apr 17, 2024

chaunceyjiang reviewed Apr 18, 2024

View reviewed changes

docs/proposals/scheduling/reschedule-task/reschedule-task.md Outdated Show resolved Hide resolved

chaosi-zju force-pushed the reschedule branch from afd344f to 2426c4b Compare April 18, 2024 10:18

RainbowMango mentioned this pull request Apr 18, 2024

Introduce a mechanism to scheduler to actively trigger rescheduling #4848

Merged

RainbowMango reviewed Apr 19, 2024

View reviewed changes

docs/proposals/scheduling/reschedule-task/reschedule-task.md Outdated Show resolved Hide resolved

docs/proposals/scheduling/reschedule-task/reschedule-task.md Outdated Show resolved Hide resolved

chaosi-zju force-pushed the reschedule branch from 2426c4b to 7fc9c12 Compare April 22, 2024 04:07

chaosi-zju force-pushed the reschedule branch from 7fc9c12 to cc77a57 Compare May 9, 2024 12:14

chaosi-zju changed the title ~~Introduce a mechanism to actively trigger rescheduling~~ Proposal of introducing a rebalance mechanism to actively trigger rescheduling of resource May 9, 2024

chaosi-zju force-pushed the reschedule branch 2 times, most recently from 1e4b127 to e7aff2a Compare May 22, 2024 09:51

RainbowMango reviewed May 23, 2024

View reviewed changes

docs/proposals/scheduling/workload-rebalancer/workload-rebalancer.md Outdated Show resolved Hide resolved

docs/proposals/scheduling/workload-rebalancer/workload-rebalancer.md Outdated Show resolved Hide resolved

Proposal of introducing a rebalance mechanism to actively trigger res…

0e1922c

…cheduling of resource. Signed-off-by: chaosi-zju <[email protected]>

chaosi-zju force-pushed the reschedule branch from e7aff2a to 0e1922c Compare May 23, 2024 11:42

RainbowMango approved these changes May 24, 2024

View reviewed changes

karmada-bot added the lgtm Indicates that a PR is ready to be merged. label May 24, 2024

karmada-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 24, 2024

karmada-bot merged commit cee1c1a into karmada-io:master May 24, 2024
12 checks passed

weidalin mentioned this pull request Sep 11, 2024

If there are any plans for WorkloadRebalancer to support resourceSelectors, similar to what is supported in PropagationPolicy #5527

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal of introducing a rebalance mechanism to actively trigger rescheduling of resource #4698

Proposal of introducing a rebalance mechanism to actively trigger rescheduling of resource #4698

chaosi-zju commented Mar 12, 2024 •

edited

Loading

codecov-commenter commented Mar 12, 2024 •

edited

Loading

wu0407 commented Mar 12, 2024

RainbowMango left a comment

chaosi-zju commented May 9, 2024

chaosi-zju commented May 9, 2024

RainbowMango left a comment

karmada-bot commented May 24, 2024

Proposal of introducing a rebalance mechanism to actively trigger rescheduling of resource #4698

Proposal of introducing a rebalance mechanism to actively trigger rescheduling of resource #4698

Conversation

chaosi-zju commented Mar 12, 2024 • edited Loading

codecov-commenter commented Mar 12, 2024 • edited Loading

Codecov Report

wu0407 commented Mar 12, 2024

RainbowMango left a comment

Choose a reason for hiding this comment

chaosi-zju commented May 9, 2024

chaosi-zju commented May 9, 2024

RainbowMango left a comment

Choose a reason for hiding this comment

karmada-bot commented May 24, 2024

chaosi-zju commented Mar 12, 2024 •

edited

Loading

codecov-commenter commented Mar 12, 2024 •

edited

Loading