Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Flakes of inconsistency between RayService Object's nested RayCluster state and RayCluster Object state #2795

Open
1 of 2 tasks
weizhaowz opened this issue Jan 21, 2025 · 3 comments
Labels
bug Something isn't working triage

Comments

@weizhaowz
Copy link

Search before asking

  • I searched the issues and found no similar issues.

KubeRay Component

ray-operator

What happened + What you expected to happen

There are some flakes in e2e tests. After the creation of a ray cluster, there might be some inconsistencies between the RayService Object's nested RayCluster state and the RayCluster Object state, in details, the RayService.status.activeServiceStatus.rayClusterStatus.state is unset, but RayCluster.status.state is ready, and the application's status is ready as well.
While the the state field is replaced with Conditions, should we switch to Conditions in the e2e tests as well?

Reproduction script

This bug is a flakiness issue.

  1. create a RayService in an e2e test;
  2. query the values of ayService.status.activeServiceStatus.rayClusterStatus.state and RayCluster.status.state

Anything else

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!
@weizhaowz weizhaowz added bug Something isn't working triage labels Jan 21, 2025
@andrewsykim
Copy link
Collaborator

@kevin85421 @rueian do you know if this bug overlaps with any of the RayService refactorign we're doing now?

@rueian
Copy link
Contributor

rueian commented Jan 22, 2025

I believe we don't cover this issue in the current RayService refactoring because the refactoring is focusing on the RayService reconciliation while the only way to solve the inconsistency is to update RayService.status.activeServiceStatus.rayClusterStatus from the RayCluster reconciliation.

@weizhaowz
Copy link
Author

Do we switch to the field of Conditions or continue to us the field of state for a while?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working triage
Projects
None yet
Development

No branches or pull requests

3 participants