Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kubelite not starting after power failure unless cgroups-per-qos=false #4378

Closed
AlexGustafsson opened this issue Jan 24, 2024 · 3 comments
Closed
Labels
inactive kind/bug Something isn't working

Comments

@AlexGustafsson
Copy link
Contributor

Summary

After the host had been shut down abruptly, microk8s (kubelite) would no longer start due to the following error:

Jan 24 19:02:35 bernd microk8s.daemon-kubelite[2373]: E0124 19:02:35.011772    2373 kubelet.go:1542] "Failed to start ContainerManager" err="failed to initialize top level QOS containers: root container [kubepods] doesn't exist"

After having applied the workaround mentioned by @neoaggelos in #4301 (comment), microk8s started.

Now microk8s cannot start without those changes.

What Should Happen Instead?

Microk8s should start without having to disable cgroups per qos.

Reproduction Steps

None.

Introspection Report

inspection-report-20240124_193532.tar.gz

@AlexGustafsson
Copy link
Contributor Author

During the last boot, before the power outage, the host had been running for a long time. microk8s had been updated from 1.26, through 1.27, 1.28 to 1.29 without a reboot. So the power cycle might just have exposed issues that would otherwise have shown.

I haven't found anything in the patch notes that suggest that there's some change in how cgroups works lately. The computer haven't been configured any different since it was working. So I'm unsure what would make cgroups misbehave (as suggested in #4301).

@neoaggelos
Copy link
Contributor

Hi @AlexGustafsson, thank you for raising this. This has been an issue we are seeing with MicroK8s 1.29 recently, see also #4361. I wonder if you are bumping into the same problem.

@neoaggelos neoaggelos added the kind/bug Something isn't working label Jan 25, 2024
Copy link

stale bot commented Dec 20, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the inactive label Dec 20, 2024
@stale stale bot closed this as completed Jan 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
inactive kind/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants