-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: Autoscaler stuck #1627
Comments
@bernardhalas Did builder restart ? There is a ~3hours difference based on the logs
|
Yes, the pod has been force-restarted. And the messages were the same. I tried to reproduce a few times, but I couldn't. I saw similar behavior once in down-sizing the nodepool by autoscaler. But that one occurred also just once. I'll spend more time on this if the situation allows, otherwise we'll close this down as unreproducible. |
I assume the following happened, the builder service was restarted whether by you or OOM killed #1512. When this happens The manifest will not be rescheduled again in 2 hours. Which I think is wrong, there has been an issue created for it long time ago #1316 Hard to say without logs of the crashed builder pod, though |
The |
Claudie 0.9.2
Current Behaviour
Autoscaler seems stuck after
terraformer
. No signs of action inkube-eleven
.Expected Behaviour
4 nodes are added to the cluster.
Steps To Reproduce
Simple deployment with
nginx
created:kubectl create deployment nginx --image=nginx
Added
resources.requests
ofcpu: 1
andmemory: 1 Gi
. And then upscaled to 6 instances:kubectl scale deployment/nginx --replicas=6
terraformer
logs create 4 new nodes successfully.builder
contains just:And
kube-eleven
has no indication of up-scale need after the initial cluster creation:Deleting
builder
andkube-eleven
seems like doesn't change the state of things.This was executed on the following
InputManifest
:The text was updated successfully, but these errors were encountered: