"No overlay network" -> "overlay network" migration does not work #213
Labels
area/networking
Networking related
kind/bug
Bug
lifecycle/rotten
Nobody worked on this for 12 months (final aging stage)
How to categorize this issue?
/area networking
/kind bug
What happened:
We discovered the following issue as part of the investigations about the impact of gardener/gardener-extension-provider-aws#621. In the context of this bug:
We discovered that after such migration for multi-zone clusters (let's say zone-1, and zone-2) the Pods running in zone-2 for example cannot reach the Pods running in zone-1.
In details:
The DNS resolution fails because the debug-pod is not able to reach the coredns Pods to DNS resolution.
After investigation we found that there is a
ippools.crd.projectcalico.org
resource. And theipipMode
field in this resource is set toNever
:ipipMode: Never
would mean "no overlay" network, but the cluster should already use overlay network. We can also verify from the control plane monitoring that thetunl0
device does not have any network traffic.We had to manually patch the
ipipMode
field to beAlways
. After that the Pod to Pod communication between zones worked again. The control plane monitoring started reporting network traffic for thetunl0
device.What you expected to happen:
The "no overlay network" -> "overlay network" migration to work without issues or a validation to be present to forbid such update or all known issues to be documented.
How to reproduce it (as minimally and precisely as possible):
See above.
Anything else we need to know?:
Environment:
v1.26.0
kubectl version
):The text was updated successfully, but these errors were encountered: