Welcome to the Linux Foundation Forum!

Office hours - Jun 8 (LFS242 / LFS258)

chrispokorni Posts: 2,372

June 2023 in Advanced Cloud Engineer IT Professional Program

Hello,

We had a visitor today share his experience with a known etcd bug. The bug apparently has been reported on github for etcd v3.4, where the leader control plane nodes of a highly available Kubernetes cluster would randomly cordon themselves in production, becoming unavailable, thus forcing the remaining etcd instances into a leader election process.

etcd v3.4 is part of Kubernetes release 1.22. One of the recommended fixes would be to only upgrade etcd to a version higher than v3.5 while keeping the production cluster at v1.22. This calls, however, for careful compatibility testing between a more recent etcd release and an older kube-apiserver. Another recommendation would be a full cluster upgrade, bringing the entire cluster to Kubernetes v1.23 which would also install etcd v3.5+, a release no longer manifesting the known bug from the earlier etcd release v3.4.

While not entirely related to the LFS258 Kubernetes Fundamentals course, this seemed to be an interesting topic worth sharing.

Regards,
-Chris

Upcoming Training

August 20, 2018
Kubernetes Administration (LFS458)
August 20, 2018
Linux System Administration (LFS301)
August 27, 2018
Open Source Virtualization (LFS462)
August 27, 2018
Linux Kernel Debugging and Security (LFD440)

Browse full catalog →

Office hours - Jun 8 (LFS242 / LFS258)

Categories

Upcoming Training

Kubernetes Administration (LFS458)

Linux System Administration (LFS301)

Open Source Virtualization (LFS462)

Linux Kernel Debugging and Security (LFD440)