Hostname mismatch causing kubelet issue after restarting
Hello,
I just want to share an issue I had at step 3.2.11, after restarting the nodes. I was unable to reach the registry address. After a bit of troubleshooting, I noticed that both my nodes where "NotReady". When checking their status and that the kubelet service kept restarting:
01:21:42 student@cp ~ → journalctl -xeu kubelet --no-pager | tail -50 Feb 05 13:22:30 cp.<redacted>.internal kubelet[40281]: E0205 13:22:30.110490 40281 reconstruct.go:189] "Failed to get Node status to reconstruct device paths" err="nodes \"cp.us-central1-a.c.<redacted>.internal\" not found"
The issue was that the hostname is the full FQDN cp.us-central1-a.c..internal, while my node in Kubernetes is registered as just cp. After the reboot, kubelet is using the FQDN and can't find itself in the cluster.
Solution: Add hostname-override to kubelet
On the control plane node, run:
sudo vi /var/lib/kubelet/kubeadm-flags.env
Change this line:
KUBELET_KUBEADM_ARGS="--pod-infra-container-image=registry.k8s.io/pause:3.10.1"
To this:
KUBELET_KUBEADM_ARGS="--pod-infra-container-image=registry.k8s.io/pause:3.10.1 --hostname-override=cp"
Then restart kubelet:
sudo systemctl restart kubelet
Finally my nodes were "Ready" when running:kubectl get nodes
Not sure why this happened, my nodes have the proper hostname in GCE.
Categories
- All Categories
- 177 LFX Mentorship
- 177 LFX Mentorship: Linux Kernel
- 754 Linux Foundation IT Professional Programs
- 374 Cloud Engineer IT Professional Program
- 170 Advanced Cloud Engineer IT Professional Program
- 74 DevOps IT Professional Program - Discontinued
- 5 DevOps & GitOps IT Professional Program
- 100 Cloud Native Developer IT Professional Program
- 7.6K Training Courses & Learning Paths
- 2 AI & ML Training
- 1 Blockchain & Decentralized Identity Training
- 5 Cloud & Containers Training
- 1 Cybersecurity Training
- 2 DevOps & Site-Reliability Training
- 1 Linux Kernel Development Training
- 1 Networking Training
- 2 Open Source Best Practice Training
- 2 System Administration Training
- 1 System Engineering Training
- 1 Web & Application Development Training
- 794 Hardware
- 202 Drivers
- 68 I/O Devices
- 37 Monitors
- 95 Multimedia
- 173 Networking
- 91 Printers & Scanners
- 89 Storage
- 769 Linux Distributions
- 81 Debian
- 68 Fedora
- 22 Linux Mint
- 13 Mageia
- 24 openSUSE
- 150 Red Hat Enterprise
- 31 Slackware
- 13 SUSE Enterprise
- 356 Ubuntu
- 465 Linux System Administration
- 31 Cloud Computing
- 73 Command Line/Scripting
- Github systems admin projects
- 98 Linux Security
- 78 Network Management
- 101 System Management
- 46 Web Management
- 112 Mobile Computing
- 20 Android
- 77 Development
- 1.2K New to Linux
- 1K Getting Started with Linux
- 393 Off Topic
- 121 Introductions
- 182 Small Talk
- 29 Study Material
- 976 Programming and Development
- 310 Kernel Development
- 648 Software Development
- 990 Software
- 382 Applications
- 182 Command Line
- 5 Compiling/Installing
- 68 Games
- 317 Installation
- Archived
- 2 LFD140 Class Forum
- 1.4K LFS258 Class Forum
Upcoming Training
-
August 20, 2018
Kubernetes Administration (LFS458)
-
August 20, 2018
Linux System Administration (LFS301)
-
August 27, 2018
Open Source Virtualization (LFS462)
-
August 27, 2018
Linux Kernel Debugging and Security (LFD440)