Cilium problem: cni plugin not initialized
I'm at the end of Lab exercise 3.2 "Grow the cluster" and my working node remains permanently NotReady.
My setup:
VirtualBox with network in promiscuous mode
VM with the control node: openSUSE Leap 15.5, hostname: opensuse
VM with the worker node: openSUSE MicroOS 20240229, hostname: microoskubectl get nodes
NAME STATUS ROLES AGE VERSION microos NotReady <none> 6d22h v1.27.11 opensuse Ready control-plane 9d v1.27.11
I've seen similar complaints in this forum, but they were caused by IP conflicts with Cilium's 192.168.0.0/16 , which is not my case AFAICS. My /etc/hosts:
10.0.2.15 opensuse 10.0.2.4 microos 10.0.2.15 k8scp
Cilium is working fine on the control node:crictl ps
CONTAINER IMAGE CREATED STATE NAME ATTEMPT POD ID POD 0945fc48055a8 ead0a4a53df89 16 hours ago Running coredns 2 7ea706d95fe31 coredns-5d78c9869d-gpc9r 3606309809790 ead0a4a53df89 16 hours ago Running coredns 2 e3ceacba89bb8 coredns-5d78c9869d-ph479 4ae0666756199 33a5be5e9ebc0 16 hours ago Running cilium-agent 2 8c3fe5b19db2a cilium-67wkh a5e3701e6e70b c961e5e7cae7b 16 hours ago Running cilium-operator 2 d069a8f4fbd28 cilium-operator-788c7d7585-nw9x7 a4e941333b15f fbe39e5d66b6a 16 hours ago Running kube-proxy 2 896e8ef472115 kube-proxy-shcq8 e965fbad9f5fa 6468fa8f98696 16 hours ago Running kube-scheduler 3 f247867c4d25a kube-scheduler-opensuse 71caba1690736 6f6e73fa8162b 16 hours ago Running kube-apiserver 3 5f7116c4e5188 kube-apiserver-opensuse 068bd43d6476e c6b5118178229 16 hours ago Running kube-controller-manager 3 43b321f99b050 kube-controller-manager-opensuse f30130109194b a0eed15eed449 16 hours ago Running etcd 3 fb63047307af7 etcd-opensuse
But on the worker node it is in infinite restart loop:crictl ps -a
CONTAINER IMAGE CREATED STATE NAME ATTEMPT POD ID POD 9add745e3a6eb c961e5e7cae7b 16 seconds ago Running cilium-operator 41 ab11da22b98ef cilium-operator-788c7d7585-z452c a9f521243e050 33a5be5e9ebc0 2 minutes ago Exited install-cni-binaries 0 6d1137f51732e cilium-9sjjt 28abb9123d681 33a5be5e9ebc0 2 minutes ago Exited clean-cilium-state 0 6d1137f51732e cilium-9sjjt 8b2fecb2864c9 33a5be5e9ebc0 2 minutes ago Exited mount-bpf-fs 0 6d1137f51732e cilium-9sjjt 47ae718fd829f 33a5be5e9ebc0 2 minutes ago Exited apply-sysctl-overwrites 0 6d1137f51732e cilium-9sjjt 9dd78b138ad37 33a5be5e9ebc0 2 minutes ago Exited mount-cgroup 0 6d1137f51732e cilium-9sjjt ae5c70717123e 33a5be5e9ebc0 2 minutes ago Exited config 42 6d1137f51732e cilium-9sjjt 0dc297dd04fcc fbe39e5d66b6a 2 minutes ago Exited kube-proxy 42 609c20e1296ad kube-proxy-pm7g9 e7ab04008d4a5 33a5be5e9ebc0 3 minutes ago Exited cilium-agent 47 eb5b3b5cc0afe cilium-9sjjt b7e4a8902c7cf c961e5e7cae7b 4 minutes ago Exited cilium-operator 40 e55810f0185e4 cilium-operator-788c7d7585-z452c
Comments
-
on the worker node:
journalctl -u kubelet -f
Mar 09 14:29:33 microos kubelet[6552]: E0309 14:29:33.566848 6552 kubelet.go:2760] "Container runtime network not ready" networkReady="NetworkReady=false reason:NetworkPluginNotReady message:Network plugin returns error: cni plugin not initialized"
Using
crictl logs
I've checked the logs of cilium-operator and they look fine until some point when it receives a "terminate" signal:level=info msg="Establishing connection to apiserver" host="https://10.96.0.1:443" subsys=k8s-client level=info msg="Connected to apiserver" subsys=k8s-client ... level=info msg="attempting to acquire leader lease kube-system/cilium-operator-resource-lock..." subsys=klog level=info msg="Leader re-election complete" newLeader=opensuse-lxMlRHFwMR operatorID=microos-HVDKTCgqWG subsys=cilium-operator-generic level=info msg="Start hook executed" duration=6.257525ms function="*api.server.Start" subsys=hive level=info msg="Signal received" signal=terminated subsys=hive
from cilium-agent:
level=info msg="Compiled new BPF template" ... level=info msg="Rewrote endpoint BPF program" ... level=info msg="Serving cilium health API at unix:///var/run/cilium/health.sock" subsys=health-server level=info msg="Signal received" signal=terminated subsys=hive
0 -
Problem solved.
For anyone coming here later from search: I had to reboot the worker node afterkubeadm join
. Probably, to apply new sysctl settings from Cilium.0 -
Hi @PetroKazmirchuk,
Glad you had it figured out. However, keep in mind that the lab material was written and tested on Ubuntu 20.04 LTS, to be in sync with the OS requirements of the CKA certification exam.
In addition, "k8scp" was intended to be an alias to the control plane node, not the actual hostname.Regards,
-Chris0 -
at work I need OpenSUSE, so my choice is deliberate, thanks.
And k8scp is indeed an alias.
Unfortunately, I've hit the new problem right away:
"Probe failed" for cilium-operatorGet \"http://127.0.0.1:9234/healthz\": dial tcp 127.0.0.1:9234: connect: connection refused"
Looking into logs of cilium-operator, I can see any mention of a health monitoring endpoint (unlike cilium-agent that does say "Serving cilium health API at unix:///var/run/cilium/health.sock", I hope it's ok)
0 -
kubelet logs show the loop of recreating cilium-agent, cilium-operator and kube-proxy with no errors in the respective container logs. Somebody is sending them the "terminate" signal.
How can I troubleshoot this further?0 -
seems like MicroOS is too exotic for Cilium I've created a new worker node using the same OpenSUSE Leap as the cp node, and Cilium started fine there
0
Categories
- All Categories
- 217 LFX Mentorship
- 217 LFX Mentorship: Linux Kernel
- 791 Linux Foundation IT Professional Programs
- 353 Cloud Engineer IT Professional Program
- 178 Advanced Cloud Engineer IT Professional Program
- 82 DevOps Engineer IT Professional Program
- 147 Cloud Native Developer IT Professional Program
- 137 Express Training Courses
- 137 Express Courses - Discussion Forum
- 6.2K Training Courses
- 47 LFC110 Class Forum - Discontinued
- 71 LFC131 Class Forum
- 42 LFD102 Class Forum
- 226 LFD103 Class Forum
- 18 LFD110 Class Forum
- 38 LFD121 Class Forum
- 18 LFD133 Class Forum
- 7 LFD134 Class Forum
- 18 LFD137 Class Forum
- 71 LFD201 Class Forum
- 4 LFD210 Class Forum
- 5 LFD210-CN Class Forum
- 2 LFD213 Class Forum - Discontinued
- 128 LFD232 Class Forum - Discontinued
- 2 LFD233 Class Forum
- 4 LFD237 Class Forum
- 24 LFD254 Class Forum
- 697 LFD259 Class Forum
- 111 LFD272 Class Forum
- 4 LFD272-JP クラス フォーラム
- 12 LFD273 Class Forum
- 148 LFS101 Class Forum
- 1 LFS111 Class Forum
- 3 LFS112 Class Forum
- 2 LFS116 Class Forum
- 4 LFS118 Class Forum
- LFS120 Class Forum
- 7 LFS142 Class Forum
- 5 LFS144 Class Forum
- 4 LFS145 Class Forum
- 2 LFS146 Class Forum
- 3 LFS147 Class Forum
- 1 LFS148 Class Forum
- 15 LFS151 Class Forum
- 2 LFS157 Class Forum
- 28 LFS158 Class Forum
- 7 LFS162 Class Forum
- 2 LFS166 Class Forum
- 4 LFS167 Class Forum
- 3 LFS170 Class Forum
- 2 LFS171 Class Forum
- 3 LFS178 Class Forum
- 3 LFS180 Class Forum
- 2 LFS182 Class Forum
- 5 LFS183 Class Forum
- 31 LFS200 Class Forum
- 737 LFS201 Class Forum - Discontinued
- 3 LFS201-JP クラス フォーラム
- 18 LFS203 Class Forum
- 134 LFS207 Class Forum
- 2 LFS207-DE-Klassenforum
- 1 LFS207-JP クラス フォーラム
- 302 LFS211 Class Forum
- 56 LFS216 Class Forum
- 52 LFS241 Class Forum
- 48 LFS242 Class Forum
- 38 LFS243 Class Forum
- 15 LFS244 Class Forum
- 2 LFS245 Class Forum
- LFS246 Class Forum
- 48 LFS250 Class Forum
- 2 LFS250-JP クラス フォーラム
- 1 LFS251 Class Forum
- 152 LFS253 Class Forum
- 1 LFS254 Class Forum
- 1 LFS255 Class Forum
- 7 LFS256 Class Forum
- 1 LFS257 Class Forum
- 1.2K LFS258 Class Forum
- 10 LFS258-JP クラス フォーラム
- 118 LFS260 Class Forum
- 159 LFS261 Class Forum
- 42 LFS262 Class Forum
- 82 LFS263 Class Forum - Discontinued
- 15 LFS264 Class Forum - Discontinued
- 11 LFS266 Class Forum - Discontinued
- 24 LFS267 Class Forum
- 22 LFS268 Class Forum
- 30 LFS269 Class Forum
- LFS270 Class Forum
- 202 LFS272 Class Forum
- 2 LFS272-JP クラス フォーラム
- 1 LFS274 Class Forum
- 4 LFS281 Class Forum
- 9 LFW111 Class Forum
- 259 LFW211 Class Forum
- 181 LFW212 Class Forum
- 13 SKF100 Class Forum
- 1 SKF200 Class Forum
- 1 SKF201 Class Forum
- 795 Hardware
- 199 Drivers
- 68 I/O Devices
- 37 Monitors
- 102 Multimedia
- 174 Networking
- 91 Printers & Scanners
- 85 Storage
- 758 Linux Distributions
- 82 Debian
- 67 Fedora
- 17 Linux Mint
- 13 Mageia
- 23 openSUSE
- 148 Red Hat Enterprise
- 31 Slackware
- 13 SUSE Enterprise
- 353 Ubuntu
- 468 Linux System Administration
- 39 Cloud Computing
- 71 Command Line/Scripting
- Github systems admin projects
- 93 Linux Security
- 78 Network Management
- 102 System Management
- 47 Web Management
- 63 Mobile Computing
- 18 Android
- 33 Development
- 1.2K New to Linux
- 1K Getting Started with Linux
- 371 Off Topic
- 114 Introductions
- 174 Small Talk
- 22 Study Material
- 805 Programming and Development
- 303 Kernel Development
- 484 Software Development
- 1.8K Software
- 261 Applications
- 183 Command Line
- 3 Compiling/Installing
- 987 Games
- 317 Installation
- 97 All In Program
- 97 All In Forum
Upcoming Training
-
August 20, 2018
Kubernetes Administration (LFS458)
-
August 20, 2018
Linux System Administration (LFS301)
-
August 27, 2018
Open Source Virtualization (LFS462)
-
August 27, 2018
Linux Kernel Debugging and Security (LFD440)