Cilium problem: cni plugin not initialized
I'm at the end of Lab exercise 3.2 "Grow the cluster" and my working node remains permanently NotReady.
My setup:
VirtualBox with network in promiscuous mode
VM with the control node: openSUSE Leap 15.5, hostname: opensuse
VM with the worker node: openSUSE MicroOS 20240229, hostname: microoskubectl get nodes
NAME STATUS ROLES AGE VERSION microos NotReady <none> 6d22h v1.27.11 opensuse Ready control-plane 9d v1.27.11
I've seen similar complaints in this forum, but they were caused by IP conflicts with Cilium's 192.168.0.0/16 , which is not my case AFAICS. My /etc/hosts:
10.0.2.15 opensuse 10.0.2.4 microos 10.0.2.15 k8scp
Cilium is working fine on the control node:crictl ps
CONTAINER IMAGE CREATED STATE NAME ATTEMPT POD ID POD 0945fc48055a8 ead0a4a53df89 16 hours ago Running coredns 2 7ea706d95fe31 coredns-5d78c9869d-gpc9r 3606309809790 ead0a4a53df89 16 hours ago Running coredns 2 e3ceacba89bb8 coredns-5d78c9869d-ph479 4ae0666756199 33a5be5e9ebc0 16 hours ago Running cilium-agent 2 8c3fe5b19db2a cilium-67wkh a5e3701e6e70b c961e5e7cae7b 16 hours ago Running cilium-operator 2 d069a8f4fbd28 cilium-operator-788c7d7585-nw9x7 a4e941333b15f fbe39e5d66b6a 16 hours ago Running kube-proxy 2 896e8ef472115 kube-proxy-shcq8 e965fbad9f5fa 6468fa8f98696 16 hours ago Running kube-scheduler 3 f247867c4d25a kube-scheduler-opensuse 71caba1690736 6f6e73fa8162b 16 hours ago Running kube-apiserver 3 5f7116c4e5188 kube-apiserver-opensuse 068bd43d6476e c6b5118178229 16 hours ago Running kube-controller-manager 3 43b321f99b050 kube-controller-manager-opensuse f30130109194b a0eed15eed449 16 hours ago Running etcd 3 fb63047307af7 etcd-opensuse
But on the worker node it is in infinite restart loop:crictl ps -a
CONTAINER IMAGE CREATED STATE NAME ATTEMPT POD ID POD 9add745e3a6eb c961e5e7cae7b 16 seconds ago Running cilium-operator 41 ab11da22b98ef cilium-operator-788c7d7585-z452c a9f521243e050 33a5be5e9ebc0 2 minutes ago Exited install-cni-binaries 0 6d1137f51732e cilium-9sjjt 28abb9123d681 33a5be5e9ebc0 2 minutes ago Exited clean-cilium-state 0 6d1137f51732e cilium-9sjjt 8b2fecb2864c9 33a5be5e9ebc0 2 minutes ago Exited mount-bpf-fs 0 6d1137f51732e cilium-9sjjt 47ae718fd829f 33a5be5e9ebc0 2 minutes ago Exited apply-sysctl-overwrites 0 6d1137f51732e cilium-9sjjt 9dd78b138ad37 33a5be5e9ebc0 2 minutes ago Exited mount-cgroup 0 6d1137f51732e cilium-9sjjt ae5c70717123e 33a5be5e9ebc0 2 minutes ago Exited config 42 6d1137f51732e cilium-9sjjt 0dc297dd04fcc fbe39e5d66b6a 2 minutes ago Exited kube-proxy 42 609c20e1296ad kube-proxy-pm7g9 e7ab04008d4a5 33a5be5e9ebc0 3 minutes ago Exited cilium-agent 47 eb5b3b5cc0afe cilium-9sjjt b7e4a8902c7cf c961e5e7cae7b 4 minutes ago Exited cilium-operator 40 e55810f0185e4 cilium-operator-788c7d7585-z452c
Comments
-
on the worker node:
journalctl -u kubelet -f
Mar 09 14:29:33 microos kubelet[6552]: E0309 14:29:33.566848 6552 kubelet.go:2760] "Container runtime network not ready" networkReady="NetworkReady=false reason:NetworkPluginNotReady message:Network plugin returns error: cni plugin not initialized"
Using
crictl logs
I've checked the logs of cilium-operator and they look fine until some point when it receives a "terminate" signal:level=info msg="Establishing connection to apiserver" host="https://10.96.0.1:443" subsys=k8s-client level=info msg="Connected to apiserver" subsys=k8s-client ... level=info msg="attempting to acquire leader lease kube-system/cilium-operator-resource-lock..." subsys=klog level=info msg="Leader re-election complete" newLeader=opensuse-lxMlRHFwMR operatorID=microos-HVDKTCgqWG subsys=cilium-operator-generic level=info msg="Start hook executed" duration=6.257525ms function="*api.server.Start" subsys=hive level=info msg="Signal received" signal=terminated subsys=hive
from cilium-agent:
level=info msg="Compiled new BPF template" ... level=info msg="Rewrote endpoint BPF program" ... level=info msg="Serving cilium health API at unix:///var/run/cilium/health.sock" subsys=health-server level=info msg="Signal received" signal=terminated subsys=hive
0 -
Problem solved.
For anyone coming here later from search: I had to reboot the worker node afterkubeadm join
. Probably, to apply new sysctl settings from Cilium.0 -
Hi @PetroKazmirchuk,
Glad you had it figured out. However, keep in mind that the lab material was written and tested on Ubuntu 20.04 LTS, to be in sync with the OS requirements of the CKA certification exam.
In addition, "k8scp" was intended to be an alias to the control plane node, not the actual hostname.Regards,
-Chris0 -
at work I need OpenSUSE, so my choice is deliberate, thanks.
And k8scp is indeed an alias.
Unfortunately, I've hit the new problem right away:
"Probe failed" for cilium-operatorGet \"http://127.0.0.1:9234/healthz\": dial tcp 127.0.0.1:9234: connect: connection refused"
Looking into logs of cilium-operator, I can see any mention of a health monitoring endpoint (unlike cilium-agent that does say "Serving cilium health API at unix:///var/run/cilium/health.sock", I hope it's ok)
0 -
kubelet logs show the loop of recreating cilium-agent, cilium-operator and kube-proxy with no errors in the respective container logs. Somebody is sending them the "terminate" signal.
How can I troubleshoot this further?0 -
seems like MicroOS is too exotic for Cilium I've created a new worker node using the same OpenSUSE Leap as the cp node, and Cilium started fine there
0
Categories
- All Categories
- 207 LFX Mentorship
- 207 LFX Mentorship: Linux Kernel
- 735 Linux Foundation IT Professional Programs
- 339 Cloud Engineer IT Professional Program
- 167 Advanced Cloud Engineer IT Professional Program
- 66 DevOps Engineer IT Professional Program
- 132 Cloud Native Developer IT Professional Program
- 122 Express Training Courses
- 122 Express Courses - Discussion Forum
- 5.9K Training Courses
- 40 LFC110 Class Forum - Discontinued
- 66 LFC131 Class Forum
- 39 LFD102 Class Forum
- 222 LFD103 Class Forum
- 17 LFD110 Class Forum
- 34 LFD121 Class Forum
- 17 LFD133 Class Forum
- 6 LFD134 Class Forum
- 17 LFD137 Class Forum
- 70 LFD201 Class Forum
- 3 LFD210 Class Forum
- 2 LFD210-CN Class Forum
- 2 LFD213 Class Forum - Discontinued
- 128 LFD232 Class Forum - Discontinued
- 1 LFD233 Class Forum
- 3 LFD237 Class Forum
- 23 LFD254 Class Forum
- 689 LFD259 Class Forum
- 110 LFD272 Class Forum
- 3 LFD272-JP クラス フォーラム
- 10 LFD273 Class Forum
- 109 LFS101 Class Forum
- LFS111 Class Forum
- 2 LFS112 Class Forum
- 1 LFS116 Class Forum
- 3 LFS118 Class Forum
- 3 LFS142 Class Forum
- 3 LFS144 Class Forum
- 3 LFS145 Class Forum
- 1 LFS146 Class Forum
- 2 LFS147 Class Forum
- 8 LFS151 Class Forum
- 1 LFS157 Class Forum
- 17 LFS158 Class Forum
- 5 LFS162 Class Forum
- 1 LFS166 Class Forum
- 3 LFS167 Class Forum
- 1 LFS170 Class Forum
- 1 LFS171 Class Forum
- 2 LFS178 Class Forum
- 2 LFS180 Class Forum
- 1 LFS182 Class Forum
- 4 LFS183 Class Forum
- 30 LFS200 Class Forum
- 737 LFS201 Class Forum - Discontinued
- 2 LFS201-JP クラス フォーラム
- 17 LFS203 Class Forum
- 118 LFS207 Class Forum
- 1 LFS207-DE-Klassenforum
- LFS207-JP クラス フォーラム
- 301 LFS211 Class Forum
- 55 LFS216 Class Forum
- 50 LFS241 Class Forum
- 43 LFS242 Class Forum
- 37 LFS243 Class Forum
- 13 LFS244 Class Forum
- 1 LFS245 Class Forum
- 45 LFS250 Class Forum
- 1 LFS250-JP クラス フォーラム
- LFS251 Class Forum
- 145 LFS253 Class Forum
- LFS254 Class Forum
- LFS255 Class Forum
- 6 LFS256 Class Forum
- LFS257 Class Forum
- 1.2K LFS258 Class Forum
- 9 LFS258-JP クラス フォーラム
- 116 LFS260 Class Forum
- 156 LFS261 Class Forum
- 41 LFS262 Class Forum
- 82 LFS263 Class Forum - Discontinued
- 15 LFS264 Class Forum - Discontinued
- 11 LFS266 Class Forum - Discontinued
- 23 LFS267 Class Forum
- 18 LFS268 Class Forum
- 29 LFS269 Class Forum
- 200 LFS272 Class Forum
- 1 LFS272-JP クラス フォーラム
- LFS274 Class Forum
- 3 LFS281 Class Forum
- 7 LFW111 Class Forum
- 257 LFW211 Class Forum
- 179 LFW212 Class Forum
- 12 SKF100 Class Forum
- SKF200 Class Forum
- 791 Hardware
- 199 Drivers
- 68 I/O Devices
- 37 Monitors
- 98 Multimedia
- 174 Networking
- 91 Printers & Scanners
- 85 Storage
- 754 Linux Distributions
- 82 Debian
- 67 Fedora
- 16 Linux Mint
- 13 Mageia
- 23 openSUSE
- 147 Red Hat Enterprise
- 31 Slackware
- 13 SUSE Enterprise
- 351 Ubuntu
- 465 Linux System Administration
- 39 Cloud Computing
- 71 Command Line/Scripting
- Github systems admin projects
- 91 Linux Security
- 78 Network Management
- 101 System Management
- 47 Web Management
- 56 Mobile Computing
- 17 Android
- 28 Development
- 1.2K New to Linux
- 1K Getting Started with Linux
- 366 Off Topic
- 114 Introductions
- 171 Small Talk
- 20 Study Material
- 534 Programming and Development
- 293 Kernel Development
- 223 Software Development
- 1.1K Software
- 212 Applications
- 182 Command Line
- 3 Compiling/Installing
- 405 Games
- 311 Installation
- 79 All In Program
- 79 All In Forum
Upcoming Training
-
August 20, 2018
Kubernetes Administration (LFS458)
-
August 20, 2018
Linux System Administration (LFS301)
-
August 27, 2018
Open Source Virtualization (LFS462)
-
August 27, 2018
Linux Kernel Debugging and Security (LFD440)