coreDNS CrashLoopBackoff

I have set up the cluster without error and am running through Lab 2.1. I noticed that the pods for coreDNS are failing.
I am running the nodes on bare metal.
Debug info is:
**kubectl get pod -n kube-system
- NAME READY STATUS RESTARTS AGE
- calico-etcd-wr2cf 1/1 Running 3 13h
- calico-kube-controllers-57c8947c94-g2lbc 1/1 Running 3 13h
- calico-node-lsjm9 2/2 Running 17 13h
- calico-node-zhgnd 2/2 Running 9 13h
- coredns-576cbf47c7-56thg 0/1 CrashLoopBackOff 54 13h
- coredns-576cbf47c7-nmznf 0/1 CrashLoopBackOff 54 13h
- etcd-nuc1 1/1 Running 4 13h
- kube-apiserver-nuc1 1/1 Running 4 13h
- kube-controller-manager-nuc1 1/1 Running 3 13h
- kube-proxy-ct89j 1/1 Running 3 13h
- kube-proxy-lbdxr 1/1 Running 5 13h
- kube-scheduler-nuc1 1/1 Running 3 13h
kubectl describe pods -n kube-system coredns-576cbf47c7-56thg
- Name: coredns-576cbf47c7-56thg
- Namespace: kube-system
- Priority: 0
- PriorityClassName: <none>
- Node: nuc1/10.10.0.53
- Start Time: Sat, 29 Dec 2018 23:06:32 +1100
- Labels: k8s-app=kube-dns
- pod-template-hash=576cbf47c7
- Annotations: <none>
- Status: Running
- IP: 192.168.21.71
- Controlled By: ReplicaSet/coredns-576cbf47c7
- Containers:
- coredns:
- Container ID: docker://5491ac6a53be7f653036af7baaecfb318679882d3ad4b60c7c02b8846f3a4f9d
- Image: k8s.gcr.io/coredns:1.2.2
- Image ID: docker-pullable://k8s.gcr.io/coredns@sha256:3e2be1cec87aca0b74b7668bbe8c02964a95a402e45ceb51b2252629d608d03a
- Ports: 53/UDP, 53/TCP, 9153/TCP
- Host Ports: 0/UDP, 0/TCP, 0/TCP
- Args:
- -conf
- /etc/coredns/Corefile
- State: Waiting
- Reason: CrashLoopBackOff
- Last State: Terminated
- Reason: Error
- Exit Code: 1
- Started: Sun, 30 Dec 2018 12:39:49 +1100
- Finished: Sun, 30 Dec 2018 12:39:50 +1100
- Ready: False
- Restart Count: 54
- Limits:
- memory: 170Mi
- Requests:
- cpu: 100m
- memory: 70Mi
- Liveness: http-get http://:8080/health delay=60s timeout=5s period=10s #success=1 #failure=5
- Environment: <none>
- Mounts:
- /etc/coredns from config-volume (ro)
- /var/run/secrets/kubernetes.io/serviceaccount from coredns-token-zwdp6 (ro)
- Conditions:
- Type Status
- Initialized True
- Ready False
- ContainersReady False
- PodScheduled True
- Volumes:
- config-volume:
- Type: ConfigMap (a volume populated by a ConfigMap)
- Name: coredns
- Optional: false
- coredns-token-zwdp6:
- Type: Secret (a volume populated by a Secret)
- SecretName: coredns-token-zwdp6
- Optional: false
- QoS Class: Burstable
- Node-Selectors: <none>
- Tolerations: CriticalAddonsOnly
- node-role.kubernetes.io/master:NoSchedule
- node.kubernetes.io/not-ready:NoExecute for 300s
- node.kubernetes.io/unreachable:NoExecute for 300s
- Events:
- Type Reason Age From Message
- ---- ------ ---- ---- -------
- Normal Scheduled 13h default-scheduler Successfully assigned kube-system/coredns-576cbf47c7-56thg to nuc1
- Warning NetworkNotReady 13h (x8 over 13h) kubelet, nuc1 network is not ready: [runtime network not ready: NetworkReady=false reason:NetworkPluginNotReady message:docker: network plugin is not ready: cni config uninitialized]
- Normal Pulled 13h (x4 over 13h) kubelet, nuc1 Container image "k8s.gcr.io/coredns:1.2.2" already present on machine
- Normal Created 13h (x4 over 13h) kubelet, nuc1 Created container
- Normal Started 13h (x4 over 13h) kubelet, nuc1 Started container
- Warning BackOff 12h (x255 over 13h) kubelet, nuc1 Back-off restarting failed container
- Normal SandboxChanged 3h31m (x2 over 3h31m) kubelet, nuc1 Pod sandbox changed, it will be killed and re-created.
- Warning BackOff 3h31m (x3 over 3h31m) kubelet, nuc1 Back-off restarting failed container
- Normal Pulled 3h30m (x2 over 3h31m) kubelet, nuc1 Container image "k8s.gcr.io/coredns:1.2.2" already present on machine
- Normal Created 3h30m (x2 over 3h31m) kubelet, nuc1 Created container
- Normal Started 3h30m (x2 over 3h31m) kubelet, nuc1 Started container
- Normal Pulled 3h29m (x4 over 3h30m) kubelet, nuc1 Container image "k8s.gcr.io/coredns:1.2.2" already present on machine
- Normal Created 3h29m (x4 over 3h30m) kubelet, nuc1 Created container
- Normal Started 3h29m (x4 over 3h30m) kubelet, nuc1 Started container
- Warning BackOff 3h5m (x124 over 3h30m) kubelet, nuc1 Back-off restarting failed container
- Warning FailedMount 92m kubelet, nuc1 MountVolume.SetUp failed for volume "coredns-token-zwdp6" : couldn't propagate object cache: timed out waiting for the condition
- Normal SandboxChanged 91m (x2 over 92m) kubelet, nuc1 Pod sandbox changed, it will be killed and re-created.
- Normal Pulled 90m (x4 over 91m) kubelet, nuc1 Container image "k8s.gcr.io/coredns:1.2.2" already present on machine
- Normal Created 90m (x4 over 91m) kubelet, nuc1 Created container
- Normal Started 90m (x4 over 91m) kubelet, nuc1 Started container
- Warning BackOff 57m (x169 over 91m) kubelet, nuc1 Back-off restarting failed container
- Normal SandboxChanged 49m (x2 over 50m) kubelet, nuc1 Pod sandbox changed, it will be killed and re-created.
- Normal Pulled 47m (x4 over 49m) kubelet, nuc1 Container image "k8s.gcr.io/coredns:1.2.2" already present on machine
- Normal Created 47m (x4 over 49m) kubelet, nuc1 Created container
- Normal Started 47m (x4 over 49m) kubelet, nuc1 Started container
- Warning BackOff 4m49s (x214 over 49m) kubelet, nuc1 Back-off restarting failed container
Comments
-
@bryonbaker ,
You can try to delete the 2 coredns pods, and they will be re-created.
Are you in Lab 2.1 of LFD259?
Thanks,
-Chris0 -
Hi,
The issue is actually thoroughly documented in the CoreDNS web site. It is caused because CoreDNS is detecting a loopback and it terminates. It is expected behaviour.The solution is to change the DNS setting in /etc/resolv.conf. For those using Ubuntu I have documented what to do here as it can be tricky - especially with Ubuntu Desktop edition.
There are other ways to solve it but in the end I set up an external DNS server with bind9 for resolving hostnames. Overkill I know.1
Categories
- All Categories
- 138 LFX Mentorship
- 138 LFX Mentorship: Linux Kernel
- 815 Linux Foundation IT Professional Programs
- 366 Cloud Engineer IT Professional Program
- 184 Advanced Cloud Engineer IT Professional Program
- 83 DevOps Engineer IT Professional Program
- 151 Cloud Native Developer IT Professional Program
- 142 Express Training Courses & Microlearning
- 142 Express Courses - Discussion Forum
- Microlearning - Discussion Forum
- 6.6K Training Courses
- 48 LFC110 Class Forum - Discontinued
- 72 LFC131 Class Forum
- 49 LFD102 Class Forum
- 234 LFD103 Class Forum
- 21 LFD110 Class Forum
- 44 LFD121 Class Forum
- LFD123 Class Forum
- LFD125 Class Forum
- 18 LFD133 Class Forum
- 8 LFD134 Class Forum
- 18 LFD137 Class Forum
- 72 LFD201 Class Forum
- 5 LFD210 Class Forum
- 5 LFD210-CN Class Forum
- 2 LFD213 Class Forum - Discontinued
- 128 LFD232 Class Forum - Discontinued
- 2 LFD233 Class Forum
- 4 LFD237 Class Forum
- 24 LFD254 Class Forum
- 719 LFD259 Class Forum
- 111 LFD272 Class Forum - Discontinued
- 4 LFD272-JP クラス フォーラム
- 13 LFD273 Class Forum
- 238 LFS101 Class Forum
- 2 LFS111 Class Forum
- 3 LFS112 Class Forum
- 3 LFS116 Class Forum
- 7 LFS118 Class Forum
- LFS120 Class Forum
- 9 LFS142 Class Forum
- 8 LFS144 Class Forum
- 4 LFS145 Class Forum
- 4 LFS146 Class Forum
- 16 LFS148 Class Forum
- 15 LFS151 Class Forum
- 5 LFS157 Class Forum
- 65 LFS158 Class Forum
- LFS158-JP クラス フォーラム
- 11 LFS162 Class Forum
- 2 LFS166 Class Forum
- 7 LFS167 Class Forum
- 3 LFS170 Class Forum
- 2 LFS171 Class Forum
- 3 LFS178 Class Forum
- 3 LFS180 Class Forum
- 2 LFS182 Class Forum
- 5 LFS183 Class Forum
- 33 LFS200 Class Forum
- 737 LFS201 Class Forum - Discontinued
- 3 LFS201-JP クラス フォーラム - Discontinued
- 20 LFS203 Class Forum
- 135 LFS207 Class Forum
- 2 LFS207-DE-Klassenforum
- 2 LFS207-JP クラス フォーラム
- 302 LFS211 Class Forum
- 56 LFS216 Class Forum
- 53 LFS241 Class Forum
- 50 LFS242 Class Forum
- 38 LFS243 Class Forum
- 16 LFS244 Class Forum
- 5 LFS245 Class Forum
- LFS246 Class Forum
- LFS248 Class Forum
- 87 LFS250 Class Forum
- 2 LFS250-JP クラス フォーラム
- 1 LFS251 Class Forum
- 156 LFS253 Class Forum
- 1 LFS254 Class Forum
- 2 LFS255 Class Forum
- 12 LFS256 Class Forum
- 1 LFS257 Class Forum
- 1.3K LFS258 Class Forum
- 11 LFS258-JP クラス フォーラム
- 135 LFS260 Class Forum
- 161 LFS261 Class Forum
- 43 LFS262 Class Forum
- 82 LFS263 Class Forum - Discontinued
- 15 LFS264 Class Forum - Discontinued
- 11 LFS266 Class Forum - Discontinued
- 24 LFS267 Class Forum
- 25 LFS268 Class Forum
- 37 LFS269 Class Forum
- 7 LFS270 Class Forum
- 202 LFS272 Class Forum - Discontinued
- 2 LFS272-JP クラス フォーラム
- 4 LFS147 Class Forum
- 2 LFS274 Class Forum
- 4 LFS281 Class Forum
- 16 LFW111 Class Forum
- 262 LFW211 Class Forum
- 185 LFW212 Class Forum
- 15 SKF100 Class Forum
- 1 SKF200 Class Forum
- 2 SKF201 Class Forum
- 797 Hardware
- 199 Drivers
- 68 I/O Devices
- 37 Monitors
- 104 Multimedia
- 174 Networking
- 91 Printers & Scanners
- 85 Storage
- 761 Linux Distributions
- 82 Debian
- 67 Fedora
- 17 Linux Mint
- 13 Mageia
- 23 openSUSE
- 149 Red Hat Enterprise
- 31 Slackware
- 13 SUSE Enterprise
- 355 Ubuntu
- 470 Linux System Administration
- 39 Cloud Computing
- 71 Command Line/Scripting
- Github systems admin projects
- 95 Linux Security
- 78 Network Management
- 102 System Management
- 47 Web Management
- 69 Mobile Computing
- 18 Android
- 38 Development
- 1.2K New to Linux
- 1K Getting Started with Linux
- 379 Off Topic
- 115 Introductions
- 177 Small Talk
- 26 Study Material
- 808 Programming and Development
- 304 Kernel Development
- 486 Software Development
- 1.8K Software
- 263 Applications
- 183 Command Line
- 3 Compiling/Installing
- 988 Games
- 317 Installation
- 103 All In Program
- 103 All In Forum
Upcoming Training
-
August 20, 2018
Kubernetes Administration (LFS458)
-
August 20, 2018
Linux System Administration (LFS301)
-
August 27, 2018
Open Source Virtualization (LFS462)
-
August 27, 2018
Linux Kernel Debugging and Security (LFD440)