LAB 2.4 Adding fluentd to basic.yaml failing to start

Good day,
I am running my cp and worker nodes as VMs via Parallels Desktop on my MacBook Pro M1. After making some tweaks to the setup scripts I have my cp and worker nodes running on the Ubuntu 20.04 arm64 VMs. I've been able to expose nginx pod to my host OS.
Now I've added the fdlogger container to my basic.yaml pod, but it is failing to start. I can only get 1/2 containers running in the pod, with kubectl get pod
reporting:
- NAME READY STATUS RESTARTS AGE
- basicpod 1/2 CrashLoopBackOff 1 (4s ago) 6s
Here is my current basic.yaml
:
- apiVersion: v1
- kind: Pod
- metadata:
- name: basicpod
- labels:
- type: webserver
- spec:
- containers:
- - name: webcont
- image: nginx
- ports:
- - containerPort: 80
- - name: fdlogger
- image: fluent/fluentd
Does anyone have some pointers on how I can further troubleshoot?
This is about all I've been able to find, but don't know how to look further:
- % kubectl logs --since=1h -c fdlogger basicpod
- exec /bin/entrypoint.sh: exec format error
Thank you for any assistance you can provide,
Phil
Best Answer
-
Hi @chrispokorni,
I was able to resolve my issue. It seems that
nginx
has put together a 'multi-platform' image, butfluentd
still has multiple architecture specific images. I appended the tagedge-debian-arm64
to the image declaration and then restarted and it worked. I'm not sure if that is the tag I ultimately want, but for now that should be fine.I had tried the
edge-debian-armhf
without success earlier, but missed the 'arm64' version until now.I think the default behavior, e.g., when an image is specified without a tag, the pod will pull the image tagged
latest
. If this is indeed true, then I'm pulling a 4 year old image for amd64.Thanks for your help.
Phil
1
Answers
-
Hi @kelloggpa,
The logs commands returns an output if the container is running, thus producing logs.
Would you be able to provide the outputs the following commands?
kubectl describe pod basicpod
kubectl get pod -A -o wide
Regards,
-Chris0 -
Hi @chrispokorni,
Thanks for your reply. Here is the information you requested
- % kubectl describe pod basicpod
- Name: basicpod
- Namespace: default
- Priority: 0
- Node: kube-worker/10.211.55.13
- Start Time: Sun, 06 Nov 2022 11:43:40 -0700
- Labels: type=webserver
- Annotations: cni.projectcalico.org/containerID: c626e7ff191220ef23aaf4699f6b6b1033e71234a2df3782a9ed1b3f261ea03a
- cni.projectcalico.org/podIP: 192.168.73.140/32
- cni.projectcalico.org/podIPs: 192.168.73.140/32
- Status: Running
- IP: 192.168.73.140
- IPs:
- IP: 192.168.73.140
- Containers:
- webcont:
- Container ID: containerd://797264fe2365e0235d4220d3ddb00b479d913bcec263d5d7aef9e3c985ddc8c3
- Image: nginx
- Image ID: docker.io/library/nginx@sha256:943c25b4b66b332184d5ba6bb18234273551593016c0e0ae906bab111548239f
- Port: 80/TCP
- Host Port: 0/TCP
- State: Running
- Started: Sun, 06 Nov 2022 11:43:41 -0700
- Ready: True
- Restart Count: 0
- Environment: <none>
- Mounts:
- /var/run/secrets/kubernetes.io/serviceaccount from kube-api-access-5wpkz (ro)
- fdlogger:
- Container ID: containerd://fefa39d4c3939f8e6a1bf2989749355f0cfa4c416204cce308ef01342074400d
- Image: fluent/fluentd
- Image ID: docker.io/fluent/fluentd@sha256:7eece00d1bc784ac1e9722b2580911cd3ead5afd740dad6594be945b3b1dd884
- Port: <none>
- Host Port: <none>
- State: Terminated
- Reason: Error
- Exit Code: 1
- Started: Sun, 06 Nov 2022 16:17:22 -0700
- Finished: Sun, 06 Nov 2022 16:17:22 -0700
- Last State: Terminated
- Reason: Error
- Exit Code: 1
- Started: Sun, 06 Nov 2022 15:50:35 -0700
- Finished: Sun, 06 Nov 2022 15:50:35 -0700
- Ready: False
- Restart Count: 23
- Environment: <none>
- Mounts:
- /var/run/secrets/kubernetes.io/serviceaccount from kube-api-access-5wpkz (ro)
- Conditions:
- Type Status
- Initialized True
- Ready False
- ContainersReady False
- PodScheduled True
- Volumes:
- kube-api-access-5wpkz:
- Type: Projected (a volume that contains injected data from multiple sources)
- TokenExpirationSeconds: 3607
- ConfigMapName: kube-root-ca.crt
- ConfigMapOptional: <nil>
- DownwardAPI: true
- QoS Class: BestEffort
- Node-Selectors: <none>
- Tolerations: node.kubernetes.io/not-ready:NoExecute op=Exists for 300s
- node.kubernetes.io/unreachable:NoExecute op=Exists for 300s
- Events:
- Type Reason Age From Message
- ---- ------ ---- ---- -------
- Warning BackOff 170m (x277 over 4h33m) kubelet Back-off restarting failed container
- Normal Pulling 15s (x25 over 4h33m) kubelet Pulling image "fluent/fluentd"
- % kubectl get pod -A -o wide
- NAMESPACE NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
- default basicpod 1/2 CrashLoopBackOff 23 (2m25s ago) 4h36m 192.168.73.140 kube-worker <none> <none>
- kube-system calico-kube-controllers-66bfd4dbc-9hxrg 1/1 Running 0 29h 192.168.55.131 kube-cp <none> <none>
- kube-system calico-node-5bwvx 1/1 Running 0 21h 10.211.55.13 kube-worker <none> <none>
- kube-system calico-node-rzrb4 1/1 Running 0 29h 10.211.55.12 kube-cp <none> <none>
- kube-system coredns-6d4b75cb6d-6hjb7 1/1 Running 0 29h 192.168.55.129 kube-cp <none> <none>
- kube-system coredns-6d4b75cb6d-lmrgf 1/1 Running 0 29h 192.168.55.130 kube-cp <none> <none>
- kube-system etcd-kube-cp 1/1 Running 0 29h 10.211.55.12 kube-cp <none> <none>
- kube-system kube-apiserver-kube-cp 1/1 Running 0 29h 10.211.55.12 kube-cp <none> <none>
- kube-system kube-controller-manager-kube-cp 1/1 Running 1 (22h ago) 29h 10.211.55.12 kube-cp <none> <none>
- kube-system kube-proxy-6h89h 1/1 Running 0 29h 10.211.55.12 kube-cp <none> <none>
- kube-system kube-proxy-hm87k 1/1 Running 0 21h 10.211.55.13 kube-worker <none> <none>
- kube-system kube-scheduler-kube-cp 1/1 Running 1 (22h ago) 29h 10.211.55.12 kube-cp <none> <none>
1 -
Same issue on my arm64 laptop. This entry safe me a few google'n hours. Thanks!
0 -
Same here. Without any tag, the image not found error. I am using google compute nodes with ubuntu 20.04
- Events:
- Type Reason Age From Message
- ---- ------ ---- ---- -------
- Normal Scheduled 17s default-scheduler Successfully assigned default/basicpod to worker
- Normal Pulling 16s kubelet Pulling image "nginx"
- Normal Pulled 16s kubelet Successfully pulled image "nginx" in 294.211729ms (294.23196ms including waiting)
- Normal Created 16s kubelet Created container webcont
- Normal Started 16s kubelet Started container webcont
- Normal Pulling 16s kubelet Pulling image "fluent/fluentd"
- Warning Failed 16s kubelet Failed to pull image "fluent/fluentd": rpc error: code = NotFound desc = failed to pull and unpack image "docker.io/fluent/fluentd:latest": failed to resolve reference "docker.io/fluent/fluentd:latest": docker.io/fluent/fluentd:latest: not found
- Warning Failed 16s kubelet Error: ErrImagePull
- Normal BackOff 14s (x2 over 15s) kubelet Back-off pulling image "fluent/fluentd"
- Warning Failed 14s (x2 over 15s) kubelet Error: ImagePullBackOff
And when using the above mentioned tag
edge-debian-arm64
it successfully pulls the image but is unable to start the flogger container- Events:
- Type Reason Age From Message
- ---- ------ ---- ---- -------
- Normal Scheduled 7m39s default-scheduler Successfully assigned default/basicpod to worker
- Normal Pulling 7m38s kubelet Pulling image "nginx"
- Normal Pulled 7m38s kubelet Successfully pulled image "nginx" in 355.034454ms (355.095081ms including waiting)
- Normal Created 7m38s kubelet Created container webcont
- Normal Started 7m38s kubelet Started container webcont
- Normal Pulling 7m38s kubelet Pulling image "fluent/fluentd:edge-debian-arm64"
- Normal Pulled 7m30s kubelet Successfully pulled image "fluent/fluentd:edge-debian-arm64" in 7.670395324s (7.670421793s including waiting)
- Normal Created 6m44s (x4 over 7m30s) kubelet Created container flogger
- Normal Started 6m44s (x4 over 7m30s) kubelet Started container flogger
- Normal Pulled 6m44s (x3 over 7m28s) kubelet Container image "fluent/fluentd:edge-debian-arm64" already present on machine
- Warning BackOff 2m31s (x24 over 7m27s) kubelet Back-off restarting failed container flogger in pod basicpod_default(a83c00d0-0c5b-4a13-a947-72d44cf0b4cd)
0 -
fluent/fluentd:edge-debian-arm64
same issue as above for @ashfaqahmed
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Scheduled 83s default-scheduler Successfully assigned default/basicpod to worker
Normal Pulling 82s kubelet Pulling image "nginx"
Normal Pulled 82s kubelet Successfully pulled image "nginx" in 327.712348ms (327.721573ms including waiting)
Normal Created 82s kubelet Created container webcont
Normal Started 82s kubelet Started container webcont
Normal Pulling 82s kubelet Pulling image "fluent/fluentd:edge-debian-arm64"
Normal Pulled 75s kubelet Successfully pulled image "fluent/fluentd:edge-debian-arm64" in 6.607935264s (6.607946163s including waiting)
Normal Created 37s (x4 over 75s) kubelet Created container fdlogger
Normal Started 37s (x4 over 75s) kubelet Started container fdlogger
Normal Pulled 37s (x3 over 73s) kubelet Container image "fluent/fluentd:edge-debian-arm64" already present on machine
Warning BackOff 11s (x6 over 72s) kubelet Back-off restarting failed container fdlogger in pod basicpod_default(38cc413b-0e44-471a-949f-8a8d4062a513)0 -
@chrispokorni - first thank you for your input above. I would never had found an answer without those clues.
After a bit of poking around (and searching on Chris' tag), I found edge-debian. I am on GCE. The architecture is x86-64.... and honestly guessing a bit. But, I tried it and it worked.
https://hub.docker.com/r/fluent/fluentd/
"v1.16.1-debian-1.0, v1.16-debian-1, edge-debian (multiarch image for arm64(AArch64) and amd64(x86_64))"
0 -
Hi, I'm also having trouble running the fluentd.
I receive:
NAME READY STATUS RESTARTS AGE
basicpod 1/2 ImagePullBackOff 0 10s
What should I do?0 -
Hi @ashfaqahmed, @mkevinmchugh, @zite,
It seem that on Docker Hub the Fluentd image repositories have been reorganized following the release of fluentd v1.
Please update basic.yaml with
image: fluentd
instead. It runs fluentd v0, the version tested in this lab exercise and in later lab exercise 5.3. It seems it is no longer maintained, however, theimage: fluent/fluentd:edge-debian
running fluentd v1 may not work in later lab exercise 5.3.Regards,
-Chris1 -
Setting
fluent/fluentd:v1.16.2-1.0
resolved the issue for me- $ cat <<'HERE' | kubectl create --filename -
- apiVersion: v1
- kind: Pod
- metadata:
- name: basicpod
- labels:
- type: webserver
- spec:
- containers:
- - name: webcont
- image: nginx
- ports:
- - containerPort: 80
- - name: fdlogger
- image: fluent/fluentd:v1.16.2-1.0
- HERE
- pod/basicpod created
- $ kubectl get pod basicpod
- NAME READY STATUS RESTARTS AGE
- basicpod 2/2 Running 0 38s
0
Categories
- All Categories
- 238 LFX Mentorship
- 238 LFX Mentorship: Linux Kernel
- 813 Linux Foundation IT Professional Programs
- 366 Cloud Engineer IT Professional Program
- 183 Advanced Cloud Engineer IT Professional Program
- 82 DevOps Engineer IT Professional Program
- 129 Cloud Native Developer IT Professional Program
- 142 Express Training Courses & Microlearning
- 142 Express Courses - Discussion Forum
- Microlearning - Discussion Forum
- 6.5K Training Courses
- 40 LFC110 Class Forum - Discontinued
- 66 LFC131 Class Forum
- 49 LFD102 Class Forum
- 233 LFD103 Class Forum
- 16 LFD110 Class Forum
- 29 LFD121 Class Forum
- LFD123 Class Forum
- LFD125 Class Forum
- 16 LFD133 Class Forum
- 6 LFD134 Class Forum
- 18 LFD137 Class Forum
- 70 LFD201 Class Forum
- 3 LFD210 Class Forum
- 2 LFD210-CN Class Forum
- 2 LFD213 Class Forum - Discontinued
- 128 LFD232 Class Forum - Discontinued
- 1 LFD233 Class Forum
- 2 LFD237 Class Forum
- 23 LFD254 Class Forum
- 718 LFD259 Class Forum
- 111 LFD272 Class Forum - Discontinued
- 3 LFD272-JP クラス フォーラム
- 13 LFD273 Class Forum
- 218 LFS101 Class Forum
- 1 LFS111 Class Forum
- 2 LFS112 Class Forum
- 1 LFS116 Class Forum
- 3 LFS118 Class Forum
- LFS120 Class Forum
- 1 LFS142 Class Forum
- 2 LFS144 Class Forum
- 3 LFS145 Class Forum
- 4 LFS146 Class Forum
- 15 LFS148 Class Forum
- 15 LFS151 Class Forum
- 1 LFS157 Class Forum
- 53 LFS158 Class Forum
- LFS158-JP クラス フォーラム
- 4 LFS162 Class Forum
- 1 LFS166 Class Forum
- 2 LFS167 Class Forum
- 3 LFS170 Class Forum
- 1 LFS171 Class Forum
- 1 LFS178 Class Forum
- 2 LFS180 Class Forum
- 1 LFS182 Class Forum
- 3 LFS183 Class Forum
- 30 LFS200 Class Forum
- 737 LFS201 Class Forum - Discontinued
- 2 LFS201-JP クラス フォーラム
- 20 LFS203 Class Forum
- 109 LFS207 Class Forum
- 1 LFS207-DE-Klassenforum
- 2 LFS207-JP クラス フォーラム
- 301 LFS211 Class Forum
- 55 LFS216 Class Forum
- 53 LFS241 Class Forum
- 43 LFS242 Class Forum
- 37 LFS243 Class Forum
- 16 LFS244 Class Forum
- 1 LFS245 Class Forum
- LFS246 Class Forum
- LFS248 Class Forum
- 44 LFS250 Class Forum
- 1 LFS250-JP クラス フォーラム
- LFS251 Class Forum
- 156 LFS253 Class Forum
- LFS254 Class Forum
- 2 LFS255 Class Forum
- 6 LFS256 Class Forum
- LFS257 Class Forum
- 1.3K LFS258 Class Forum
- 9 LFS258-JP クラス フォーラム
- 113 LFS260 Class Forum
- 161 LFS261 Class Forum
- 41 LFS262 Class Forum
- 82 LFS263 Class Forum - Discontinued
- 15 LFS264 Class Forum - Discontinued
- 11 LFS266 Class Forum - Discontinued
- 21 LFS267 Class Forum
- 18 LFS268 Class Forum
- 29 LFS269 Class Forum
- 7 LFS270 Class Forum
- 199 LFS272 Class Forum
- 1 LFS272-JP クラス フォーラム
- 2 LFS147 Class Forum
- LFS274 Class Forum
- 3 LFS281 Class Forum
- 2 LFW111 Class Forum
- 257 LFW211 Class Forum
- 176 LFW212 Class Forum
- 15 SKF100 Class Forum
- SKF200 Class Forum
- 2 SKF201 Class Forum
- 797 Hardware
- 198 Drivers
- 68 I/O Devices
- 37 Monitors
- 96 Multimedia
- 174 Networking
- 91 Printers & Scanners
- 83 Storage
- 752 Linux Distributions
- 82 Debian
- 67 Fedora
- 16 Linux Mint
- 13 Mageia
- 23 openSUSE
- 147 Red Hat Enterprise
- 31 Slackware
- 13 SUSE Enterprise
- 349 Ubuntu
- 463 Linux System Administration
- 39 Cloud Computing
- 71 Command Line/Scripting
- Github systems admin projects
- 91 Linux Security
- 78 Network Management
- 101 System Management
- 46 Web Management
- 55 Mobile Computing
- 18 Android
- 28 Development
- 1.2K New to Linux
- 1K Getting Started with Linux
- 364 Off Topic
- 115 Introductions
- 170 Small Talk
- 26 Study Material
- 518 Programming and Development
- 304 Kernel Development
- 211 Software Development
- 1.1K Software
- 211 Applications
- 180 Command Line
- 3 Compiling/Installing
- 405 Games
- 311 Installation
- 78 All In Program
- 78 All In Forum
Upcoming Training
-
August 20, 2018
Kubernetes Administration (LFS458)
-
August 20, 2018
Linux System Administration (LFS301)
-
August 27, 2018
Open Source Virtualization (LFS462)
-
August 27, 2018
Linux Kernel Debugging and Security (LFD440)