Validation of cluster configuration
My system is not as expected according to the lab, so I now have two parts and I am checking to see if I am off by a little or a lot.
(All on the Master server)
1 -
student@lfs458-node-1a0a:~$ kubectl get node
NAME STATUS ROLES AGE VERSION
lfs458-node-1a0a Ready master 28m v1.14.1
lfs458-worker Ready 50s v1.14.1
I only have the master node, no worker
2 -
student@lfs458-node-1a0a:~$ kubectl get pods --all-namespaces
NAMESPACE NAME READY STATUS RESTARTS AGE
kube-system calico-etcd-jlgwr 1/1 Running 0 6m
kube-system calico-kube-controllers-74b888b647-wlqf5 1/1 Running 0 6m
kube-system calico-node-tpvnr 2/2 Running 0 6m
kube-system coredns-78fcdf6894-nc5cn 1/1 Running 0 17m
kube-system coredns-78fcdf6894-xs96m 1/1 Running 0 17m
I have many in the COntainerCreating stage
student@ubuntu:~$ kubectl get pods --all-namespaces
NAMESPACE NAME READY STATUS RESTARTS AGE
kube-system calico-node-wgfmw 0/2 ContainerCreating 4 2d1h
kube-system coredns-fb8b8dccf-7dvgf 0/1 ContainerCreating 322 2d1h
kube-system coredns-fb8b8dccf-psbj2 0/1 ContainerCreating 318 2d1h
kube-system etcd-ubuntu 0/1 Pending 0 0s
kube-system kube-proxy-ncm5q 0/1 ContainerCreating 1 2d1h
Now I think something is wrong. The kubeadm join command worked, I generated a key and all those steps went as stated. So either I missed something or I did something wrong. Any suggestions? Or is this all fine and I need to delete the two coredns pods and let them get re-created?
Thank you.
Comments
-
Hi,
The ContainerCreating state indicates that the images cannot be pulled from the Docker repository, and the fact that you can't see your worker node indicates that the join did not work.
Overall this is a networking issue with your VMs/nodes. In order for Kubernetes to properly work, the infrastructure networking has to be setup correctly. Kubernetes expects that the nodes can see each other and communicate with each other, and also each node should have access to the internet. These are settings you need to revise in VMware for each VM/node.
There may be a firewall enabled at the guest OS level in Ubuntu, which may again block some traffic. I recommend verifying that also.
Regards,
-Chris0 -
Fair enough Chris. Thank you. I will work on that and retry the join. Thanks for the clarification on the docker part too.
0 -
One the worker node I got this message. I coded in the actual IP address so I do not know why this message is referring to localhost
student@ubuntu:~$ kubectl get nodes
The connection to the server localhost:8080 was refused - did you specify the right host or port?0 -
Hi:
Just to explain this is not a real time tech support post. New students sometimes don't fully grok that.
You have 20 posts in the last few days, and the moderators have been rather gracious in spending the time to explain things. While this is their job, perhaps you could hold off on posting until you study the documentation more carefully and try various things, as well as search through the archives of the forum.
Of course, if you see things that are wrong in the course material, do not hesitate to promptly post about that as it gladly accepted.
Thanks for your consideration.
0 -
Thanks coop.
Yes, I assumed given a forum the responses would not be real-time. I have been using the search feature before posting as I hope someone has already asked the question I have and the answer is already there. And I do admit one of the issues was I misread the prompt (a product of doing this effort after working hours and getting tired)
I also now have a teammate working on the course material starting soon. He is doing the cloud version whereas I am doing the VMWare version. Hopefully as he goes through this we can work together and solve our own issues.
For reading documentation, I have gone through the class material up to where is says do lab 3, up to lab 3.5. All the labs are self documenting. Thus it is difficult when something seemly does not work to know what is wrong or not wrong or missing information or even assumed aspects. {Within many kubernetes training programs I have been trying to get through there are many assumptions around environments, packages, knowledge of what just should happen, differences in Linix, kubernetes products. I understand and realize this environment is changing quickly and continuously which is also why I do appreciate pointing to other points of reference.}
If I have over-stepped the usage of this forum, I do apologize and again thank the moderators for responding.
0 -
Hi @prenticect,
Please review the exercise to make sure that you ran that command on the correct node. Running it on the wrong node produces the "... connection ... refused ..." error output.
Renaming your nodes may also help in the future. Rather than having both named "ubuntu", one could be named "master" and the other "worker". When working on both nodes at the same time this will save you some headaches.
Regards,
-Chris0
Categories
- All Categories
- 177 LFX Mentorship
- 177 LFX Mentorship: Linux Kernel
- 750 Linux Foundation IT Professional Programs
- 373 Cloud Engineer IT Professional Program
- 169 Advanced Cloud Engineer IT Professional Program
- 74 DevOps IT Professional Program - Discontinued
- 4 DevOps & GitOps IT Professional Program
- 99 Cloud Native Developer IT Professional Program
- 7.6K Training Courses & Learning Paths
- 1 AI & ML Training
- 1 Blockchain & Decentralized Identity Training
- 3 Cloud & Containers Training
- 1 Cybersecurity Training
- 2 DevOps & Site-Reliability Training
- 1 Linux Kernel Development Training
- 1 Networking Training
- 1 Open Source Best Practice Training
- 1 System Administration Training
- 1 System Engineering Training
- 1 Web & Application Development Training
- 792 Hardware
- 202 Drivers
- 68 I/O Devices
- 37 Monitors
- 95 Multimedia
- 173 Networking
- 91 Printers & Scanners
- 87 Storage
- 769 Linux Distributions
- 81 Debian
- 68 Fedora
- 22 Linux Mint
- 13 Mageia
- 24 openSUSE
- 150 Red Hat Enterprise
- 31 Slackware
- 13 SUSE Enterprise
- 356 Ubuntu
- 465 Linux System Administration
- 31 Cloud Computing
- 73 Command Line/Scripting
- Github systems admin projects
- 98 Linux Security
- 78 Network Management
- 101 System Management
- 46 Web Management
- 106 Mobile Computing
- 18 Android
- 73 Development
- 1.2K New to Linux
- 1K Getting Started with Linux
- 392 Off Topic
- 121 Introductions
- 181 Small Talk
- 29 Study Material
- 955 Programming and Development
- 310 Kernel Development
- 627 Software Development
- 983 Software
- 375 Applications
- 182 Command Line
- 5 Compiling/Installing
- 68 Games
- 317 Installation
- Archived
- 2 LFD140 Class Forum
Upcoming Training
-
August 20, 2018
Kubernetes Administration (LFS458)
-
August 20, 2018
Linux System Administration (LFS301)
-
August 27, 2018
Open Source Virtualization (LFS462)
-
August 27, 2018
Linux Kernel Debugging and Security (LFD440)
