kubeadm join ISSUE: [discovery] Failed to request cluster info, will try again:
Hi,
I've been trying to add a worker node to the cluster. I've followed the doc but I'm hitting this issue and I can't find a way past it. I've obviously missed something. Here is the command and error:
PART1
root@ip-172-31-18-206:~# kubeadm join --token od1wg1.a9wd79hstxz3ll4z 172.31.19.37:6443 --discovery-token-ca-cert-hash sha256:4aed0a78c329495d91e031a336668ccaf07528c84b7120f230f2f 161a98e7693 --v=2 I1018 15:46:36.761858 25485 join.go:367] [preflight] found NodeName empty; using OS hostname as NodeName I1018 15:46:36.761930 25485 initconfiguration.go:105] detected and using CRI socket: /var/run/dockershim.sock [preflight] Running pre-flight checks I1018 15:46:36.762004 25485 preflight.go:90] [preflight] Running general checks I1018 15:46:36.762037 25485 checks.go:254] validating the existence and emptiness of directory /etc/kubernetes/manifests I1018 15:46:36.762083 25485 checks.go:292] validating the existence of file /etc/kubernetes/kubelet.conf I1018 15:46:36.762124 25485 checks.go:292] validating the existence of file /etc/kubernetes/bootstrap-kubelet.conf I1018 15:46:36.762140 25485 checks.go:105] validating the container runtime I1018 15:46:36.806159 25485 checks.go:131] validating if the service is enabled and active [WARNING IsDockerSystemdCheck]: detected "cgroupfs" as the Docker cgroup driver. The recommended driver is "systemd". Please follow the guide at https://kubernetes.io/docs/setup/cri/ I1018 15:46:36.858635 25485 checks.go:341] validating the contents of file /proc/sys/net/bridge/bridge-nf-call-iptables I1018 15:46:36.858693 25485 checks.go:341] validating the contents of file /proc/sys/net/ipv4/ip_forward I1018 15:46:36.858729 25485 checks.go:653] validating whether swap is enabled or not I1018 15:46:36.858762 25485 checks.go:382] validating the presence of executable ip I1018 15:46:36.858793 25485 checks.go:382] validating the presence of executable iptables I1018 15:46:36.858813 25485 checks.go:382] validating the presence of executable mount I1018 15:46:36.858834 25485 checks.go:382] validating the presence of executable nsenter I1018 15:46:36.858851 25485 checks.go:382] validating the presence of executable ebtables I1018 15:46:36.858870 25485 checks.go:382] validating the presence of executable ethtool I1018 15:46:36.858891 25485 checks.go:382] validating the presence of executable socat I1018 15:46:36.858910 25485 checks.go:382] validating the presence of executable tc I1018 15:46:36.858927 25485 checks.go:382] validating the presence of executable touch I1018 15:46:36.858950 25485 checks.go:524] running all checks I1018 15:46:36.873553 25485 checks.go:412] checking whether the given node name is reachable using net.LookupHost I1018 15:46:36.882411 25485 checks.go:622] validating kubelet version I1018 15:46:36.937337 25485 checks.go:131] validating if the service is enabled and active I1018 15:46:36.943627 25485 checks.go:209] validating availability of port 10250 I1018 15:46:36.943778 25485 checks.go:292] validating the existence of file /etc/kubernetes/pki/ca.crt I1018 15:46:36.943797 25485 checks.go:439] validating if the connectivity type is via proxy or direct I1018 15:46:36.943826 25485 join.go:427] [preflight] Discovering cluster-info I1018 15:46:36.944224 25485 token.go:200] [discovery] Trying to connect to API Server "172.31.19.37:6443" I1018 15:46:36.944877 25485 token.go:75] [discovery] Created cluster-info discovery client, requesting info from "https://172.31.19.37:6443" I1018 15:47:06.945803 25485 token.go:83] [discovery] Failed to request cluster info, will try again: [Get https://172.31.19.37:6443/api/v1/namespaces/kube-public/configmaps/cluster-info: dial tcp 172.31.19.37:6443: i/o timeout] I1018 15:47:41.946445 25485 token.go:83] [discovery] Failed to request cluster info, will try again: [Get https://172.31.19.37:6443/api/v1/namespaces/kube-public/configmaps/cluster-info: dial tcp 172.31.19.37:6443: i/o timeout] ^C root@ip-172-31-18-206:~#
I'm able to telnet to port 22 from the worker to the master:
root@ip-172-31-18-206:~# telnet 172.31.19.37 22 Trying 172.31.19.37... Connected to 172.31.19.37. Escape character is '^]'. SSH-2.0-OpenSSH_7.2p2 Ubuntu-4ubuntu2.8 ^] telnet> quit Connection closed. root@ip-172-31-18-206:~# root@ip-172-31-18-206:~# telnet 172.31.19.37 6443 Trying 172.31.19.37...
PART2 with more details follows once it gets approved.
Answers
-
Hi @dmccuk,
Similar discussions have been posted recently in the forum, where a second node fails to join the cluster.
From your output, the failure is a timeout when accessing port 6443 on the master node.
Port 22 is irrelevant in this scenario since Kubernetes uses lots of different individual port numbers and port ranges - and 6443 is one of them.Read carefully the special instructions at the beginning of Lab exercise 3.1. These instructions are critical in setting up your infrastructure's networking (firewall rules) for inter-node communication.
Regards,
-Chris0 -
Hi Chris,
Thanks for your message. I worked out what I hadn't done. I'll write it here so others can benefit:
1) In AWS, create a new security group and open up all the ports.
2) Select one of your Kubernetes instances --> actions --> networking
3) Tick the new kubernetes group, adding it to your instance.
4) Repeat for all the other kubernetes instances.
5) Retry the failing command.I hope that helps.
Dennis
1 -
PART2:
The FW is off on both the worker and the master:
WORKER:
root@ip-172-31-18-206:~# sudo ufw status Status: inactive root@ip-172-31-18-206:~# root@ip-172-31-18-206:~# service ufw status ● ufw.service - Uncomplicated firewall Loaded: loaded (/lib/systemd/system/ufw.service; enabled; vendor preset: enabled) Active: inactive (dead) since Fri 2019-10-18 16:00:34 UTC; 1min 13s ago Process: 26404 ExecStop=/lib/ufw/ufw-init stop (code=exited, status=0/SUCCESS) Main PID: 396 (code=exited, status=0/SUCCESS) Oct 18 14:51:35 ubuntu systemd[1]: Started Uncomplicated firewall. Oct 18 16:00:34 ip-172-31-18-206 systemd[1]: Stopping Uncomplicated firewall... Oct 18 16:00:34 ip-172-31-18-206 ufw-init[26404]: Skip stopping firewall: ufw (not enabled) Oct 18 16:00:34 ip-172-31-18-206 systemd[1]: Stopped Uncomplicated firewall. Warning: Journal has been rotated since unit was started. Log output is incomplete or unavailable. root@ip-172-31-18-206:~#
MASTER:
ubuntu@ip-172-31-19-37:~$ sudo ufw status Status: inactive ubuntu@ip-172-31-19-37:~$ sudo service ufw status ● ufw.service - Uncomplicated firewall Loaded: loaded (/lib/systemd/system/ufw.service; enabled; vendor preset: enabled) Active: inactive (dead) since Fri 2019-10-18 16:02:49 UTC; 14s ago Process: 6637 ExecStop=/lib/ufw/ufw-init stop (code=exited, status=0/SUCCESS) Main PID: 379 (code=exited, status=0/SUCCESS) Oct 18 14:51:23 ubuntu systemd[1]: Started Uncomplicated firewall. Oct 18 16:02:49 ip-172-31-19-37 systemd[1]: Stopping Uncomplicated firewall... Oct 18 16:02:49 ip-172-31-19-37 ufw-init[6637]: Skip stopping firewall: ufw (not enabled) Oct 18 16:02:49 ip-172-31-19-37 systemd[1]: Stopped Uncomplicated firewall. Warning: Journal has been rotated since unit was started. Log output is incomplete or unavailable. ubuntu@ip-172-31-19-37:~$
Here is the master namespaces:
ubuntu@ip-172-31-19-37:~$ kubectl get pods --all-namespaces NAMESPACE NAME READY STATUS RESTARTS AGE kube-system calico-node-9zmmr 2/2 Running 0 63m kube-system coredns-fb8b8dccf-mbg2w 1/1 Running 0 65m kube-system coredns-fb8b8dccf-nbm88 1/1 Running 0 65m kube-system etcd-ip-172-31-19-37 1/1 Running 0 64m kube-system kube-apiserver-ip-172-31-19-37 1/1 Running 0 64m kube-system kube-controller-manager-ip-172-31-19-37 1/1 Running 0 64m kube-system kube-proxy-tztvb 1/1 Running 0 65m kube-system kube-scheduler-ip-172-31-19-37 1/1 Running 0 64m
I've been through this link and the steps I'm taking are identical:
https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/create-cluster-kubeadm/#pod-networkThe TOKEN and openssl key I'm using in my join command:
ubuntu@ip-172-31-19-37:~$ kubeadm token list TOKEN TTL EXPIRES USAGES DESCRIPTION EXTRA GROUPS od1wg1.a9wd79hstxz3ll4z 22h 2019-10-19T14:58:11Z authentication,signing The default bootstrap token generated by 'kubeadm init'. system:bootstrappers:kubeadm:default-node-token ubuntu@ip-172-31-19-37:~$ ubuntu@ip-172-31-19-37:~$ openssl x509 -pubkey -in /etc/kubernetes/pki/ca.crt | openssl rsa -pubin -outform der 2>/dev/null | openssl dgst -sha256 -hex | sed 's/^.* //' 4aed0a78c329495d91e031a336668ccaf07528c84b7120f230f2f161a98e7693 ubuntu@ip-172-31-19-37:~$
NETSTAT from the master:
ubuntu@ip-172-31-19-37:~$ netstat -tnlp (Not all processes could be identified, non-owned process info will not be shown, you would have to be root to see it all.) Active Internet connections (only servers) Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name tcp 0 0 127.0.0.1:9099 0.0.0.0:* LISTEN - tcp 0 0 172.31.19.37:2379 0.0.0.0:* LISTEN - tcp 0 0 127.0.0.1:2379 0.0.0.0:* LISTEN - tcp 0 0 172.31.19.37:2380 0.0.0.0:* LISTEN - tcp 0 0 127.0.0.1:10257 0.0.0.0:* LISTEN - tcp 0 0 127.0.0.1:43122 0.0.0.0:* LISTEN - tcp 0 0 127.0.0.1:10259 0.0.0.0:* LISTEN - tcp 0 0 0.0.0.0:22 0.0.0.0:* LISTEN - tcp 0 0 127.0.0.1:42623 0.0.0.0:* LISTEN - tcp 0 0 127.0.0.1:10248 0.0.0.0:* LISTEN - tcp 0 0 127.0.0.1:10249 0.0.0.0:* LISTEN - tcp6 0 0 :::10250 :::* LISTEN - tcp6 0 0 :::10251 :::* LISTEN - tcp6 0 0 :::6443 :::* LISTEN - tcp6 0 0 :::10252 :::* LISTEN - tcp6 0 0 :::10256 :::* LISTEN - tcp6 0 0 :::22 :::* LISTEN -
I'm stuck! If anyone can help I would really appreciate it!
0
Categories
- All Categories
- 206 LFX Mentorship
- 206 LFX Mentorship: Linux Kernel
- 733 Linux Foundation IT Professional Programs
- 339 Cloud Engineer IT Professional Program
- 165 Advanced Cloud Engineer IT Professional Program
- 66 DevOps Engineer IT Professional Program
- 132 Cloud Native Developer IT Professional Program
- 119 Express Training Courses
- 119 Express Courses - Discussion Forum
- 5.9K Training Courses
- 40 LFC110 Class Forum - Discontinued
- 66 LFC131 Class Forum
- 39 LFD102 Class Forum
- 219 LFD103 Class Forum
- 17 LFD110 Class Forum
- 32 LFD121 Class Forum
- 17 LFD133 Class Forum
- 6 LFD134 Class Forum
- 17 LFD137 Class Forum
- 70 LFD201 Class Forum
- 3 LFD210 Class Forum
- 2 LFD210-CN Class Forum
- 2 LFD213 Class Forum - Discontinued
- 128 LFD232 Class Forum - Discontinued
- 1 LFD233 Class Forum
- 2 LFD237 Class Forum
- 23 LFD254 Class Forum
- 683 LFD259 Class Forum
- 109 LFD272 Class Forum
- 3 LFD272-JP クラス フォーラム
- 10 LFD273 Class Forum
- 95 LFS101 Class Forum
- LFS111 Class Forum
- 2 LFS112 Class Forum
- 1 LFS116 Class Forum
- 3 LFS118 Class Forum
- 2 LFS142 Class Forum
- 3 LFS144 Class Forum
- 3 LFS145 Class Forum
- 1 LFS146 Class Forum
- 2 LFS147 Class Forum
- 8 LFS151 Class Forum
- 1 LFS157 Class Forum
- 10 LFS158 Class Forum
- 4 LFS162 Class Forum
- 1 LFS166 Class Forum
- 3 LFS167 Class Forum
- 1 LFS170 Class Forum
- 1 LFS171 Class Forum
- 2 LFS178 Class Forum
- 2 LFS180 Class Forum
- 1 LFS182 Class Forum
- 4 LFS183 Class Forum
- 30 LFS200 Class Forum
- 737 LFS201 Class Forum - Discontinued
- 2 LFS201-JP クラス フォーラム
- 17 LFS203 Class Forum
- 112 LFS207 Class Forum
- 1 LFS207-DE-Klassenforum
- LFS207-JP クラス フォーラム
- 301 LFS211 Class Forum
- 55 LFS216 Class Forum
- 49 LFS241 Class Forum
- 43 LFS242 Class Forum
- 37 LFS243 Class Forum
- 13 LFS244 Class Forum
- 1 LFS245 Class Forum
- 45 LFS250 Class Forum
- 1 LFS250-JP クラス フォーラム
- LFS251 Class Forum
- 143 LFS253 Class Forum
- LFS254 Class Forum
- LFS255 Class Forum
- 6 LFS256 Class Forum
- LFS257 Class Forum
- 1.2K LFS258 Class Forum
- 9 LFS258-JP クラス フォーラム
- 114 LFS260 Class Forum
- 152 LFS261 Class Forum
- 41 LFS262 Class Forum
- 82 LFS263 Class Forum - Discontinued
- 15 LFS264 Class Forum - Discontinued
- 11 LFS266 Class Forum - Discontinued
- 23 LFS267 Class Forum
- 18 LFS268 Class Forum
- 29 LFS269 Class Forum
- 199 LFS272 Class Forum
- 1 LFS272-JP クラス フォーラム
- LFS274 Class Forum
- 3 LFS281 Class Forum
- 2 LFW111 Class Forum
- 257 LFW211 Class Forum
- 176 LFW212 Class Forum
- 12 SKF100 Class Forum
- SKF200 Class Forum
- 791 Hardware
- 199 Drivers
- 68 I/O Devices
- 37 Monitors
- 98 Multimedia
- 174 Networking
- 91 Printers & Scanners
- 85 Storage
- 754 Linux Distributions
- 82 Debian
- 67 Fedora
- 16 Linux Mint
- 13 Mageia
- 23 openSUSE
- 147 Red Hat Enterprise
- 31 Slackware
- 13 SUSE Enterprise
- 351 Ubuntu
- 464 Linux System Administration
- 39 Cloud Computing
- 70 Command Line/Scripting
- Github systems admin projects
- 91 Linux Security
- 78 Network Management
- 101 System Management
- 47 Web Management
- 56 Mobile Computing
- 17 Android
- 28 Development
- 1.2K New to Linux
- 1K Getting Started with Linux
- 365 Off Topic
- 113 Introductions
- 171 Small Talk
- 20 Study Material
- 522 Programming and Development
- 291 Kernel Development
- 213 Software Development
- 1.1K Software
- 212 Applications
- 180 Command Line
- 3 Compiling/Installing
- 405 Games
- 311 Installation
- 79 All In Program
- 79 All In Forum
Upcoming Training
-
August 20, 2018
Kubernetes Administration (LFS458)
-
August 20, 2018
Linux System Administration (LFS301)
-
August 27, 2018
Open Source Virtualization (LFS462)
-
August 27, 2018
Linux Kernel Debugging and Security (LFD440)