Welcome to the Linux Foundation Forum!

Worker node can't join

Hello,
The worker node gets a timeout when trying to join the cluster. The CP node seems up and running.
Here the worker output:

Failed to request cluster-info, will try again: 
Get "https://k8scp:6443/api/v1/namespaces/kube-public/configmaps/cluster-info?timeout=10s": 
x509: certificate is valid for dl-dt-03, kubernetes, kubernetes.default, kubernetes.default.svc, 
kubernetes.default.svc.cluster.local, not k8scp

Any idea what goes wrong? The full output is here: https://x0.at/M_i_.txt

Comments

  • On the CP node, the /etc/hosts file is:

    $ cat /etc/hosts
    127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
    ::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
    172.168.1.6          LN-TW-01.resuniv.lan LN-TW-01
    172.168.1.7          HP-TW-01.resuniv.lan HP-TW-01
    172.167.1.254        DL-DT-03.resuniv.lan DL-DT-03
    172.167.1.254   k8scp
    

    there's a typo on the 167, I guess it was it

  • Fixed:

    # kubeadm reset
    # rm -rf /var/lib/etcd/
    # kubeadm init --config=kubeadm-config.yaml --upload-certs | tee kubeadm-init.out
    

Categories

Upcoming Training