Issue on 2019-11-05 version Lab 3.2 problem

gfalasca · December 2019

The excercise regards the configuration of a local docker repository.

I am doing the lab in a bare-metal cluster on Ubuntu 18.04 configured via the k8sMaster and k8sSecond scripts.
Two nodes, one of which is an untaint master.

The problem occurs at the step 21, where we are supposed to access the "registry" service via curl directly from the master node's shell.
In my case the connection hangs until the timeout is reached, I have also disabled the ufw so there shouldn't be any firewall problem as mentioned in the lab.
Being the "registry" service of ClusterIP type, I was not surprised of being unable to access it directly from the shell as I knew that ClusterIP services are accessible only from inside kubernetes.
But from the description of the step 21 it seems that we should be able to access the service IP:Port directly from the shell.

Am I doing anything wrong?
Could anyone confirm that, from the master machine's shell, a ClusterIP svc address and ports should be directly reachable?

chrispokorni · December 2019

Hi @gfalasca,

You are correct, you should be able to reach a Cluster IP curling from either node of your cluster, just as described in the exercise.

Did you encounter any issues in the previous Lab exercise 2.1, possibly on step 10, where you were also directed to curl to a Cluster IP from the master node?

Regards,
-Chris

gfalasca · December 2019

Hi Chris, thanks a lot.
Yes, I had the problem also there, I went forward but yes, already at the step 7 of lab 2.3, where there is the curl to the basic pod main container listening on port 80.

Double checked and apparmor is down and SELinux not installed. Ufw also disabled.

Currently the status of the kube-system pods is ok, but I noticed a few things that may (or may not) have a correlation with the problem I face:

after the installation, the calico-node pods where 0/1 ready. Seems that the bird-ready readiness probe was failing, so I tried commenting it at the line 657 of calico.yaml. After that they started correctly, but sometimes one of them fall in an error state. It happened after hours of operations. Deleting and redeploying it seems to restore its readiness.
Even if they shouldn't affect the curl to the service as I am directly using the ClusterIP IP address, also the coredns pods seem to be a bit unstable, sometimes I find one of them in CrashLoopBackOff state.

coredns-5644d7b6d9-5r4x4 0/1 CrashLoopBackOff 20 2d

After a while the pod becomes apparently ready and running again.

kubernetes@dev2:~$ k get pod -n kube-system

NAME                                       READY   STATUS    RESTARTS   AGE
calico-kube-controllers-6bbf58546b-btk9d   1/1     Running   1          23h
calico-node-87vj7                          1/1     Running   4          18h
calico-node-n856s                          1/1     Running   0          103s
coredns-5644d7b6d9-5r4x4                   1/1     Running   19         2d
coredns-5644d7b6d9-78jng                   1/1     Running   0          104s
etcd-dev2                                  1/1     Running   12         2d
kube-apiserver-dev2                        1/1     Running   13         2d
kube-controller-manager-dev2               1/1     Running   12         2d
kube-proxy-skjqf                           1/1     Running   1          2d
kube-proxy-x62v9                           1/1     Running   16         2d
kube-scheduler-dev2                        1/1     Running   12         2d

`
Any idea about other checks I can do?
Maybe the machine where the master is installed is too small. It's a physical srvr with 2GB ram only, but there is only the k8s master on it.

chrispokorni · December 2019

You may have just answered your own question
It is safe to have 4 GB mem to every 1 CPU. But exercises will run faster with 2 CPUs and 8 GB mem, according to the Overview section of Lab exercise 2.1.

Commenting out a readiness probe is just like a flu medicine - it only masks the symptoms, while the flu is still there.

Regards,
-Chris

gfalasca · December 2019

Thanks a lot Chris, strange enough everything works as expected from the worker node, I can curl both the basic pod and the registry service without problems.
From the master instead it doesn't work, I will try with the master on a bigger machine as you suggested, but sounds weird that everything works from the worker node.

Thanks and regards

gfalasca · December 2019

Hi @chrispokorni the problem I am having seems very similar to the issue reported by @rcougil in Issue with worker node on Lab 3.2 Step 30 (worker pull from registry)
Before moving the master to a bigger machine I did some further analysis and I noticed the following:

The IP address + Container port of a container seems to be reachable from the same node where the pod is deployed (-o wide). From the other machine of my two nodes cluster, curling that address:port hangs until the connection timeout is reached. Sniffing the packets reveals the presence of only the SYN packets.
The ClusterIP+port of a service which has one single endpoint (like with the basicservice and registry services of the lab) seem to be perfectly reachable, but only from the node where the pod is actually deployed

@rcougil maybe you could verify if in your case the ClusterIP service behave in the same way?

To me it still seems to be a network related problem, as all the rest seems to work (scaling deployment, creation of resources,...).
In the next days I will migrate the master to a bigger machine with at least 4Gb of RAM per core as suggested by @chrispokorni and will let you know if the issue disappears.

rcougil · December 2019

Hi @gfalasca, yes same issue. I'm pretty sure Lab instructions are wrong. A service type ClusterIP cannot be reached from node Worker. Only would be accesible from a Pod running in Worker node, but not from the node itself.

This K8s course is the crappiest e-learning course i've done in my entire life, is extremely overprice for it's content (or the lack of it), also is plenty of errata until the end), very disappointed. i do encourage colleges to not to buy it in any circumstance.

chrispokorni · December 2019

Hi @gfalasca @rcougil,

Please be aware that infrastructure networking configuration plays a key role in the behavior of your Kubernetes cluster. Disabling firewall services at the node OS level may not be sufficient. As you are setting up your VM instances you have to ensure that infrastructure level firewalls are open to ingress traffic from all sources to all ports for all protocols.

If in the cloud, this requires one custom VPC network with one all-open ingress firewall rule (or all open Security Group on AWS).

If on a local hypervisor, enabling traffic from all sources is achieved by configuring the networking settings at the hypervisor level.

Without any firewalls in place (infra and OS level) a Service Cluster IP is accessible from any node in the cluster. Access to Pod IP addresses should also be available from any node in the cluster, regardless of the node where the Pod is running.

Regards,
-Chris

eugeneng · December 2019

Hi, I'm having the same setup and problem as @gfalasca

Wondering if there is a solution to this? Should I configure to use NodePort instead of ClusterIP?

TIA!

bkclements · January 2020

Hi everyone,

I completed Lab 3.3 a few days ago and did not have any problems using ClusterIP for registry access from the both the master and worker 'host'.

I am running on bare metal, 16 GB master and 8 GB worker (though I started with 4GB each) running Ubuntu 18. These are Dell 9020M.. I got 6 of these relatively cheap from Dell Refurb website. I see they have another batch on sale right now, with coupon code you can get like 40% off the listed price..

anyway in @gfalasca post there are a large number of restarts which I find concerning. In my case I ran k8Master.sh setup 5 days ago, then had a power failure 2 days ago due to an ice storm.

I just restarted everything (swapoff -a; service kubectl start) on both nodes, and this is what I see from the master node (node2):

root@node2:~/lfd259/LFD259/SOLUTIONS# kc get pods -A -o wide
NAMESPACE     NAME                                       READY   STATUS    RESTARTS   AGE     IP                NODE    NOMINATED NODE   READINESS GATES
default       nginx-595f85746d-p44hg                     1/1     Running   2          3d20h   192.168.166.152   node1   <none>           <none>
default       registry-cbc9b4779-jt5hl                   1/1     Running   2          3d20h   192.168.166.153   node1   <none>           <none>
kube-system   calico-kube-controllers-6bbf58546b-fh995   1/1     Running   2          5d19h   192.168.104.24    node2   <none>           <none>
kube-system   calico-node-6kksh                          1/1     Running   2          5d19h   10.99.10.3        node2   <none>           <none>
kube-system   calico-node-bjf77                          1/1     Running   2          5d19h   10.99.10.2        node1   <none>           <none>
kube-system   coredns-5644d7b6d9-2hbk7                   1/1     Running   2          5d19h   192.168.104.22    node2   <none>           <none>
kube-system   coredns-5644d7b6d9-zbndc                   1/1     Running   2          5d19h   192.168.104.19    node2   <none>           <none>
kube-system   etcd-node2                                 1/1     Running   2          5d19h   10.99.10.3        node2   <none>           <none>
kube-system   kube-apiserver-node2                       1/1     Running   2          5d19h   10.99.10.3        node2   <none>           <none>
kube-system   kube-controller-manager-node2              1/1     Running   2          5d19h   10.99.10.3        node2   <none>           <none>
kube-system   kube-proxy-9j6b2                           1/1     Running   2          5d19h   10.99.10.3        node2   <none>           <none>
kube-system   kube-proxy-rdpnb                           1/1     Running   2          5d19h   10.99.10.2        node1   <none>           <none>
kube-system   kube-scheduler-node2                       1/1     Running   2          5d19h   10.99.10.3        node2   <none>           <none>

In my case, the registry is running on the worker node (node1)

here's the services

root@node2:~/lfd259/LFD259/SOLUTIONS# kc get service
NAME         TYPE        CLUSTER-IP       EXTERNAL-IP   PORT(S)    AGE
kubernetes   ClusterIP   10.96.0.1        <none>        443/TCP    5d20h
nginx        ClusterIP   10.102.132.154   <none>        443/TCP    3d20h
registry     ClusterIP   10.99.112.183    <none>        5000/TCP   3d20h

from the master node I can push an image to the registry that's running on the worker node, using the cluster-ip:

root@node2:~/lfd259/LFD259/SOLUTIONS# docker push 10.99.112.183:5000/simpleapp
The push refers to repository [10.99.112.183:5000/simpleapp]
d0348f584525: Pushed 
a98ea9b99554: Pushed 
03a3dc679282: Pushed 
35fc403d4c4c: Pushed 
c1fbc35a2660: Pushed 
f63773c65620: Pushed 
e6d60910d056: Pushed 
b52c1c103fae: Pushed 
6f1c84e6ec59: Pushed 
dd5242c2dc8a: Pushed 
latest: digest: sha256:46a671056aabf5c4a4ed0dc77e7fad5c209529037c8737f6e465d64aa01d01e9 size: 2428

as part of the lab, 'simpleapp' has already been run on both the worker and master nodes. If I switch to the worker node (node1) and pull from the registry, using the cluster IP, the pull works and docker says the image hasn't changed

root@node1:~# docker pull 10.99.112.183:5000/simpleapp
Using default tag: latest
latest: Pulling from simpleapp
Digest: sha256:46a671056aabf5c4a4ed0dc77e7fad5c209529037c8737f6e465d64aa01d01e9
Status: Image is up to date for 10.99.112.183:5000/simpleapp:latest

The master node can reach the registry service running on the worker node, using the cluster-ip, because kube-proxy has configured iptables to route the connection over the calico network to the worker node.

Here, on the master node I try to ping all the cluster-IP addresses listed above, all the pings fail:

root@node2:~/lfd259/LFD259/SOLUTIONS# ping 10.96.0.1   
PING 10.96.0.1 (10.96.0.1) 56(84) bytes of data.        
^C
--- 10.96.0.1 ping statistics ---                                               
2 packets transmitted, 0 received, 100% packet loss, time 1028ms                                                                                                         

root@node2:~/lfd259/LFD259/SOLUTIONS# ping 10.102.132.154                                                                                       
PING 10.102.132.154 (10.102.132.154) 56(84) bytes of data.                                                                                                                                    
^C                  
--- 10.102.132.154 ping statistics ---                                                                  
2 packets transmitted, 0 received, 100% packet loss, time 1011ms                                           

root@node2:~/lfd259/LFD259/SOLUTIONS# ping 10.99.112.183                                                                                                                                     
PING 10.99.112.183 (10.99.112.183) 56(84) bytes of data.
^C                                                                                                                           
--- 10.99.112.183 ping statistics ---                                                                                                                                                         
1 packets transmitted, 0 received, 100% packet loss, time 0ms

however a tcp connection to 10.99.112.183 port 5000 will work, due to the iptables setup on the master node (and worker node)

@rcougil said that cluster-ip services are only reachable from within a pod and not from the node itself. But I've shown above it does work, and @chrispokorni also said it should work as long as no blocking is occurring at the infrastructure level.

This article kind of explains why it works, but the diagrams don't clearly show that iptables does its work at the node level:

https://medium.com/google-cloud/understanding-kubernetes-networking-services-f0cb48e4cc82

gfalasca · January 2020

Hi everyone, I did a further investigation and came to a conclusion, anyone can confirm the following?

About reachability of ClusterIP, my understanding is identical to @rcougil's one, in Kubernetes a ClusterIP shouldn't be reachable from outside the Cluster, and the master or worker nodes shells should be intended as outside the cluster.

In the lab installation scripts we've installed Calico network on top of Kubernetes (in my case versionCalico v3.9.4), and as explained in Projectcalico, "Calico v3.4 introduces the ability to advertise Kubernetes service cluster IP routes over BGP, making Kubernetes services accessible outside of the Kubernetes cluster without the need for a dedicated load balancer."

So my understanding is that in standard Kubernetes it shouldn't be possibile to access a ClusterIP from the node machines or from outside the cluster, but Calico >3.4 provides this feature on its own.

Now, I don't know yet why the Calico installation in my bare-metal Ubuntu cluster doesn't work properly, but for sure something is not working on calico side as the restarts of the calico pods noticed also by @bkclements testify.

chrispokorni · January 2020

Hi @bkclements,
Great work! Your detailed post is much appreciated!

Regards,
-Chris

chrispokorni · January 2020

Hi @gfalasca,

You are correct about the reachability of a ClusterIP - it should not be reachable from outside the cluster.

However, nodes are "inside" the cluster, being Kubernetes API resources - therefore both master and worker node shells are inside the cluster and should reach any service inside the cluster.

Regards,
-Chris

chrispokorni · January 2020

Hi @eugeneng,

As mentioned earlier, that is not a normal/expected behavior of a Kubernetes cluster. Such behavior is consistent with network related issues between the Kubernetes cluster nodes - typically infrastructure network firewall rules or security groups. As confirmed in @bkclements' detailed post, the ClusterIP should be accessible from any node in the cluster.

Regards,
-Chris

gfalasca · January 2020

Hi @eugeneng, all,
at the end I've figured out what the problem was by digging into calico.

As said I am using two bare-metal machines with Ubuntu 18.04. The master is small in my case (only 2GB ram), but it doesn't seem to be a problem for now.

In my case the two calico-node pods were starting but not getting ready because the calico's bird readiness probe was failing.

I noticed that the TCP port 179 (which should be open in every node for calico to work properly) was opened on the master node but not on the worker. Once I opened it on the worker, the calico-node pods reached the ready state and everything started working properly.

Issue on 2019-11-05 version Lab 3.2 problem

Comments

Categories

Upcoming Training

Kubernetes Administration (LFS458)

Linux System Administration (LFS301)

Open Source Virtualization (LFS462)

Linux Kernel Debugging and Security (LFD440)